ArXiv Bans Papers Containing Unchecked AI Output

ArXiv will impose a one-year ban on authors whose submissions contain "incontrovertible evidence" of unchecked large-language-model output, according to statements by Thomas Dietterich, chair of arXiv's computer science section reported by The Verge and TechCrunch. Examples that can trigger the penalty include hallucinated references, chatbot prompt artifacts, and fabricated data tables, sources say. TechCrunch and The Verge report the penalty is followed by a requirement that subsequent arXiv submissions be first accepted by a reputable peer-reviewed venue. Reporting by The Next Web and Science.org cites large-scale audits showing fabricated citations have increased sharply in recent years, framing the change as a response to rising low-quality AI-generated preprints.
What happened
According to reporting in The Verge and TechCrunch, Thomas Dietterich, chair of arXiv's computer science section, clarified that submissions containing "incontrovertible evidence that the authors did not check the results of LLM generation" will trigger a one-year ban for the authors. The Verge and TechCrunch say the policy targets clear signs such as hallucinated references, leftover chatbot instructions, or fabricated tables. Both outlets report that after a one-year ban authors must have subsequent arXiv submissions accepted by a reputable peer-reviewed venue before posting on the platform. TechCrunch and The Verge also report that moderators must flag issues and section chairs must confirm the evidence before the penalty is applied, and that authors will have an appeal option.
Technical details
Editorial analysis - technical context: The specific failure modes highlighted in reporting - fabricated citations, unvetted paste-in of LLM output, and visible prompt or meta-comments - are consistent with known LLM hallucination patterns and with documented cases in scholarly literature. Public coverage lists common indicators that moderators can detect manually, including:
- •hallucinated or non-existent references cited as if real,
- •visible chatbot prompts or meta-comments left in the text,
- •placeholder notes in tables or figures such as "fill in with real numbers."
These are symptoms of low-effort or careless integration of large language models into manuscript drafting. The policy as described does not ban use of LLMs for drafting or editing; sources emphasize the penalty targets unverified LLM output.
Context and significance
Multiple outlets place arXiv's move in a broader pattern of platforms adapting governance to AI-written content. Reporting by The Next Web cites a Columbia University audit that found fabricated citations rose steeply after 2023; Science.org coverage frames arXiv's action as an attempt to reduce a growing volume of low-quality, AI-generated submissions. Observers in media coverage note that arXiv functions as a primary distribution channel in fast-moving fields, which magnifies the risk that erroneous preprints will propagate quickly through citations and downstream work.
What to watch
For practitioners: Watch how enforcement plays out in practice - frequency of moderator flags, rate of confirmed cases, and appeal outcomes reported by arXiv. Also monitor whether other preprint servers or journals adopt similar penalties or introduce technical screening (for example, automated checks for citation validity or metadata anomalies). Finally, track follow-up studies quantifying the prevalence of fabricated references and other LLM-related errors in preprints versus peer-reviewed literature.
Scoring Rationale
ArXiv's policy affects how research circulates in fast-moving fields and raises operational questions for practitioners and repositories. The decision is notable for its potential to reduce propagation of AI-generated errors while shifting verification burdens.
Practice interview problems based on real data
1,500+ SQL & Python problems across 15 industry datasets — the exact type of data you work with.
Try 250 free problems

