What happened
According to reporting in The Verge and TechCrunch, Thomas Dietterich, chair of arXiv's computer science section, clarified that submissions containing "incontrovertible evidence that the authors did not check the results of LLM generation" will trigger a one-year ban for the authors. The Verge and TechCrunch say the policy targets clear signs such as hallucinated references, leftover chatbot instructions, or fabricated tables. Both outlets report that after a one-year ban authors must have subsequent arXiv submissions accepted by a reputable peer-reviewed venue before posting on the platform. TechCrunch and The Verge also report that moderators must flag issues and section chairs must confirm the evidence before the penalty is applied, and that authors will have an appeal option.
Technical details
Editorial analysis - technical context: The specific failure modes highlighted in reporting - fabricated citations, unvetted paste-in of LLM output, and visible prompt or meta-comments - are consistent with known LLM hallucination patterns and with documented cases in scholarly literature. Public coverage lists common indicators that moderators can detect manually, including:
- •hallucinated or non-existent references cited as if real,
- •visible chatbot prompts or meta-comments left in the text,
- •placeholder notes in tables or figures such as "fill in with real numbers."
These are symptoms of low-effort or careless integration of large language models into manuscript drafting. The policy as described does not ban use of LLMs for drafting or editing; sources emphasize the penalty targets unverified LLM output.
Context and significance
Multiple outlets place arXiv's move in a broader pattern of platforms adapting governance to AI-written content. Reporting by The Next Web cites a Columbia University audit that found fabricated citations rose steeply after 2023; Science.org coverage frames arXiv's action as an attempt to reduce a growing volume of low-quality, AI-generated submissions. Observers in media coverage note that arXiv functions as a primary distribution channel in fast-moving fields, which magnifies the risk that erroneous preprints will propagate quickly through citations and downstream work.
What to watch
For practitioners: Watch how enforcement plays out in practice - frequency of moderator flags, rate of confirmed cases, and appeal outcomes reported by arXiv. Also monitor whether other preprint servers or journals adopt similar penalties or introduce technical screening (for example, automated checks for citation validity or metadata anomalies). Finally, track follow-up studies quantifying the prevalence of fabricated references and other LLM-related errors in preprints versus peer-reviewed literature.
Key Points
- 1Preprint-platform governance actions aim to reduce low-quality AI-generated papers but can shift verification work onto moderators and downstream reviewers.
- 2Visible LLM failure modes - fake citations, prompt remnants, placeholder data - are simple to detect manually but widespread enough to affect literature reliability.
- 3Audits reporting sharp rises in fabricated citations increase pressure on repositories and journals to combine policy and technical screening.
Scoring Rationale
ArXiv's policy affects how research circulates in fast-moving fields and raises operational questions for practitioners and repositories. The decision is notable for its potential to reduce propagation of AI-generated errors while shifting verification burdens.
Practice interview problems based on real data
1,625 SQL & Python problems across 15 industry datasets — the exact type of data you work with.
Try 250 free problems


