Policy & Regulationarxivpreprintsai generated textresearch integrity

ArXiv Bans Papers Containing Unchecked AI Output

|May 19, 2026|By LDS Team

7.4

Relevance Score

ArXiv Bans Papers Containing Unchecked AI Output — Photo: sciencealert.com · rights & takedowns

ArXiv will impose a one-year ban on authors whose submissions contain "incontrovertible evidence" of unchecked large-language-model output, according to statements by Thomas Dietterich, chair of arXiv's computer science section reported by The Verge and TechCrunch. Examples that can trigger the penalty include hallucinated references, chatbot prompt artifacts, and fabricated data tables, sources say. TechCrunch and The Verge report the penalty is followed by a requirement that subsequent arXiv submissions be first accepted by a reputable peer-reviewed venue. Reporting by The Next Web and Science.org cites large-scale audits showing fabricated citations have increased sharply in recent years, framing the change as a response to rising low-quality AI-generated preprints.

What happened

According to reporting in The Verge and TechCrunch, Thomas Dietterich, chair of arXiv's computer science section, clarified that submissions containing "incontrovertible evidence that the authors did not check the results of LLM generation" will trigger a one-year ban for the authors. The Verge and TechCrunch say the policy targets clear signs such as hallucinated references, leftover chatbot instructions, or fabricated tables. Both outlets report that after a one-year ban authors must have subsequent arXiv submissions accepted by a reputable peer-reviewed venue before posting on the platform. TechCrunch and The Verge also report that moderators must flag issues and section chairs must confirm the evidence before the penalty is applied, and that authors will have an appeal option.

Technical details

Editorial analysis - technical context

The specific failure modes highlighted in reporting - fabricated citations, unvetted paste-in of LLM output, and visible prompt or meta-comments - are consistent with known LLM hallucination patterns and with documented cases in scholarly literature. Public coverage lists common indicators that moderators can detect manually, including:

•hallucinated or non-existent references cited as if real,
•visible chatbot prompts or meta-comments left in the text,
•placeholder notes in tables or figures such as "fill in with real numbers."

These are symptoms of low-effort or careless integration of large language models into manuscript drafting. The policy as described does not ban use of LLMs for drafting or editing; sources emphasize the penalty targets unverified LLM output.

Context and significance

Multiple outlets place arXiv's move in a broader pattern of platforms adapting governance to AI-written content. Reporting by The Next Web cites a Columbia University audit that found fabricated citations rose steeply after 2023; Science.org coverage frames arXiv's action as an attempt to reduce a growing volume of low-quality, AI-generated submissions. Observers in media coverage note that arXiv functions as a primary distribution channel in fast-moving fields, which magnifies the risk that erroneous preprints will propagate quickly through citations and downstream work.

What to watch

For practitioners

Watch how enforcement plays out in practice - frequency of moderator flags, rate of confirmed cases, and appeal outcomes reported by arXiv. Also monitor whether other preprint servers or journals adopt similar penalties or introduce technical screening (for example, automated checks for citation validity or metadata anomalies). Finally, track follow-up studies quantifying the prevalence of fabricated references and other LLM-related errors in preprints versus peer-reviewed literature.

Key Points

1Preprint-platform governance actions aim to reduce low-quality AI-generated papers but can shift verification work onto moderators and downstream reviewers.
2Visible LLM failure modes - fake citations, prompt remnants, placeholder data - are simple to detect manually but widespread enough to affect literature reliability.
3Audits reporting sharp rises in fabricated citations increase pressure on repositories and journals to combine policy and technical screening.

Scoring Rationale

ArXiv's policy affects how research circulates in fast-moving fields and raises operational questions for practitioners and repositories. The decision is notable for its potential to reduce propagation of AI-generated errors while shifting verification burdens.

MoreAI Research news

Sources

Primary source and supporting public references used for this report.

12 sources

Primary sourcesciencealert.com'AI Slop' Is Flooding Science Publishing, And One Major Site Is Fighting Back

View 11 more sources

Practice interview problems based on real data

1,625 SQL & Python problems across 15 industry datasets — the exact type of data you work with.

Try 250 free problems

What happened

Technical details

Editorial analysis - technical context

•hallucinated or non-existent references cited as if real,
•visible chatbot prompts or meta-comments left in the text,
•placeholder notes in tables or figures such as "fill in with real numbers."

Context and significance

What to watch

For practitioners

Key Points

1Preprint-platform governance actions aim to reduce low-quality AI-generated papers but can shift verification work onto moderators and downstream reviewers.

2Visible LLM failure modes - fake citations, prompt remnants, placeholder data - are simple to detect manually but widespread enough to affect literature reliability.

3Audits reporting sharp rises in fabricated citations increase pressure on repositories and journals to combine policy and technical screening.

ArXiv Bans Papers Containing Unchecked AI Output

What happened

Technical details

Editorial analysis - technical context

Context and significance

What to watch

For practitioners

Key Points

Scoring Rationale

Sources

More AI & Data Science News

White House Launches Gold Eagle Vulnerability Clearinghouse

Anthropic and OpenAI Increase Federal Lobbying Spending in Q2

Claude Code Hooks Automate Checks and Guardrails

UVA and Clemson Researchers Introduce Hospital AI Framework

ArXiv Bans Papers Containing Unchecked AI Output

What happened

Technical details

Editorial analysis - technical context

Context and significance

What to watch

For practitioners

Key Points

Scoring Rationale

Sources

More AI & Data Science News

White House Launches Gold Eagle Vulnerability Clearinghouse

Anthropic and OpenAI Increase Federal Lobbying Spending in Q2

Claude Code Hooks Automate Checks and Guardrails

UVA and Clemson Researchers Introduce Hospital AI Framework