ArXiv will ban authors for a year if they let AI do all the work
At a glance:
- ArXiv, the preprint repository that underpins research in computer science and math, is imposing a one-year ban on authors who submit papers with incontrovertible evidence they did not verify LLM-generated content.
- Evidence of unchecked AI output includes hallucinated references and visible LLM comments; after the ban, authors must get their work accepted by a reputable peer-reviewed venue before resubmitting to arXiv.
- The policy, announced by CS section chair Thomas Dietterich, is not a blanket prohibition on large language models but an accountability mandate — authors remain fully responsible for inappropriate language, plagiarized content, biased content, errors, incorrect references, or misleading material regardless of how it was generated.
Why arXiv is tightening the rules
ArXiv, pronounced "archive," has become one of the primary channels through which scientific research circulates long before it reaches formal journals. Fields such as computer science and mathematics depend on it for rapid dissemination of findings, and the repository itself has turned into a barometer for trends in scholarly output. That centrality is precisely what makes the flood of low-quality, AI-generated papers so damaging.
In response, arXiv has already rolled out measures such as requiring first-time submitters to obtain an endorsement from an established author. Now the organization is going further. On Thursday, Thomas Dietterich, chair of arXiv's computer science section, posted that submissions containing "incontrovertible evidence" authors did not check LLM-generated results cannot be trusted. His language is unambiguous: if the evidence is there, the paper is tainted.
The kinds of evidence Dietterich flagged include "hallucinated references" — citations that simply do not exist — and comments to or from the large language model visible in the submission. These are the telltale signs that a researcher copy-pasted output without scrutiny.
What the penalty looks like
The consequences are steep. Authors found to have submitted unchecked LLM output face a one-year ban from arXiv. When they return, every subsequent arXiv submission must first be accepted by a reputable peer-reviewed venue before it can be posted to the repository.
Dietterich stressed in a conversation with 404 Media that this is a "one-strike" rule. Moderators must flag the issue and section chairs must confirm the evidence before the penalty is imposed, and authors retain the right to appeal the decision. The process is designed to catch clear-cut cases rather than punish researchers who use LLMs as a drafting tool and then verify their work.
"This isn't an outright prohibition on using LLMs," Dietterich said. "It's an insistence that authors take full responsibility for the content, irrespective of how the contents are generated." In practice, that means if a researcher copies inappropriate language, plagiarized content, biased content, errors, mistakes, incorrect references, or misleading content directly from an LLM, they are still on the hook.
The bigger picture: AI-generated citations are spreading
The crackdown comes as recent peer-reviewed research has documented a rise in fabricated citations within biomedical papers, a trend researchers attribute to large language models. LLMs are notorious for generating plausible-sounding but nonexistent references, and when those citations slip into manuscripts they can mislead other scientists and pollute literature databases.
It is worth noting that scientists are not the only ones caught out. Across academia, industry, and even journalism, fabricated references generated by AI have surfaced in published work, prompting broader conversations about verification standards. For arXiv, the stakes are especially high because its papers are often treated as quasi-final by readers who skip the subsequent journal publication step.
ArXiv's independence could strengthen enforcement
The policy change also arrives as arXiv transitions from being hosted by Cornell University — a arrangement that lasted more than two decades — to operating as an independent nonprofit. That structural shift is expected to give the organization more financial flexibility to address issues such as AI-generated slop, a term widely used to describe low-effort, machine-produced content that clutters search results and databases.
Greater autonomy could mean faster policy decisions, more dedicated moderation resources, and the ability to invest in tooling that detects AI-generated text or fabricated references at scale. For the research community that depends on arXiv for timely access to cutting-edge work, the combination of stricter rules and a more empowered governance structure signals that the platform is taking the quality problem seriously.
What researchers should watch next
For authors who rely on arXiv for early distribution of their work, the message is clear: use large language models if you wish, but verify everything. Check every reference, re-read every paragraph for bias or incoherence, and remove any LLM interaction traces before submission. The one-year ban is a significant reputational and logistical cost — especially for early-career researchers whose next arXiv posting may need to clear a peer-reviewed journal first.
The policy is also likely to evolve. As detection tools improve and the community's understanding of AI-assisted writing matures, arXiv may refine what counts as "incontrovertible evidence" or adjust the penalty framework. Researchers should monitor announcements from section chairs and the arXiv moderation team for updates.
FAQ
What qualifies as "incontrovertible evidence" of unchecked LLM output on arXiv?
Is arXiv banning the use of large language models in research papers?
What happens after an author receives a one-year ban from arXiv?
More in the feed
Prepared by the editorial stack from public data and external sources.
Original article