Cornell and UCLA Study Finds 146,900 Fake Citations in Science Papers

The integrity of the global scientific record hinges on a simple premise: that every reference cited is a tangible, verifiable piece of evidence. When a researcher cites a study, they are inviting the reader to follow a paper trail of truth. However, a new study conducted by researchers affiliated with Cornell and UCLA suggests that this trail is increasingly being paved with ghosts. By analyzing 111 million references across 2.5 million scientific papers, the team identified 146,900 citations that appear to be products of AI hallucinations—references that simply do not exist.

The Mechanics of Academic Fabrication

The study highlights a fundamental tension between the convenience of Large Language Models (LLMs) and the rigor of peer-reviewed science. LLMs like ChatGPT and Gemini are designed to generate fluent, plausible prose, but they lack a functional tether to reality, frequently inventing book titles, journals, and articles to satisfy a prompt. While researchers have always grappled with "sloppy science" and human error, the scale of this issue has shifted. By comparing modern papers against data from before 2023, the authors found a statistically significant spike in non-existent references following the widespread adoption of LLMs.

This phenomenon represents a departure from traditional academic misconduct. Previously, falsification was often a deliberate, labor-intensive act. Now, the ease of automation allows for the rapid, mass-production of "meaningless noise" that dilutes the collective body of human knowledge. As Usha Haley, a professor of management at Wichita State University, notes, this trend threatens the very foundation of cumulative knowledge, particularly as it gains traction among early-career scholars who may be over-relying on automated drafting tools.

The Breadth of the Contamination

The researchers focused their audit on four major scientific repositories: arXiv, bioRxiv, SSRN, and PubMed Central. These platforms serve as vital clearinghouses where scientists post pre-prints to allow for immediate global collaboration and scrutiny. Because these repositories prioritize speed and accessibility, the influx of hallucinated citations poses a systemic risk to the visibility of legitimate research.

It is important to note a key limitation: the study distinguishes between simple, human-made typographical errors and genuine AI-generated fabrications, yet the sheer volume of unmatched references suggests the latter is the primary driver of the recent surge. The researchers observed that these bad citations were not concentrated in a handful of rogue papers but were instead scattered across a wide array of submissions. This distribution pattern indicates that the issue is not isolated to a few bad actors, but rather suggests a systemic normalization of unverified AI usage during the writing process.

Guarding the Scientific Record

The scientific community is not standing idle in the face of this erosion. Steinn Sigurdsson, scientific director at arXiv, has been vocal about the risk, characterizing the influx of AI-generated content as a dilution of scientific truth that misdirects researchers. In response, arXiv has begun implementing stricter policies, recently announcing that it will ban authors who submit work containing hallucinated citations or unchecked AI content.

The next measure of success for these repositories will be the trend line of unmatched citations in future repository audits. As institutions tighten their submission guidelines, the efficacy of these bans will be determined by whether the frequency of non-existent references begins to plateau or decline. Ultimately, the survival of the scientific method depends on whether researchers can successfully disentangle the power of artificial intelligence from the necessity of human verification.