The article linked below which discusses the integrity of the peer-review process, strongly suggests that peer-reviewed research is at risk, specifically due to the rise of Generative AI.
The article reports on a controversy at a major international AI conference where a significant portion of the peer reviews were found to be generated by artificial intelligence.
Key Points from the Article's Context
- AI-Generated Reviews: An analysis found that approximately 21% of manuscript reviews for the conference were reportedly written fully by AI, and over half showed signs of AI assistance.
- Risk to Integrity: This development raises serious concerns about the integrity and reliability of the peer-review process, which is the cornerstone of scientific quality control. If reviews are being created by AI without proper human oversight, the quality of published research is jeopardized.
- The Broader Context: The use of AI in peer review is part of a larger, evolving challenge facing scientific publishing, which includes an exponential increase in manuscript submissions and a struggle to find enough qualified human reviewers.
https://www.nature.com/articles/d41586-025-03506-6Quote
Major AI conference flooded with peer reviews written fully by AI
Controversy has erupted after 21% of manuscript reviews for an international AI conference were found to be generated by artificial intelligence.
What can researchers do if they suspect that their manuscripts have been peer reviewed using artificial intelligence (AI)? Dozens of academics have raised concerns on social media about manuscripts and peer reviews submitted to the organizers of next year’s International Conference on Learning Representations (ICLR), an annual gathering of specialists in machine learning. Among other things, they flagged hallucinated citations and suspiciously long and vague feedback on their work.
Graham Neubig, an AI researcher at Carnegie Mellon University in Pittsburgh, Pennsylvania, was one of those who received peer reviews that seemed to have been produced using large language models (LLMs). The reports, he says, were “very verbose with lots of bullet points” and requested analyses that were not “the standard statistical analyses that reviewers ask for in typical AI or machine-learning papers.”
But Neubig needed help proving that the reports were AI-generated. So, he posted on X (formerly Twitter) and offered a reward for anyone who could scan all the conference submissions and their peer reviews for AI-generated text. The next day, he got a response from Max Spero, chief executive of Pangram Labs in New York City, which develops tools to detect AI-generated text. Pangram screened all 19,490 studies and 75,800 peer reviews submitted for ICLR 2026, which will take place in Rio de Janeiro, Brazil, in April. Neubig and more than 11,000 other AI researchers will be attending.
Pangram’s analysis revealed that around 21% of the ICLR peer reviews were fully AI-generated, and more than half contained signs of AI use. The findings were posted online by Pangram Labs. “People were suspicious, but they didn’t have any concrete proof,” says Spero. “Over the course of 12 hours, we wrote some code to parse out all of the text content from these paper submissions,” he adds.
The conference organizers say they will now use automated tools to assess whether submissions and peer reviews breached policies on using AI in submissions and peer reviews. This is the first time that the conference has faced this issue at scale, says Bharath Hariharan, a computer scientist at Cornell University in Ithaca, New York, and senior programme chair for ICLR 2026. “After we go through all this process … that will give us a better notion of trust.”