AI models are using material from retracted scientific papers

Recent studies confirm that leading AI models, including popular chatbots and specialized research tools, frequently incorporate and reference content from retracted scientific papers without disclosing their invalid status. This widespread issue, demonstrated across platforms like ChatGPT, Elicit, and Perplexity, raises significant concerns about the reliability and trustworthiness of AI-generated information, particularly in critical domains such as medical advice and scientific research. The problem poses substantial risks to the integrity of AI applications and could complicate institutional investments in AI for scientific advancement, despite some developers beginning to integrate retraction data to address the challenge.

Analysis

Recent studies reveal a systemic flaw across the artificial intelligence sector, where prominent AI models, including OpenAI's ChatGPT and specialized research tools from Elicit, Perplexity, and Consensus, are incorporating information from retracted scientific papers without indicating their invalid status. A study on ChatGPT (GPT-4o) found it referenced retracted papers in approximately 24% of test cases, while initial tests by MIT Technology Review showed other research tools like Consensus and Ai2 ScholarQA cited invalid papers in 86% and 81% of cases, respectively. This issue, reflected in the strongly negative sentiment score (-0.6), poses a significant integrity risk to AI applications, particularly in critical fields like medicine and science where the US National Science Foundation is investing $75 million. The corporate response is fragmented, creating potential for differentiation; Consensus (CCSI) has proactively integrated retraction data, improving its results significantly and earning a positive per-ticker sentiment (0.5), whereas peers have been slower to act or have downplayed accuracy claims. The problem is compounded by a lack of comprehensive retraction databases and inconsistent publisher standards, suggesting this is a persistent industry-wide challenge rather than an easily rectifiable bug.

AllMind

AllMind

AI models are using material from retracted scientific papers

Analysis

AllMind AI Terminal

Market Sentiment

Key Decisions for Investors