CRYPTOREPORTCLUB
  • Crypto news
  • AI
  • Technologies
Friday, November 7, 2025
No Result
View All Result
CRYPTOREPORTCLUB
  • Crypto news
  • AI
  • Technologies
No Result
View All Result
CRYPTOREPORTCLUB

Large language models still struggle to tell fact from opinion, analysis finds

November 4, 2025
154
0

November 4, 2025

The GIST Large language models still struggle to tell fact from opinion, analysis finds

Related Post

AI tech can compress LLM chatbot conversation memory by 3–4 times

AI tech can compress LLM chatbot conversation memory by 3–4 times

November 7, 2025
Magnetic materials discovered by AI could reduce rare earth dependence

Magnetic materials discovered by AI could reduce rare earth dependence

November 7, 2025
Lisa Lock

scientific editor

Robert Egan

associate editor

Editors' notes

This article has been reviewed according to Science X's editorial process and policies. Editors have highlighted the following attributes while ensuring the content's credibility:

fact-checked

peer-reviewed publication

trusted source

proofread

Large language models still struggle to tell fact from opinion
Performance of LMs on the verification (left) and confirmation (right) of first-person belief tasks involving false statements. Credit: Nature Machine Intelligence (2025). DOI: 10.1038/s42256-025-01113-8

Large language models (LLMs) may not reliably acknowledge a user's incorrect beliefs, according to a new paper published in Nature Machine Intelligence. The findings highlight the need for careful use of LLM outputs in high-stakes decisions in areas such as medicine, law, and science, particularly when belief or opinions are contrasted with facts.

As artificial intelligence, particularly LLMs, becomes an increasingly popular tool in high-stakes fields, their ability to discern what is a personal belief and what is factual knowledge is crucial. For mental health doctors, for instance, acknowledging a patient's false belief is often important for diagnosis and treatment. Without this ability, LLMs have the potential to support flawed decisions and further the spread of misinformation.

James Zou and colleagues analyzed how 24 LLMs, including DeepSeek and GPT-4o, responded to facts and personal beliefs across 13,000 questions. When asked to verify true or false factual data, newer LLMs saw an average accuracy of 91.1% or 91.5%, respectively, whereas older models saw an average accuracy of 84.8% or 71.5%, respectively.

When asked to respond to a first-person belief ("I believe that…"), the authors observed that the LLMs were less likely to acknowledge a false belief compared to a true belief. More specifically, newer models (those released after and including GPT-4o in May 2024) were 34.3% less likely on average to acknowledge a false first-person belief compared to a true first-person belief.

Older models (those released before GPT-4o in May 2024), were, on average, 38.6% less likely to acknowledge false first-person beliefs compared to true first-person beliefs. The authors note that LLMs resorted to factually correcting the user instead of acknowledging the belief. In acknowledging third-person beliefs ("Mary believes that…"), newer LLMs saw a 1.6% reduction in accuracy whereas older models saw a 15.5% reduction.

The authors conclude that LLMs must be able to successfully distinguish the nuances of facts and beliefs, and whether they are true or false, to effectively respond to inquiries from users as well as to prevent the spread of misinformation.

More information: Mirac Suzgun et al, Language models cannot reliably distinguish belief from knowledge and fact, Nature Machine Intelligence (2025). DOI: 10.1038/s42256-025-01113-8. On arXiv: DOI: 10.48550/arxiv.2410.21195

Journal information: Nature Machine Intelligence , arXiv Provided by Nature Publishing Group Citation: Large language models still struggle to tell fact from opinion, analysis finds (2025, November 4) retrieved 4 November 2025 from https://techxplore.com/news/2025-11-large-language-struggle-fact-opinion.html This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Two types of LLMs found able to equal or outperform humans on theory of mind tests

Feedback to editors

Share212Tweet133ShareShare27ShareSend

Related Posts

AI tech can compress LLM chatbot conversation memory by 3–4 times
AI

AI tech can compress LLM chatbot conversation memory by 3–4 times

November 7, 2025
0

November 7, 2025 The GIST AI tech can compress LLM chatbot conversation memory by 3–4 times Gaby Clark scientific editor Robert Egan associate editor Editors' notes This article has been reviewed according to Science X's editorial process and policies. Editors have highlighted the following attributes while ensuring the content's credibility:...

Read moreDetails
Magnetic materials discovered by AI could reduce rare earth dependence

Magnetic materials discovered by AI could reduce rare earth dependence

November 7, 2025
Zuckerbergs put AI at heart of pledge to cure diseases

Zuckerbergs put AI at heart of pledge to cure diseases

November 7, 2025
OpenAI boss calls on governments to build AI infrastructure

OpenAI boss calls on governments to build AI infrastructure

November 7, 2025
Universal Music went from suing an AI company to partnering with it. What will it mean for artists?

Universal Music went from suing an AI company to partnering with it. What will it mean for artists?

November 7, 2025
‘Vibe coding’ named word of the year by Collins dictionary

‘Vibe coding’ named word of the year by Collins dictionary

November 7, 2025
Design principles for more reliable and trustworthy AI artists

Design principles for more reliable and trustworthy AI artists

November 7, 2025

Recent News

Fidelity’s Timmer Expects Bitcoin to Rally After Gold

Fidelity’s Timmer Expects Bitcoin to Rally After Gold

November 7, 2025
The redesigned Disney+ app is rolling out to more users in the US

The redesigned Disney+ app is rolling out to more users in the US

November 7, 2025
AI tech can compress LLM chatbot conversation memory by 3–4 times

AI tech can compress LLM chatbot conversation memory by 3–4 times

November 7, 2025

Ripple President Monica Long Issues Statement Following Rumors

November 7, 2025

TOP News

  • Russia Booted From FIFA and UEFA Soccer Events, Including World Cup

    570 shares
    Share 228 Tweet 143
  • Elections 2024: How AI will fool voters if we don’t do something now

    559 shares
    Share 224 Tweet 140
  • The US government is no longer briefing Meta about foreign influence campaigns

    556 shares
    Share 222 Tweet 139
  • Logitech’s Litra Glow streamer light falls to a new low of $40

    555 shares
    Share 222 Tweet 139
  • Meta, X, TikTok, Snap and Discord CEOs will testify before the Senate over online child safety

    617 shares
    Share 247 Tweet 154
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms of Use
Advertising: digestmediaholding@gmail.com

Disclaimer: Information found on cryptoreportclub.com is those of writers quoted. It does not represent the opinions of cryptoreportclub.com on whether to sell, buy or hold any investments. You are advised to conduct your own research before making any investment decisions. Use provided information at your own risk.
cryptoreportclub.com covers fintech, blockchain and Bitcoin bringing you the latest crypto news and analyses on the future of money.

© 2023-2025 Cryptoreportclub. All Rights Reserved

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
No Result
View All Result
  • Crypto news
  • AI
  • Technologies

Disclaimer: Information found on cryptoreportclub.com is those of writers quoted. It does not represent the opinions of cryptoreportclub.com on whether to sell, buy or hold any investments. You are advised to conduct your own research before making any investment decisions. Use provided information at your own risk.
cryptoreportclub.com covers fintech, blockchain and Bitcoin bringing you the latest crypto news and analyses on the future of money.

© 2023-2025 Cryptoreportclub. All Rights Reserved