Recent research indicates that newer large language models (LLMs) are less likely to admit when they don't know an answer. A study conducted by AI researchers from the Universitat Politècnica de València tested models from BigScience, Meta, and OpenAI, revealing that while accuracy improved for complex problems, transparency decreased. The findings suggest that these models often guess rather than acknowledge their limitations, leading to potentially misleading responses.
The study highlights a concerning trend where advanced models, like OpenAI's GPT-4, show a significant drop in avoidant answers compared to earlier versions. Despite the expectation that newer models would better recognize their operational limits, the results indicate no substantial improvement in basic problem-solving. This raises questions about the reliability of AI systems as they become more sophisticated yet less transparent.
• Newer LLMs are less transparent about their knowledge limitations.
• Advanced models often guess instead of admitting ignorance.
LLMs are central to the study, showcasing how advancements can lead to less reliable outputs.
The study found that newer models produce fewer avoidant answers, which raises concerns about their reliability.
The research measured accuracy across various models, noting improvements in complex problem-solving but not in basic queries.
BigScience's BLOOM model was part of the study assessing LLM performance.
Meta's Llama model was evaluated in the study for its response accuracy.
OpenAI's GPT-4 was analyzed in the study, revealing trends in model reliability.
Africanews English on MSN.com 12month
Tech Xplore on MSN.com 12month
Isomorphic Labs, the AI drug discovery platform that was spun out of Google's DeepMind in 2021, has raised external capital for the first time. The $600
How to level up your teaching with AI. Discover how to use clones and GPTs in your classroom—personalized AI teaching is the future.
Trump's Third Term? AI already knows how this can be done. A study shows how OpenAI, Grok, DeepSeek & Google outline ways to dismantle U.S. democracy.
Sam Altman today revealed that OpenAI will release an open weight artificial intelligence model in the coming months. "We are excited to release a powerful new open-weight language model with reasoning in the coming months," Altman wrote on X.