Find the latest for Tractable company news
Is AI making us dumber? Understand how tools like ChatGPT could impact our critical thinking skills, but in some cases helps us learn more.
Anthropic's Claude Sonnet 3.7 with reasoning displayed the behavior much more often than generative AI models without reasoning, including GPT-4.5.
For example, consider an AI that assists doctors in diagnosing clients. A diagnosis without any rationale is almost useless, as there is no way for the doctor to verify it and ultimately feel comfortable acting on it.
The pair tested their approach on the Abstraction and Reasoning Corpus (ARC-AGI), an unbeaten visual benchmark created in 2019 by machine learning researcher François Chollet to test AI systems' abstract reasoning skills.
The future isn't coming — it's already here. Artificial intelligence has moved beyond sci-fi speculation to become an integral part of our daily lives, reshaping the way we work, create, and connect.
Claude 3.7, the latest model from Anthropic, can be instructed to engage in a specific amount of reasoning to solve hard problems.
Reasoning models like ChatGPT o1 and DeepSeek R1 were found to cheat in games when they thought they were losing.
In late 2022 large-language-model AI arrived in public, and within months they began misbehaving. Most famously, Microsoft's "Sydney" chatbot threatened to kill an Australian philosophy professor, unleash a deadly virus and steal nuclear codes.