Microsoft's new rStar-Math technique upgrades small models to outperform OpenAI's o1-preview at math problems

Microsoft has introduced rStar-Math, a new reasoning technique designed to enhance the performance of small language models (SLMs) in solving math problems. This technique reportedly allows these models to achieve results comparable to, and in some cases better than, OpenAI's o1-preview model. The research, conducted in collaboration with Peking University and Tsinghua University, demonstrates significant improvements across various smaller models, including Microsoft's Phi-3 mini and Alibaba's Qwen models.

The rStar-Math technique utilizes Monte Carlo Tree Search (MCTS) to break down complex math problems into simpler tasks, enabling smaller models to self-evolve and improve their reasoning capabilities. After rigorous testing, the Qwen2.5-Math-7B model achieved a remarkable accuracy of 90% on the MATH benchmark, surpassing OpenAI's previous best. This development emphasizes the potential of smaller models to deliver high performance while reducing the computational costs associated with larger systems.

Key AI Highlights in this Article

• rStar-Math technique enhances small models' math problem-solving capabilities.

• Qwen2.5-Math-7B model achieved 90% accuracy, outperforming OpenAI's o1-preview.

Key AI Terms Mentioned in this Article

Small Language Models (SLMs)

SLMs are designed to perform tasks efficiently, showcasing high performance in specific applications like math reasoning.

Monte Carlo Tree Search (MCTS)

MCTS is a method used to iteratively refine solutions to complex problems, mimicking human reasoning.

Self-evolution

Self-evolution refers to the process where models improve their performance through iterative training and feedback.

Companies Mentioned in this Article

Microsoft

Microsoft is advancing AI research with the rStar-Math technique, enhancing small model capabilities.

Alibaba

Alibaba's Qwen models were part of the study, demonstrating significant performance improvements with rStar-Math.

Microsoft Alibaba Quantum Computing Research & Innovations

Related News

Microsoft's new rStar-Math technique upgrades small models to outperform OpenAI's o1-preview at math problems

VentureBeat 6month

Microsoft's smaller AI model beats the big guys: Meet Phi-4, the efficiency king

VentureBeat 7month

Microsoft's GRIN-MoE AI model takes on coding and math, beating competitors in key benchmarks

VentureBeat 9month

OpenAI's o3-Mini Shows Superior Accuracy Than o1-Mini Without 'Thinking' Longer: Harvard Study

Analytics India Magazine 4month

OpenAI releases new o1 AI, its first model capable of reasoning

BGR 10month

DeepSeek claims its 'reasoning' model beats OpenAI's o1 on certain benchmarks

TechCrunch on MSN.com 5month

Did Microsoft Spill the Secrets of OpenAI?

Analytics India Magazine 6month

OpenAI, rivals seek new path to smarter AI as current methods hit limitations

The Globe and Mail 8month

Latest Articles

Alphabet's AI drug discovery platform Isomorphic Labs raises $600M from Thrive

TechCrunch 3month

Isomorphic Labs, the AI drug discovery platform that was spun out of Google's DeepMind in 2021, has raised external capital for the first time. The $600

AI In Education - Up-level Your Teaching With AI By Cloning Yourself

Forbes 3month

How to level up your teaching with AI. Discover how to use clones and GPTs in your classroom—personalized AI teaching is the future.

Trump's Third Term - How AI Can Help To Overthrow The US Government

Forbes 3month

Trump's Third Term? AI already knows how this can be done. A study shows how OpenAI, Grok, DeepSeek & Google outline ways to dismantle U.S. democracy.

Sam Altman Says OpenAI Will Release an 'Open Weight' AI Model This Summer

Wired 3month

Sam Altman today revealed that OpenAI will release an open weight artificial intelligence model in the coming months. "We are excited to release a powerful new open-weight language model with reasoning in the coming months," Altman wrote on X.

Guest

Explore AI

Explore GPTs

Explore AI News

Explore AI Videos

Explore AI for Jobs

Microsoft's new rStar-Math technique upgrades small models to outperform OpenAI's o1-preview at math problems

Small Language Models (SLMs)

Monte Carlo Tree Search (MCTS)

Self-evolution

Microsoft

Alibaba

Related News

Microsoft's new rStar-Math technique upgrades small models to outperform OpenAI's o1-preview at math problems

Microsoft's smaller AI model beats the big guys: Meet Phi-4, the efficiency king

Microsoft's GRIN-MoE AI model takes on coding and math, beating competitors in key benchmarks

OpenAI's o3-Mini Shows Superior Accuracy Than o1-Mini Without 'Thinking' Longer: Harvard Study

OpenAI releases new o1 AI, its first model capable of reasoning

DeepSeek claims its 'reasoning' model beats OpenAI's o1 on certain benchmarks

Did Microsoft Spill the Secrets of OpenAI?

OpenAI, rivals seek new path to smarter AI as current methods hit limitations

Get Email Alerts for AI News

Latest Articles

Popular Topics