Microsoft's new AI agent can control software and robots

Full Article
Microsoft's new AI agent can control software and robots

Microsoft Research has launched Magma, an innovative AI foundation model that integrates visual and language processing to control software and robotic systems. This model is touted as the first of its kind to not only process multimodal data but also act upon it, potentially revolutionizing interactions in both digital and physical environments. The development of Magma is a collaborative effort involving several prestigious universities and aims to enhance the capabilities of AI in executing complex tasks autonomously.

Magma is positioned as a significant advancement towards agentic AI, which can autonomously plan and execute tasks based on user-defined goals. Unlike previous models that required separate systems for perception and control, Magma combines these functions into a single framework, showcasing its potential in real-world applications. If successful, this model could redefine how AI assistants operate, moving beyond simple text interactions to more complex, autonomous actions.

• Magma integrates visual and language processing for advanced AI capabilities.

• The model aims to enhance autonomous task execution in real-world applications.

Key AI Terms Mentioned in this Article

Magma

Magma is an integrated AI model that combines visual and language processing for controlling software and robotics.

Agentic AI

Agentic AI refers to systems capable of autonomously planning and executing tasks on behalf of users.

Multimodal AI

Multimodal AI processes and acts on various types of data, such as text, images, and video.

Companies Mentioned in this Article

Microsoft

Microsoft is leading the development of Magma, aiming to enhance AI's capabilities in software and robotics.

OpenAI

OpenAI is exploring agentic AI through projects like Operator, which performs UI tasks autonomously.

Get Email Alerts for AI News

By creating an email alert, you agree to AIleap's Terms of Service and Privacy Policy. You can pause or unsubscribe from email alerts at any time.

Latest Articles

Alphabet's AI drug discovery platform Isomorphic Labs raises $600M from Thrive
TechCrunch 6month

Isomorphic Labs, the AI drug discovery platform that was spun out of Google's DeepMind in 2021, has raised external capital for the first time. The $600

AI In Education - Up-level Your Teaching With AI By Cloning Yourself
Forbes 6month

How to level up your teaching with AI. Discover how to use clones and GPTs in your classroom—personalized AI teaching is the future.

Trump's Third Term - How AI Can Help To Overthrow The US Government
Forbes 6month

Trump's Third Term? AI already knows how this can be done. A study shows how OpenAI, Grok, DeepSeek & Google outline ways to dismantle U.S. democracy.

Sam Altman Says OpenAI Will Release an 'Open Weight' AI Model This Summer
Wired 6month

Sam Altman today revealed that OpenAI will release an open weight artificial intelligence model in the coming months. "We are excited to release a powerful new open-weight language model with reasoning in the coming months," Altman wrote on X.

Popular Topics