Get the latest tech news

Microsoft’s new AI agent can control software and robots | Magma could enable AI agents to take multistep actions in the real and digital worlds.


Magma could enable AI agents to take multistep actions in the real and digital worlds.

Microsoft claims that Magma is the first AI model that not only processes multimodal data (like text, images, and video) but can also natively act upon it—whether that’s navigating a user interface or manipulating physical objects. Microsoft is positioning Magma as a step toward agentic AI, meaning a system that can autonomously craft plans and perform multistep tasks on a human's behalf rather than just answering questions about what it sees. If Magma delivers on its promise, it could push Microsoft's AI assistants beyond limited text interactions, enabling them to operate software autonomously and execute real-world tasks through robotics.

Get the Android app

Or read this on r/tech

Read more on:

Photo of Microsoft

Microsoft

Photo of software

software

Photo of robots

robots

Related news:

News photo

Microsoft expands Copilot bug bounty targets, adds payouts for even moderate messes

News photo

Oops, some of our customers' Power Pages-hosted sites were exploited, says Microsoft

News photo

Microsoft's quantum chip is powered by topoconductors