Get the latest tech news
Microsoft’s new AI agent can control software and robots | Magma could enable AI agents to take multistep actions in the real and digital worlds.
Magma could enable AI agents to take multistep actions in the real and digital worlds.
Microsoft claims that Magma is the first AI model that not only processes multimodal data (like text, images, and video) but can also natively act upon it—whether that’s navigating a user interface or manipulating physical objects. Microsoft is positioning Magma as a step toward agentic AI, meaning a system that can autonomously craft plans and perform multistep tasks on a human's behalf rather than just answering questions about what it sees. If Magma delivers on its promise, it could push Microsoft's AI assistants beyond limited text interactions, enabling them to operate software autonomously and execute real-world tasks through robotics.
Or read this on r/tech