Get the latest tech news
The Most Capable Open Source AI Model Yet Could Supercharge AI Agents
A compact and fully open source visual AI model will make it easier for AI to take control of your computer—hopefully in a good way.
This means it can make sense of a computer screen, potentially helping an AI agent perform tasks such as browsing the web, navigating through file directories, and drafting documents. “Having an open source, multimodal model means that any startup or researcher that has an idea can try to do it,” says Ofir Press, a postdoc at Princeton University who works on AI agents. Press says that the fact that Molmo is open source means that developers will be more easily able to fine-tune their agents for specific tasks, such as working with spreadsheets, by providing additional training data.
Or read this on Wired