Get the latest tech news

Anthropic’s Computer Use mode shows strengths and limitations in new study


Claude can perform impressively complex tasks, but it will also make stupid mistakes from time to time.

A new study by Show Lab at the National University of Singapore provides an overview of what we can expect from the current generation of graphical user interface (GUI) agents. Office productivity tasks test the agent’s ability to perform common operations such as formatting documents, sending emails and creating presentations. However, tools like Claude Computer Use can help product teams explore ideas and iterate over different solutions to a problem without investing time and money in developing new features or services to automate tasks.

Get the Android app

Or read this on Venture Beat

Read more on:

Photo of limitations

limitations

Photo of new study

new study

Photo of strengths

strengths

Related news:

News photo

Political abuse on X is a global, widespread, and cross-partisan phenomenon, suggests new study | New study suggests that individuals on social media platform, ‘X’, who deviate from their party norms are quickly treated as if they were a political enemy.

News photo

OpenAI and others seek new path to smarter AI as current methods hit limitations

News photo

New Study On Moons of Uranus Raises Chance of Life