Get the latest tech news

Composo helps enterprises monitor how well AI apps work


AI and the large language models (LLMs) that power them have a ton of useful applications, but for all their promise, they're not very reliable. No one

London-based startup Composo feels it has a headstart in trying to solve that problem, thanks to its custom models that can help enterprises evaluate the accuracy and quality of apps that are powered by LLMs. The company’s similar to Agenta, Freeplay, Humanloop and LangSmith, which all claim to offer a more solid, LLM-based alternative to human testing, checklists and existing observability tools. That’s notable because this widens the scope of its potential market — you don’t have to be a developer to use it, and domain experts and executives can evaluate AI apps for inconsistencies, quality and accuracy themselves.

Get the Android app

Or read this on TechCrunch

Read more on:

Photo of enterprises

enterprises

Photo of Composo

Composo

Related news:

News photo

Pendulum’s AI-driven platform helps enterprises better predict supply and demand

News photo

DeepSeek-R1 is a boon for enterprises — making AI apps cheaper, easier to build, and more innovative

News photo

Doti gives enterprises a flexible AI-powered search experience to unlock their data silos