Get the latest tech news

Show HN: Paramount – Human Evals of AI Customer Support

Agent accuracy measurements for LLMs. Contribute to ask-fini/paramount development by creating an account on GitHub.

Paramount lets your expert agents evaluate AI chats, enabling: quality assurance ground truth capturing automated regression testing In order to set up successfully, define which input and output parameters represent the chat list used in the LLM.

Get the Android app

Or read this on Hacker News