Get the latest tech news
Show HN: Paramount – Human Evals of AI Customer Support
Agent accuracy measurements for LLMs. Contribute to ask-fini/paramount development by creating an account on GitHub.
Paramount lets your expert agents evaluate AI chats, enabling: quality assurance ground truth capturing automated regression testing In order to set up successfully, define which input and output parameters represent the chat list used in the LLM.
Or read this on Hacker News