Get the latest tech news
Evaluating LLMs for my personal use case
My life is not a math Olympiad
Programming - “Write a bash script to ..” Sysadmin - “With curl how do I ..” Technical explanations - “Explain underlay networks in a data center” General knowledge and creative tasks - “Recipe for blackened seasoning” The set of models I chose to evaluate was based on my past experience with them, various leaderboards and their cost on Open Router. To access their best models via the API, OpenAI now requires you to complete a Know-You-Customer process similar to opening a bank account.
Or read this on Hacker News