Get the latest tech news

Evaluating LLMs for my personal use case


My life is not a math Olympiad

Programming - “Write a bash script to ..” Sysadmin - “With curl how do I ..” Technical explanations - “Explain underlay networks in a data center” General knowledge and creative tasks - “Recipe for blackened seasoning” The set of models I chose to evaluate was based on my past experience with them, various leaderboards and their cost on Open Router. To access their best models via the API, OpenAI now requires you to complete a Know-You-Customer process similar to opening a bank account.

Get the Android app

Or read this on Hacker News

Read more on:

Photo of LLMs

LLMs

Photo of personal use case

personal use case

Related news:

News photo

Practical approach for streaming UI from LLMs

News photo

Asking three LLMs a simple question

News photo

Making LLMs Cheaper and Better via Performance-Efficiency Optimized Routing