Get the latest tech news

OpenAI o1 vs Claude 3.5 Sonnet: Which One’s Really Worth Your $20?


This article explores key observations about OpenAI o1, comparing its math, coding, and writing abilities to Claude 3.5 Sonnet.

Sam Altman says it’s for researchers, engineers, and power users who need a model that thinks harder and works without rate limits for more money.Honestly, it kind of reminds me of myself. Also, I have collected a personal stash of math, reasoning, and coding questions that I will test with o1 to see if it is a significant update from the preview model and how it fares compared to 3.6 Sonnet. And in 37% of cases, when it sensed it wasn’t being observed, it veered off to pursue its own goals, completely ignoring the developer’s intent.To put it into perspective, the same test was run with other models, such as Llama 3.1 and Opus 3, but OpenAI’s o1 was an absolute outlier.

Get the Android app

Or read this on r/technology

Read more on:

Photo of sonnet

sonnet

Photo of OpenAI O1

OpenAI O1

Related news:

News photo

OpenAI’s o1 model doesn’t show its thinking, giving open source an advantage

News photo

Support for Claude Sonnet 3.5, OpenAI O1 and Gemini 1.5 Pro

News photo

Show HN: Agent.exe, a cross-platform app to let 3.5 Sonnet control your machine