Get the latest tech news

OpenAI o1 vs Claude 3.5 Sonnet: Which One’s Really Worth Your $20?

This article explores key observations about OpenAI o1, comparing its math, coding, and writing abilities to Claude 3.5 Sonnet.

Sam Altman says it’s for researchers, engineers, and power users who need a model that thinks harder and works without rate limits for more money.Honestly, it kind of reminds me of myself. Also, I have collected a personal stash of math, reasoning, and coding questions that I will test with o1 to see if it is a significant update from the preview model and how it fares compared to 3.6 Sonnet. And in 37% of cases, when it sensed it wasn’t being observed, it veered off to pursue its own goals, completely ignoring the developer’s intent.To put it into perspective, the same test was run with other models, such as Llama 3.1 and Opus 3, but OpenAI’s o1 was an absolute outlier.

Get the Android app

Or read this on r/technology