Get the latest tech news
This Week in AI: Maybe we should ignore AI benchmarks for now
In this edition of This Week in AI, we talk about Grok 3 and how little AI benchmarks mean to the average AI user.
The benchmark consists of over 1,400 freelance software engineering tasks that range from bug fixes and feature deployments to “manager-level” technical implementation proposals. Step-Audio supports Chinese, English, and Japanese and lets users adjust the emotion and even dialect of the synthetic audio it creates, including singing. Founded in 2023, Stepfun reportedly recently closed a funding round worth several hundred million dollars from a host of investors that include Chinese state-owned private equity firms.
Or read this on TechCrunch