Get the latest tech news

This Week in AI: Maybe we should ignore AI benchmarks for now


In this edition of This Week in AI, we talk about Grok 3 and how little AI benchmarks mean to the average AI user.

The benchmark consists of over 1,400 freelance software engineering tasks that range from bug fixes and feature deployments to “manager-level” technical implementation proposals. Step-Audio supports Chinese, English, and Japanese and lets users adjust the emotion and even dialect of the synthetic audio it creates, including singing. Founded in 2023, Stepfun reportedly recently closed a funding round worth several hundred million dollars from a host of investors that include Chinese state-owned private equity firms.

Get the Android app

Or read this on TechCrunch

Read more on:

Photo of Week

Week

Photo of AI benchmarks

AI benchmarks

Related news:

News photo

KDE Plasma 6.3.1 Released With A Few Dozen Fixes For The Week

News photo

The Morning After: What to expect from Apple’s new launch this week

News photo

Despite Plans for AI-Powered Search, Reddit's Stock Fell 14% This Week