Get the latest tech news

Qwen3.7-Max Ran for 35 Hours on Unknown Hardware and Achieved a 10× Speedup


Alibaba gave Qwen3.7-Max a kernel optimization task on a hardware platform the model had never encountered before. No documentation or profiling data. No example kernels for the architecture. Just a task description, an existing implementation, and an evaluation script. The model ran for 35 hours. It made 1,158 tool calls. It wrote, compiled, profiled, and rewrote the kernel repeatedly, diagnosing failures, fixing bugs, identifying blocks, and redesigning the architecture multiple times without anyone watching. After 30 hours it was still finding meaningful improvements. The final result was a 10x speedup over the reference implementation. For context: GLM 5.1 ran the same task and reached 7.3x. Kimi K2.6 reached 5x. DeepSeek V4 Pro reached 3.3x. The models that stopped early did so because they issued no tool calls for five consecutive rounds, they concluded they couldn't make further progress and stopped. Qwen3.7-Max didn't stop.

None

Get the Android app

Or read this on Hacker News

Read more on:

Photo of Max

Max

Photo of hours

hours

Photo of 10× speedup

10× speedup

Related news:

News photo

New Hampshire data center developer withdraws plans hours before opponents were to pack town meeting

News photo

India's cyber agency sets clock at 12 hours to tackle exploited bugs as AI turns up the heat

News photo

Chinese GPU maker sells out over 30,000 gaming GPUs within 48 hours despite lukewarm benchmarks — LX 7G100 proves hype trumps performance