Get the latest tech news

Orthrus-Qwen3: up to 7.8×tokens/forward on Qwen3, identical output distribution


Fast, lossless LLM inference via dual-view diffusion decoding. - chiennv2000/orthrus

None

Get the Android app

Or read this on Hacker News

Read more on:

Photo of Qwen3

Qwen3

Related news:

News photo

Qwen3-Coder-Next offers vibe coders a powerful open source, ultra-sparse model with 10x higher throughput for repo tasks

News photo

Qwen3-Max Thinking beats Gemini 3 Pro and GPT-5.2 on Humanity's Last Exam (with search)

News photo

Qwen3-VL can scan two-hour videos and pinpoint nearly every detail