Get the latest tech news

Orthrus-Qwen3: up to 7.8×tokens/forward on Qwen3, identical output distribution

Fast, lossless LLM inference via dual-view diffusion decoding. - chiennv2000/orthrus

None

Related news:

Qwen3-Coder-Next offers vibe coders a powerful open source, ultra-sparse model with 10x higher throughput for repo tasks

Qwen3-Max Thinking beats Gemini 3 Pro and GPT-5.2 on Humanity's Last Exam (with search)

Qwen3-VL can scan two-hour videos and pinpoint nearly every detail