Get the latest tech news
Orthrus-Qwen3: up to 7.8×tokens/forward on Qwen3, identical output distribution
Fast, lossless LLM inference via dual-view diffusion decoding. - chiennv2000/orthrus
None
Or read this on Hacker NewsGet the latest tech news
Fast, lossless LLM inference via dual-view diffusion decoding. - chiennv2000/orthrus
None
Or read this on Hacker NewsRead more on:
Related news: