Get the latest tech news

Moondream 3 Preview: Frontier-level reasoning at a blazing speed


Think deeper, detect smarter, run just as fast.

Sorting produce, or detecting missing herd animals from a drone, or recognizing security incidents - none of these tasks can be built without fast vision inference. We do not use a separate context-length extension phase during training, instead opting to interleave long-context samples while pretraining with a default context length of 4096 tokens. It was trained with load-balancing and router orthogonality losses to help similar tokens specialize together early on, then had load balancing disabled in post-training to avoid catastrophic forgetting from distribution shift.

Get the Android app

Or read this on Hacker News

Read more on:

Photo of preview

preview

Photo of Moondream

Moondream

Photo of blazing speed

blazing speed

Related news:

News photo

Reviewing iOS 26 for power users: Reminders, Preview, and more | Ars Technica

News photo

iOS 26 adds two brand new apps to your iPhone’s Home Screen [Apple Games, Preview]

News photo

Microsoft's first preview of Visual Studio 2026: Deeper AI and a design refresh