Get the latest tech news

14× faster embeddings: how we rebuilt the ONNX path in Manticore


Released in Manticore Search 27.1.5, the new ONNX Runtime backend makes auto-embeddings ~14× faster on average than the previous SentenceTransformers/Candle path on the same hardware, same model, same weights — and the margin holds whether you run 1 client thread or 32.

None

Get the Android app

Or read this on Hacker News

Read more on:

Photo of onnx

onnx

Photo of Manticore

Manticore

Photo of faster embeddings

faster embeddings

Related news:

News photo

Faster KNN search in Manticore: 2-pass HNSW, batched distances, and AVX-512

News photo

KNN early termination in Manticore Search

News photo

Microsoft disrupts ONNX phishing-as-a-service infrastructure