Get the latest tech news

Asymmetric Quantization: Near-Lossless Retrieval with 97% Storage Reduction


Late interaction makes retrieval more precise but turns every document into many vectors. Asymmetric quantization keeps query vectors precise and stores document vectors as binary signs, cutting corpus storage 32x while holding 89.65 vs 90.26 NDCG@10.

None

Get the Android app

Or read this on Hacker News

Read more on:

Photo of % storage reduction

% storage reduction

Photo of lossless retrieval

lossless retrieval