Get the latest tech news
Asymmetric Quantization: Near-Lossless Retrieval with 97% Storage Reduction
Late interaction makes retrieval more precise but turns every document into many vectors. Asymmetric quantization keeps query vectors precise and stores document vectors as binary signs, cutting corpus storage 32x while holding 89.65 vs 90.26 NDCG@10.
None
Or read this on Hacker News