Get the latest tech news

Don't use cosine similarity carelessly


Cosine similarity - the duct tape of AI. Convenient but often misused. Let's find out how to use it better.

But with vectors, we can chart entities and relationships between them — both to provide as a structured input to a machine learning models, and on its own, to find similar items. Sure, it sticks everything together — images, text, audio, code — but like real duct tape, it's a quick fix that often masks deeper problems rather than solving them. Photo from Python Summit 2024 Warsaw - a laptop of Piotr Migdał showing "Don't use cosine similarity" talk front slide.

Get the Android app

Or read this on Hacker News

Read more on:

Photo of Cosine Similarity

Cosine Similarity

Related news:

News photo

A not so fast implementation of cosine similarity in C++ and SIMD

News photo

Alternatives to cosine similarity

News photo

How does cosine similarity work?