Get the latest tech news

We Hit 100% GPU Utilization–and Then Made It 3× Faster by Not Using It


Learn how to achieve near-100% GPU utilization processing millions of text documents with Qwen3 embeddings.

1# Define the returntypefor embeddings 12 from sentence_transformers import SentenceTransformer 14 device ='cuda'if torch.cuda.is_available()else'cpu'

Get the Android app

Or read this on Hacker News

Read more on:

Photo of % gpu utilization

% gpu utilization