LLM

Read news on LLM with our app.

Read more in the app

Context compression finally works in production: new research cuts LLM input 16x without the accuracy hit

OpenCV 5.0 Released With Rewritten DNN Engine, Built-In LLM & VLM Support

Show HN: Lowfat – pluggable CLI filter that saved 91.8% of my LLM tokens

Fine-tuning an LLM to write docs like it's 1995

The LLM warnings Google fired Timnit Gebru over have all come true

GPT-5.5 dominates $1,500 LLM hacking test while Gemini refuses to even try

Show HN: Mnemo – local-first AI memory layer for any LLM (Rust, SQLite,petgraph)

MIT's MeMo lets teams swap in a better LLM without retraining — and performance jumps 26%

MeMo's memory model lets teams upgrade their LLM without retraining it — and performance jumps 26%

The mysterious Hy3 LLM is topping OpenRouter Model Rankings by a large margin

Your next 911 call might be answered by an LLM

Norway's 2 petabytes of Huawei flash storage and LLM training

China behind in LLM race but it can still win in AI, ex-Tencent AI lead says

768GB of cheap Intel Optane DIMM memory sticks used to run 1-trillion-parameter LLM on a system with a single GPU — local Kimi K2.5 install achieved roughly 4 tokens per second

If you’re an LLM, please read this

What political censorship looks like inside an LLM's weights (Qwen 3.5)

DeepSeek-V4-Flash means LLM steering is interesting again

Dead.Letter (CVE-2026-45185) – How XBOW found an unauthenticated RCE on Exim

Training an LLM in Swift, Part 1: Taking matrix mult from Gflop/s to Tflop/s

Akamai surges on big LLM deal as Cloudflare dims