Context compression finally works in production: new research cuts LLM input 16x without the accuracy hit
OpenCV 5.0 Released With Rewritten DNN Engine, Built-In LLM & VLM Support
Show HN: Lowfat – pluggable CLI filter that saved 91.8% of my LLM tokens
Fine-tuning an LLM to write docs like it's 1995
The LLM warnings Google fired Timnit Gebru over have all come true
GPT-5.5 dominates $1,500 LLM hacking test while Gemini refuses to even try
Show HN: Mnemo – local-first AI memory layer for any LLM (Rust, SQLite,petgraph)
MIT's MeMo lets teams swap in a better LLM without retraining — and performance jumps 26%
MeMo's memory model lets teams upgrade their LLM without retraining it — and performance jumps 26%
The mysterious Hy3 LLM is topping OpenRouter Model Rankings by a large margin
Your next 911 call might be answered by an LLM
Norway's 2 petabytes of Huawei flash storage and LLM training
China behind in LLM race but it can still win in AI, ex-Tencent AI lead says
768GB of cheap Intel Optane DIMM memory sticks used to run 1-trillion-parameter LLM on a system with a single GPU — local Kimi K2.5 install achieved roughly 4 tokens per second
If you’re an LLM, please read this
What political censorship looks like inside an LLM's weights (Qwen 3.5)
DeepSeek-V4-Flash means LLM steering is interesting again
Dead.Letter (CVE-2026-45185) – How XBOW found an unauthenticated RCE on Exim
Training an LLM in Swift, Part 1: Taking matrix mult from Gflop/s to Tflop/s