tokens

Read news on tokens with our app.

Read more in the app

$0.09 and $290.12 are both the price of 1M output tokens

Does Speaking to Agents Like Cavemen Save 65% of Tokens? We Test

I burned all my tokens researching how to save tokens

Running Gemma 4 26B at 5 tokens/sec on a 13-year-old Xeon with no GPU

The real prices of frontier models

Claude Code sends 33k tokens before reading the prompt; OpenCode sends 7k

Show HN: Docx-CLI: agents read/edit Word docs using 1/2 the time and tokens

Karp: Anthropic/OpenAI are stealing customer IP and their tokens have low value

Show HN: Recall – Local project memory for Claude Code

Show HN: CleverCrow: give tokens to your favorite projects

Google's DiffusionGemma generates 256 tokens in parallel and self-corrects as it goes

MiMo-v2.5-Pro-UltraSpeed: 1T model with 1000 tokens per second

What Are Tokens in LLMs?

Tokenomics: Quantifying Where Tokens Are Used in Agentic Software Engineering

Show HN: Lowfat – pluggable CLI filter that saved 91.8% of my LLM tokens

768GB of cheap Intel Optane DIMM memory sticks used to run 1-trillion-parameter LLM on a system with a single GPU — local Kimi K2.5 install achieved roughly 4 tokens per second

FBI warns Kali365 phishing kit is stealing Microsoft OAuth tokens at scale

America's top cyber-defense agency left a GitHub repo open with with passwords, keys, tokens – and incredibly obvious filenames

America's top cyber-defense agency left a GitHub repo open with passwords, keys, tokens – and incredibly obvious filenames

Using AI to click around on a website burns 45x as many tokens as just using APIs