Read news on language models with our app.
Read more in the app
Language models pack billions of concepts into 12k dimensions
Why language models hallucinate
How attention sinks keep language models stable
LangExtract: Python library for extracting structured data from language models
The Dangers of Stochastic Parrots: Can Language Models Be Too Big?
AbsenceBench: Language models can't tell what's missing
Anthropic researchers teach language models to fine-tune themselves
How much do language models memorize?
Type-constrained code generation with language models
Block Diffusion: Interpolating Autoregressive and Diffusion Language Models
Liquid: Language models are scalable and unified multi-modal generators
Beyond Quacking: Deep Integration of Language Models and RAG into DuckDB
Circuit Tracing: Revealing Computational Graphs in Language Models (Anthropic)
VLMaterial: Procedural Material Generation with Large Vision-Language Models
TopoNets: High-Performing Vision and Language Models with Brain-Like Topography
TinyStories: How Small Can Language Models Be and Still Speak Coherent English? (2023)
Letting Language Models Write My Website
PaliGemma 2: Powerful Vision-Language Models, Simple Fine-Tuning
AMD Open-Source 1B OLMo Language Models
Training Language Models to Self-Correct via Reinforcement Learning