Read news on language models with our app.
Read more in the app
Pretraining Language Models via Neural Cellular Automata
Tree Search Distillation for Language Models Using PPO
India Becomes World’s Most Active Market for Large-Language Models as AI Firms Eye Growth
Heretic: Automatic censorship removal for language models
Language models are injective and hence invertible
Antislop: A framework for eliminating repetitive patterns in language models
OpenTSLM: Language models that understand time series
Language models pack billions of concepts into 12k dimensions
Why language models hallucinate
How attention sinks keep language models stable
LangExtract: Python library for extracting structured data from language models
The Dangers of Stochastic Parrots: Can Language Models Be Too Big?
AbsenceBench: Language models can't tell what's missing
Anthropic researchers teach language models to fine-tune themselves
How much do language models memorize?
Type-constrained code generation with language models
Block Diffusion: Interpolating Autoregressive and Diffusion Language Models
Liquid: Language models are scalable and unified multi-modal generators
Beyond Quacking: Deep Integration of Language Models and RAG into DuckDB
Circuit Tracing: Revealing Computational Graphs in Language Models (Anthropic)