language models

Read news on language models with our app.

Read more in the app

Language models pack billions of concepts into 12k dimensions

Why language models hallucinate

How attention sinks keep language models stable

LangExtract: Python library for extracting structured data from language models

The Dangers of Stochastic Parrots: Can Language Models Be Too Big?

AbsenceBench: Language models can't tell what's missing

Anthropic researchers teach language models to fine-tune themselves

How much do language models memorize?

Type-constrained code generation with language models

Block Diffusion: Interpolating Autoregressive and Diffusion Language Models

Liquid: Language models are scalable and unified multi-modal generators

Beyond Quacking: Deep Integration of Language Models and RAG into DuckDB

Circuit Tracing: Revealing Computational Graphs in Language Models (Anthropic)

VLMaterial: Procedural Material Generation with Large Vision-Language Models

TopoNets: High-Performing Vision and Language Models with Brain-Like Topography

TinyStories: How Small Can Language Models Be and Still Speak Coherent English? (2023)

Letting Language Models Write My Website

PaliGemma 2: Powerful Vision-Language Models, Simple Fine-Tuning

AMD Open-Source 1B OLMo Language Models

Training Language Models to Self-Correct via Reinforcement Learning