Get the latest tech news

Reinforcement learning, explained with a minimum of math and jargon


To create reliable agents, AI companies had to go beyond predicting the next token.

“Over the past week, developers around the world have begun building ‘autonomous agents’ that work with large language models (LLMs) such as OpenAI’s GPT-4 to solve complex problems,” Mark Sullivan wrote for Fast Company. In the process, I hope to give readers an intuitive understanding of how reinforcement learning helped to enable the new generation of agentic AI systems that began to appear in the second half of 2024. You’d need to convert every principle of good driving—including subtle considerations like following distances, taking turns at intersections, and when it’s OK to cross a double yellow line—into explicit mathematical formulas.

Get the Android app

Or read this on Hacker News

Read more on:

Photo of Minimum

Minimum

Photo of math

math

Photo of jargon

jargon

Related news:

News photo

Hospital cyber attacks cost $600K/hour. Here’s how AI is changing the math

News photo

Painting with Math: A Gentle Study of Raymarching (2023)

News photo

As a programmer, get your math sorted