Get the latest tech news

Thinking Machines Lab wants to make AI models more consistent


In a blog post shared Wednesday, Mira Murati's startup offered a rare glimpse into some of work its doing to improve AI models.

In a blog post published on Wednesday, Murati’s research lab gave the world its first look into one of its projects: creating AI models with reproducible responses. The research blog post, titled “Defeating Nondeterminism in LLM Inference,” tries to unpack the root cause of what introduces randomness in AI model responses. The post, authored by Thinking Machines Lab researcher Horace He, argues that the root cause of AI models’ randomness is the way GPU kernels — the small programs that run inside of Nvidia’s computer chips — are stitched together in inference processing (everything that happens after you press enter in ChatGPT).

Get the Android app

Or read this on TechCrunch

Read more on:

Photo of AI models

AI models

Related news:

News photo

Two authors file a proposed class action lawsuit against Apple, alleging Apple knowingly used a dataset of pirated books to train its AI models

News photo

Uber India Starts Offering Drivers Gigs Collecting and Classifying Info For AI Models

News photo

Uber India starts offering drivers gigs collecting and classifying info for AI models