Get the latest tech news

The "it" in AI models is the dataset


’ve been at OpenAI for almost a year now. In that time, I’ve trained a lot of generative models.

As I’ve spent these hours observing the effects of tweaking various model configurations and hyperparameters, one thing that has struck me is the similarities in between all the training runs. Sufficiently large diffusion conv-unets produce the same images as ViT generators. It implies that model behavior is not determined by architecture, hyperparameters, or optimizer choices.

Get the Android app

Or read this on Hacker News

Read more on:

Photo of AI models

AI models

Photo of dataset

dataset

Related news:

News photo

Geoffrey Hinton: Open sourcing AI models akin to open sourcing nuclear weapons

News photo

Meta Releases Llama 3 AI Models, Claiming Top Performance

News photo

Overture Maps Foundation Releases Beta of Its First Open Map Dataset