Get the latest tech news

The model is the product


Old data, new models

Google’s Gemini and Perplexity’s chat assistants also offer “Deep Research” features, but neither has published any literature on how they optimized their models or systems for the task or any substaintial quantitative evaluations (…) We will make an assumption that the fine-tuning work done is non-substantial. You can count all theses companies on your hands: Prime Intellect, Moondream, Arcee, Nous, Pleias, Jina, the HuggingFace pretraining team (actually tiny)… Along with a few more academic actors (Allen AI, Eleuther…) they build and support most of the current open infrastructure for training. I believe the message comes straight from Sam Altman and will likely result in some adjustment in the next YC batch but pinpoint to a larger shift: soon the big labs select partners won't be API customers but associated contractors involved in the earlier training stage.

Get the Android app

Or read this on Hacker News

Read more on:

Photo of product

product

Photo of model

model

Related news:

News photo

Mac Studio With M3 Ultra Runs Massive DeepSeek R1 AI Model Locally

News photo

Baidu Releases Reasoning AI Model to Take On DeepSeek

News photo

iPhone 17 Pro Max Rumors Allegedly Refer to 'iPhone 17 Ultra' Model