Get the latest tech news

Scaling Laws, Carefully


Scaling laws are one of the most critical empirical findings in deep learning. The observation is simple in form: the training loss $L$ decreases predictably as we scale up model size $N$, dataset size $D$, and compute $C$, following a power-law curve, which appears as a straight line on a log-log plot. We can view scaling laws as a framework for describing the relationship between compute, loss, model size and data; at its core, it is about how to allocate precious compute optimally between $N$ and $D$.

None

Get the Android app

Or read this on Hacker News

Read more on:

Photo of laws

laws

Related news:

News photo

How to build a virtual cell and biology scaling laws

News photo

Grok AI broke Canada’s laws, created 6K sexual deepfakes per hour: probe

News photo

Meta Deploys ‘Momfluencers’ to Counter Child Safety Criticism: Meta is using Instagram influencer moms to soften its image on child safety amid scrutiny and legal pressure over its impact on minors. Some influencers are advocating for Meta top priority: laws forcing Apple and Google to verify ages