Get the latest tech news

Maintaining large-scale AI capacity at Meta


Meta is currently operating many data centers with GPU training clusters across the world. Our data centers are the backbone of our operations, meticulously designed to support the scaling demands …

Our increased focus on AI was driven both by its rise in driving business outcomes and the huge growth in these types of workloads’ computational needs. They can be as small as a single GPU running for a couple minutes, while generative AI jobs can have trillions of parameters and often span thousands of hosts that need to work together and are very sensitive to interruptions. We also try to stay as current and flexible as possible with the software stack; in the event of firmware upgrades, this allows us to utilize new features or reduce failure rates.

Get the Android app

Or read this on Hacker News

Read more on:

Photo of Meta

Meta

Photo of scale AI capacity

scale AI capacity

Related news:

News photo

Meta pauses AI models launch in Europe due to Irish request

News photo

Meta accused of trying to discredit ad researchers

News photo

Apple and Meta may face official EU charges for failing to allow marketplace competition