Get the latest tech news

Beyond GPT architecture: Why Google’s Diffusion approach could reshape LLM deployment


Gemini Diffusion is also useful for tasks such as refactoring code, adding new features to applications, or converting an existing codebase to a different language.

(Editor’s note: We’ll be unpacking paradigm shifts like diffusion-based language models—and what it takes to run them in production—at VB Transform, June 24–25 in San Francisco, alongside Google DeepMind, LinkedIn and other enterprise AI leaders.) In an interview with VentureBeat, Brendan O’Donoghue, research scientist at Google DeepMind and one of the leads on the Gemini Diffusion project, elaborated on some of the advantages of diffusion-based techniques when compared to autoregression. BenchmarkTypeGemini DiffusionGemini 2.0 Flash-Lite LiveCodeBench (v6)Code30.9%28.5%BigCodeBenchCode45.4%45.8%LBPP (v2)Code56.8%56.0%SWE-Bench Verified*Code22.9%28.5%HumanEvalCode89.6%90.2%MBPPCode76.0%75.8%GPQA DiamondScience40.4%56.5%AIME 2025Mathematics23.3%20.0%BIG-Bench Extra HardReasoning15.0%21.0%Global MMLU (Lite)Multilingual69.1%79.0%* Non-agentic evaluation (single turn edit only), max prompt length of 32K.

Get the Android app

Or read this on Venture Beat

Read more on:

Photo of Google

Google

Photo of GPT

GPT

Photo of LLM deployment

LLM deployment

Related news:

News photo

Google’s Plan to Buy Security Firm Wiz Gets Antitrust Review

News photo

Google's new AI Search test might help you understand confusing topics quicker

News photo

Do you trust Xi with your 'private' browsing data? Apple, Google stores still offer China-based VPNs, report says