Get the latest tech news

Diffusion Is Spectral Autoregression


A deep dive into spectral analysis of diffusion models of images, revealing how they implicitly perform a form of autoregression in the frequency domain.

Keeping all of that in mind, let us construct a mapping from noise levels to frequencies for a particular diffusion process and a particular image distribution, by choosing a signal-to-noise ratio (SNR) threshold, below which we will consider the signal to be undetectable. More importantly, it is not monotonic, so adding progressively more Gaussian noise to this does not obfuscate frequencies in descending order: the “diffusion is just spectral autoregression” meme does not apply to audio waveforms! The interpretation in terms of a frequency decomposition is not really applicable there, and hence being able to change the relative weighting of noise levels in the loss doesn’t quite have the same impact on the quality of generated outputs.

Get the Android app

Or read this on Hacker News

Read more on:

Photo of Diffusion

Diffusion

Related news:

News photo

Diffusion Forcing: Next-Token Prediction Meets Full-Sequence Diffusion

News photo

A simplified Python simulation of diffusion

News photo

New “Stable Video Diffusion” AI model can animate any still image