Denoising Diffusion Probabilistic Models
Denoising Diffusion Probabilistic Models
Problem
Framing
Diffusion models had a variational formulation but had not shown GAN-class sample fidelity. The paper closes that gap with an -prediction parameterization of the reverse Gaussian chain, reaching CIFAR-10 IS and FID while remaining a likelihood model.
Currently Used Methods
Foundational and direct antecedents
- @DeepUnsupervisedLearningusing2015 — variational diffusion with a fixed Gaussian corruption process.
- Limitation in context: lacked a practical reverse parameterization with strong image fidelity.
- @dinhNVP2017 — exact-likelihood flow modeling with invertible transformations.
- Limitation in context: did not match DDPM's perceptual quality on these image benchmarks.
- @songScoreSDE2020 — multi-noise score matching with Langevin-style sampling.
- Limitation in context: not the same discrete variational diffusion chain.
- @ronnebergerUNet2015 — U-Net encoder–decoder backbone for dense image prediction.
- Limitation in context: needs timestep conditioning for reverse-process denoising.
Proposed Method
Architecture
The reverse model is a U-Net-like Wide-ResNet with group normalization, shared weights across timesteps, sinusoidal timestep embeddings, and self-attention at . The model uses four resolutions; the model uses six.

Loss / Objective
Training uses the simplified noise-prediction objective at a random timestep.
Sampling Rule / Algorithm
Sampling starts from Gaussian noise and applies one reverse Gaussian step per timestep.
Training Procedure
- Diffusion length: .
- Forward schedule: linear, to .
- Batch size: 128 on CIFAR-10; 64 on larger images.
- Optimizer: Adam.
- Learning rate: .
- EMA decay: 0.9999.
- CIFAR-10 dropout: 0.1.
- Reverse variances: fixed, not learned.
Evaluation
Datasets
- CIFAR-10, unconditional,
- CelebA-HQ,
- LSUN Bedroom,
- LSUN Church,
- LSUN Cat,
Metrics
- Inception Score
- FID
- Negative log-likelihood in bits/dim
- Rate-distortion RMSE
Headline results
- CIFAR-10 unconditional: IS , FID .
- CIFAR-10 test-set FID: .
- LSUN Bedroom : FID .
- LSUN Church : FID .
- CIFAR-10 best-sample model: rate bits/dim, distortion bits/dim.

Ablations
- Objective: full variational bound improves likelihood; improves sample quality.
- Reverse target: predicting works only with variational-bound training.
- Variance parameterization: learned diagonal variance destabilizes training and worsens fidelity.
- Sampling length: fixes generation cost at 1000 network evaluations.
Method Strengths and Weaknesses
Strengths
- Reaches CIFAR-10 FID , beating many published image generators.
- Uses a simple Gaussian reverse chain with fixed variances.
- Connects diffusion training to denoising score matching.
- Shows progressive generation and compression behavior across timesteps.
Weaknesses
- Sampling needs neural evaluations.
- Likelihood trails stronger exact-likelihood image models.
- Learned reverse variances hurt stability and fidelity.
- Best likelihood objective differs from best sample-quality objective.
Suggestions from the authors
- Shorten diffusion chains for faster sampling.
- Find objectives that improve likelihood and sample quality together.
- Explain the progressive lossy coding bias more precisely.
- Explore alternative diffusion lengths and forward processes.
Links
Prior Papers
- @DeepUnsupervisedLearningusing2015 — introduces variational diffusion modeling that DDPM makes practical.
- @songScoreSDE2020 — gives the score-matching view tied to DDPM's -prediction objective.
- @ronnebergerUNet2015 — supplies the U-Net backbone adapted for reverse-process denoising.
- @dinhNVP2017 — provides a likelihood-based image-modeling baseline that contrasts with DDPM's fidelity tradeoff.
Further Papers
- @DenoisingDiffusionImplicitModels2020 — replaces the long Markov chain with faster implicit sampling trajectories.
- @dhariwalDiffusionBeatGANs2021 — strengthens training and sampling until diffusion beats GANs on fidelity.
- @nicholImprovedDDPM2021 — extends DDPM with learned variances and faster sampling.
- @rombachLatentDiffusion2022 — moves DDPM-style diffusion to latent space for scalable synthesis.
- @ClassifierFreeDiffusionGuidance2022 — adds the guidance rule that becomes standard in conditional diffusion.
- @GenerativeInverseDesignof2023 — applies diffusion-style generation to inverse design problems.