A curated list of awesome discrete diffusion models resources.
This repo is maintained by Subham Sahoo, Yingheng Wang, and Yair Schiff. Feel free to send pull requests to add more papers! Papers must be added in a chronological sequence, with the most recent accepted papers taking precedence over unaccepted papers. Please use the following format:
{paper-name}, {conference} {year} [[link-to-the-abstract-page], [code-if-available]]
- Introductory Materials
- Topic areas
- Getting started with Diffusion Language Models, 2024.
- Diffusion Language Models, 2023 [URL]
- My notes on discrete denoising diffusion models (D3PMs), 2022 [URL]
- Simple and Effective Masked Diffusion Language Models, NeurIPS 2024 [arXiv, code]
- Simplified and Generalized Masked Diffusion for Discrete Data, NeurIPS 2024 [arXiv]
- Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution, ICML 2024 [arXiv, code]
- Think While You Generate: Discrete Diffusion with Planned Denoising, arXiv 2024 [arXiv, code]
- Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data, arXiv 2024 [arXiv, code]
- Analog Bits: Generating Discrete Data using Diffusion Models with Self-Conditioning, ICLR 2023 [arXiv, code]
- DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models, ICLR 2023 [arXiv, code]
- FiLM: Fill-in Language Models for Any-Order Generation, arXiv 2023 [arXiv, code]
- A Continuous Time Framework for Discrete Denoising Models, NeurIPS 2022 [arXiv, code]
- Autoregressive Diffusion Models, ICLR 2022 [arXiv]
- EdiT5: Semi-Autoregressive Text Editing with T5 Warm-Start, arXiv 2022 [arXiv, code]
- Argmax Flows and Multinomial Diffusion: Learning Categorical Distributions, NeurIPS 2021 [arXiv, code]
- Structured Denoising Diffusion Models in Discrete State-Spaces, NeurIPS 2021 [arXiv, code]
- SSD-LM: Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Control, ACL 2023 [arXiv, code]
- Diffusion-LM Improves Controllable Text Generation, NeurIPS 2022 [arXiv, code]
- Self-conditioned Embedding Diffusion for Text Generation, NeurIPS 2022 [arXiv]
- Continuous Diffusion for Categorical Data, arXiv 2022 [arXiv]
- Discrete Flow Matching, NeurIPS 2024 [arXiv]
- Generative Flows on Discrete State-Spaces: Enabling Multimodal Flows with Applications to Protein Co-Design, ICML 2024 [arXiv]
- Beyond Autoregression: Fast LLMs via Self-Distillation Through Time, arXiv 2024 [arXiv]
- Masked Diffusion Models are Secretly Time-Agnostic Masked Models and Exploit Inaccurate Categorical Sampling, arXiv 2024 [arXiv]
- Informed Correctors for Discrete Diffusion Models, arXiv 2024 [arXiv]
- Jump Your Steps: Optimizing Sampling Schedule of Discrete Diffusion Models, arXiv 2024 [arXiv]
- Steering Masked Discrete Diffusion Models via Discrete Denoising Posterior Prediction, arXiv 2024 [arXiv]
- Unlocking Guidance for Discrete State-Space Diffusion and Flow Models, arXiv 2024 [arXiv]
- Protein Design with Guided Discrete Diffusion, NeurIPS 2023 [arXiv, code]
- Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion, CORR 2024 [arXiv, code]
- DINOISER: Diffused Conditional Sequence Learning By Manipulating Noises, TACL 2024 [arXiv, code]
- DiffusER: Discrete Diffusion via Edit-based Reconstruction, ICLR 2023 [arXiv, code]
- A Cheaper and Better Diffusion Language Model with Soft-Masked Noise, EMNLP 2023 [arXiv, code]
- DiffusionBERT: Improving Generative Masked Language Models with Diffusion Models, ACL 2023 [arXiv, code]
- Discrete Copula Diffusion, arXiv 2024 [arXiv]
- Formulating Discrete Probability Flow Through Optimal Transport, NeurIPS 2023 [arXiv, code]
- Categorical SDEs with Simplex Diffusion, arXiv 2022 [arXiv]
- Fine-Tuning Discrete Diffusion Models via Reward Optimization with Applications to DNA and Protein Design, arXiv 2024 [arXiv, code]
- Scaling Diffusion Language Models via Adaptation from Autoregressive Models, arXiv 2024 [arXiv]
- Scaling up Masked Diffusion Models on Text, arXiv 2024 [arXiv]
- Likelihood-Based Diffusion Language Models, NeurIPS 2023 [arXiv, code]
- Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning, arXiv 2023 [arXiv, code]