Vignac, ClémentKrawczuk, IgorSiraudin, AntoineWang, BohanCevher, VolkanFrossard, Pascal2023-02-232023-02-232023-02-232023https://infoscience.epfl.ch/handle/20.500.14299/195050This work introduces DiGress, a discrete denoising diffusion model for generating graphs with categorical node and edge attributes. Our model utilizes a discrete diffusion process that progressively edits graphs with noise, through the process of adding or removing edges and changing the categories. A graph transformer network is trained to revert this process, simplifying the problem of distribution learning over graphs into a sequence of node and edge classification tasks. We further improve sample quality by introducing a Markovian noise model that preserves the marginal distribution of node and edge types during diffusion, and by incorporating auxiliary graph-theoretic features. A procedure for conditioning the generation on graph-level features is also proposed. DiGress achieves state-of-theart performance on molecular and non-molecular datasets, with up to 3x validity improvement on a planar graph dataset. It is also the first model to scale to the large GuacaMol dataset containing 1.3M drug-like molecules without the use of molecule-specific representations.ML-AIDiGress: Discrete Denoising diffusion for graph generationtext::conference output::conference proceedings::conference paper