FreeU: Free Lunch in Diffusion U-Net
Paper
• 2309.11497
• Published
• 66
Imagic: Text-Based Real Image Editing with Diffusion Models
Paper
• 2210.09276
• Published
• 1
On Architectural Compression of Text-to-Image Diffusion Models
Paper
• 2305.15798
• Published
• 5
Wuerstchen: Efficient Pretraining of Text-to-Image Models
Paper
• 2306.00637
• Published
• 13
CLIP-KD: An Empirical Study of Distilling CLIP Models
Paper
• 2307.12732
• Published
Online Clustered Codebook
Paper
• 2307.15139
• Published
• 1
Residual Denoising Diffusion Models
Paper
• 2308.13712
• Published
• 3
InstaFlow: One Step is Enough for High-Quality Diffusion-Based
Text-to-Image Generation
Paper
• 2309.06380
• Published
• 33
Restart Sampling for Improving Generative Processes
Paper
• 2306.14878
• Published
• 5
Controlling Text-to-Image Diffusion by Orthogonal Finetuning
Paper
• 2306.07280
• Published
• 25
Emu: Enhancing Image Generation Models Using Photogenic Needles in a
Haystack
Paper
• 2309.15807
• Published
• 34
Finite Scalar Quantization: VQ-VAE Made Simple
Paper
• 2309.15505
• Published
• 24
Muse: Text-To-Image Generation via Masked Generative Transformers
Paper
• 2301.00704
• Published
PixArt-α: Fast Training of Diffusion Transformer for
Photorealistic Text-to-Image Synthesis
Paper
• 2310.00426
• Published
• 61
DPM-Solver-v3: Improved Diffusion ODE Solver with Empirical Model
Statistics
Paper
• 2310.13268
• Published
• 18
CommonCanvas: An Open Diffusion Model Trained with Creative-Commons
Images
Paper
• 2310.16825
• Published
• 36
Image Super-resolution Via Latent Diffusion: A Sampling-space Mixture Of
Experts And Frequency-augmented Decoder Approach
Paper
• 2310.12004
• Published
• 2
SAM-CLIP: Merging Vision Foundation Models towards Semantic and Spatial
Understanding
Paper
• 2310.15308
• Published
• 23
Matryoshka Diffusion Models
Paper
• 2310.15111
• Published
• 45
Beyond U: Making Diffusion Models Faster & Lighter
Paper
• 2310.20092
• Published
• 12
I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion
Models
Paper
• 2311.04145
• Published
• 34
Soft Mixture Denoising: Beyond the Expressive Bottleneck of Diffusion
Models
Paper
• 2309.14068
• Published
• 1
Denoising Diffusion Step-aware Models
Paper
• 2310.03337
• Published
• 1
DiffNAS: Bootstrapping Diffusion Models by Prompting for Better
Architectures
Paper
• 2310.04750
• Published
• 1
PIXART-δ: Fast and Controllable Image Generation with Latent
Consistency Models
Paper
• 2401.05252
• Published
• 49
Mastering Text-to-Image Diffusion: Recaptioning, Planning, and
Generating with Multimodal LLMs
Paper
• 2401.11708
• Published
• 30
UNIMO-G: Unified Image Generation through Multimodal Conditional
Diffusion
Paper
• 2401.13388
• Published
• 13
Deconstructing Denoising Diffusion Models for Self-Supervised Learning
Paper
• 2401.14404
• Published
• 18
CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept
Matching
Paper
• 2404.03653
• Published
• 35