t2i - a Solaren Collection

Solaren 's Collections

lm

t2i

updated Apr 7, 2024

FreeU: Free Lunch in Diffusion U-Net

Paper • 2309.11497 • Published Sep 20, 2023 • 66
Imagic: Text-Based Real Image Editing with Diffusion Models

Paper • 2210.09276 • Published Oct 17, 2022 • 1
On Architectural Compression of Text-to-Image Diffusion Models

Paper • 2305.15798 • Published May 25, 2023 • 5
Wuerstchen: Efficient Pretraining of Text-to-Image Models

Paper • 2306.00637 • Published Jun 1, 2023 • 13
CLIP-KD: An Empirical Study of Distilling CLIP Models

Paper • 2307.12732 • Published Jul 24, 2023
Online Clustered Codebook

Paper • 2307.15139 • Published Jul 27, 2023 • 1
Residual Denoising Diffusion Models

Paper • 2308.13712 • Published Aug 25, 2023 • 3
InstaFlow: One Step is Enough for High-Quality Diffusion-Based Text-to-Image Generation

Paper • 2309.06380 • Published Sep 12, 2023 • 33
Restart Sampling for Improving Generative Processes

Paper • 2306.14878 • Published Jun 26, 2023 • 5
Controlling Text-to-Image Diffusion by Orthogonal Finetuning

Paper • 2306.07280 • Published Jun 12, 2023 • 25
Emu: Enhancing Image Generation Models Using Photogenic Needles in a Haystack

Paper • 2309.15807 • Published Sep 27, 2023 • 34
Finite Scalar Quantization: VQ-VAE Made Simple

Paper • 2309.15505 • Published Sep 27, 2023 • 24
Muse: Text-To-Image Generation via Masked Generative Transformers

Paper • 2301.00704 • Published Jan 2, 2023
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Paper • 2310.00426 • Published Sep 30, 2023 • 61
DPM-Solver-v3: Improved Diffusion ODE Solver with Empirical Model Statistics

Paper • 2310.13268 • Published Oct 20, 2023 • 18
CommonCanvas: An Open Diffusion Model Trained with Creative-Commons Images

Paper • 2310.16825 • Published Oct 25, 2023 • 36
Image Super-resolution Via Latent Diffusion: A Sampling-space Mixture Of Experts And Frequency-augmented Decoder Approach

Paper • 2310.12004 • Published Oct 18, 2023 • 2
SAM-CLIP: Merging Vision Foundation Models towards Semantic and Spatial Understanding

Paper • 2310.15308 • Published Oct 23, 2023 • 23
Matryoshka Diffusion Models

Paper • 2310.15111 • Published Oct 23, 2023 • 45
Beyond U: Making Diffusion Models Faster & Lighter

Paper • 2310.20092 • Published Oct 31, 2023 • 12
I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models

Paper • 2311.04145 • Published Nov 7, 2023 • 34
Soft Mixture Denoising: Beyond the Expressive Bottleneck of Diffusion Models

Paper • 2309.14068 • Published Sep 25, 2023 • 1
Denoising Diffusion Step-aware Models

Paper • 2310.03337 • Published Oct 5, 2023 • 1
DiffNAS: Bootstrapping Diffusion Models by Prompting for Better Architectures

Paper • 2310.04750 • Published Oct 7, 2023 • 1
PIXART-δ: Fast and Controllable Image Generation with Latent Consistency Models

Paper • 2401.05252 • Published Jan 10, 2024 • 49
Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs

Paper • 2401.11708 • Published Jan 22, 2024 • 30
UNIMO-G: Unified Image Generation through Multimodal Conditional Diffusion

Paper • 2401.13388 • Published Jan 24, 2024 • 13
Deconstructing Denoising Diffusion Models for Self-Supervised Learning

Paper • 2401.14404 • Published Jan 25, 2024 • 18
CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching

Paper • 2404.03653 • Published Apr 4, 2024 • 35