Glance: Accelerating Diffusion Models with 1 Sample Paper ⢠2512.02899 ⢠Published about 1 month ago ⢠28
WEAVE: Unleashing and Benchmarking the In-context Interleaved Comprehension and Generation Paper ⢠2511.11434 ⢠Published Nov 14, 2025 ⢠44
š± Sailor2 Language Models Collection Sailing in South-East Asia with Inclusive Multilingual LLMs ⢠34 items ⢠Updated Nov 19, 2025 ⢠30
VCode: a Multimodal Coding Benchmark with SVG as Symbolic Visual Representation Paper ⢠2511.02778 ⢠Published Nov 4, 2025 ⢠101
UniLumos: Fast and Unified Image and Video Relighting with Physics-Plausible Feedback Paper ⢠2511.01678 ⢠Published Nov 3, 2025 ⢠35
From Charts to Code: A Hierarchical Benchmark for Multimodal Models Paper ⢠2510.17932 ⢠Published Oct 20, 2025 ⢠7
Paper2Video: Automatic Video Generation from Scientific Papers Paper ⢠2510.05096 ⢠Published Oct 6, 2025 ⢠118
V-MAGE: A Game Evaluation Framework for Assessing Visual-Centric Capabilities in Multimodal Large Language Models Paper ⢠2504.06148 ⢠Published Apr 8, 2025 ⢠13
V-MAGE: A Game Evaluation Framework for Assessing Visual-Centric Capabilities in Multimodal Large Language Models Paper ⢠2504.06148 ⢠Published Apr 8, 2025 ⢠13
V-MAGE: A Game Evaluation Framework for Assessing Visual-Centric Capabilities in Multimodal Large Language Models Paper ⢠2504.06148 ⢠Published Apr 8, 2025 ⢠13 ⢠2
Beyond Words: Advancing Long-Text Image Generation via Multimodal Autoregressive Models Paper ⢠2503.20198 ⢠Published Mar 26, 2025 ⢠4
Beyond Words: Advancing Long-Text Image Generation via Multimodal Autoregressive Models Paper ⢠2503.20198 ⢠Published Mar 26, 2025 ⢠4
Beyond Words: Advancing Long-Text Image Generation via Multimodal Autoregressive Models Paper ⢠2503.20198 ⢠Published Mar 26, 2025 ⢠4 ⢠3