view post Post 528 I made a Hugging Face Space for SCAIL-2 🤗Reference character + driving motion → animated result.A simple demo to explore the paper’s core workflow with curated examples.👉 fffiloni/SCAIL-2 See translation 👍 2 2 🤗 1 1 + Reply
Lip Forcing: Few-Step Autoregressive Diffusion for Real-time Lip Synchronization Paper • 2606.11180 • Published 24 days ago • 35
SCAIL-2: Unifying Controlled Character Animation with End-to-end In-Context Conditioning Paper • 2606.10804 • Published 25 days ago • 52
MoVerse: Real-Time Video World Modeling with Panoramic Gaussian Scaffold Paper • 2606.13376 • Published 23 days ago • 15
World Tracing: Generative Pixel-Aligned Geometry Beyond the Visible Paper • 2606.13652 • Published 23 days ago • 16
OmniDirector: General Multi-Shot Camera Cloning without Cross-Paired Data Paper • 2606.13432 • Published 23 days ago • 113
PermaVid: Consistent Video Generation Across Edits via Disentangled Context Memory Paper • 2606.16449 • Published 19 days ago • 6
Memento: Reconstruct to Remember for Consistent Long Video Generation Paper • 2606.14667 • Published 22 days ago • 18
DreamX-World 1.0: A General-Purpose Interactive World Model Paper • 2606.16993 • Published 19 days ago • 113
Data Journalist Agent: Transforming Data into Verifiable Multimodal Stories Paper • 2606.11176 • Published 25 days ago • 130
Holo-World: Unified Camera, Object and Weather Control for Video World Model Paper • 2606.20083 • Published 16 days ago • 11
Improving Text-to-Music Generation with Human Preference Rewards Paper • 2606.21670 • Published 15 days ago • 1
Libretto: Giving LLM Agents a Sense of Musical Structure Paper • 2606.22708 • Published 13 days ago • 2
MeshFlow: Mesh Generation with Equivariant Flow Matching Paper • 2606.23489 • Published 12 days ago • 3
Vera: A Layered Diffusion Model for Content-Preserving Video Editing Paper • 2606.23610 • Published 12 days ago • 11
ShutterMuse: Capture-Time Photography Guidance with MLLMs Paper • 2606.25763 • Published 10 days ago • 46
DomainShuttle: Freeform Open Domain Subject-driven Text-to-video Generation Paper • 2606.26058 • Published 10 days ago • 67
Confidence-Aware Tool Orchestration for Robust Video Understanding Paper • 2606.26904 • Published 9 days ago • 11