CodeOCR: On the Effectiveness of Vision Language Models in Code Understanding Paper • 2602.01785 • Published 6 days ago • 91
Idea2Story: An Automated Pipeline for Transforming Research Concepts into Complete Scientific Narratives Paper • 2601.20833 • Published 10 days ago • 172
EvoFSM: Controllable Self-Evolution for Deep Research with Finite State Machines Paper • 2601.09465 • Published 24 days ago • 41
Watching, Reasoning, and Searching: A Video Deep Research Benchmark on Open Web for Agentic Video Reasoning Paper • 2601.06943 • Published 27 days ago • 210
Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length Paper • 2512.04677 • Published Dec 4, 2025 • 170
SteadyDancer: Harmonized and Coherent Human Image Animation with First-Frame Preservation Paper • 2511.19320 • Published Nov 24, 2025 • 42
SEAP: Training-free Sparse Expert Activation Pruning Unlock the Brainpower of Large Language Models Paper • 2503.07605 • Published Mar 10, 2025 • 66
LEGO: Learning EGOcentric Action Frame Generation via Visual Instruction Tuning Paper • 2312.03849 • Published Dec 6, 2023 • 7
HybridNeRF: Efficient Neural Rendering via Adaptive Volumetric Surfaces Paper • 2312.03160 • Published Dec 5, 2023 • 8
Self-conditioned Image Generation via Generating Representations Paper • 2312.03701 • Published Dec 6, 2023 • 9
MagicStick: Controllable Video Editing via Control Handle Transformations Paper • 2312.03047 • Published Dec 5, 2023 • 11