Marco-MoE Collection A suit of multilingual MoE models with highly-sparse architectures • 4 items • Updated 2 days ago • 8
Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale Paper • 2603.25040 • Published 10 days ago • 125
PrismAudio: Decomposed Chain-of-Thoughts and Multi-dimensional Rewards for Video-to-Audio Generation Paper • 2511.18833 • Published Nov 24, 2025 • 5
Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model Paper • 2603.21986 • Published 13 days ago • 120
dots.mocr Collection Multimodal OCR: Parse Anything from Documents • 2 items • Updated 17 days ago • 7
InCoder-32B: Code Foundation Model for Industrial Scenarios Paper • 2603.16790 • Published 19 days ago • 306
MiroThinker-1.7 & H1: Towards Heavy-Duty Research Agents via Verification Paper • 2603.15726 • Published 20 days ago • 184
Qianfan-OCR: A Unified End-to-End Model for Document Intelligence Paper • 2603.13398 • Published 25 days ago • 152