ABot-M0.5: Unified Mobility-and-Manipulation World Action Model Paper • 2607.00678 • Published 1 day ago • 9
TIPSv2: Advancing Vision-Language Pretraining with Enhanced Patch-Text Alignment Paper • 2604.12012 • Published Apr 13 • 15
The Hitchhiker's Guide to Agentic AI: From Foundations to Systems Paper • 2606.24937 • Published 11 days ago • 17
Seed2.0 Model Card: Towards Intelligence Frontier for Real-World Complexity Paper • 2607.00248 • Published 3 days ago • 11
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization Paper • 2601.05242 • Published Jan 8 • 234
MuSViT: A Foundation Vision Model for Sheet Music Representation Paper • 2606.31811 • Published 3 days ago • 4
BrainJanus: A Unified Model for Understanding and Generation across Brain, Vision, and Language Paper • 2606.30319 • Published 4 days ago • 7
Scenes as Objects, Not Primitives: Instance-Structured 3D Tokenization from Unposed Views Paper • 2606.29513 • Published 5 days ago • 42
Agentic Abstention: Do Agents Know When to Stop Instead of Act? Paper • 2606.28733 • Published 6 days ago • 135
Qwen-RobotNav Technical Report: A Scalable Navigation Model Designed for an Agentic Navigation System Paper • 2606.18112 • Published 15 days ago • 22
Qwen-RobotManip Technical Report: Alignment Unlocks Scale for Robotic Manipulation Foundation Models Paper • 2606.17846 • Published 16 days ago • 26
TUA-Bench: A Benchmark for General-Purpose Terminal-Use Agents Paper • 2606.28480 • Published 7 days ago • 44
OSWorld2.0: Benchmarking Computer Use Agents on Long-Horizon Real-World Tasks Paper • 2606.29537 • Published 5 days ago • 17
Nemotron-Labs-Diffusion-Image: Advancing Masked Discrete Diffusion for High-Resolution Image Synthesis Paper • 2606.29814 • Published 4 days ago • 9
Scaling the Horizon, Not the Parameters: Reaching Trillion-Parameter Performance with a 35B Agent Paper • 2606.30616 • Published 4 days ago • 81