LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding Paper ⢠2605.27365 ⢠Published 5 days ago ⢠127
LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling Paper ⢠2605.08083 ⢠Published 23 days ago ⢠69
Parallel-R1: Towards Parallel Thinking via Reinforcement Learning Paper ⢠2509.07980 ⢠Published Sep 9, 2025 ⢠105
Prepare Reasoning Language Models for Multi-Agent Debate with Self-Debate Reinforcement Learning Paper ⢠2601.22297 ⢠Published Jan 29 ⢠2
PaperBanana: Automating Academic Illustration for AI Scientists Paper ⢠2601.23265 ⢠Published Jan 30 ⢠228
Multi-Crit: Benchmarking Multimodal Judges on Pluralistic Criteria-Following Paper ⢠2511.21662 ⢠Published Nov 26, 2025 ⢠11
LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model Paper ⢠2509.00676 ⢠Published Aug 31, 2025 ⢠85
ViCrit: A Verifiable Reinforcement Learning Proxy Task for Visual Perception in VLMs Paper ⢠2506.10128 ⢠Published Jun 11, 2025 ⢠22
Cosmos-Reason1 Collection ā ļø The latest version of Cosmos Reason is now live! š https://huggingface.co/collections/nvidia/cosmos-reason2 ⢠5 items ⢠Updated 1 day ago ⢠42
Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models Paper ⢠2504.15271 ⢠Published Apr 21, 2025 ⢠69
PixMo Collection A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog ⢠9 items ⢠Updated Mar 2 ⢠90
Eagle Collection Eagle is a family of frontier vision-language models with data-centric strategies. The model supports both HD image and long-context video input. ⢠17 items ⢠Updated 1 day ago ⢠45
LLaVA-Critic Collection as a general evaluator for assessing model performance ⢠6 items ⢠Updated Oct 6, 2024 ⢠10
LLaVA-Critic: Learning to Evaluate Multimodal Models Paper ⢠2410.02712 ⢠Published Oct 3, 2024 ⢠37