-
EVA-CLIP-18B: Scaling CLIP to 18 Billion Parameters
Paper • 2402.04252 • Published • 29 -
Vision Superalignment: Weak-to-Strong Generalization for Vision Foundation Models
Paper • 2402.03749 • Published • 14 -
ScreenAI: A Vision-Language Model for UI and Infographics Understanding
Paper • 2402.04615 • Published • 44 -
EfficientViT-SAM: Accelerated Segment Anything Model Without Performance Loss
Paper • 2402.05008 • Published • 23
Collections
Discover the best community collections!
Collections including paper arxiv:2511.18423
-
VisMem: Latent Vision Memory Unlocks Potential of Vision-Language Models
Paper • 2511.11007 • Published • 15 -
O-Mem: Omni Memory System for Personalized, Long Horizon, Self-Evolving Agents
Paper • 2511.13593 • Published • 25 -
General Agentic Memory Via Deep Research
Paper • 2511.18423 • Published • 161 -
MemEvolve: Meta-Evolution of Agent Memory Systems
Paper • 2512.18746 • Published • 27
-
General Agentic Memory Via Deep Research
Paper • 2511.18423 • Published • 161 -
Diffusion Language Models are Super Data Learners
Paper • 2511.03276 • Published • 128 -
SAM 3: Segment Anything with Concepts
Paper • 2511.16719 • Published • 125 -
Back to Basics: Let Denoising Generative Models Denoise
Paper • 2511.13720 • Published • 67
-
Memory in the Age of AI Agents
Paper • 2512.13564 • Published • 131 -
Agent READMEs: An Empirical Study of Context Files for Agentic Coding
Paper • 2511.12884 • Published • 17 -
General Agentic Memory Via Deep Research
Paper • 2511.18423 • Published • 161 -
Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models
Paper • 2510.04618 • Published • 127
-
Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning
Paper • 2511.16043 • Published • 108 -
General Agentic Memory Via Deep Research
Paper • 2511.18423 • Published • 161 -
Agent0-VL: Exploring Self-Evolving Agent for Tool-Integrated Vision-Language Reasoning
Paper • 2511.19900 • Published • 48 -
MobiAgent: A Systematic Framework for Customizable Mobile Agents
Paper • 2509.00531 • Published • 7
-
O-Mem: Omni Memory System for Personalized, Long Horizon, Self-Evolving Agents
Paper • 2511.13593 • Published • 25 -
OmniScientist: Toward a Co-evolving Ecosystem of Human and AI Scientists
Paper • 2511.16931 • Published • 7 -
General Agentic Memory Via Deep Research
Paper • 2511.18423 • Published • 161 -
MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling
Paper • 2511.11793 • Published • 165
-
EVA-CLIP-18B: Scaling CLIP to 18 Billion Parameters
Paper • 2402.04252 • Published • 29 -
Vision Superalignment: Weak-to-Strong Generalization for Vision Foundation Models
Paper • 2402.03749 • Published • 14 -
ScreenAI: A Vision-Language Model for UI and Infographics Understanding
Paper • 2402.04615 • Published • 44 -
EfficientViT-SAM: Accelerated Segment Anything Model Without Performance Loss
Paper • 2402.05008 • Published • 23
-
General Agentic Memory Via Deep Research
Paper • 2511.18423 • Published • 161 -
Diffusion Language Models are Super Data Learners
Paper • 2511.03276 • Published • 128 -
SAM 3: Segment Anything with Concepts
Paper • 2511.16719 • Published • 125 -
Back to Basics: Let Denoising Generative Models Denoise
Paper • 2511.13720 • Published • 67
-
Memory in the Age of AI Agents
Paper • 2512.13564 • Published • 131 -
Agent READMEs: An Empirical Study of Context Files for Agentic Coding
Paper • 2511.12884 • Published • 17 -
General Agentic Memory Via Deep Research
Paper • 2511.18423 • Published • 161 -
Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models
Paper • 2510.04618 • Published • 127
-
Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning
Paper • 2511.16043 • Published • 108 -
General Agentic Memory Via Deep Research
Paper • 2511.18423 • Published • 161 -
Agent0-VL: Exploring Self-Evolving Agent for Tool-Integrated Vision-Language Reasoning
Paper • 2511.19900 • Published • 48 -
MobiAgent: A Systematic Framework for Customizable Mobile Agents
Paper • 2509.00531 • Published • 7
-
VisMem: Latent Vision Memory Unlocks Potential of Vision-Language Models
Paper • 2511.11007 • Published • 15 -
O-Mem: Omni Memory System for Personalized, Long Horizon, Self-Evolving Agents
Paper • 2511.13593 • Published • 25 -
General Agentic Memory Via Deep Research
Paper • 2511.18423 • Published • 161 -
MemEvolve: Meta-Evolution of Agent Memory Systems
Paper • 2512.18746 • Published • 27
-
O-Mem: Omni Memory System for Personalized, Long Horizon, Self-Evolving Agents
Paper • 2511.13593 • Published • 25 -
OmniScientist: Toward a Co-evolving Ecosystem of Human and AI Scientists
Paper • 2511.16931 • Published • 7 -
General Agentic Memory Via Deep Research
Paper • 2511.18423 • Published • 161 -
MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling
Paper • 2511.11793 • Published • 165