849

Avi

avahal

AI & ML interests

LLMs

Recent Activity

commented on a paper about 9 hours ago

Video-Based Reward Modeling for Computer-Use Agents

commented on a paper about 9 hours ago

IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse

commented on a paper about 9 hours ago

Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections

View all activity

Organizations

None yet

commented 4 papers about 9 hours ago

Video-Based Reward Modeling for Computer-Use Agents

Paper • 2603.10178 • Published 6 days ago • 36 •

IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse

Paper • 2603.12201 • Published 4 days ago • 45 •

Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections

Paper • 2603.12180 • Published 4 days ago • 56 •

Spatial-TTT: Streaming Visual-based Spatial Intelligence with Test-Time Training

Paper • 2603.12255 • Published 4 days ago • 78 •

commented 4 papers about 10 hours ago

Omni-Diffusion: Unified Multimodal Understanding and Generation with Masked Discrete Diffusion

Paper • 2603.06577 • Published 10 days ago • 43 •

MM-Zero: Self-Evolving Multi-Model Vision Language Models From Zero Data

Paper • 2603.09206 • Published 6 days ago • 45 •

Thinking to Recall: How Reasoning Unlocks Parametric Knowledge in LLMs

Paper • 2603.09906 • Published 6 days ago • 62 •

Geometry-Guided Reinforcement Learning for Multi-view Consistent 3D Scene Editing

Paper • 2603.03143 • Published 13 days ago • 137 •

commented 4 papers about 11 hours ago

Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning

Paper • 2603.04597 • Published 12 days ago • 159 •

commented 2 papers 6 days ago

Believe Your Model: Distribution-Guided Confidence Calibration

Paper • 2603.03872 • Published 12 days ago • 38 •

Lost in Stories: Consistency Bugs in Long Story Generation by LLMs

Paper • 2603.05890 • Published 10 days ago • 83 •

commented 5 papers 7 days ago

WildActor: Unconstrained Identity-Preserving Video Generation

Paper • 2603.00586 • Published 16 days ago • 34 •

Progressive Residual Warmup for Language Model Pretraining

Paper • 2603.05369 • Published 11 days ago • 33 •

Planning in 8 Tokens: A Compact Discrete Tokenizer for Latent World Model

Paper • 2603.05438 • Published 11 days ago • 36 •

BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforcement Learning

Paper • 2603.04918 • Published 11 days ago • 54 •

Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders

Paper • 2603.06569 • Published 10 days ago • 105 •

commented a paper 10 days ago

RoboPocket: Improve Robot Policies Instantly with Your Phone

Paper • 2603.05504 • Published 11 days ago • 31 •

Avi

AI & ML interests

Recent Activity

Organizations

avahal's activity