KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance Paper • 2604.12627 • Published 12 days ago • 98
Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe Paper • 2604.13016 • Published 12 days ago • 85
AgentSocialBench: Evaluating Privacy Risks in Human-Centered Agentic Social Networks Paper • 2604.01487 • Published 25 days ago • 10
SLEA-RL: Step-Level Experience Augmented Reinforcement Learning for Multi-Turn Agentic Training Paper • 2603.18079 • Published Mar 18 • 1