arxiv:2604.18486
Tianyi Jiang
LumosJiang
AI & ML interests
None yet
Recent Activity
upvoted a paper about 18 hours ago
DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards updated a model 9 days ago
LumosJiang/qwen3-8b-base-sft-open-thoughts-114k-800steps published a model 9 days ago
LumosJiang/qwen3-8b-base-sft-open-thoughts-114k-800stepsOrganizations
None yet