Ayush Tanwar
Ayush342
ยท
AI & ML interests
None yet
Recent Activity
upvoted a paper about 1 month ago
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization new activity
6 months ago
mradermacher/Bitnet-Llama-70M-GGUF:Base model updated
a Space over 1 year ago
Ayush342/test