hyunseoki/verl-math-transfer-llama31-8b-to-llama32-3b-pool7to1 Text Generation • 8B • Updated 29 days ago • 425
hyunseoki/verl-math-transfer-7bi-to-3bi-fix05-pool7to1 Text Generation • 8B • Updated 29 days ago • 325
tianrui6641/omnicoder_local9b_blackwell_coregen_hdlfix_v2_hf_r64_epoch1-merged-bf16 Image-Text-to-Text • 9B • Updated 28 days ago • 2
rghosh8/gsm8k-deepseek-llm-7b-chat-rajat-seed-42-G-16_merged Text Generation • 7B • Updated 27 days ago • 2.34k
rghosh8/gsm8k-deepseek-llm-7b-chat-rajat-seed-3407-G-16_merged Text Generation • 7B • Updated 27 days ago • 82
rghosh8/gsm8k-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4 Text Generation • Updated 27 days ago • 32
rghosh8/gsm8k-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4_merged Text Generation • 2B • Updated 27 days ago • 79
rghosh8/gsm8k-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-16 Text Generation • Updated 27 days ago • 32
rghosh8/gsm8k-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-16_merged Text Generation • 2B • Updated 27 days ago • 75
rghosh8/gsm8k-deepseek-r1-distill-qwen-1.5b-rajat-seed-3407-G-4 Text Generation • Updated 27 days ago • 31
rghosh8/gsm8k-deepseek-r1-distill-qwen-1.5b-rajat-seed-3407-G-4_merged Text Generation • 2B • Updated 27 days ago • 79
rghosh8/gsm8k-deepseek-r1-distill-qwen-1.5b-rajat-seed-3407-G-16_merged Text Generation • 2B • Updated 27 days ago • 72
rghosh8/gsm8k-deepseek-r1-distill-qwen-1.5b-rajat-seed-3407-G-16 Text Generation • Updated 27 days ago • 30