A collection of ablation and final models trained on the Outlier-Safe Pre-Training (OSP) framework.
Data Mining and Information Systems Lab
dmis-lab
AI & ML interests
None yet
Organizations
Med-PRM
This collection hosts Med-PRM series introduced in paper, Med-PRM: Medical Reasoning Models with Stepwise, Guideline-verified Process Rewards
-
dmis-lab/llama-3.1-medprm-reward-v1.0
Text Generation • Updated • 150 • 16 -
dmis-lab/llama-3.1-medprm-reward-raw-training-set
Viewer • Updated • 11.7k • 7 -
dmis-lab/llama-3.1-medprm-reward-training-set
Viewer • Updated • 11.7k • 30 • 9 -
dmis-lab/llama-3.1-medprm-reward-raw-test-set
Viewer • Updated • 5.47k • 7
Outlier-Safe Pre-Training (OSP)
A collection of ablation and final models trained on the Outlier-Safe Pre-Training (OSP) framework.
Med-PRM
This collection hosts Med-PRM series introduced in paper, Med-PRM: Medical Reasoning Models with Stepwise, Guideline-verified Process Rewards
-
dmis-lab/llama-3.1-medprm-reward-v1.0
Text Generation • Updated • 150 • 16 -
dmis-lab/llama-3.1-medprm-reward-raw-training-set
Viewer • Updated • 11.7k • 7 -
dmis-lab/llama-3.1-medprm-reward-training-set
Viewer • Updated • 11.7k • 30 • 9 -
dmis-lab/llama-3.1-medprm-reward-raw-test-set
Viewer • Updated • 5.47k • 7
models 54
dmis-lab/OSP-1.4B-100B-Shampoo-SSNorm-EmbProj
1B • Updated
• 1 • 4
dmis-lab/OSP-1.4B-100B-Shampoo-SSNorm
1B • Updated
• 3
dmis-lab/OSP-1.4B-100B-Muon-SSNorm-EmbProj
1B • Updated
• 4 • 4
dmis-lab/OSP-1.4B-100B-Muon-EmbProj
1B • Updated
• 1 • 3
dmis-lab/OSP-1.4B-100B-Muon-SSNorm
1B • Updated
• 1 • 3
dmis-lab/OSP-1.4B-100B-Muon-Only
1B • Updated
• 3 • 3
dmis-lab/OSP-1.4B-100B-Muon
1B • Updated
• 3
dmis-lab/OSP-1.4B-100B-Adam
1B • Updated
• 4 • 3
dmis-lab/OSP-1.4B-1T-Muon-SSNorm-EmbProj
1B • Updated
• 4 • 4
dmis-lab/OSP-1.4B-1T-Adam
1B • Updated
• 5 • 3
datasets 10
dmis-lab/llama-3.1-medprm-reward-raw-test-set
Viewer
• Updated
• 5.47k • 7
dmis-lab/llama-3.1-medprm-reward-raw-training-set
Viewer
• Updated
• 11.7k • 7
dmis-lab/llama-3.1-medprm-reward-test-set
Updated
• 18 • 2
dmis-lab/llama-3.1-medprm-reward-training-set
Viewer
• Updated
• 11.7k • 30 • 9
dmis-lab/TemporalHead
Viewer
• Updated
• 11 • 132 • 1
dmis-lab/meerkat-instructions
Viewer
• Updated
• 440k • 119 • 10
dmis-lab/RF-Collection
Preview
• Updated
• 164 • 1
dmis-lab/ChroKnowBench
Preview
• Updated
• 294 • 7
dmis-lab/ETHIC
Viewer
• Updated
• 1.99k • 65 • 7
dmis-lab/MedLFQA
Viewer
• Updated
• 4.95k • 58 • 17