Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
RLAIF
/
dpo_answer_reddit_judge_1e-6_0.02_4B_8B
like
0
Follow
RLAIF
21
Safetensors
Model card
Files
Files and versions
xet
Community
main
dpo_answer_reddit_judge_1e-6_0.02_4B_8B
8.84 GB
1 contributor
History:
2 commits
AngelRaychev
Upload folder using huggingface_hub
59f94de
verified
4 months ago
global_step_468
Upload folder using huggingface_hub
4 months ago
.gitattributes
1.59 kB
Upload folder using huggingface_hub
4 months ago