baohao
/

SAGE-light_Llama-3.2-3B-Instruct

Model card Files Files and versions

YAML Metadata Warning:empty or missing yaml metadata in repo card

Check out the documentation for more information.

Paper: https://huggingface.co/papers/2602.03143

Training set: https://huggingface.co/datasets/baohao/sage_train

Validation set: https://huggingface.co/datasets/baohao/sage_validation

Github: https://github.com/BaohaoLiao/SAGE.git

Downloads last month: 3

Safetensors

Model size

4B params

Tensor type

BF16

·

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for baohao/SAGE-light_Llama-3.2-3B-Instruct

Quantizations

Collection including baohao/SAGE-light_Llama-3.2-3B-Instruct

SAGE

Self-Hinting Language Models Enhance Reinforcement Learning • 23 items • Updated 21 days ago • 2

Paper for baohao/SAGE-light_Llama-3.2-3B-Instruct

Self-Hinting Language Models Enhance Reinforcement Learning

Paper • 2602.03143 • Published Feb 3 • 31