Knowledge Distillation - Students meta-llama/Llama-3.1-8B-Instruct Text Generation • 8B • Updated Sep 25, 2024 • 6.91M • • 6.07k Qwen/Qwen2.5-7B-Instruct Text Generation • 8B • Updated Jan 12, 2025 • 8.5M • • 1.36k
Knowledge Distillation - Teachers meta-llama/Llama-3.1-70B-Instruct Text Generation • 71B • Updated Dec 15, 2024 • 444k • • 925 Qwen/Qwen2.5-72B-Instruct Text Generation • 73B • Updated Jan 12, 2025 • 328k • • 952
Knowledge Distillation - Students meta-llama/Llama-3.1-8B-Instruct Text Generation • 8B • Updated Sep 25, 2024 • 6.91M • • 6.07k Qwen/Qwen2.5-7B-Instruct Text Generation • 8B • Updated Jan 12, 2025 • 8.5M • • 1.36k
Knowledge Distillation - Teachers meta-llama/Llama-3.1-70B-Instruct Text Generation • 71B • Updated Dec 15, 2024 • 444k • • 925 Qwen/Qwen2.5-72B-Instruct Text Generation • 73B • Updated Jan 12, 2025 • 328k • • 952