deepseek-ai/DeepSeek-V3.1-Terminus Text Generation β’ 685B β’ Updated Sep 29, 2025 β’ 5.84k β’ β’ 360
Qwen/Qwen3-Next-80B-A3B-Instruct Text Generation β’ 81B β’ Updated Sep 17, 2025 β’ 1.11M β’ β’ 930
Running 218 FineVision: Open Data is All You Need π 218 A new open-source dataset for training VLMs
google/embeddinggemma-300m Sentence Similarity β’ 0.3B β’ Updated Sep 25, 2025 β’ 1.18M β’ β’ 1.46k
Paused Featured 807 Qwen Image Edit β 807 Edit and enhance images based on descriptive instructions
ngxson/Home-Cook-Mistral-Small-Omni-24B-2507-GGUF Any-to-Any β’ 24B β’ Updated Jul 28, 2025 β’ 1.19k β’ 27
Running 3.67k The Ultra-Scale Playbook π 3.67k The ultimate guide to training LLM on large GPU Clusters