Running 5 Transformers Model Architectures 📐 5 Browse and filter transformer model architecture diagrams
view article Article Introducing North Mini Code: Cohere’s First Model For Developers CohereLabs • 23 days ago • 79
nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16 Text Generation • 561B • Updated 22 days ago • 147k • • 256
view article Article Task-Seeded Synthetic Q&A Generation for Nemotron Pretraining nvidia • 28 days ago • 17
view article Article Harness, Scaffold, and the AI Agent Terms Worth Getting Right sergiopaniego, ariG23498 • May 25 • 125
Running on CPU Upgrade Featured 409 ML Intern 🤖 409 Explore machine learning tasks via an interactive web app
The ATOM Report: Measuring the Open Language Model Ecosystem Paper • 2604.07190 • Published Apr 8 • 5