Bringing BERT into modernity via both architecture changes and scaling
-
answerdotai/ModernBERT-base
Fill-Mask • 0.1B • Updated • 980k • 997 -
answerdotai/ModernBERT-large
Fill-Mask • Updated • 191k • 455 -
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference
Paper • 2412.13663 • Published • 161