view article Article ViDoRe V3: a comprehensive evaluation of retrieval for enterprise use-cases Nov 5, 2025 • 57
view article Article Introducing MTEB v2: Evaluation of embedding and retrieval systems for more than just text Oct 20, 2025 • 34
HUME: Measuring the Human-Model Performance Gap in Text Embedding Task Paper • 2510.10062 • Published Oct 11, 2025 • 8
view article Article What Open-Source Developers Need to Know about the EU AI Act's Rules for GPAI Models Aug 4, 2025 • 29
Dynaword: From One-shot to Continuously Developed Datasets Paper • 2508.02271 • Published Aug 4, 2025 • 14
MMTEB: Massive Multilingual Text Embedding Benchmark Paper • 2502.13595 • Published Feb 19, 2025 • 43
Danish Text Datasets Collection These include high-quality Danish text datasets for pre-training, fine-tuning, etc. • 16 items • Updated Dec 15, 2024 • 3
GLiNER multi-task: Generalist Lightweight Model for Various Information Extraction Tasks Paper • 2406.12925 • Published Jun 14, 2024 • 25
DANSK and DaCy 2.6.0: Domain Generalization of Danish Named Entity Recognition Paper • 2402.18209 • Published Feb 28, 2024 • 1
State-of-the-art Danish Models Collection These models constitute state-of-the-art models for Danish within their respective domain (highlighted below the model). • 18 items • Updated Nov 4, 2025 • 16
ScandEval: A Benchmark for Scandinavian Natural Language Processing Paper • 2304.00906 • Published Apr 3, 2023 • 4
Danish Benchmarks Collection Benchmarks for evaluating Danish Models. • 2 items • Updated Jun 9, 2024 • 4