LitBench SAA-Lab/LitBench-Train Viewer • Updated Jul 7, 2025 • 43.8k • 131 • 3 SAA-Lab/LitBench-Rationales Viewer • Updated May 16, 2025 • 43.7k • 299 SAA-Lab/LitBench-Test Viewer • Updated Jul 7, 2025 • 2.38k • 47 LitBench: A Benchmark and Dataset for Reliable Evaluation of Creative Writing Paper • 2507.00769 • Published Jul 1, 2025 • 4
LitBench: A Benchmark and Dataset for Reliable Evaluation of Creative Writing Paper • 2507.00769 • Published Jul 1, 2025 • 4
The Sound of Syntax Models SAA-Lab/Qwen2.5-Omni-7B-UltraSuite 11B • Updated May 9, 2025 SAA-Lab/Qwen2.5-Omni-3B-UltraSuite 6B • Updated May 10, 2025 • 4 SAA-Lab/Qwen2-Audio-7B-Instruct-Ultrasuite 8B • Updated May 10, 2025 • 6 • 2 SAA-Lab/Qwen2.5-Omni-7B-UltraSuite-woA 11B • Updated May 14, 2025 • 6
Creative-Writing-Verifier Data and model for SAA-Lab/writingprompts-pairwise-test Viewer • Updated Mar 9, 2025 • 1k • 8 SAA-Lab/writingprompts-pairwise-train Viewer • Updated Mar 9, 2025 • 19.7k • 7 SAA-Lab/wp_test_0421 Viewer • Updated Apr 21, 2025 • 6.22k • 2 • 1 SAA-Lab/wp_train_0421 Viewer • Updated Apr 21, 2025 • 48.8k • 5
SLPHelmDatasets SAA-Lab/SLPHelmDataset Viewer • Updated May 15, 2025 • 19.4k • 13.2k SAA-Lab/SLPHelmUltraSuitePlus Viewer • Updated Sep 14, 2025 • 926 • 21 SAA-Lab/SLPHelm Viewer • Updated Oct 4, 2025 • 28.6k • 11
WP Test SAA-Lab/test_jan25 Viewer • Updated May 9, 2025 • 155 • 2 SAA-Lab/test-jan24 Viewer • Updated May 9, 2025 • 796 • 1 SAA-Lab/test_march23 Viewer • Updated May 9, 2025 • 1.9k • 2 SAA-Lab/test_oct23 Viewer • Updated May 9, 2025 • 1k • 2
LitBench SAA-Lab/LitBench-Train Viewer • Updated Jul 7, 2025 • 43.8k • 131 • 3 SAA-Lab/LitBench-Rationales Viewer • Updated May 16, 2025 • 43.7k • 299 SAA-Lab/LitBench-Test Viewer • Updated Jul 7, 2025 • 2.38k • 47 LitBench: A Benchmark and Dataset for Reliable Evaluation of Creative Writing Paper • 2507.00769 • Published Jul 1, 2025 • 4
LitBench: A Benchmark and Dataset for Reliable Evaluation of Creative Writing Paper • 2507.00769 • Published Jul 1, 2025 • 4
SLPHelmDatasets SAA-Lab/SLPHelmDataset Viewer • Updated May 15, 2025 • 19.4k • 13.2k SAA-Lab/SLPHelmUltraSuitePlus Viewer • Updated Sep 14, 2025 • 926 • 21 SAA-Lab/SLPHelm Viewer • Updated Oct 4, 2025 • 28.6k • 11
The Sound of Syntax Models SAA-Lab/Qwen2.5-Omni-7B-UltraSuite 11B • Updated May 9, 2025 SAA-Lab/Qwen2.5-Omni-3B-UltraSuite 6B • Updated May 10, 2025 • 4 SAA-Lab/Qwen2-Audio-7B-Instruct-Ultrasuite 8B • Updated May 10, 2025 • 6 • 2 SAA-Lab/Qwen2.5-Omni-7B-UltraSuite-woA 11B • Updated May 14, 2025 • 6
WP Test SAA-Lab/test_jan25 Viewer • Updated May 9, 2025 • 155 • 2 SAA-Lab/test-jan24 Viewer • Updated May 9, 2025 • 796 • 1 SAA-Lab/test_march23 Viewer • Updated May 9, 2025 • 1.9k • 2 SAA-Lab/test_oct23 Viewer • Updated May 9, 2025 • 1k • 2
Creative-Writing-Verifier Data and model for SAA-Lab/writingprompts-pairwise-test Viewer • Updated Mar 9, 2025 • 1k • 8 SAA-Lab/writingprompts-pairwise-train Viewer • Updated Mar 9, 2025 • 19.7k • 7 SAA-Lab/wp_test_0421 Viewer • Updated Apr 21, 2025 • 6.22k • 2 • 1 SAA-Lab/wp_train_0421 Viewer • Updated Apr 21, 2025 • 48.8k • 5