Scale AI
company
Verified
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
Agentic Rubrics as Contextual Verifiers for SWE Agents
ResearchRubrics: A Benchmark of Prompts and Rubrics For Evaluating Deep Research Agents
datasets
21
ScaleAI/MCP-Atlas
Viewer
•
Updated
•
500
•
267
•
6
ScaleAI/audiomc
Viewer
•
Updated
•
452
•
257
•
2
ScaleAI/robotics-meerkat
Updated
•
3
ScaleAI/VisualToolBench
Viewer
•
Updated
•
1.2k
•
123
•
1
ScaleAI/SA2_bowlstack0
Viewer
•
Updated
•
200
•
12
ScaleAI/dummy_mcp
Viewer
•
Updated
•
16
•
9
ScaleAI/PRBench
Viewer
•
Updated
•
1.65k
•
448
•
6
ScaleAI/researchrubrics
Viewer
•
Updated
•
101
•
73
•
12
ScaleAI/swe-oec-claude-expert
Viewer
•
Updated
•
1.27k
•
52
•
1
ScaleAI/TutorBench
Viewer
•
Updated
•
1.47k
•
439
•
2