Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Alex Shaw's picture
128 1 1

Alex Shaw

alexgshaw
ryanmarten's profile picture blanchon's profile picture lincolnhuj's profile picture
·
https://www.tbench.ai/
  • alexgshaw
  • alexgshaw
  • alexgshaw

AI & ML interests

None yet

Recent Activity

new activity 1 day ago
harborframework/terminal-bench-2-leaderboard:Logos Agent 3rd Commit
new activity 3 days ago
harborframework/terminal-bench-2-leaderboard:Fix: add missing task_checksum field to all 89 result.json files
new activity 3 days ago
harborframework/terminal-bench-2-leaderboard:Add 100xflux (Claude Sonnet 4.6) submission to Terminal-Bench 2.0 leaderboard
View all activity

Organizations

Perception, Control, and Cognition Lab's profile picture  ML Foundations Development's profile picture Laude Institute's profile picture DCAgent's profile picture Harbor's profile picture Terminal-Bench's profile picture Harbor Framework's profile picture

upvoted a paper 3 months ago

Terminal-Bench: Benchmarking Agents on Hard, Realistic Tasks in Command Line Interfaces

Paper • 2601.11868 • Published Jan 17 • 35
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs