Patronus AI

Team

company

Verified

https://patronus.ai

patronusai

Activity Feed Request to join this org

AI & ML interests

LLM Evaluation

Recent Activity

vgtomahawk published a model 1 day ago

PatronusAI/Qwen3-4B-Instruct-2507-CE-s39T-GPT41Tea-notR-L2-M-Ep1-6e-5-Q32-65536-1534Feb14

patronus-bartek updated a model 4 days ago

PatronusAI/Qwen3-4B-Instruct-2507-CE-s39T-GPT41Tea-notR-L2-M-Ep1-6e-5-Q32-65536-1534Feb14

vgtomahawk published a model 6 days ago

PatronusAI/Qwen3-4B-Instruct-2507-Car-150F-GPT41Tea-notR-L4-M-Ep1-6e-5-Q32-65536-1012Feb13

View all activity

Papers

Benchmarking Reward Hack Detection in Code Environments via Contrastive Analysis

MEMTRACK: Evaluating Long-Term Memory and State Tracking in Multi-Platform Dynamic Agent Environments

View all Papers

vgtomahawk

published a model 1 day ago

PatronusAI/Qwen3-4B-Instruct-2507-CE-s39T-GPT41Tea-notR-L2-M-Ep1-6e-5-Q32-65536-1534Feb14

4B • Updated 4 days ago • 48

patronus-bartek

updated a model 4 days ago

PatronusAI/Qwen3-4B-Instruct-2507-CE-s39T-GPT41Tea-notR-L2-M-Ep1-6e-5-Q32-65536-1534Feb14

4B • Updated 4 days ago • 48

vgtomahawk

published a model 6 days ago

PatronusAI/Qwen3-4B-Instruct-2507-Car-150F-GPT41Tea-notR-L4-M-Ep1-6e-5-Q32-65536-1012Feb13

4B • Updated 6 days ago • 22

patronus-bartek

updated a model 6 days ago

PatronusAI/Qwen3-4B-Instruct-2507-Car-150F-GPT41Tea-notR-L4-M-Ep1-6e-5-Q32-65536-1012Feb13

4B • Updated 6 days ago • 22

vgtomahawk

published a model 9 days ago

PatronusAI/Qwen3-4B-Instruct-2507-Car-150F-GPT41Tea-notR-L16-M-Ep1-6e-5-Q32-65536-0942Feb10

4B • Updated 9 days ago • 92

patronus-bartek

updated a model 9 days ago

PatronusAI/Qwen3-4B-Instruct-2507-Car-150F-GPT41Tea-notR-L16-M-Ep1-6e-5-Q32-65536-0942Feb10

4B • Updated 9 days ago • 92

vgtomahawk

published a model 11 days ago

PatronusAI/Qwen3-4B-Instruct-2507-Car-331-GPT41Tea-notR-L16-M-Ep1-6e-5-Q32-65536-0823Feb06

4B • Updated 12 days ago • 29

patronus-bartek

updated a model 12 days ago

PatronusAI/Qwen3-4B-Instruct-2507-Car-331-GPT41Tea-notR-L16-M-Ep1-6e-5-Q32-65536-0823Feb06

4B • Updated 12 days ago • 29

vgtomahawk

published a model 12 days ago

PatronusAI/Qwen3-4B-Instruct-2507-Car-150-GPT41Tea-notR-L16-M-Ep1-6e-5-Q32-65536-0259Feb06

4B • Updated 13 days ago • 21

patronus-bartek

updated a model 13 days ago

PatronusAI/Qwen3-4B-Instruct-2507-Car-150-GPT41Tea-notR-L16-M-Ep1-6e-5-Q32-65536-0259Feb06

4B • Updated 13 days ago • 21

vgtomahawk

published a model 13 days ago

PatronusAI/Qwen3-4B-Instruct-2507-Car-150-GPT41Tea-notROnly-Merge-6e-5-Q32-65536-1609Feb05

4B • Updated 15 days ago • 20

patronus-bartek

updated a model 15 days ago

PatronusAI/Qwen3-4B-Instruct-2507-Car-150-GPT41Tea-notROnly-Merge-6e-5-Q32-65536-1609Feb05

4B • Updated 15 days ago • 20

vgtomahawk

published a model 16 days ago

PatronusAI/Qwen3-4B-Instruct-2507-Car-50-GPT41Tea-notROnly-Merge-6e-5-Q4-32768-1633Feb04

4B • Updated 16 days ago • 9.94k

patronus-bartek

updated a model 16 days ago

PatronusAI/Qwen3-4B-Instruct-2507-Car-50-GPT41Tea-notROnly-Merge-6e-5-Q4-32768-1633Feb04

4B • Updated 16 days ago • 9.94k

DarshanDeshpande

submitted a paper to Daily Papers 21 days ago

Benchmarking Reward Hack Detection in Code Environments via Contrastive Analysis

Paper • 2601.20103 • Published 24 days ago • 1

DarshanDeshpande

published a dataset 22 days ago

PatronusAI/trace-dataset

Viewer • Updated 22 days ago • 517 • 23 • 2

DarshanDeshpande

updated a dataset 22 days ago

PatronusAI/trace-dataset

Viewer • Updated 22 days ago • 517 • 23 • 2

vgtomahawk

published a model 28 days ago

PatronusAI/Qwen3-4B-Instruct-2507-Tau2-32-GPT41Teach-notROnly-Merge-6e-5-Q4-32768-1445Jan22

4B • Updated 29 days ago • 106

patronus-bartek

updated a model 29 days ago

PatronusAI/Qwen3-4B-Instruct-2507-Tau2-32-GPT41Teach-notROnly-Merge-6e-5-Q4-32768-1445Jan22

4B • Updated 29 days ago • 106

DarshanDeshpande

authored a paper 4 months ago

MEMTRACK: Evaluating Long-Term Memory and State Tracking in Multi-Platform Dynamic Agent Environments

Paper • 2510.01353 • Published Oct 1, 2025 • 3

AI & ML interests

Recent Activity

Papers

Team members 16

PatronusAI's activity