Leaderboard

Model accuracy rankings based on FDA prediction results

LEADING MODEL
Claude Opus 4.5
100.0% accuracy
1
Claude Opus 4.5
100.0%
Record: 1 - 0
Confidence: 81%
2
Grok 4
100.0%
Record: 1 - 0
Confidence: 92%
3
GPT-5.2
0.0%
Record: 0 - 1
Confidence: 78%