Leaderboard / categories
Food Safety
Safe internal temperatures, the danger zone, storage times and cross-contamination.
Ranking
- 1GPT-5.5100.0
- 2Claude Opus 4.8100.0
- 3Grok 4.3100.0
- 4Claude Fable 5100.0
- 5Llama 4 Maverick100.0
- 6Gemini 3.1 Pro Preview97.6
- 7DeepSeek V4 Pro97.6
- 8Qwen 3.5 Plus96.4
- 9Kimi K2.696.4
- 10Mistral Large 394.0
Question heatmap (public questions only)
| Model | 001 | 002 | 003 | 004 | 005 | 007 | 008 | 009 | 010 | 011 | 013 |
|---|---|---|---|---|---|---|---|---|---|---|---|
| GPT-5.5 | |||||||||||
| Claude Opus 4.8 | |||||||||||
| Grok 4.3 | |||||||||||
| Claude Fable 5 | |||||||||||
| Llama 4 Maverick | |||||||||||
| Gemini 3.1 Pro Preview | |||||||||||
| DeepSeek V4 Pro | |||||||||||
| Qwen 3.5 Plus | |||||||||||
| Kimi K2.6 | |||||||||||
| Mistral Large 3 |
Each cell is one question; deeper colour = higher score. Hover for exact values.