Leaderboard / categories

Conversions

Volume, weight and temperature conversions across kitchen units and locales.

Ranking

Question heatmap (public questions only)

Model	001	002	003	004	005	006	007	008	009	010	011	012	013	014	015	016	017	018	019	020	021	022	023	024	025	026	027	028
GPT-5.4 Mini
Grok 4.3
GPT-5.5
Claude Fable 5
Claude Opus 4.8
Gemini 3.5 Flash
Gemini 3.1 Pro Preview
Kimi K2.6
Qwen 3.5 Plus
Claude Sonnet 4.6
DeepSeek V4 Pro
Mistral Large 3
Llama 4 Maverick

Each cell is one question; deeper colour = higher score. Hover for exact values.