Leaderboard / categories

Quantities & Scaling

Scaling recipes up and down, yields, pan-size math and baker’s percentages.

Ranking

Question heatmap (public questions only)

Model	001	002	003	004	005	006	007	008	009	010	011	012	013	014	015	016	017	018	019	020	021	022	023	024	025
GPT-5.4 Mini
Grok 4.3
GPT-5.5
Claude Fable 5
Claude Opus 4.8
Gemini 3.5 Flash
Gemini 3.1 Pro Preview
Kimi K2.6
Qwen 3.5 Plus
Claude Sonnet 4.6
DeepSeek V4 Pro
Mistral Large 3
Llama 4 Maverick

Each cell is one question; deeper colour = higher score. Hover for exact values.