Leaderboard / categories

Substitutions

Ingredient swaps with correct ratios, including allergen-aware alternatives.

Ranking

Question heatmap (public questions only)

Model	001	002	003	004	005	006	007	008	009	010	011	012	013	014	015	016	017	018	019	020	021
Grok 4.3
GPT-5.5
Claude Sonnet 4.6
Claude Opus 4.8
Claude Fable 5
GPT-5.4 Mini
DeepSeek V4 Pro
Kimi K2.6
Qwen 3.5 Plus
Mistral Large 3
Gemini 3.5 Flash
Llama 4 Maverick
Gemini 3.1 Pro Preview

Each cell is one question; deeper colour = higher score. Hover for exact values.