← All Comparisons

Gemini 3.1 Pro vs Llama 4 Maverick

A detailed comparison of Gemini 3.1 Pro (Google) and Llama 4 Maverick (Meta) across pricing, performance, and features.

Pricing Comparison

Metric	Gemini 3.1 Pro	Llama 4 Maverick	Difference
Input / 1M tokens	$2.00	$0.31	-85%
Output / 1M tokens	$12.00	$0.85	-93%
Context window	1M	1M	—
Max output	64K	32K	—

Benchmark Comparison

Benchmark	Gemini 3.1 Pro	Llama 4 Maverick
MMLU-Pro	91%	80.5%
HumanEval	95%	90.2%
GPQA	94.3%	—

Capabilities

Capability	Gemini 3.1 Pro	Llama 4 Maverick
audio	✓	✗
code	✓	✓
reasoning	✓	✗
text	✓	✓
tool-use	✓	✗
vision	✓	✓

Gemini 3.1 Pro Strengths

✓#1 on 12 of 18 tracked benchmarks
✓94.3% GPQA Diamond — highest of any model
✓Same price as Gemini 3 Pro (free upgrade)
✓1M context with configurable thinking levels

Gemini 3.1 Pro Weaknesses

✗Still in preview
✗Context-tiered pricing ($4/$18 above 200K)

Llama 4 Maverick Strengths

✓Open-source and self-hostable
✓1M context window
✓Very competitive via API providers

Llama 4 Maverick Weaknesses

✗Requires significant compute to self-host
✗Fewer tool-use capabilities than proprietary models

Quick Verdict

Best value: Llama 4 Maverick is the more affordable option at $0.31/$0.85 per 1M tokens.

Higher benchmarks: Gemini 3.1 Pro scores higher on average across available benchmarks (93.4% avg).

Choose Llama 4 Maverick if cost matters most. Choose Gemini 3.1 Pro if you need the best possible quality for complex tasks.

More Comparisons

Gemini 3.1 Pro vs Claude Opus 4.6 Gemini 3.1 Pro vs Claude Sonnet 4.6 Gemini 3.1 Pro vs Claude Sonnet 4.5 Gemini 3.1 Pro vs Claude Haiku 4.5 Gemini 3.1 Pro vs GPT-5.3 Codex Gemini 3.1 Pro vs GPT-5.2 Codex