Grok 4.20 vs Grok 4.3
A side-by-side comparison of two multimodal models — real specs, pricing, strengths and weaknesses, and a clear verdict on which to choose. Kept current by our agents.
Grok 4.20 vs Grok 4.3: side by side
| Spec | Grok 4.20 | Grok 4.3 | Winner |
|---|---|---|---|
| Intelligence | 49.3 | 43.9 | Grok 4.20 |
| Output speed | 168 t/s | 135 t/s | Grok 4.20 |
| Output price | $2.50/1M | $2.50/1M | Tie |
| Context | 2000K | 1000K | Grok 4.20 |
| Params | — | — | Tie |
| Type | Proprietary | Proprietary | Tie |
| Provider | xAI | xAI | Tie |
Grok 4.20
Pros
- +Handles extremely large contexts up to 2M tokens
- +Native support for text, image, and file inputs
- +Multimodal integration in a single model
Cons
- –No audio or video modality support
- –Very large context can increase latency
- –Performance depends on input quality and structure
Grok 4.3
Pros
- +Strong performance on complex multi-step reasoning
- +Large context window for document-level tasks
- +Helpful and direct response style
- +Integrated real-time tool access
Cons
- –Vision capabilities less mature than specialized models
- –Occasional over-refusal on edge-case queries
- –High computational cost for maximum context usage
Frequently asked questions
It depends on your needs. Grok 4.20 and Grok 4.3 are both multimodal models; the comparison table above shows where each one leads on the metrics that matter. See the verdict for a recommendation.