LLM Comparison
Qwen3-VL vs Gemma 4 26B A4B
Side-by-side specs, pricing & capabilities · Updated April 2026
Add to comparison
2/6 modelsSame tier:
| Organization | ||
| OpenTools Score | 80 200 | |
| Family | Qwen3 | Gemma |
| Status | Current | Current |
| Release Date | Apr 2025 | Apr 2026 |
| Context Window | 131K tokens | 262K tokens |
| Input Price | $0.20/M tokens | $0.08/M tokens |
| Output Price | $0.60/M tokens | $0.35/M tokens |
| Pricing Notes | — | Cache read: $0.0100/M tokens |
| Capabilities | textvisioncodetool-use | textvisionvideocode |
| Max Output | 8K tokens | — |
| API Identifier | qwen-vl-max | google/gemma-4-26b-a4b-it |
| Benchmarks | ||
| MMMU | 70.3 | — |
| DocVQA | 94.1 | — |
| ChartQA | 86.5 | — |
| OCRBench | 88.7 | — |
| MathVista | 74.8 | — |
| RealWorldQA | 75.2 | — |
| Video-MME | 69.8 | — |
| View Qwen3-VL | View Gemma 4 26B A4B | |
Cost Calculator
Enter your expected monthly token usage to compare costs.
| Model | Input | Output | Total / mo | vs Best |
|---|---|---|---|---|
| Gemma 4 26B A4BCheapest | $0.08 | $0.18 | $0.26 | — |
| Qwen3-VL | $0.20 | $0.30 | $0.50 | +96% |
Alibaba
Qwen3-VL
Qwen3-VL is Alibaba's multimodal vision-language model from the Qwen3 family. It processes images, videos, and text together, excelling at document understanding, chart reading, OCR, and visual reasoning tasks across multiple languages.
Gemma 4 26B A4B
Gemma 4 26B A4B is a multimodal llm from Google. Supports up to 262,144 token context window. Available from $0.08/M input tokens.
More Comparisons
Looking for more AI models?
Browse All LLMs