LLM Comparison
Qwen3-VL vs Qwen3.5-9B
Side-by-side specs, pricing & capabilities · Updated April 2026
Price vs Intelligence
Add to comparison
2/6 modelsSame tier:
| Organization | ||
| OpenTools Score | 80 200 | 54 430 |
| Family | Qwen3 | Qwen |
| Status | Current | Current |
| Release Date | Apr 2025 | Mar 2026 |
| Context Window | 131K tokens | 262K tokens |
| Input Price | $0.20/M tokens | $0.10/M tokens |
| Output Price | $0.60/M tokens | $0.15/M tokens |
| Capabilities | textvisioncodetool-use | textvisionvideocode |
| Max Output | 8K tokens | — |
| API Identifier | qwen-vl-max | qwen/qwen3.5-9b |
| Benchmarks | ||
| MMMU | 70.3 | — |
| DocVQA | 94.1 | — |
| ChartQA | 86.5 | — |
| OCRBench | 88.7 | — |
| MathVista | 74.8 | — |
| RealWorldQA | 75.2 | — |
| Video-MME | 69.8 | — |
| MMLU | — | 87.5 |
| HumanEval | — | 88.5 |
| View Qwen3-VL | View Qwen3.5-9B | |
Cost Calculator
Enter your expected monthly token usage to compare costs.
| Model | Input | Output | Total / mo | vs Best |
|---|---|---|---|---|
| Qwen3.5-9BCheapest | $0.10 | $0.08 | $0.18 | — |
| Qwen3-VL | $0.20 | $0.30 | $0.50 | +186% |
Alibaba
Qwen3-VL
Qwen3-VL is Alibaba's multimodal vision-language model from the Qwen3 family. It processes images, videos, and text together, excelling at document understanding, chart reading, OCR, and visual reasoning tasks across multiple languages.
Alibaba
Qwen3.5-9B
Qwen3.5-9B is a multimodal llm from Alibaba. Supports up to 262,144 token context window. Achieves 87.5% on MMLU. Available from $0.10/M input tokens.
More Comparisons
Looking for more AI models?
Browse All LLMs