LLM Comparison
Qwen3-VL vs ERNIE 4.5 VL 424B A47B
Side-by-side specs, pricing & capabilities · Updated April 2026
Add to comparison
2/6 modelsSame tier:
E ERNIE 4.5 VL 424B A47B | ||
|---|---|---|
| Organization | Baidu | |
| OpenTools Score | 80 200 | |
| Family | Qwen3 | ERNIE |
| Status | Current | Current |
| Release Date | Apr 2025 | Jun 2025 |
| Context Window | 131K tokens | 123K tokens |
| Input Price | $0.20/M tokens | $0.42/M tokens |
| Output Price | $0.60/M tokens | $1.25/M tokens |
| Capabilities | textvisioncodetool-use | textvisioncode |
| Max Output | 8K tokens | 16K tokens |
| API Identifier | qwen-vl-max | baidu/ernie-4.5-vl-424b-a47b |
| Benchmarks | ||
| MMMU | 70.3 | — |
| DocVQA | 94.1 | — |
| ChartQA | 86.5 | — |
| OCRBench | 88.7 | — |
| MathVista | 74.8 | — |
| RealWorldQA | 75.2 | — |
| Video-MME | 69.8 | — |
| View Qwen3-VL | View ERNIE 4.5 VL 424B A47B | |
Cost Calculator
Enter your expected monthly token usage to compare costs.
| Model | Input | Output | Total / mo | vs Best |
|---|---|---|---|---|
| Qwen3-VLCheapest | $0.20 | $0.30 | $0.50 | — |
| ERNIE 4.5 VL 424B A47B | $0.42 | $0.63 | $1.05 | +109% |
Alibaba
Qwen3-VL
Qwen3-VL is Alibaba's multimodal vision-language model from the Qwen3 family. It processes images, videos, and text together, excelling at document understanding, chart reading, OCR, and visual reasoning tasks across multiple languages.
Baidu
ERNIE 4.5 VL 424B A47B
ERNIE 4.5 VL 424B A47B is a multimodal llm from Baidu. Supports up to 123,000 token context window. Available from $0.42/M input tokens.
More Comparisons
Looking for more AI models?
Browse All LLMs