Multimodal
azure
inference
nvidia
openrouter
github-models
cloudflare-ai-gateway
vercel
Llama Family
Open Source
Released: Sep 2024
Updated: Feb 2026 Llama-3.2-11B-Vision-Instruct
Llama-3.2-11B-Vision-Instruct is a Multimodal model.
Vision
Audio
Tool Use
Reasoning
Citations
Key Specs
Context
131k
Max Output
16k
Best Price (Input / Output)
Chat with Model
Free / Free per 1M tokens
Pricing Comparison
| Provider | Input (1M) | Output (1M) | Image (1k) |
|---|---|---|---|
| github-models | Free | Free | - |
| nvidia | Free | Free | - |
| openrouter | Free | Free | - |
| cloudflare-ai-gateway | $0.05 | $0.68 | - |
| inference | $0.06 | $0.06 | - |
| vercel | $0.16 | $0.16 | - |
| azure | $0.37 | $0.37 | - |
Prices are per 1 million tokens unless otherwise noted. Image pricing is per 1000 images if applicable.