Multimodal
cohere
Command Family
Open Source
Released: Jul 2025 Updated: Feb 2026

Command A Vision

Command A Vision is a Multimodal model.

Vision
Audio
Tool Use
Reasoning
Citations

Key Specs

Context
128k
Max Output
8k
Best Price (Input / Output)
$2.50 / $10.00 per 1M tokens
Chat with Model

Pricing Comparison

Provider Input (1M) Output (1M) Image (1k)
cohere $2.50 $10.00 -

Prices are per 1 million tokens unless otherwise noted. Image pricing is per 1000 images if applicable.