Claude 3.5 Haiku

Claude 3.5 Haiku, the next generation of Anthropic's fastest and most cost-effective model, is optimal for use cases where speed and affordability matter.

Try in Vertex AI View model card in Model Garden

Property Description
Model ID claude-3-5-haiku@20241022
Token limits
Maximum input tokens 200,000
Maximum output tokens 8,000
Capabilities
Technical specifications
Images
  • Limitation and specifications: See Vision in Anthropic's documentation
Documents
  • Limitation and specifications: See PDF support in Anthropic's documentation
Knowledge cutoff date July 2024
Versions
  • claude-3-5-haiku@20241022
    • Launch stage: Generally available
    • Release date: October 22, 2024
Supported regions

Model availability

(Includes fixed quota & Provisioned Throughput)

  • United States
    • us-east5

ML processing

  • United States
    • Multi-region
Quota limits
  • us-east5:
    • QPM: 80
    • TPM: 350,000 (input and output)
    • Context length: 200,000
Pricing See Pricing.