Anthropic’s Claude 3.5 Haiku is Finally Here

4 months ago 41
  • Published on December 12, 2024
  • In AI News

And Opus 3.5 has finished its training. 

Anthropic Launches Claude 2.1, Surpasses GPT-4 Turbo in Context Length

As predicted by AIM, Anthropic is officially rolling out Claude Haiku 3.5 on Claude.ai. Haiku 3.5 is the latest iteration of Anthropic’s smallest and fastest model. 

Haiku 3.5 also surpasses Claude 3 Opus, which is Anthropic’s largest model, albeit from the previous generation. However, Haiku 3.5 doesn’t seem to support images as inputs yet, and Claude defaults back to Haiku 3 after uploading one.

Haiku 3.5 is available on both the Claude web and the mobile app. The model was announced a month ago and was made available via the API on Amazon Bedrock and Google Cloud’s Vertex AI.  A few days ago, Anthropic also announced that they’re optimising Claude 3.5 Haiku on AWS Trainium 2

“By enabling latency optimisation, Claude 3.5 Haiku can deliver up to 60% faster inference speed—making it the ideal choice for use cases ranging from code completions to real-time content moderation and chatbots,” said Anthropic in the announcement. 

Anthropic also lowered the price of Claude 3.5 Haiku to $0.80 per million input tokens and $4 per million output tokens. 

Moreover, Dario Amodei revealed a few weeks ago that there are plans to release the flagship Opus 3.5 Soon. As per reports, Anthropic has apparently finished training Claude 3.5 Opus, and ‘it performed well, with it scaling appropriately’. 

The report also said that Anthropic did not release it because it used Claude 3.5 Opus to generate synthetic data and for reward modelling to enhance the capabilities of 3.5 Sonnet. 

It rings a bell, doesn’t it? Amodei also said, “Scaling is continuing. There will definitely be more powerful models coming from us than the models that exist today. 

“Or if there aren’t, we’ve deeply failed as a company,” he added. 

But is Haiku 3.5 a tad bit too late? Google’s latest Gemini 2.0 Flash is a small model that offers tremendous performance, even better than the flagship Gemini 1.5 Pro. Forget Haiku; the model offers close performance with Claude 3.5 Sonnet on most benchmarks. 

The new Gemini 2.0 Flash looks like a monster of a model.

Close to Claude 3.5 Sonnet on most benchmarks they show, and even better on MATH.

But if it's priced like 1.5 Flash, it's essentially free in comparison.

Not to mention that 2.0 Pro is coming… pic.twitter.com/kxNh6CJZWB

— Matt Shumer (@mattshumer_) December 11, 2024

Moreover, it also offers multimodal features, including images, video, and audio, as well as multimodal outputs such as natively generated images combined with text and steerable text-to-speech (TTS) multilingual audio. 

Picture of Supreeth Koundinya

Supreeth Koundinya

Supreeth is an engineering graduate who is curious about the world of artificial intelligence and loves to write stories on how it is solving problems and shaping the future of humanity.

Association of Data Scientists

GenAI Corporate Training Programs

India's Biggest Developers Summit

February 5 – 7, 2025 | Nimhans Convention Center, Bangalore

Download the easiest way to
stay informed

Subscribe to The Belamy: Our Weekly Newsletter

Biggest AI stories, delivered to your inbox every week.

February 5 – 7, 2025 | Nimhans Convention Center, Bangalore

Rising 2025 | DE&I in Tech & AI

Mar 20 and 21, 2025 | 📍 J N Tata Auditorium, Bengaluru

Data Engineering Summit 2025

May, 2025 | 📍 Bangalore, India

MachineCon GCC Summit 2025

June 2025 | 583 Park Avenue, New York

September, 2025 | 📍Bangalore, India

MachineCon GCC Summit 2025

The Most Powerful GCC Summit of the year

discord icon

Our Discord Community for AI Ecosystem, In collaboration with NVIDIA. 

Read Entire Article