On Friday, Anthropic announced the first member of the Claude 3.5 Large Language Model (LLM) family, the Claude 3.5 Sonnet. The new release comes just three months after the AI company introduced the Claude 3 family of artificial intelligence (AI) models. According to the company, the latest AI model can outperform the Claude 3 Opus, the most capable model from the previous generation. It’s also getting a new Artifact feature that will let users see a sandbox view when the chatbot is asked to generate code snippets, website designs and more.
Claude 3.5 Sonnet details
According to the company’s press release, the Claude 3.5 Sonnet is now more powerful and cost-effective than the Claude 3 Opus. According to Anthropic’s naming convention, the Haiku model is the smallest, the Sonnet model sits in the middle, and the Opus is the flagship AI-level model. The AI company is also expected to release the 3.5 Haiku and Opus models in the coming months.
In terms of performance of the new AI model, Claude 3.5 Sonnet comes with a context window of 200,000 tokens and costs $3 (roughly Rs. 250) per million input tokens and $15 (roughly Rs. 1,254) per million output tokens. The company claims that the latest model can run twice as fast as the Claude 3 Opus. It scored 88.7 percent in 5-shot (the AI was trained on a small number of labeled examples) on the Massive Multitask Language Understanding (MMLU) benchmark and 92.0 percent in 0-shot on the HumanEval benchmark.
Anthropic claimed that Claude 3.5 Sonnet can independently write, edit and execute code. It also comes with reasoning and problem-solving capabilities that allow it to perform complex tasks such as code translation.
In addition to text and coding-based upgrades, the new AI model also gets improved computer vision. The AI company claimed that Claude 3.5 Sonnet can transcribe text from imperfect images, graphics or illustrations. The new Artifact feature is also included in the new AI chatbot. It’s a sandbox-style preview window that opens when the AI is asked to generate content such as code snippets, text documents, or website designs. Users can view, edit and upgrade Claude’s generation.
In terms of safety parameters, Anthropic stated that the Claude 3.5 Sonnet remains at the ASL-2 (AI Safety Level 2) standard. According to AI’s Responsible Scaling Policy, ASL-2 refers to systems that show early signs of dangerous capabilities (such as providing instructions for the creation of biological weapons), but are not considered useful due to a lack of reliability and that the information is already publicly available.
In addition to the security rating, Anthropic also claimed to have hired outside experts to test and improve security mechanisms with the Claude 3.5 Sonnet. The AI model was also submitted to the UK Artificial Intelligence Safety Institute (UK AISI) for evaluation before it was deployed.
Currently, Claude 3.5 Sonnet is available for free on Claude.ai and the Claude iOS app. Subscribers to the Claude Pro and Claude Team plans can also access with higher price limits. In addition, it is also available through the Anthropic API, Amazon Bedrock, and Google Cloud’s Vertex AI.