Anthropic Introduces Claude 3.5 Sonnet with Visual PDF Analysis for Images, Charts, and Graphs under 100 Pages

5 months ago 40
  • Last updated November 2, 2024
  • In AI News

Days after Claude 3.5 Sonnet received a major update, Anthropic dropped another useful feature called Visual PDF. 

Claude has introduced a new feature preview that can read all kinds of visuals inside a PDF that is less than 100 pages. This now makes it easy to upload a document, retrieve the complete context, and digest information from PDFs, especially research papers and technical documents that contain charts and graphs, among other images and visuals. 

Visual PDFs are an experimental feature in the Feature Previews available on Claude 3.5 Sonnet. 

Claude can now view images within a PDF, in addition to text.

This helps Claude 3.5 Sonnet more accurately understand complex documents, such as those laden with charts or graphics.

Enable the feature preview: https://t.co/bJ8BjBT6zG. pic.twitter.com/VNSf547ptT

— Anthropic (@AnthropicAI) November 1, 2024

The good news doesn’t end there. The company has also increased the document size limit from 10MB to 30 MB. A user on X was quick to point out the change, and Claude now lets you upload a maximum of five images or documents, with a size limit of 30 MB each. 

“Up until today, when you attached a PDF in Claude.AI, we would use a text extraction service to grab the text and send that to Claude in the prompt. Now, Claude can actually see the PDF visually alongside the text”, said Alex Albert, Head of Claude Relations at Anthropic, in a post on X.

You can access the feature from the pop-up banner on the home page. Once you select the Visual PDFs in the Feature Preview tab and turn it on, it will be made available for future conversations. Moreover, Anthropic has also announced that it supports adding PDFs as input in an API request. 

Anthropic Is on a Roll

Only a few days before, Anthropic released Computer Use which caused quite a storm in the AI ecosystem. Recently, they also announced a partnership with GitHub, which included Claude 3.5 Sonnet inside GitHub Coilot. 

A few days ago, it was also announced that Claude can now execute and run JavaScript code. It’s called the Analysis Tool, and it can also generate data visualisations after it writes and executes the code. Apart from Visual PDFs and Analysis Tool, Claude also provides a feature called LaTex rendering to generate mathematical equations from a user’s input.

At this point, it is well established that Claude’s 3.5 Sonnet is the best AI model for running code. OpenAI’s latest GPT o1 isn’t there yet, and even with its latest offering, Canvas, it still fails to keep up with Claude’s abilities. 

Picture of Supreeth Koundinya

Supreeth Koundinya

Supreeth is an engineering graduate who is curious about the world of artificial intelligence and loves to write stories on how it is solving problems and shaping the future of humanity.

8th Nov 2024
Meet 30+ top CDOs, ITDMs & AI Leaders.

Upcoming Large format Conference

India's Biggest Developers Summit

February 5 – 7, 2025 | Nimhans Convention Center, Bangalore

Download the easiest way to
stay informed

AI is Killing Remote Work 

Siddharth Jindal

“People wanted remote jobs and then got replaced by global talent that works twice as hard at half the cost. And now AI is also coming for those jobs.”

Subscribe to The Belamy: Our Weekly Newsletter

Biggest AI stories, delivered to your inbox every week.

Rising 2025 | DE&I in Tech & AI Summit

Mar 20 and 21, 2025 | 📍 J N Tata Auditorium, Bengaluru

Data Engineering Summit 2024

May 30 and 31, 2024 | 📍 Bangalore, India

February 5 – 7, 2025 | Nimhans Convention Center, Bangalore

MachineCon GCC Summit 2024

June 28 2024 | 📍Bangalore, India

September 25-27, 2024 | 📍Bangalore, India

25 July 2025 | 583 Park Avenue, New York

discord icon

Our Discord Community for AI Ecosystem, In collaboration with NVIDIA. 

World's Biggest Media & Analyst firm specializing in AI

AIM publishes every day, and we believe in quality over quantity, honesty over spin. We offer a wide variety of branding and targeting options to make it easy for you to propagate your brand.

AIM Brand Solutions, a marketing division within AIM, specializes in creating diverse content such as documentaries, public artworks, podcasts, videos, articles, and more to effectively tell compelling stories.

ADaSci Corporate training program on Generative AI provides a unique opportunity to empower, retain and advance your talent

With MachineHack you can not only find qualified developers with hiring challenges but can also engage the developer community and your internal workforce by hosting hackathons.

Conduct Customized Online Assessments on our Powerful Cloud-based Platform, Secured with Best-in-class Proctoring

AIM Research produces a series of annual reports on AI & Data Science covering every aspect of the industry. Request Customised Reports & AIM Surveys for a study on topics of your interest.

Immerse yourself in AI and business conferences tailored to your role, designed to elevate your performance and empower you to accomplish your organization’s vital objectives.

© Analytics India Magazine Pvt Ltd & AIM Media House LLC 2024

Read Entire Article