Google’s New AI Model Outperforms DeepSeek-V3, OpenAI’s o3-mini

1 month ago 17
  • Published on March 12, 2025
  • In AI News

Gemma 3 27B ranks highly on Chatbot Arena, requiring only a single GPU despite others needing up to 32.

Gemma 2 2B

Google on Wednesday announced Gemma 3, the next iteration in the Gemma family of open-weight models. It is a successor to the Gemma 2 model released last year. 

The small model comes in a range of parameter sizes – 1B, 4B, 12B and 27B. The model also supports a longer context window of 128K tokens. It can analyse videos, images, and text, supports 35 languages out of the box, and provides pre-trained support for 140 languages. 

In the Chatbot Arena, Gemma 3 27B outperformed DeepSeek-V3, OpenAI’s o3-mini and Meta’s Llama 3-405B model. Models in Chatbot Arena are evaluated against each other through side-by-side evaluations by humans. 

Moreover, Gemma 3 27B scored 67.5% and 42.4 across standard benchmarks like MMLU-Pro, GPQA Diamond, respectively. The model performs well compared to other small models in the competition. 

Claude 3.5 Haiku scored 63% on the MMLU-Pro benchmark and 41% on GPQA Diamond, while OpenAI’s GPT-4o Mini achieved 65% and 43% on the same tests, respectively. Meta’s Llama 3.3 70B outperformed both, with 71% in MMLU-Pro and 50% in GPQA Diamond, making it the strongest contender among these models.

However, Gemma-3’s key superpower seems to be efficient compute usage. Google said that Gemma 327B achieved the scores with a single NVIDIA H100 GPU, whereas other models necessitated up to 32 GPUs. 

Source: Google

The company also revealed that the architecture of the model was modified to reduce the KV-cache memory, which tends to increase with longer context. 

Google has published a detailed technical report outlining the techniques used to build the model, its performance and other specifications. Gemma 3 can be accessed via various methods. Google is offering the model on the web using the Google AI Studio, via the default chatbot or the API, and it is also available on the Google GenAI SDK

Besides, the model can be downloaded for local deployment on Hugging Face, Ollama, and Kaggle.

Along with Gemma 3, Google has also launched ShieldGemma 2, a 4B parameter image safety checker built on Gemma 3’s foundation. This provides safety labels for harmful images which involve dangerous, sexually explicit, and violent content. 

Picture of Supreeth Koundinya

Supreeth Koundinya

Supreeth is an engineering graduate who is curious about the world of artificial intelligence and loves to write stories on how it is solving problems and shaping the future of humanity.

Association of Data Scientists

GenAI Corporate Training Programs

India's Biggest Women in Tech Summit

March 20 and 21, 2025 | 📍 NIMHANS Convention Center, Bengaluru

Download the easiest way to
stay informed

Subscribe to The Belamy: Our Weekly Newsletter

Biggest AI stories, delivered to your inbox every week.

Rising 2025 Women in Tech & AI

March 20 - 21, 2025 | 📍 NIMHANS Convention Center, Bengaluru

AI Startups Conference.April 25, 2025 | 📍 Hotel Radisson Blu, Bengaluru, India

Data Engineering Summit 2025

May 15 - 16, 2025 | 📍 Hotel Radisson Blu, Bengaluru

MachineCon GCC Summit 2025

June 20 to 22, 2025 | 📍 ITC Grand, Goa

Sep 17 to 19, 2025 | 📍KTPO, Whitefield, Bengaluru, India

India's Biggest Developers Summit Feb, 2025 | 📍Nimhans Convention Center, Bengaluru

Read Entire Article