OpenAI’s o3 Wins Gold at International Olympiad in Informatics 2024

2 months ago 26
  • Published on February 13, 2025
  • In AI News

The research highlights that more training and test-time compute improves model performance, nearing top human levels.

Illustration by Nikhil Kumar

New research from OpenAI highlights the results of their reasoning models (o-series models) and how LLMs have evolved from amateur competitive programmers to competing with the world’s best. 

OpenAI’s latest AI model, o3, earned an impressive 2724 rating on CodeForces, placing it in the 99.8th percentile. It also secured a gold medal-level score at the 2024 International Olympiad in Informatics (IOI).

According to the research, o3 outperforms the o1-ioi model, which is specifically fine-tuned for IOI. This proves that reinforcement learning is more effective than hand-crafted approaches.

At IOI 2024, o3 competed under standard conditions and crossed the gold medal threshold. On CodeForces, it ranked among the top 200 programmers globally, competing with elite human coders.

“General-purpose reasoning capabilities developed through reinforcement learning are now outperforming carefully hand-crafted, domain-specific solutions,” said Ethan Mollick, associate professor at The Wharton School. “Rather than building specialised systems for specific tasks, large, general-purpose models can achieve superior results through better reasoning abilities.”

The research is part of OpenAI’s ongoing efforts to assess its models’ performance in competitive programming and broader software engineering. 

Anthropic, the company behind the Claude model series, also released a report on Monday which highlighted AI’s influence on the workplace.

The findings revealed that approximately 36% of all occupations incorporate AI for at least a quarter of their tasks. Moreover, 57% of AI applications enhance human capabilities, while 43% focus on automation. However, only 4% of occupations rely on AI for at least 75% of their tasks.

The study identified software development and technical writing as the primary areas where AI is utilised. In contrast, AI plays a minimal role in tasks that involve physical interaction with the environment.

Picture of Aditi Suresh

Aditi Suresh

I hold a degree in political science, and am interested in how AI and online culture intersect. I can be reached at [email protected]

Association of Data Scientists

GenAI Corporate Training Programs

India's Biggest Women in Tech Summit

Mar 20 and 21, 2025 | 📍 J N Tata Auditorium, Bengaluru

Download the easiest way to
stay informed

Subscribe to The Belamy: Our Weekly Newsletter

Biggest AI stories, delivered to your inbox every week.

Rising 2025 | DE&I in Tech & AI

Mar 20 and 21, 2025 | 📍 J N Tata Auditorium, Bengaluru

AI Startups Conference.
April 25, 2025 | 📍 Hotel Radisson Blue, Bangalore, India

Data Engineering Summit 2025

15-16 May, 2025 | 📍 Taj Yeshwantpur, Bengaluru, India

MachineCon GCC Summit 2025

19-20th June 2025 | 📍 ITC Grand, Goa

17-19 September, 2025 | 📍KTPO, Whitefield, Bangalore, India

India's Biggest Developers Summit Nimhans Convention Center, Bangalore

discord icon

Our Discord Community for AI Ecosystem.

Read Entire Article