OpenAI’s experimental model achieved gold at the International Math Olympiad

OpenAI has achieved “gold medal-level performance” at the International Math Olympiad, notching another important milestone for AI’s fast-paced growth. Alexander Wei, a research scientist at OpenAI working on LLMs and reasoning, posted on X that an experimental research model delivered on this “longstanding grand challenge in AI.”

According to Wei, an unreleased model from OpenAI was able to solve five out of six problems at one of the world’s longest-standing and prestigious math competitions, earning 35 out of 42 points total. The International Math Olympiad (IMO) sees countries send up to six students to solve extremely difficult algebra and pre-calculus problems. These exercises are seemingly simple but usually require some creativity to score the highest marks on each problem. For this year’s competition, only 67 of the 630 total contestants received gold medals, or roughly 10 percent.

AI is often tasked with tackling complex datasets and repetitive actions, but it usually falls short when it comes to solving problems that require more creativity or complex decision-making. However, with the latest IMO competition, OpenAI says its model was able to handle complicated math problems with human-like reasoning.

“By doing so, we’ve obtained a model that can craft intricate, watertight arguments at the level of human mathematicians,” Wei wrote on X. Wei and Sam Altman, CEO of OpenAI, both added that the company doesn’t expect to release anything with this level of math capability for several months. That means the upcoming GPT-5 will likely be an improvement from its predecessor, but it won’t feature that same impressive capability to compete in the IMO.

Source link

Read the News

Subscribe

Follow us

Read the News

Subscribe

Follow us

OpenAI’s experimental model achieved gold at the International Math Olympiad

Latest

Mistral bets on ‘build-your-own AI’ as it takes on OpenAI, Anthropic in the enterprise

IO Interactive splits with MindsEye developer and ends Hitman collab

Pixel Weather tweaks widget, new icons come to Google Clock 8.6

Kalshi’s legal troubles pile up, as Arizona files first ever criminal charges over ‘illegal gambling business’

Newsletter

Don't miss

Mistral bets on ‘build-your-own AI’ as it takes on OpenAI, Anthropic in the enterprise

IO Interactive splits with MindsEye developer and ends Hitman collab

Pixel Weather tweaks widget, new icons come to Google Clock 8.6

Kalshi’s legal troubles pile up, as Arizona files first ever criminal charges over ‘illegal gambling business’

Subnautica 2 might finally be entering early access in May

Mistral bets on ‘build-your-own AI’ as it takes on OpenAI, Anthropic in the enterprise

IO Interactive splits with MindsEye developer and ends Hitman collab

Pixel Weather tweaks widget, new icons come to Google Clock 8.6

LEAVE A REPLY Cancel reply

About us

Most recent

Mistral bets on ‘build-your-own AI’ as it takes on OpenAI, Anthropic in the enterprise

IO Interactive splits with MindsEye developer and ends Hitman collab

Pixel Weather tweaks widget, new icons come to Google Clock 8.6

Kalshi’s legal troubles pile up, as Arizona files first ever criminal charges over ‘illegal gambling business’

Most popular

Mistral bets on ‘build-your-own AI’ as it takes on OpenAI, Anthropic in the enterprise

IO Interactive splits with MindsEye developer and ends Hitman collab

Pixel Weather tweaks widget, new icons come to Google Clock 8.6

Kalshi’s legal troubles pile up, as Arizona files first ever criminal charges over ‘illegal gambling business’

Subscribe