GPT-5.2 Pro Tops FrontierMath Leaderboard With 29.2% Score

GPT-5.2 Pro grabbed first place on the FrontierMath benchmark with a 29.2% accuracy score, beating out other cutting-edge AI models on one of the toughest math reasoning tests available.

⬤ OpenAI's GPT-5.2 Pro just claimed the number one spot on FrontierMath's public leaderboard, correctly solving 14 out of 48 problems for a 29.2% accuracy rate. This benchmark is widely recognized as one of the hardest tests for advanced mathematical reasoning in AI, making GPT-5.2 Pro's lead particularly impressive. The model outperformed every other system tested on the same challenging problem set.

⬤ Coming in second was Gemini 3 Pro Preview at 18.8% accuracy (9 correct answers out of 48). Other GPT-5.2 variations followed close behind—the "xhigh" configuration hit 16.7%, while both "high" and "medium" versions tied at 14.6%. Earlier GPT-5.1 (high) and GPT-5 (high) models each scored 12.5%. What really stands out here is that even the best-performing model solved less than a third of the problems, showing just how brutally difficult FrontierMath really is and how much room there still is for improvement in AI math skills.

⬤ The leaderboard also shows strong competition across the AI industry. Google DeepMind's Gemini 2.5 Deep Think scored 10.4%, while various GPT-5 family models ranged from 6.3% to 12.5%. Gemini 2.5 Flash and Gemini 3 Flash landed near the bottom at 4.2% each. Every score comes with an uncertainty range due to the small 48-question sample size. FrontierMath is classified as a Tier 4 benchmark, meaning it focuses on complex, high-level reasoning rather than straightforward computational tasks.

GPT-5.2 Thinking Beats Experts in 70.9% of Professional Tasks

OpenAI's latest benchmark shows GPT-5.2 Thinking now matches or outperforms human experts across most real-world professional work, completing tasks 10x faster at 1% of the cost.

⬤ This leaderboard update matters because it shows real, measurable progress on genuinely difficult math problems—not just cherry-picked examples or easy prompts. GPT-5.2 Pro's strong performance signals meaningful advances in mathematical reasoning, which could shake up the competitive landscape and raise the bar for what we expect from next-generation AI systems.

News Source

#AI #AI News #GPT-5.2 Pro #FrontierMath

Saad Ullah E-mail Twitter Facebook

Saad Ullah - engineer and writer passionate about AI, blockchain, and the disruptive technologies driving fintech innovation.