frontiermath
Apr 22
OpenAI's Math Test Controversy: A Benchmarking Brouhaha
Jan 26
OpenAI's Secret Sauce: Behind the Record-Breaking Math Benchmark
Jan 20
OpenAI Unleashes PhD-Level AI 'Super-Agents' - Game Changer or Overhyped Dream?
Jan 20
OpenAI's FrontierMath Fiasco: Unpacking the Controversy
Jan 20
OpenAI's Secret Support of FrontierMath Stirs Up Controversy in AI Community
Jan 20
The Evolution of Evaluating LLMs: From Traditional to FrontierMath & Beyond
Dec 27