data contamination

1+ articles

AI Credibility AI Ethics AI Evaluation Benchmarking Epoch AI

OpenAI's FrontierMath Fiasco: Unpacking the Controversy

OpenAI is under fire for its involvement with the FrontierMath benchmark, sparking fierce debate around data transparency and ethics in AI evaluation. Despite funding the project, OpenAI's access to sensitive test data has raised eyebrows about potential biases and conflicts of interest. The community is abuzz with speculation on whether OpenAI's claimed 25% success rate was truly clean or clouded by data contamination. This debacle sheds light on broader issues of accountability and the need for independent AI evaluation.

Jan 20

OpenAI's FrontierMath Fiasco: Unpacking the Controversy