OpenAI Introduces GDPval Benchmark to Compare AI with Human Professionals
OpenAI has launched a new benchmark, GDPval, designed to measure how well AI models perform against human experts across major industries. The test is part of OpenAI’s long-term mission to track progress toward artificial general intelligence (AGI), which would enable AI to handle economically valuable work at a human or higher level. According to the…
