New AGI benchmark indicates whether a future AI model could cause ‘catastrophic harm’ Google Alert – Artificial General Intelligence

by · October 14, 2024

OpenAI scientists have designed MLE-bench — a compilation of 75 extremely difficult tests that can assess whether a future advanced AI agent is …