New AGI benchmark indicates whether a future AI model could cause ‘catastrophic harm’ Google Alert – Artificial General Intelligence
OpenAI scientists have designed MLE-bench — a compilation of 75 extremely difficult tests that can assess whether a future advanced AI agent is …