A benchmark of expert-level academic questions to assess AI capabilities – Nature Google Alert – Artificial General Intelligence
The saturation of existing benchmarks, as shown in Fig. 1, limits our ability to precisely measure artificial intelligence (AI) capabilities and calls …