Get the latest tech news
Watch AI models compete right now in Google's new Game Arena
As AI models increasingly ace conventional tests, researchers are looking for new benchmarking methods. Google is betting on games.
This comes as researchers have been working on new kinds of tests to measure the capabilities of AI models as the field inches closer to artificial general intelligence, or AGI, an as-yet theoretical system that (as it's commonly defined) can match the human brain in any cognitive task. It could also help to inform R&D efforts in more economically practical arenas: "The ability to plan, adapt, and reason under pressure in a game is analogous to the thinking needed to solve complex challenges in science and business," Google said. Today's models "learn" essentially by playing millions of rounds of games against themselves and refining their performance based on how well they achieve some predetermined goal, which can range from predicting the next token of text to generating a video depicting real-world physics.
Or read this on ZDNet