Reading List

How Anthropic, OpenAI, and Google are testing AI models by having them play Pokémon Blue on Twitch to track a model's ability to reason and make decisions (Isabelle Bousquette/Wall Street Journal) from Techmeme RSS feed.