Reading List

Google introduces FACTS Grounding benchmark for evaluating the factuality of LLMs, and announces a leaderboard that ranks Gemini 2.0 Flash Experimental on top (Google DeepMind) from Techmeme RSS feed.