Reading List
Google DeepMind's mechanistic interpretability team details why it shifted from fully reverse-engineering neural nets to a focus on "pragmatic interpretability" (AI Alignment Forum) from Techmeme RSS feed.
Google DeepMind's mechanistic interpretability team details why it shifted from fully reverse-engineering neural nets to a focus on "pragmatic interpretability" (AI Alignment Forum)
AI Alignment Forum:
Google DeepMind's mechanistic interpretability team details why it shifted from fully reverse-engineering neural nets to a focus on “pragmatic interpretability” — Executive Summary — The Google DeepMind mechanistic interpretability team has made a strategic pivot over the past year …