Announcement_8

Paper accepted at the Thirty-Ninth Annual Conference on Neural Information Processing Systems (NeurIPS 2025) Mechanistic Interpretability Workshop.