A downloadable project.
We created a simple web app allowing users to create some standard mechanistic interpretability plots (based on Stefan’s explainer ) for arbitrary prompts. By Stefan Heimersheim and Jonathan Ng. If the page shows an error ("Oh no") pull o...