Link to attention score experiments with Whisper and our analysis so far when the model hallucinates (provided in report too): https://github.com/erees1/alignment-jam/blob/main/Whisper_Attention.ipynb
jplhughes
2
Posts
A member registered Nov 10, 2022
Recent community posts
Interpreting Catastrophic Failure Modes in OpenAI’s Whisper jam comments · Posted in Interpreting Catastrophic Failure Modes in OpenAI’s Whisper jam comments
Here's the link to reproduce the logit lens experiments with Whisper (we didn't have time to put it in our write up): https://github.com/McHughes288/alignment-jam/blob/main/logit_lens_whisper.ipynb