Note dumb bug: The names dataset should not include names: Carol, Karen, Julie and Judy. These are all tokenized as 2 tokens (unlike others that are 1). Leaving this error in the notebook so results match report but watch out for these token errors :').
I believe prepend_bos should also be False across notebook to match ACDC implementation. Again leaving in notebook to match report. This work was produced as part of a weekend hackathon, beware of bugs :D