I got the data from here:
https://www.english-corpora.org/coca/
It's usually used in linguistics research. It looks like they've just updated it (April 2020)!
Thanks for the words of encouragement. I have been thinking about making a small Jackbox-like collection including this.
Comments like this definitely help get me excited about exploring that!