So I have made the first step for my cheater detector. A simple text comparator!
Here is the sourcecode:
https://gist.github.com/Voidsay/d84a64fc94e79a31c1a005da1245c2cc
It will be able to detect copy past comments and slightly misspelled as well as slightly varied comments.
Next I will figure out how to go thru all participant comments and compare them to each other to calculate a final similarity score for each participant.
Hopefully I will then be able to see a clear difference between normal commenters and dirty dirty cheaters.