On Sale: GamesAssetsToolsTabletopComics
Indie game storeFree gamesFun gamesHorror games
Game developmentAssetsComics
SalesBundles
Jobs
Tags

What are you doing in this dead thread?

There is no problem with the scraping itself. My c# script works perfectly fine and the database fits all the data well. The problem is the analysis itself. I hoped that the Jaccard index would be enough and create a bump in the bell curve somewhere in the lower score indicating an anomaly aka the cheaters. Unfortunately that didn't happen at all. In fact some confirmed "cheaters" scored close to the middle.

I kind of expected the algorithm to fail, since it is the barest of bones. All I got from it was a little scraping and database experience as well as a sort of inaccurate comment uniqueness rating. There is another thread about this tough.