Skip to main content

On Sale: GamesAssetsToolsTabletopComics
Indie game storeFree gamesFun gamesHorror games
Game developmentAssetsComics
SalesBundles
Jobs
TagsGame Engines

​​I rated YOU COMMENTS! See the results here

A topic by Voidsay created Aug 17, 2020 Views: 362 Replies: 16
Viewing posts 1 to 8
Submitted(+4)

Over the last week I have been programming an API that retrieves your comments under every submitted "Brackeys Jam" game and calculates a "Comment Uniqueness Index" shortened to "CUI". The CUI is a value between 0 and 100.000 with 0 meaning that every comment you have made is a copy of every other comment and 100.000 meaning that you have never used the same word twice.

So without further to do here are my findings:

Bell curve:

Ratings TOP 10:

Name Game ID CUI Number of comments*
1 OgelGames 722621 90536 43
2 basaOnly 720648 90187 19
3 Pix1lDev 723136 89994 40
4 Haredo 724910 89690 48
5 sohwathismelvin 723687 89666 115
6 Jason Flores 723454 89515 29
7 ldd 718085 89478 21
8 Mr. Minticuz 724863 89453 30
9 Tipu 722753 89394 30
10 NoelOskar 724990 89269 28

In total 751 participants fulfilled the minimal requirement (at least 5 comments under ratings) for a score. Because of post limitation in the community tab I couldn't put all the score here. If you wish to see the full list you'll need to visit my github and open the "CUI_Score.pdf" .


*note that your comments under your own submission aren't counted here, since a reasonable assumption could be made that you would write a lot of "thank yous" that in turn would count against your score.

Some questions you might have:

If you can't find yourself in the ratings this can be because of a number of things. 

  1. you didn't meat the minimum requirement of 5 comments under the rating page of other participants.
  2. this data set was taken on Sunday the 16th, so it might not include some comments that you have written since then.
  3. some of your comments got eaten by the database Craken (he doesn't like special characters) and I didn't bother adding them manually. I am a lazy void after all.
  4. your name contained special characters. In this case you are still in my database but with a jumbled name.

Note that a low score doesn't automatically mean that you are a bot or a bad rater. Some low scores can be explained by the low minimal requirement of 5 comments. Short and generic comments ("I liked the game", "really good implementation of the theme") as well as an excessive usage of common words ("a", "the", "I"...) can lead to a low score. The algorithm is super simple and doesn't necessarily reflect the contents correctly.

About the process:

I have written an API in C# that collected all games and authors from the Breackeys Jam 2020.2 submissions page and saves them in a MySql database. After that it goes thru the list of submitted games and collects all comments made on the rating page of each game.

This data set is then fed into a Jaccard Index calculation function, that compares every comment that a participant made under other peoples games with every other such comment, including itself. This causes some inaccuracy, which I accept (because I am a lazy void). The average is calculated from these indexes. Finlay I subtracted the calculated value from one and multiply it by 100.000. The reason behind this is  to make the scores more impressive. Displaying the CUI as a float from 0 to 1, would have been basically the same.

If you have other questions regarding the data and how it was acquired, a mistake you have spotted or simply want to offer a suggestion you can write me a comment!

If you're interested in the data set I used to preform your own analysis you can request the MySql database or an excel file (I am currently searching for a good place to upload it)

Submitted

I think alot of the cheaters comments got eaten because they usually include smiley faces or emojis in their comments since the lower you go the less comments there are in the list. 

Submitted(+1)

Emojis as well as pictures and links shouldn't pose any issue. A thing that I did notice however is that some games have several people listed as authors and I am pretty sure that I didn't copy both. I assume that these second/third authors cause db problems, but first I will have to find some examples to do more testing.

Would be cool to have a final score after the rating period!

Submitted

You had written comments with special characters would get eaten so something like this    :)   would be eaten by the database Kraken  ?

Submitted(+1)

I actually thought about it a little more.

I actually believe that those eaten comments where caused ether by the fact that the database is set to Swedish Latin by default (idk, why that is and I am not sure if I changed it before or after collecting all the author names)
or by the fact that my comments table requires an author name as a foreign key (I suspect I missed some people that where able to comment since they technically participated, but where the second or third author of the game, so they where not in the primary key table... more database lingo... You get the point I am still bad at getting data and storing it)

I just like blaming the Kraken for my misfortunes and bugs.

Submitted

damn i am like top 5 

Submitted

Apparently writing more than 100 comments didn't hurt your score. I am actually not sure how the number of comments affect the final score, but I should take another look at this once my head is a little more clear.

Congrats by the way!

PS how accurate would you evaluate the result? Did you actually write completely different comments or is there something wrong with my program?

Submitted

I wouldn’t say that all my comments is entirely different. I do point out whats wrong and whats right. At the same time i do say “good job” or “great work” or “congrats” at the end of most comments. 
did you take those comments for a post i made on the forum? Because most of them i wrote “rated and commented” (i posted comments on their page instead). 

Great job btw! Efforts. 

PS. Phewww yr code didnt catch me XD

Submitted

Forum posts where ignored. Only posts under submissions.

Since I saw a couple rated and commented posts on your timeline, I was a little curious about your big score and how this type of comment didn't effect it. But as I said forum posts and other comments where ignored, so it makes sense.

Submitted

wow. I’m so glad I made it to the TOP 10 in position #7. I personally find what you’ve done to be so interesting and it drew a smile on my face.

I tried to point out, in as much detail as I could, what I liked and didn’t like about each game, and although I got a couple of generic comments on my game, I am very thankful for the people that left very precise and actionable feedback.

My Top 7 result here feels great. Thanks for validating those of us who took some time to play each and every game that we could.

Submitted

Congrats!

Originally I wanted to do something different with the data set, but decided that I am always way too negative with my intentions, so I thought it would be nice to do something positive for a change. (Never mind the algorithm being too crude to show the results I wanted, so here we are)

I will probably recompile all the data at the end of the rating period to show off the final result.

Submitted

This is pretty cool and I'm just posting to get it back to the top so more people see it

Deleted post
Submitted

Could you elaborate on the term score? I don't exactly understand what you mean by that. If you mean the CUI in order it is in the file just at the very bottom.

I sort of want to try to calculate the information content of each comment and calculate the average for each author, since I remembered that that was a thing in I.T. class.

Deleted post
Submitted (1 edit)

Wow really fascinating! 

I’m in the top 100 though😅

Submitted

Amazing! 

Thank you for taking the time to do something about this.