15 entries were submitted between 2023-01-20 16:00:00 and 2023-01-23 03:15:00. 52 ratings were given to 15 entries (100.0%) between 2023-01-23 03:15:00 and 2023-01-25 14:00:00. The average number of ratings per game was 3.5 and the median was .
Criteria | Rank | Score* | Raw Score |
Judge's choice | #1 | n/a | n/a |
Reproducibility | #1 | 4.400 | 4.400 |
Mechanistic interpretability | #2 | 4.400 | 4.400 |
Novelty | #3 | 4.200 | 4.200 |
Generality | #11 | 2.800 | 2.800 |
ML Safety | #11 | 2.800 | 2.800 |
Criteria | Rank | Score* | Raw Score |
Mechanistic interpretability | #1 | 4.571 | 4.571 |
Judge's choice | #2 | n/a | n/a |
Generality | #4 | 3.286 | 3.286 |
ML Safety | #4 | 3.429 | 3.429 |
Reproducibility | #4 | 4.143 | 4.143 |
Novelty | #8 | 3.000 | 3.000 |
Criteria | Rank | Score* | Raw Score |
Judge's choice | #3 | n/a | n/a |
Reproducibility | #3 | 4.250 | 4.250 |
Mechanistic interpretability | #5 | 4.250 | 4.250 |
Generality | #5 | 3.000 | 3.000 |
Novelty | #8 | 3.000 | 3.000 |
ML Safety | #12 | 2.750 | 2.750 |
Criteria | Rank | Score* | Raw Score |
Judge's choice | #4 | n/a | n/a |
Novelty | #4 | 3.750 | 3.750 |
Reproducibility | #5 | 4.000 | 4.000 |
Generality | #5 | 3.000 | 3.000 |
ML Safety | #6 | 3.250 | 3.250 |
Mechanistic interpretability | #9 | 3.750 | 3.750 |