25 entries were submitted between 2022-11-11 17:00:00 and 2022-11-13 13:15:00. 217 ratings were given to 24 entries (96.0%) between 2022-11-13 13:15:00 and 2022-11-13 16:30:00. The average number of ratings per game was 8.7 and the median was .
Criteria | Rank | Score* | Raw Score |
Interpretability | #1 | 4.222 | 4.222 |
Reproducibility | #1 | 4.556 | 4.556 |
Judge's choice | #2 | n/a | n/a |
Generality | #10 | 2.778 | 2.778 |
ML Safety | #10 | 2.778 | 2.778 |
Novelty | #11 | 2.889 | 2.889 |
Criteria | Rank | Score* | Raw Score |
Reproducibility | #2 | 4.364 | 4.364 |
Interpretability | #4 | 3.636 | 3.636 |
ML Safety | #6 | 3.000 | 3.000 |
Novelty | #8 | 3.273 | 3.273 |
Generality | #14 | 2.545 | 2.545 |
Criteria | Rank | Score* | Raw Score |
Reproducibility | #3 | 4.300 | 4.300 |
ML Safety | #9 | 2.900 | 2.900 |
Generality | #16 | 2.500 | 2.500 |
Novelty | #16 | 2.700 | 2.700 |
Interpretability | #20 | 2.200 | 2.200 |
Criteria | Rank | Score* | Raw Score |
Reproducibility | #4 | 4.243 | 4.500 |
Generality | #9 | 2.828 | 3.000 |
Interpretability | #11 | 3.182 | 3.375 |
Novelty | #14 | 2.711 | 2.875 |
ML Safety | #16 | 2.239 | 2.375 |
Criteria | Rank | Score* | Raw Score |
ML Safety | #3 | 3.214 | 3.214 |
Judge's choice | #4 | n/a | n/a |
Reproducibility | #5 | 4.214 | 4.214 |
Novelty | #13 | 2.857 | 2.857 |
Interpretability | #14 | 2.929 | 2.929 |
Generality | #20 | 2.286 | 2.286 |
Criteria | Rank | Score* | Raw Score |
Judge's choice | #3 | n/a | n/a |
Generality | #5 | 3.064 | 3.250 |
Reproducibility | #6 | 4.125 | 4.375 |
Novelty | #6 | 3.300 | 3.500 |
Interpretability | #6 | 3.536 | 3.750 |
ML Safety | #8 | 2.946 | 3.125 |
Criteria | Rank | Score* | Raw Score |
Novelty | #5 | 3.364 | 3.364 |
Reproducibility | #7 | 4.091 | 4.091 |
Generality | #14 | 2.545 | 2.545 |
ML Safety | #15 | 2.273 | 2.273 |
Interpretability | #16 | 2.727 | 2.727 |
Criteria | Rank | Score* | Raw Score |
ML Safety | #1 | 3.778 | 3.778 |
Interpretability | #1 | 4.222 | 4.222 |
Generality | #1 | 3.444 | 3.444 |
Novelty | #2 | 3.778 | 3.778 |
Reproducibility | #8 | 3.889 | 3.889 |
Criteria | Rank | Score* | Raw Score |
Novelty | #4 | 3.545 | 3.545 |
Generality | #6 | 3.000 | 3.000 |
Reproducibility | #9 | 3.818 | 3.818 |
Interpretability | #13 | 3.000 | 3.000 |
ML Safety | #13 | 2.545 | 2.545 |
Criteria | Rank | Score* | Raw Score |
Generality | #1 | 3.444 | 3.444 |
ML Safety | #4 | 3.111 | 3.111 |
Novelty | #10 | 3.111 | 3.111 |
Interpretability | #10 | 3.222 | 3.222 |
Reproducibility | #10 | 3.778 | 3.778 |
Criteria | Rank | Score* | Raw Score |
Novelty | #1 | 3.889 | 3.889 |
ML Safety | #2 | 3.222 | 3.222 |
Generality | #4 | 3.222 | 3.222 |
Interpretability | #5 | 3.556 | 3.556 |
Reproducibility | #11 | 3.667 | 3.667 |
Criteria | Rank | Score* | Raw Score |
Interpretability | #8 | 3.300 | 3.500 |
Reproducibility | #12 | 3.653 | 3.875 |
Generality | #13 | 2.593 | 2.750 |
ML Safety | #14 | 2.475 | 2.625 |
Novelty | #17 | 2.593 | 2.750 |
Criteria | Rank | Score* | Raw Score |
Reproducibility | #13 | 3.536 | 3.750 |
ML Safety | #20 | 2.003 | 2.125 |
Generality | #21 | 2.239 | 2.375 |
Interpretability | #22 | 2.121 | 2.250 |
Novelty | #23 | 1.768 | 1.875 |
Criteria | Rank | Score* | Raw Score |
Interpretability | #11 | 3.182 | 3.375 |
Reproducibility | #13 | 3.536 | 3.750 |
Novelty | #14 | 2.711 | 2.875 |
Generality | #17 | 2.475 | 2.625 |
ML Safety | #22 | 1.650 | 1.750 |
Criteria | Rank | Score* | Raw Score |
ML Safety | #11 | 2.593 | 2.750 |
Generality | #11 | 2.711 | 2.875 |
Reproducibility | #13 | 3.536 | 3.750 |
Interpretability | #17 | 2.593 | 2.750 |
Novelty | #22 | 2.239 | 2.375 |
Criteria | Rank | Score* | Raw Score |
Reproducibility | #16 | 3.402 | 3.857 |
Interpretability | #21 | 2.142 | 2.429 |
Generality | #23 | 2.016 | 2.286 |
ML Safety | #23 | 1.638 | 1.857 |
Novelty | #24 | 1.638 | 1.857 |
Criteria | Rank | Score* | Raw Score |
Judge's choice | #1 | n/a | n/a |
Generality | #1 | 3.444 | 3.444 |
Interpretability | #3 | 3.778 | 3.778 |
ML Safety | #4 | 3.111 | 3.111 |
Novelty | #9 | 3.222 | 3.222 |
Reproducibility | #17 | 3.222 | 3.222 |
Criteria | Rank | Score* | Raw Score |
Reproducibility | #17 | 3.222 | 3.222 |
Generality | #18 | 2.444 | 2.444 |
Novelty | #18 | 2.556 | 2.556 |
ML Safety | #21 | 1.889 | 1.889 |
Interpretability | #23 | 2.000 | 2.000 |
Criteria | Rank | Score* | Raw Score |
Novelty | #6 | 3.300 | 3.500 |
Generality | #7 | 2.946 | 3.125 |
Interpretability | #8 | 3.300 | 3.500 |
ML Safety | #11 | 2.593 | 2.750 |
Reproducibility | #19 | 3.182 | 3.375 |
Criteria | Rank | Score* | Raw Score |
Generality | #8 | 2.898 | 3.286 |
Interpretability | #15 | 2.772 | 3.143 |
ML Safety | #18 | 2.142 | 2.429 |
Novelty | #19 | 2.520 | 2.857 |
Reproducibility | #20 | 3.024 | 3.429 |