Skip to main content

Indie game storeFree gamesFun gamesHorror games
Game developmentAssetsComics
SalesBundles
Jobs
TagsGame Engines

Results

25 entries were submitted between 2022-11-11 17:00:00 and 2022-11-13 13:15:00. 217 ratings were given to 24 entries (96.0%) between 2022-11-13 13:15:00 and 2022-11-13 16:30:00. The average number of ratings per game was 8.7 and the median was .

Backup Transformer Heads are Robust to Ablation Distribution

by satojk

Ranked 1st in Reproducibility with 9 ratings (Score: 4.556)

View submission page

CriteriaRankScore*Raw Score
Interpretability#14.2224.222
Reproducibility#14.5564.556
Judge's choice#2n/an/a
Generality#102.7782.778
ML Safety#102.7782.778
Novelty#112.8892.889

War is 15% conflic, 15% DragonMagazine

by Giles

Ranked 2nd in Reproducibility with 11 ratings (Score: 4.364)

View submission page

CriteriaRankScore*Raw Score
Reproducibility#24.3644.364
Interpretability#43.6363.636
ML Safety#63.0003.000
Novelty#83.2733.273
Generality#142.5452.545

Optimising image patches to change RL-agent behaviour

by robertsc

Ranked 3rd in Reproducibility with 10 ratings (Score: 4.300)

View submission page

CriteriaRankScore*Raw Score
Reproducibility#34.3004.300
ML Safety#92.9002.900
Generality#162.5002.500
Novelty#162.7002.700
Interpretability#202.2002.200
CriteriaRankScore*Raw Score
Reproducibility#44.2434.500
Generality#92.8283.000
Interpretability#113.1823.375
Novelty#142.7112.875
ML Safety#162.2392.375

Probing Conceptual Knowledge on Solved Games

by mentaleap

Ranked 5th in Reproducibility with 14 ratings (Score: 4.214)

View submission page

CriteriaRankScore*Raw Score
ML Safety#33.2143.214
Judge's choice#4n/an/a
Reproducibility#54.2144.214
Novelty#132.8572.857
Interpretability#142.9292.929
Generality#202.2862.286

Model editing hazards at the example of ROME

by jas-ho, JuliaPersson, goodheart_points

Ranked 6th in Reproducibility with 8 ratings (Score: 4.125)

View submission page

CriteriaRankScore*Raw Score
Judge's choice#3n/an/a
Generality#53.0643.250
Reproducibility#64.1254.375
Novelty#63.3003.500
Interpretability#63.5363.750
ML Safety#82.9463.125

Regularly Oversimplifying Neural Networks

by Botahamec, Nicholas Kross

Ranked 7th in Reproducibility with 11 ratings (Score: 4.091)

View submission page

CriteriaRankScore*Raw Score
Novelty#53.3643.364
Reproducibility#74.0914.091
Generality#142.5452.545
ML Safety#152.2732.273
Interpretability#162.7272.727
CriteriaRankScore*Raw Score
ML Safety#13.7783.778
Interpretability#14.2224.222
Generality#13.4443.444
Novelty#23.7783.778
Reproducibility#83.8893.889
CriteriaRankScore*Raw Score
Novelty#43.5453.545
Generality#63.0003.000
Reproducibility#93.8183.818
Interpretability#133.0003.000
ML Safety#132.5452.545

Top-Down Interpretability Through Eigenspectra

by jhoogland

Ranked 10th in Reproducibility with 9 ratings (Score: 3.778)

View submission page

CriteriaRankScore*Raw Score
Generality#13.4443.444
ML Safety#43.1113.111
Novelty#103.1113.111
Interpretability#103.2223.222
Reproducibility#103.7783.778
No image :(

Interpreting Catastrophic Failure Modes in OpenAI’s Whisper

by Lawrencium103

Ranked 11th in Reproducibility with 9 ratings (Score: 3.667)

View submission page

CriteriaRankScore*Raw Score
Novelty#13.8893.889
ML Safety#23.2223.222
Generality#43.2223.222
Interpretability#53.5563.556
Reproducibility#113.6673.667
CriteriaRankScore*Raw Score
Interpretability#83.3003.500
Reproducibility#123.6533.875
Generality#132.5932.750
ML Safety#142.4752.625
Novelty#172.5932.750
CriteriaRankScore*Raw Score
Reproducibility#133.5363.750
ML Safety#202.0032.125
Generality#212.2392.375
Interpretability#222.1212.250
Novelty#231.7681.875
No image :(
CriteriaRankScore*Raw Score
Interpretability#113.1823.375
Reproducibility#133.5363.750
Novelty#142.7112.875
Generality#172.4752.625
ML Safety#221.6501.750

Mechanisms of Causal Reasoning

by Jacy Reese Anthis

Ranked 13th in Reproducibility with 8 ratings (Score: 3.536)

View submission page

CriteriaRankScore*Raw Score
ML Safety#112.5932.750
Generality#112.7112.875
Reproducibility#133.5363.750
Interpretability#172.5932.750
Novelty#222.2392.375
No image :(

Observing and Validating Induction heads in SOLU-8l-old

by poppingtonic

Ranked 16th in Reproducibility with 7 ratings (Score: 3.402)

View submission page

CriteriaRankScore*Raw Score
Reproducibility#163.4023.857
Interpretability#212.1422.429
Generality#232.0162.286
ML Safety#231.6381.857
Novelty#241.6381.857
No image :(
CriteriaRankScore*Raw Score
Judge's choice#1n/an/a
Generality#13.4443.444
Interpretability#33.7783.778
ML Safety#43.1113.111
Novelty#93.2223.222
Reproducibility#173.2223.222
No image :(

Finding unusual neuron sets by activation vector distance

by Gurkenglasius

Ranked 17th in Reproducibility with 9 ratings (Score: 3.222)

View submission page

CriteriaRankScore*Raw Score
Reproducibility#173.2223.222
Generality#182.4442.444
Novelty#182.5562.556
ML Safety#211.8891.889
Interpretability#232.0002.000
No image :(

Interpretability at a glance

by carlhenrikrolf, koriavinash1, HH10

Ranked 19th in Reproducibility with 8 ratings (Score: 3.182)

View submission page

CriteriaRankScore*Raw Score
Novelty#63.3003.500
Generality#72.9463.125
Interpretability#83.3003.500
ML Safety#112.5932.750
Reproducibility#193.1823.375

Interpretability Hackathon: Sparsity Lens

by astOwOlfo

Ranked 20th in Reproducibility with 7 ratings (Score: 3.024)

View submission page

CriteriaRankScore*Raw Score
Generality#82.8983.286
Interpretability#152.7723.143
ML Safety#182.1422.429
Novelty#192.5202.857
Reproducibility#203.0243.429