AI Testing Hackathon

Hosted by Esben Kran, Apart Research, Zaki, fbarez, haydn belfield · #alignmentjam

Entries

Ratings

Overview Submissions Results

Community

Screenshots Submission feed

Butanium published a game 2 years ago

Trojan detection and implementation on transformers

A downloadable game.

Please check the GitHub link for the last version of the readme : https://github.com/crsegerie/trojan-gpt-benchmark Among other things, we have used a very recent paper which allows mixing fine-tuned trojan weights in order to combine 2 bac...

View game

xqiorra published a project 2 years ago

Evaluating Critical Level Of Perturbations Required To Achieve Certain Fail Rate

A downloadable project.

The fundamental point of the current research is to measure the precise amount of perturbations (noise) needed to be added to a certain image in order for it to be misclassified by the network. The estimation would be achieved via the metho...

View project

podar_hd@cy.iitr.ac.in published a project 2 years ago

Formal Verification for Paren-balance checking

A downloadable project.

Artificial intelligence is fast developing new capabilities and is able to interpret the context of grammatical structures and sentences correctly. However, the problem of balancing parentheses remains relevant and unsolved in the domain of...

#artificial-intelligence #neural-networks #nlp

View project

alexfoote published a project 2 years ago

Investigating Training Dynamics via Token Loss Trajectories

A downloadable project.

View project

JanWehner published a game 2 years ago

This Is Fine(-tuning): A benchmark testing LLMs robustness against bad fine-tuning data

A downloadable game.

Large language models (LLMs) build up models of the world and of tasks leading them to impressive performance on many benchmarks. But how robust are these models against bad data? Motivated by an example where an actively learning LLM is be...

View game

Giles published a project 2 years ago

Model Hubris: On the Presumptuousness of Large Language Models

A downloadable project.

We can expect large language models to be deployed in numerous contexts where a friendly, natural-language interface is expected. In order to augment their functionality, we might expect these systems to interface with other AI or software...

View project

Jonathan Claybrough published a project 2 years ago

GIF

Discovering Latent Knowledge in Language Models Without Supervision - extensions and testing

A downloadable project.

Based on the paper "Discovering Latent Knowledge in Language Models without supervision" this project discusses how well the proposed method applies to the concept of ambiguity. To do that, we tested the Contrast Consistent Search method on...

View project

mokon published a project 2 years ago

LLM benchmarking through specifically-aligned feedback

A downloadable project.

#science

View project

Kal0000 published a project 2 years ago

Counting Letters, Chaining Premises & Solving Equations: Exploring Inverse Scaling Problems with GPT-3

A downloadable project.

Language models generally show increased performance in a variety of tasks as their size increases. But there are a class of problems for which increase in model size results in worse performance. These are known as inverse scaling problems...

#gpt #inverse-scaling #llm

View project

Apart Research published a project 2 years ago

Example Project

A downloadable project.

Benchmarking Number Comprehension Conflation Esben Kran, Sabrina Zaki, Haydn Belfield AI Testing Hackathon Report Apart Research PIs: Esben Kran, Haydn Belfield, Fazl Barez Date: 16th December, 2022

#science-fiction

View project

itch.io

AI Testing Hackathon