Play project
“Let's think step by step” reduces hindsight-neglect's itch.io pageWho else worked on this with you?
Theresa Thoraldson, Chris Lons, Richard Annilo, Victor levoso
Leave a comment
Log in with itch.io to leave a comment.
Who else worked on this with you?
Theresa Thoraldson, Chris Lons, Richard Annilo, Victor levoso
Log in with itch.io to leave a comment.
Comments
Winner of the fourth prize!! Congratulations!
General: Based off of another project, very neat. Proposes a clean solution to a pretty serious problem. I like the next steps.
Fazl: Worth running the same prompts on different datasets from the inverse scaling challenge.
Alignment: Creates an easy solution to a clearly defined problem and might generalize well beyond this. Does not “solve” cognition for the AI but increases its alignment drastically. Prompt engineers trained by model, since there’s big shifts based on the prompt.
AI Psychology: “Let’s think step by step” works in larger models. Maybe it is a general solution for things. Maybe it is a general alignment solution to instigate system 2 thinking. Escapes biasing prompts. Very limited actual understanding. Diverges from prompt game.
Novelty: Have not seen this simple prompt before.
Generality: Yes, accepted by the Inverse Scaling Prize team as well.
Reproducibility: A code base but needs manual annotation afterwards because of code limitations. 4 extra things: Rick-rolling YouTube links, ASCII art bias, only larger models can explain jokes, moral uncertainty is person-dependent. Awesome stuff!
I want to note that we weren't the first to come up with "let's think step by step",https://arxiv.org/abs/2205.11916 this paper might be the origin but I wouldn't be surprised if a lot of people discovered it independently before, it's not that hard to notice if you play around with models.
It's not special either, other prompts that mean the same also seem to work.
What's novel is that as far as I know nobody had tried to use it to reduce biases or in the inverse scaling price datasets.
(And in general the effects of using let's think step by step style prompts seems pretty unexplored so far in general).
Link to repository: https://github.com/tthoraldson/LLM_Hackathon