this isnt a bad little implementation you've packaged up here. it's probably the fastest AND least resource hungry LLM i've ever run, though it does forget rules, insert nonsensical details, and repeat itself. it was quite easy to edit the starting prompts you provided into anything at all i wanted.
the only thing really missing is a "redo output" command.