Regarding the sound I think I more or less understand now... Just square wave and only one channel.
The pixel part is a big relief. If pixels need to snap (such that a sprite in a fraction position) the programming would be nightmare.