Recently I have been trying to make a 32b computer and I have some issues with the ALU
When building a multiplier circuit I have discovered a lot of lag. The reason I have tried to make it into hardware is because it can be efficient as all hell, Im talking O(1) efficient.
But if I use an algorithm I can reduce the lag by a lot but it comes with a price. It takes O(4N+4) operation which is relatively.
Im stumped can you guys help me out