Ahh, great to have been able to iterate on it that much :D . It shows.
Yeah that'd be a nice solution too. It makes 'sense' but more feedback when interacting would really help solidify the idea faster.
With the pitch you could possibly have the same end point but different start point based on the item. For example (don't know your values) to steal a small painting you need to reduce 5 to 0, so the pitch starts offset by multiplier of 5, changing until 0. And the larger statues need to be reduced from 15 to 0, so they're offset by 15 (that'd naturally sound heavier also) slowly coming down to an offset of 0. Then you can immediately hear the difference, while having the same final pitch become familiar and anticipated.