Help optimizing simple decompressor

I’ve managed to fix my code and pushed it to my fork:

On my machine, it improves the running time of the cbf-bench-binary from 2.284s to 0.288s, so almost 8x. It also reduces memory residency from 534MB to 24MB which is about 22x less and suggests it runs in constant additional memory.

2 Likes