I heard that in this algorithm the memory characteristics are important. Probably slow memory and one channel does not allow to fully open the CPU potential in this algorithm. I wonder what the developer will say in this regard
Dual-channel mode of operative memory is desirable. Ideally, it would be better for each core to have its own memory bar.