The processing of one 512-bit block is performed in 66 clock cycles and the bit-rate achieved is 7.75Mbps / MHz on the input of the SHA256 core.
Its important to note performance/SHA-256 IP is not the issue. The issue is getting everything else together in a cost effective manner.
-SZ