Post
Topic
Board Development & Technical Discussion
Re: tcatm's 4-way SSE2 for Linux 32/64-bit 0.3.9 rc2
by
Vasiliev
on 16/08/2010, 03:17:07 UTC
I propose to compile sha256.cpp with -O3 -march=amdfamk10 (will work on 32bit and 64bit) as only CPUs supporting this instruction set (AMD Phenom, Intel i5 and newer) benefit from -4way and it'll improve performance by ~9%.
GCC 4.3.3 doesn't support -march=amdfamk10.  I get:
sha256.cpp:1: error: bad value (amdfamk10) for -march= switch
try -march=amdfam10