That looks about right. i7z helps you visualise if the 'correct' cores are unloaded. You don't need to split into two processes - affining the process to the cores works fine.
I get ~630h/s from one node of 2 e5-2650l v1s. Not very power efficient, but with rising prices....