Reference OEM 7970, 13.3 beta, SDK that came with it. CGMiner 2.11.4, Diablo kernel, Worksize 256, Default vectors (1), intensity 11, 2 GPU Threads. Don't need to specify any other options aside from pool and GPU speeds. 1185 Core, 300 memory.
713.2 MHash/sec
Running with 1 GPU thread was about 700 on the nose. Also, I have not experimented with all the combinations quite yet.
Can you post your card and cgminer command line please?