Is there a way to use the cpu+gpu together?
Pretty sure that is what this miner does. Its just not well optimized yet. Give it a few months, now that the source is public, other people will make small changes and updates, and eventually it will be quite fast. There is no reason to believe that parallelization of prime sieving wouldn't speed things up, it's just very difficult to implement, and takes time.