You know what would be killer??
ASIC for scrypt mining.
scrypt requires a fixed amount of RAM per thread (this is one of its major features). So to make scrypt highly parallelized, you'd also need tons of RAM. (Right now, litecoin is tuned such that each thread requires only ~128kb, but even that meaningfully limits how many parallel hashes you could compute.)