Not aiming to be argumentative because I never studied it deeply, but wasnt there a strong analysis done that showed a large GPU optimization (and by extension FPGA/asic) for your cuckoo algorithm existed? I never saw a response from you on that.
The Cuckoo Cycle webpage lists several performance claims. Which of the claims does the strong analysis refute? And why haven't you claimed the corresponding large bounty?