Yes and there is also a CUDA intrinsic that search for the number of starting zero __ffsll which could be used to speed up the checking of public key.
CPU profiles of the last release:
(using compressed address)

(using uncompressed address)

EDIT: FindKey include, mainly, lookup table, ecc arithmetic (ModAdd and ModSub)
ModMulK1 is the SecpK1 modular mult, there are 2 ModMulK1 signatures for this method.
Added the 2 profiles for compressed and uncompressed.