A more important question would be, how are you going to move those addresses back to host memory? CUDA has a copy-memory-between-device-and-host function, but I am not so sure about OpenCL - at any rate, calling such a function for each address would be very slow, and I advise you to fill up a block of device memory with address bytes before copying it to the host RAM.