Using NN for cracking cryptographic functions is pointless. NN can capture only simple dependencies.
I expect the number of weights needed for capturing one bit with bigger than insignificant probability to be in the order of 2128.
NNs can capture very had dependecies, depends on type and number of hidden layers.
https://www.sciencedirect.com/science/article/pii/S0895717707000362