To come back to yours, I think the error is in the first sentence: "the difference between successive values of Z(i) = log(P(i)) are independent random variables with probability distributions that are symmetric about zero"
Small proof: under log brownian (with no drift) the important basic concept is that the best expectation of price in the future is the current value of the price.
OK, it seems that I was using a definition of "log Brownian" that is not the standard one used in finance.
Indeed I was assuming that the increments D(i) = Z(i+1)-Z(i) = log(P(i+1)/P(i)) were normal variables with
zero mean, so that Z(i) would be a Brownian variable as it is usually defined in other areas - with no trend. (Note that I am a prof of
computer science, not economics!)
However, as you point out, by that definition the expected price E(P(i0+n)) would grow exponentially with n; which does not make sense in the trading context, where the "efficient market hypothesis" demands E(P(i0+n)) = P(i0). (Or does it? See below.)
We agree at least that in a log-Brownian model the increments D(i) = Z(i+1)-Z(i) = log(P(i+1)/P(i)) should be assumed to be independent random variables, yes?
The standard way to achieve E(P(i0+n)) = P(i0), in finance, seems to be: assume that the increments D(i) are Gaussian variables with slightly negative mean, mu = -sigma^2/2. That is, one assumes a slight negative trend in the log-price Z(i) so that the broadening of the log-normal distribution of P(i0+n) as n increases preserves the mean P(i0). Is that correct?
That assumption satisfies the "efficient market hypothesis", but implies (as you pointed out) that the price is slightly more likely to go down than to go up at each step. Then Prob(P(i0+n) < P(i0)) increases with with the stride n. Which seems weird too.
We can get rid of this weidness by assuming a probability distribution for D(i) such that E(D(i)) = 0, E(exp(D(i))) = 1. It seems that these two conditions cannot be obtained with a Gaussian distribution, except in the limit when sigma → 0. However, they can be achieved with other distributions that are symmetric about zero
, especially if they have fatter tails than the Gaussian. And, indeed, the most obvious deficiency of the log-Brownian model seems to be that, in real data, the distribution of the increments is not Gaussian.
If my math is correct, the distribution of the n-step increments too would satisfy both conditions: E(Z(i0+n)) = Z(i0), and E(P(i0+n)) = P(i0). Moreover the distribution of Z(i0+n), being the convolution of n symmetric distributions, would be symmetric about Z(i0), implying that Prob(Z(i0+n) < Z(i0)) = 1/2, and hence Prob(P(i0+n) < P(i0)) = 1/2.
With these assumptions, even though the distribution of P(i0+n)/P(i0) approaches a log-normal distribution as n increases (by the central limit theorem), it remains sufficiently "log-abnormal" to satisfy those conditions (which a true log-normal distribution cannot achieve, it seems).
Perhaps you can tell me what would be a convenient
"fat-tailed" symmetric distribution to assume for the increments D(i) that would satisfy both conditions. (Perhaps a mixture of Gaussians with zero mean whose variance has log-normal distribution? I would have a justification for that choice...)
Finally, about the "efficient market hypothesis": shouldn't it say that E(P(i0 + n)) = P(i0)*Q^n, where Q > 1 is the typical ROI factor of a generic investment per time step? That is, if E(P(i0+n)) = X, then P(i0) should be
less than X, otherwise other investments would be more profitable.
With that modification to the "efficient market hypothesis", the distribution of D(i) must satisfy E(exp(D(i)) = Q, not 1; and one can achieve that even with a
zero-mean Gaussian if desired. In that case one would have a legitimate log-Browninan model (with Gaussian increments) such that that E(Z(i0+n)) = Z(i0), E(P(i0+n)) = P(i0)*Q^n, and Prob(P(i0+n) < P(i0)) = 1/2. Does this make sense?
EDIT: not sure whether the distribution must/may have fatter tails than a Gaussian. Too sleepy to think now...
- "We agree at least that in a log-Brownian model the increments D(i) = Z(i+1)-Z(i) = log(P(i+1)/P(i)) should be assumed to be independent random variables, yes? " yes
- "Then Prob(P(i0+n) < P(i0)) increases with with the stride n. Which seems weird too." At first maybe, but this makes sense as the expectation of the price in the future being the current price, and the price being limited on the downside to 0, the pdf (probability density function) of the price in the lower than current area is higher to compensate for the fat tail of the density. Think that if current price is 100, there is a small but non zero probability that it will be 1million in 1 year. To keep expectation in 1 year being 100 and since we can't have negative prices, it means there is a large weight on small positive values, and therefore the probability of price being lower must be higher. I don't think there is any way to avoid that, one must just get convinced that it is not weird. And it isn't, if you look at historical prices of financial time series, this is a true historical fact.
- "Finally, about the "efficient market hypothesis": shouldn't it say that E(P(i0 + n)) = P(i0)*Q^n, where Q > 1 is the typical ROI factor of a generic investment per time step?" your Q is usually the risk less interest rate, which we assumed here to be 0 when we said no trend in the price