Brownian Motion and the Riemann Zeta Function

Intriguingly, various constructions related to Brownian motion result in quantities with moments described by the Riemann zeta function. These distributions appear in integral representations used to extend the zeta function to the entire complex plane, as described in an earlier post. Now, I look at how they also arise from processes constructed from Brownian motion such as Brownian bridges, excursions and meanders.

Recall the definition of the Riemann zeta function as an infinite series

$\displaystyle \zeta(s)=1+2^{-s}+3^{-s}+4^{-s}+\cdots$

which converges for complex argument s with real part greater than one. This has a unique extension to an analytic function on the complex plane outside of a simple pole at s = 1.

Often, it is more convenient to use the Riemann xi function which can be defined as zeta multiplied by a prefactor involving the gamma function,

$\displaystyle \xi(s)=\frac12s(s-1)\pi^{-s/2}\Gamma(s/2)\zeta(s).$

This is an entire function on the complex plane satisfying the functional equation ξ(1 - s) = ξ(s).

It turns out that ξ describes the moments of a probability distribution, according to which a random variable X is positive with moments

$\displaystyle {\mathbb E}[X^s]=2\xi(s),$

(1)

which is well-defined for all complex s. In the post titled The Riemann Zeta Function and Probability Distributions, I denoted this distribution by Ψ, which is a little arbitrary but was the symbol used for its probability density. A related distribution on the positive reals, which we will denote by Φ, is given by the moments

$\displaystyle {\mathbb E}[X^s]=\frac{1-2^{1-s}}{s-1}2\xi(s)$

(2)

which, again, is defined for all complex s.

As standard, complex powers of a positive real x are defined by x^s = e^slogx, so (1,2) are equivalent to the moment generating functions of logX, which uniquely determines the distributions. The probability densities and cumulative distribution functions can be given, although I will not do that here since they are already explicitly written out in the earlier post. I will write X ∼ Φ or X ∼ Ψ to mean that random variable X has the respective distribution. As we previously explained, these are closely connected:

If X ∼ Ψ and, independently, Y is uniform on [1, 2], then X/Y ∼ Φ.
If X, Y ∼ Φ are independent then √X² + Y² ∼ Ψ.

The purpose of this post is to describe some constructions involving Brownian bridges, excursions and meanders which naturally involve the Φ and Ψ distributions.

Theorem 1 The following have distribution Φ:

√2/πZ where Z = sup_t|B_t| is the absolute maximum of a standard Brownian bridge B.

Z/√2π where Z = sup_tB_t is the maximum of a Brownian meander B.

√2πZ where Z is the sample standard deviation of a Brownian bridge B,

$\displaystyle Z=\left(\int_0^1(B_t-\bar B)^2\,dt\right)^{\frac12}$

with sample mean B̅ = ∫₀¹B_t dt.

√π/2Z where Z is the pathwise Euclidean norm of a 2-dimensional Brownian bridge B = (B¹, B²),

$\displaystyle Z=\left(\int_0^1\lVert B_t\rVert^2\,dt\right)^{\frac12}$

√τπ/2 where τ = inf{t ≥ 0: ‖B_t‖= 1} is the first time at which the norm of a 3-dimensional standard Brownian motion B = (B¹, B², B³) hits 1.

The Kolmogorov distribution is, by definition, the absolute maximum of a Brownian bridge. So, the first statement of theorem 1 is saying that Φ is just the Kolmogorov distribution scaled by the constant factor √2/π. Moving on to Ψ;

Theorem 2 The following have distribution Ψ:

√2/πZ where Z = sup_tB_t – inf_tB_t is the range of a standard Brownian bridge B.

√2/πZ where Z = sup_tB_t is the maximum of a (normalized) Brownian excursion B.

√π/2Z where Z is the pathwise Euclidean norm of a 4-dimensional Brownian bridge B = (B¹, B², B³, B⁴),

$\displaystyle Z=\left(\int_0^1\lVert B_t\rVert^2\,dt\right)^{\frac12}.$

Continue reading “Brownian Motion and the Riemann Zeta Function” →

The Minimum and Maximum of Brownian motion

If X is standard Brownian motion, what is the distribution of its absolute maximum |X|_t^∗ = sup_s ≤ t|X_s| over a time interval [0, t]? Previously, I looked at how the reflection principle can be used to determine that the maximum X_t^∗ = sup_s ≤ tX_s has the same distribution as |X_t|. This is not the same thing as the maximum of the absolute value though, which is a more difficult quantity to describe. As a first step, |X|_t^∗ is clearly at least as large as X_t^∗ from which it follows that it stochastically dominates |X_t|.

I would like to go further and precisely describe the distribution of |X|_t^∗. What is the probability that it exceeds a fixed positive level a? For this to occur, the suprema of both X and –X must exceed a. Denoting the minimum and maximum by

$\displaystyle \begin{aligned} &X_t^m=\inf_{s\le t}X_s,\\ &X_t^M=\sup_{s\le t}X_s, \end{aligned}$

then |X|_t^∗ is the maximum of X_t^M and –X_t^m. I have switched notation a little here, and am using X^M to denote what was previously written as X^∗. This is just to use similar notation for both the minimum and maximum. Using inclusion-exclusion, the probability that the absolute maximum is greater than a level a is,

$\displaystyle \begin{aligned} {\mathbb P}(\lvert X\rvert_t^* > a)={} & {\mathbb P}(X_t^M > a)+{\mathbb P}(X_t^m < -a)\\ & -{\mathbb P}(X_t^M > a{\rm\ and\ }X_t^m < -a). \end{aligned}$

As X_t^M has the same distribution as |X_t| and, by symmetry, so does –X^m, we obtain

$\displaystyle {\mathbb P}(\lvert X\rvert_t^* > a)=4{\mathbb P}(X_t > a)-{\mathbb P}(X_t^M > a{\rm\ and\ }X_t^m < -a).$

This hasn’t really answered the question. All we have done is to re-express the probability in terms of both the minimum and maximum being beyond a level. For large values of a it does, however, give a good approximation. The probability of the Brownian motion reaching a large positive value a and then dropping to the large negative value –a will be vanishingly small, so the final term in the identity above can be neglected. This gives an asymptotic approximation as a tends to infinity,

$\displaystyle \begin{aligned} {\mathbb P}(\lvert X\rvert_t^* > a) &\sim 4{\mathbb P}(X_t > a)\\ &\sim\sqrt{\frac{8t}{\pi a^2}}e^{-\frac{a^2}{2t}}. \end{aligned}$

(1)

The last expression here is just using the fact that X_t is centered Gaussian with variance t and applying a standard approximation for the cumulative normal distribution function.

For small values of a, approximation (1) does not work well at all. We know that the left-hand-side should tend to 1, whereas 4ℙ(X_t > a) will tend to 2, and the final expression diverges. In fact, it can be shown that

$\displaystyle {\mathbb P}(\lvert X\rvert_t^* < a)\sim\frac{4}{\pi}e^{-\frac{t\pi^2}{8a^2}}$

(2)

as a → 0. I gave a direct proof in this math.stackexchange answer. In this post, I will look at how we can compute joint distributions of the minimum, maximum and terminal value of Brownian motion, from which limits such as (2) will follow. Continue reading “The Minimum and Maximum of Brownian motion” →

The Brownian Drawdown Process

The drawdown of a stochastic process is the amount that it has dropped since it last hit its maximum value so far. For process X with running maximum X^∗_t = sup_s ≤ tX_s, the drawdown is thus X^∗_t – X_t, which is a nonnegative process. This is as in figure 1 below.

Brownian motion drawdown — Figure 1: Brownian motion and its drawdown process

The previous post used the reflection principle to show that the maximum of a Brownian motion has the same distribution as its terminal absolute value. That is, X^∗_t and |X_t| are identically distributed.

For a process X started from zero, its maximum and drawdown can be written as X^∗_t – X₀ and X^∗_t – X_t. Reversing the process in time across the interval [0, t] will exchange these values. So, reversing in time and translating so that it still starts from zero will exchange the maximum value and the drawdown. Specifically, write

$\displaystyle Y_s = X_{t-s} - X_t$

for time index 0 ≤ s ≤ t. The maximum of Y is equal to the drawdown of X,

$\displaystyle Y^*_t = X^*_t-X_t.$

If X is standard Brownian motion then so is Y, since the independent normal increments property for Y follows from that of X. As already stated, the maximum Y^∗_t = X^∗_t – X_t has the same distribution as the absolute value |Y_t|= |X_t|. So, the drawdown has the same distribution as the absolute value at each time.

Lemma 1 If X is standard Brownian motion, then X^∗_t – X_t has the same distribution as |X_t| at each time t ≥ 0.

Continue reading “The Brownian Drawdown Process” →

The Maximum of Brownian Motion and the Reflection Principle

The distribution of a standard Brownian motion X at a positive time t is, by definition, centered normal with variance t. What can we say about its maximum value up until the time? This is X^∗_t = sup_s ≤ tX_s, and is clearly nonnegative and at least as big as X_t. To be more precise, consider the probability that the maximum is greater than a fixed positive value a. Such problems will be familiar to anyone who has looked at pricing of financial derivatives such as barrier options, where the payoff of a trade depends on whether the maximum or minimum of an asset price has crossed a specified barrier level.

This can be computed with the aid of a symmetry argument commonly referred to as the reflection principle. The idea is that, if we reflect the Brownian motion when it first hits a level, then the resulting process is also a Brownian motion. The first time at which X hits level a is τ = inf{t ≥ 0: X_t ≥ a}, which is a stopping time. Reflecting the process about this level at all times after τ gives a new process

Reflected Brownian motion — Figure 1: Reflecting Brownian motion when it hits level a.

Continue reading “The Maximum of Brownian Motion and the Reflection Principle” →

Brownian Meanders

Having previously looked at Brownian bridges and excursions, I now turn to a third kind of process which can be constructed either as a conditioned Brownian motion or by extracting a segment from Brownian motion sample paths. Specifically, the Brownian meander, which is a Brownian motion conditioned to be positive over a unit time interval. Since this requires conditioning on a zero probability event, care must be taken. Instead, it is cleaner to start with an alternative definition by appropriately scaling a segment of a Brownian motion.

For a fixed positive times T, consider the last time σ before T at which a Brownian motion X is equal to zero,

$\displaystyle \sigma=\sup\left\{t\le T\colon X_t=0\right\}.$

(1)

On interval [σ, T], the path of X will start from 0 and then be either strictly positive or strictly negative, and we may as well restrict to the positive case by taking absolute values. Scaling invariance says that c^-1/2X_ct is itself a standard Brownian motion for any positive constant c. So, scaling the path of X on [σ, 1] to the unit interval defines a process

$\displaystyle B_t=(T-\sigma)^{-1/2}\lvert X_{\sigma+t(T-\sigma)}\rvert.$

(2)

over 0 ≤ t ≤ 1; This starts from zero and is strictly positive at all other times.

Brownian meander construction — Figure 2: Constructing a Brownian meander

Scaling invariance shows that the law of the process B does not depend on the choice of fixed time T The only remaining ambiguity is in the choice of the fixed time T.

Lemma 1 The distribution of B defined by (2) does not depend on the choice of the time T > 0.

Proof: Consider any other fixed positive time T̃, and use the construction above with T̃, σ̃, B̃ in place of T, σ, B respectively. We need to show that B̃ and B have the same distribution. Using the scaling factor S = T̃/T, then X′_t = S^-1/2X_tS is a standard Brownian motion. Also, σ′= σ̃/S is the last time before T at which X′ is zero. So,

$\displaystyle \tilde B_t=(T-\sigma')^{-1/2}\lvert X'_{\sigma'+t(T-\sigma')}\rvert$

has the same distribution as B. ⬜

This leads to the definition used here for Brownian meanders.

Definition 2 A continuous process {B_t}_{t ∈ [0, 1]} is a Brownian meander if and only it has the same distribution as (2) for a standard Brownian motion X and fixed time T > 0.

In fact, there are various alternative — but equivalent — ways in which Brownian excursions can be defined and constructed.

As a scaled segment of a Brownian motion before a time T and after it last hits 0. This is definition 2.
As a Brownian motion conditioned on being positive. See theorem 4 below.
As a segment of a Brownian excursion. See lemma 5.
As the path of a standard Brownian motion starting from its minimum, in either the forwards or backwards direction. See theorem 6.
As a Markov process with specified transition probabilities. See theorem 9 below.
As a solution to an SDE. See theorem 12 below.

Continue reading “Brownian Meanders” →

Brownian Excursions

A normalized Brownian excursion is a nonnegative real-valued process with time ranging over the unit interval, and is equal to zero at the start and end time points. It can be constructed from a standard Brownian motion by conditioning on being nonnegative and equal to zero at the end time. We do have to be careful with this definition, since it involves conditioning on a zero probability event. Alternatively, as the name suggests, Brownian excursions can be understood as the excursions of a Brownian motion X away from zero. By continuity, the set of times at which X is nonzero will be open and, hence, can be written as the union of a collection of disjoint (and stochastic) intervals (σ, τ).

In fact, Brownian motion can be reconstructed by simply joining all of its excursions back together. These are independent processes and identically distributed up to scaling. Because of this, understanding the Brownian excursion process can be very useful in the study of Brownian motion. However, there will by infinitely many excursions over finite time periods, so the procedure of joining them together requires some work. This falls under the umbrella of ‘excursion theory’, which is outside the scope of the current post. Here, I will concentrate on the properties of individual excursions.

In order to select a single interval, start by fixing a time T > 0. As X_T is almost surely nonzero, T will be contained inside one such interval (σ, τ). Explicitly,

$\displaystyle \begin{aligned} &\sigma=\sup\left\{t\le T\colon X_t=0\right\},\\ &\tau=\inf\left\{t\ge T\colon X_t=0\right\}, \end{aligned}$

(1)

so that σ < T < τ < ∞ almost surely. The path of X across such an interval is t ↦ X_σ + t for time t in the range [0, τ - σ]. As it can be either nonnegative or nonpositive, we restrict to the nonnegative case by taking the absolute value. By invariance, S^-1/2X_tS is also a standard Brownian motion, for each fixed S > 0. Using a stochastic factor S = τ – σ, the width of the excursion is normalised to obtain a continuous process {B_t}_{t ∈ [0, 1]} given by

$\displaystyle B_t=(\tau-\sigma)^{-1/2}\lvert X_{\sigma+t(\tau-\sigma)}\rvert.$

(2)

By construction, this is strictly positive over 0 < t < 1 and equal to zero at the endpoints t ∈ {0, 1}.

Figure 2: Constructing a Brownian excursion

The only remaining ambiguity is in the choice of the fixed time T.

Lemma 1 The distribution of B defined by (2) does not depend on the choice of the time T > 0.

Proof: This follows from scaling invariance of Brownian motion. Consider any other fixed positive time T̃, and use the construction above with T̃, σ̃, τ̃, B̃ in place of T, σ, τ, B respectively. We need to show that B̃ and B have the same distribution. Using the scaling factor S = T̃/T, then X′_t = S^-1/2X_tS is a standard Brownian motion. Also, σ′= σ̃/S and τ′= τ̃/S are random times given in the same way as σ and τ, but with the Brownian motion X′ in place of X in (1). So,

$\displaystyle \tilde B_t=(\tau^\prime-\sigma^\prime)^{-1/2}\lvert X^\prime_{\sigma^\prime+t(\tau^\prime-\sigma^\prime)}\rvert$

has the same distribution as B. ⬜

This leads to the definition used here for Brownian excursions.

Definition 2 A continuous process {B_t}_{t ∈ [0, 1]} is a Brownian excursion if and only it has the same distribution as (2) for a standard Brownian motion X and time T > 0.

In fact, there are various alternative — but equivalent — ways in which Brownian excursions can be defined and constructed.

As a normalized excursion away from zero of a Brownian motion. This is definition 2.
As a normalized excursion away from zero of a Brownian bridge. This is theorem 6.
As a Brownian bridge conditioned on being nonnegative. See theorem 9 below.
As the sample path of a Brownian bridge, translated so that it has minimum value zero at time 0. This is a very interesting and useful method of directly computing excursion sample paths from those of a Brownian bridge. See theorem 12 below, sometimes known as the Vervaat transform.
As a Markov process with specified transition probabilities. See theorem 15 below.
As a transformation of Bessel process paths, see theorem 16 below.
As a Bessel bridge of order 3. This can be represented either as a Bessel process conditioned on hitting zero at time 1., or as the vector norm of a 3-dimensional Brownian bridge. See lemma 17 below.
As a solution to a stochastic differential equation. See theorem 18 below.

Continue reading “Brownian Excursions” →

Martingale Marginals

[This post originates from twitter threads posted on 9 Sep 21 and 2 Feb 22.]

A while ago, I proved the result: A continuous strong Markov martingale is uniquely determined by its marginal distributions. This is discussed in a recent paper by Beiglböck, Pammer, and Schachermayer.

link.springer.com
From Bachelier to Dupire via optimal transport

Time for a thread discussing some of these ideas, which have been studied for decades in both stochastic calculus and mathematical finance. Continue reading “Martingale Marginals” →

The Collatz Conjecture

[This post originates from a twitter thread on the Collatz conjecture.]

Time for some #RandomNumberTheory. The Collatz Conjecture is a famous unsolved mathematical problem which also goes by various other names, such as the ‘3n+1’ conjecture. Since it was introduced by Lothar Collatz in 1937, no-one has been able to prove it either true or false.

Continue reading “The Collatz Conjecture” →

Brownian Bridge Fourier Expansions

Figure 1: Sine series approximations to a Brownian bridge

Brownian bridges were described in a previous post, along with various different methods by which they can be constructed. Since a Brownian bridge on an interval ${[0,T]}$ is continuous and equal to zero at both endpoints, we can consider extending to the entire real line by partitioning the real numbers into intervals of length T and replicating the path of the process across each of these. This will result in continuous and periodic sample paths, suggesting another method of representing Brownian bridges. That is, by Fourier expansion. As we will see, the Fourier coefficients turn out to be independent normal random variables, giving a useful alternative method of constructing a Brownian bridge.

There are actually a couple of distinct Fourier expansions that can be used, which depends on precisely how we consider extending the sample paths to the real line. A particularly simple result is given by the sine series, which I describe first. This is shown for an example Brownian bridge sample path in figure 1 above, which plots the sequence of approximations formed by truncating the series after a small number of terms. This tends uniformly to the sample path, although it is quite slow to converge as should be expected when approximating such a rough path by smooth functions. Also plotted, is the series after the first 100 terms, by which time the approximation is quite close to the target. For simplicity, I only consider standard Brownian bridges, which are defined on the unit interval ${[0,1]}$ . This does not reduce the generality, since bridges on an interval ${[0,T]}$ can be expressed as scaled versions of standard Brownian bridges.

Theorem 1 A standard Brownian bridge B can be decomposed as

$\displaystyle B_t=\sum_{n=1}^\infty\frac{\sqrt2Z_n}{\pi n}\sin(\pi nt)$ (1)

over ${0\le t\le1}$ , where ${Z_1,Z_2,\ldots}$ is an IID sequence of standard normals. This series converges uniformly in t, both with probability one and in the ${L^p}$ norm for all ${1\le p < \infty}$ .

Continue reading “Brownian Bridge Fourier Expansions” →

Brownian Bridges

A Brownian bridge can be defined as standard Brownian motion conditioned on hitting zero at a fixed future time T, or as any continuous process with the same distribution as this. Rather than conditioning, a slightly easier approach is to subtract a linear term from the Brownian motion, chosen such that the resulting process hits zero at the time T. This is equivalent, but has the added benefit of being independent of the original Brownian motion at all later times.

Lemma 1 Let X be a standard Brownian motion and ${T > 0}$ be a fixed time. Then, the process

$\displaystyle B_t = X_t - \frac tTX_T$ (1)

over ${0\le t\le T}$ is independent from ${\{X_t\}_{t\ge T}}$ .

Proof: As the processes are joint normal, it is sufficient that there is zero covariance between them. So, for times ${s\le T\le t}$ , we just need to show that ${{\mathbb E}[B_sX_t]}$ is zero. Using the covariance structure ${{\mathbb E}[X_sX_t]=s\wedge t}$ we obtain,

$\displaystyle {\mathbb E}[B_sX_t]={\mathbb E}[X_sX_t]-\frac sT{\mathbb E}[X_TX_t]=s-\frac sTT=0$

as required. ⬜

This leads us to the definition of a Brownian bridge.

Definition 2 A continuous process ${\{B_t\}_{t\in[0,T]}}$ is a Brownian bridge on the interval ${[0,T]}$ if and only it has the same distribution as ${X_t-\frac tTX_T}$ for a standard Brownian motion X.

In case that ${T=1}$ , then B is called a standard Brownian bridge.

There are actually many different ways in which Brownian bridges can be defined, which all lead to the same result.

As a Brownian motion minus a linear term so that it hits zero at T. This is definition 2.
As a Brownian motion X scaled as ${tT^{-1/2}X_{T/t-1}}$ . See lemma 9 below.
As a joint normal process with prescribed covariances. See lemma 7 below.
As a Brownian motion conditioned on hitting zero at T. See lemma 14 below.
As a Brownian motion restricted to the times before it last hits zero before a fixed positive time T, and rescaled to fit a fixed time interval. See lemma 15 below.
As a Markov process. See lemma 13 below.
As a solution to a stochastic differential equation with drift term forcing it to hit zero at T. See lemma 18 below.

There are other constructions beyond these, such as in terms of limits of random walks, although I will not cover those in this post. Continue reading “Brownian Bridges” →

	Anonymous on Feller Processes
	SilverBladeII on Cadlag Modifications
	Anonymous on Spitzer’s Formula
	Anonymous on Spitzer’s Formula
	Anonymous on Brownian Bridges
	Anonymous on Spitzer’s Formula
	Anonymous on Spitzer’s Formula
	Anonymous on Spitzer’s Formula
	Anonymous on Spitzer’s Formula
	Anonymous on Spitzer’s Formula