Ito’s lemma is one of the most important and useful results in the theory of stochastic calculus. This is a stochastic generalization of the chain rule, or change of variables formula, and differs from the classical deterministic formulas by the presence of a quadratic variation term. One drawback which can limit the applicability of Ito’s lemma in some situations, is that it only applies for twice continuously differentiable functions. However, the quadratic variation term can alternatively be expressed using local times, which relaxes the differentiability requirement. This generalization of Ito’s lemma was derived by Tanaka and Meyer, and applies to one dimensional semimartingales.
The local time of a stochastic process X at a fixed level x can be written, very informally, as an integral of a Dirac delta function with respect to the continuous part of the quadratic variation ,
(1) |
This was explained in an earlier post. As the Dirac delta is only a distribution, and not a true function, equation (1) is not really a well-defined mathematical expression. However, as we saw, with some manipulation a valid expression can be obtained which defines the local time whenever X is a semimartingale.
Going in a slightly different direction, we can try multiplying (1) by a bounded measurable function and integrating over x. Commuting the order of integration on the right hand side, and applying the defining property of the delta function, that
is equal to
, gives
(2) |
By eliminating the delta function, the right hand side has been transformed into a well-defined expression. In fact, it is now the left side of the identity that is a problem, since the local time was only defined up to probability one at each level x. Ignoring this issue for the moment, recall the version of Ito’s lemma for general non-continuous semimartingales,
(3) |
where . Equation (2) allows us to express this quadratic variation term using local times,
The benefit of this form is that, even though it still uses the second derivative of , it is only really necessary for this to exist in a weaker, measure theoretic, sense. Suppose that
is convex, or a linear combination of convex functions. Then, its right-hand derivative
exists, and is itself of locally finite variation. Hence, the Stieltjes integral
exists. The infinitesimal
is alternatively written
and, in the twice continuously differentiable case, equals
. Then,
(4) |
Using this expression in (3) gives the Ito-Tanaka-Meyer formula.
The derivation above is clearly far from being rigorous. For one thing, we started with the informal identity (1), which did not even have a well-defined meaning. For another, the local time is only defined up to almost sure equivalence. That is, only up to probability one. However, (4) involves the value simultaneously at all real x. The arbitrary choice of
on an uncountable collection of zero probability events, one for each x, could affect the value of the integral. So, we still do not have a well-defined expression. In fact, it is not even clear if
is measurable in x. This is the old problem of choosing good versions of stochastic processes except, now, we are concerned with the path as the level x varies, rather than the time index t.
Before giving a rigorous statement of the Ito-Tanaka-Meyer formula, we first need a jointly measurable version of the local times. As usual, we work with respect to a filtered probability space .
Lemma 1 Let X be a semimartingale and
. Then, the local times
have a version which is jointly measurable. That is,
is continuous and increasing in t and
-measurable.
So long as a jointly measurable version of the local times is chosen, the Ito-Tanaka-Meyer formula holds.
Theorem 2 (Ito-Tanaka-Meyer) Let X be a semimartingale,
be convex, and
be a jointly measurable version of the local times. Then,
(5) almost surely, for each
.
This result clearly extends to any which is a linear combination of convex functions. In the earlier post defining local times, we already showed that the final summation in (5) is almost surely finite and, hence, the integral
is also finite. In fact, it is equal to the term A in lemma 5 of that post. We can always choose a version of
equal to zero over
. It is also usually possible to choose
to be almost surely bounded as x varies, which would explain why the integral is finite, but this is not always possible.
The proofs of lemma 1 and theorem 2 will be given further down. For now, we look at some immediate consequences, starting with the following more rigorous version of identity (2). Note that this was used in the informal derivation of the Ito-Tanaka-Meyer formula above. The more rigorous argument is in the opposite direction, using theorem 2 to prove (2).
Theorem 3 Let X be a semimartingale and
be a jointly measurable version of the local times. Then,
(6) almost surely, for all
and measurable
which is either nonnegative or bounded.
Proof: Implicit in equation (6) is the statement that is almost surely Lebesgue integrable w.r.t. x, when
is bounded. This is equivalent to
being almost surely finite, which is implied by (5) with
.
First, consider nonnegative continuous . Then, we can define convex and twice continuously differentiable
with
. For example, take
. Comparing Ito’s formula (3) with (5) immediately gives
almost surely.
We have shown that (6) holds for all nonnegative and continuous . Furthermore, if
is a sequence of bounded nonnegative measurable functions satisfying (6) and, if
increases to a limit
then, by monotone convergence,
also satisfies (6). So, the result follows from the monotone class theorem. ⬜
Applying theorem 3 for the special case expresses the continuous quadratic variation as an integral over the local times.
Corollary 4 Let X be a semimartingale and
be a jointly measurable version of the local times. Then,
almost surely, for each
.
Proof Of The Ito-Tanaka-Meyer Formula
The proof of the Ito-Tanaka-Meyer formula is not really difficult. In fact, for the most part, it is straightforward. However, there are some technical obstacles to be overcome, so I will give a brief outline of the idea before diving into the details.
Recall the definition of local times,
(7) |
The terms in this expression correspond one-to-one with the terms in the target equality (5).
Let be a convex function. For the moment, I assume that it has a bounded derivative and that
as x tends to minus infinity, which simplifies things a bit, and will generalize to arbitrary convex functions afterwards. Then, integrate both sides of (7) w.r.t.
. We will show that each of the terms of (7) has a jointly measurable version, which is integrable over x, and the integral is equal to the corresponding term of (5), which will imply that (7) holds for the given choice of
.
The left hand side and the first term on the right are jointly measurable and the integral is easily evaluated,
The stochastic Fubini theorem can be applied to the second term on the right. This states that it has a jointly measurable version, which is cadlag in t and integrable over x, and we can commute the order of integration,
This equals the second term on the right of (5).
The third term on the right hand side of (7) is just the local time. It will automatically have a version which is cadlag in t, jointly measurable, and integrable over x, so long as each of the other terms do, by linearity. Then, the third term on the right of (5) is explicitly equal to the integral of this over x, so we have nothing to prove here.
Now, look at the final term of (7), which accounts for the jumps of X. As it stands, the sum is over the uncountable set of times . To fix this issue, choose a countable sequence,
, of stopping times with disjoint graphs, whose union almost surely contain the jump times of X with probability one. That this is always possible was shown was shown earlier in my notes. Then, the term can be rewritten as
for all x, almost surely. The summand here is nonnegative, so we can integrate with respect to and apply the standard Fubini theorem to obtain,
almost surely. This completes the proof of theorem 2 for the current choice of .
Note that if the function is linear, of the form for constants a and b, then (5) is straightforward. As
it reduces to
which is immediate.
Now consider the case where for
, for some constant K. As it is linear over
, we can write
for constants a and b, and convex function which is equal to zero over
. By the argument above, (5) holds with
replaced by either
or
and, by linearity, it holds for
.
Finally, consider arbitrary convex . Fixing positive real
, define the function
which is convex and satisfies for
and
otherwise. So, by the argument above, (5) holds for
replaced by
. Also, on the event that
, each of the terms of (2) is unchanged under replacing
by
. Hence, (5) holds on this event. Letting K increase to infinity completes the proof of theorem 2.
The argument above not only proved the Ito-Tanaka-Meyer formula, it also established the existence of local times which are jointly measurable as stated in lemma 1, and cadlag in t. To complete the proof of lemma 1, it only remains to show that the local times can simultaneously be chosen to be continuous and increasing in t. Consider the set,
For general processes, this need not be measurable. In our case, we already know that is cadlag, implying that to be continuous and increasing in t is equivalent to it being locally uniformly continuous and increasing for t restricted to the rational numbers. As the rationals are countable, this gives an
-measurable set. So,
gives a jointly measurable version of the local times which is also continuous and increasing in t.