Probability and measure theory relies on the concept of measurable sets. On the real numbers ℝ, in particular, there are several different sigma-algebras which are commonly used, and a set is said to be measurable if it lies in the one under consideration. Probabilities and measures are only defined for events lying in a specific sigma-algebra, so it is essential to know if sets are measurable. Fortunately, most simply constructed events will indeed be measurable, but this is not always the case. In fact, once we start working with more complex setups, such as continuous-time stochastic processes observed at random times, non-measurable sets occur more commonly than might be expected. To avoid such issues, it is usual to enlarge the underlying sigma-algebra defining a probability space as much as possible.
The Borel sets form the smallest sigma-algebra containing the open sets or, equivalently, containing all intervals. This is denoted as , which I will also shorten to
. An explicit construction of a non-Borel set was given by Lusin in 1927. Every irrational real number can be expressed uniquely as a continued fraction
where a0 is an integer and ai are positive integers for i ≥ 1. Lusin considered the set of irrationals whose continued fraction coefficients contain a subsequence ak1, ak2, … such that each term is a divisor of the subsequent term.
Other examples can be given along similar lines to Lusin’s. Every real number has a binary expansion
where a0 is an integer and ai is in {0, 1} for each i ≥ 1. Consider the set of reals having a binary expansion for which there is an infinite sequence of positive integers k1, k2, …, each term strictly dividing the next, such that aki = 1. I will give proofs that these examples are non-Borel in this post.
There is a general method of enlarging sigma-algebras known as the completion. Consider a measure μ defined on a measurable space consisting of sigma-algebra
on set X. The completion
consists of all subsets S ⊆ X which can be sandwiched between sets in
whose difference has zero measure. That is, A ⊆ S ⊆ B for
with μ(B ∖ A) = 0. It can be shown that
is a sigma-algebra containing
, and μ uniquely extends to a measure on this by taking μ(S) = μ(A) = μ(B) for S, A, B as above.
The Lebesgue measure λ is uniquely defined on the Borel sets by specifiying its value on intervals as λ((a, b)) = b – a. The completion is the Lebesgue sigma-algebra, which I will denote by
. Usually, when saying that a subset of the reals is measurable without further qualification, it is understood to mean that it is in
. The non-Borel set constructed by Lusin can be shown to be measurable (in fact, its complement has zero Lebesgue measure).
While the Lebesgue measure extends uniquely to , this is not true for for all measures defined on the Borel sigma-algebra. In particular, it will not be true for singular measures, which assign a positive value to some Lebesgue-null sets. An example is the uniform probability measure (or, Haar measure) on the Cantor middle-thirds set C. This has zero Lebesgue measure, so every subset of C is in
, but the uniform measure on C cannot be extended uniquely to all subsets. For this reason, universal completions are often used. For a measurable space
, the universal completion
consists of the subsets of X which lie in the completion of
with respect to every possible sigma-finite measure.
The intersection is taken over all sigma-finite measures μ on . It is enough to take the intersection over finite or, even, probability measures, since every sigma-finite measure is equivalent to such. The universal completion is a bit tricky to understand in an explicit way but, by construction, all sigma-finite measures on a sigma-algebra extend uniquely to its universal completion. It can be shown that Lusin’s example of a non-Borel set does lie in the universal completion
.
Finally, the power set consisting of all subsets of a set X is a sigma-algebra. For uncountable sets such as the reals, this is often too large to be of use and measures cannot be extended in any unique way. However, we have four common sigma-algebras of the real numbers,
| (1) |
In this post, I show that each of these inclusions is strict. That is, there are subsets of the reals which are not Lebesgue measurable, there are Lebesgue sets which are not in the universal completion , and there are sets in
which are not Borel. Lusin’s construction is an example of the latter. The strictness of the other two inclusions does depend crucially on the axiom of choice so, unlike Lusin’s set, the examples demonstrating that these are strict are not explicit. Continue reading “Non-Measurable Sets”
