We continue the investigation of representing probability spaces as states on algebras. Whereas, previously, I focused attention on the commutative case and on classical probabilities, in the current post I will look at non-commutative quantum probability.
Quantum theory is concerned with computing probabilities of outcomes of measurements of a physical system, as conducted by an observer. The standard approach is to start with a Hilbert space , which is used to represent the states of the system. This is a vector space over the complex numbers, together with an inner product
. By definition, this is linear in one argument and anti-linear in the other,
for and
. Positive definiteness is required, so that
for
. I am using the physicists’ convention, where the inner product is linear in the second argument and anti-linear in the first. Furthermore, physicists often use the bra–ket notation
, which can be split up into the `bra’
and `ket’
considered as elements of the dual space of
and of
respectively. For a linear operator
, the expression
is often expressed as
in the physicists’ language. By the Hilbert space definition,
is complete with respect to the norm
.
The space of bounded linear operators will be denoted by
, with the adjoint of
denoted by
, so that
This makes into a *-algebra. Quantum mechanics involves states, observables, and transformations in the following way. I use
to denote the unit circle
.
- A (pure) state is represented by a vector
normalised so that
. Any other vector
with
represents the same physical state. If
are states then, if a system is in pure state
, the probability of it being in state
is
. I will also use non-normalised states in what follows, for which
need not equal 1, with the understanding that it should be normalised to
before performing calculations.
- A measurement may only be able to determine whether the system is in a closed subspace
. If the system is in state
, then the probability of being found in the given subspace of states is
, where
represents orthogonal projection onto the subspace. If the measurement is `perfect’, so that it does not interfere with the system in any other way, then a positive measurement result leaves the system in the state
.
- The complete set of possible distinct results of a measurement can be represented by a set of pairwise orthogonal and closed subspaces
such that
is dense in
. Letting
represent orthogonal projection onto subspace
, then orthogonality is equivalent to
for
. Completeness of the set of outcomes is equivalent to
. If the system is in pure state
then, the probability of the measurement giving outcome
is
, after which it will be in state
.
- Observables are represented by self-adjoint operators. I consider bounded observables, which are represented by operators
satisfying self-adjointness
. Many of the observables encountered in practice, such as position, momentum, energy, etc., are unbounded. For simplicity, I ignore this fact and will just consider the bounded case. Suppose that an observable
has a complete set of (normalised) eigenvectors
, with distinct eigenvalues
. As
is self-adjoint, the eigenvalues are real, and the eigenvectors will be orthogonal. A measurement of
determines which of the eigenstates
the system is in, and determines the value of
as the associated eigenvalue. If the system starts in state
, then
has probability
of having value
. Hence, the expected value is,
(1) This can be generalised to the degenerate case where the eigenvalues are not distinct by, instead, considering the orthogonal projections
onto the eigenspaces, so that
. Further generalisation to continuous spectra can be achieved using the spectral theorem, although (1) still holds for the expectation.
Note that orthogonal projection onto a closed subspace is a special type of observable which takes only the values 0 and 1, in which case (1) gives the probability of the system being measured to be in the subspace.
- Transformations and symmetries of the state space are represented by unitary operators
. By definition, these satisfy
and take a state
to the new state
. Note that, if
, then
represents the same transformation as
. Examples include space translations, rotations, and time evolution. Continuous transformation groups
,
, satisfying
are associated with observables by
. For example, space translations correspond to momentum, rotations correspond to angular momentum, and time evolution corresponds to the total energy of the system. Some discrete transformations, such as time reversal, are represented by anti-unitary operators.
- Given two physical systems represented by Hilbert spaces
and
, the composite system is represented by the tensor product
. An operator
on the space
is represented, in the composite system by, by
. Similarly, an operator
on
is represented by
in the composite system.
Of course, for a full physical model of a system, we would need to relate actual physical measurements to specific observable operators, and describe the time evolution of the system. That is not the aim of this post, however.
The existence of states such that
means that probabilities are unavoidable in quantum theory. If a system is prepared in the pure state
, then measured to determine if it is in state
, an affirmative result occurs with probability
. The non-squared quantity
is called a probability amplitude. This is not a physically observable quantity, although it is used to explain phenomena such as interference observed in the double-slit experiment, as probability amplitudes can add constructively or cancel due to phase differences.
I also note that, as with classical probability theory, states on quantum systems are subjective. Observers may correctly assign different states to the same system, due to differing levels of knowledge. Whenever a measurement is made, an observer will update their state in accordance with the rules above, whereas a separate observer, who is not aware of the measurement result, will not update their state in the same way.
Mixed States
We can combine classical probability with pure quantum states, to obtain what are known as mixed states. Given a probability measure on the set of states
with
, we consider the mixture of quantum and classical probability, where
gives the probability of the physical system being in any measurable set of pure states. If a specific outcome of an experiment on a system in a pure state
occurs with probability
, then its probability of occurring in the mixed state is
. For a quantum observable
, its expected value in a pure state is given by (1). Hence, in the mixed state described by
, its expected value is
(2) |
This expectation operator, , is a linear map which is
- self-adjoint:
.
- positive:
.
- normalized:
.
In our language established in the previous post, is a state on the *-algebra
. A vitally important fact about mixed states is that all physically observable consequences are incorporated in the expectation operator. Any property of the state that cannot be expressed in terms of
is not physical. I explain below how the effects of measurements and transformations on a mixed state can be described using the expectation operator alone.
As outlined above, an outcome of a measurement is represented by an orthogonal projection . The probability of the outcome represented by
is given, in the mixed state, by
. Then, assuming that the measurement is `perfect’, if it gives an affirmative result, the system will be described by a new mixed state
given as,
(3) |
Note that this bears a significant similarity to the formula for classical conditional probabilities. The set of possible distinct results of a measurement is represented by a set of orthogonal projections satisfying . This ensures that
, which is clearly necessary to interpret
as probabilities.
Transformations of the system were described above by the action of unitary operators on pure states. However, the action associated to a unitary operator can be expressed just as easily on a mixed state, and takes
to a new expectation operator
,
Anti-unitary operators on the state can also be described simply in terms of their effect on the expectation operator , and are of the form
for anti-unitary .
I now note that certain mixed states with distinctly different descriptions can have the same expectation operator so, in fact, represent the same physical state. Consider a Hilbert space with finite dimension and let
and
be two orthonormal bases. Also, let
be the uniform distribution on the space of unit vectors in
. We have three distinct mixed states, (i) each pure state
occurs with probability
, (ii) each pure state
occurs with probability
and, (iii) the distribution of pure states is given by
. The fact that the trace of a square matrix is independent of the basis chosen shows that these mixed states all have the same expectation operator and, hence, represent equivalent descriptions of the system,
This demonstrates that, once we mix classical probabilities with quantum states, they can never be disentangled. The classical probability distribution is not something that can be observed.
An alternative method of describing mixed states is with trace class operators. For a classical probability measure on the space of pure states, we can form the self-adjoint and nonnegative operator,
I am using the physicsts’ bra–ket notation. Alternatively, maps each vector
to
This fully describes the physical state, since the expectation operator (2) can be written as
In particular, using gives
. Conversely, any nonnegative
with unit trace describes a mixed state. This is because
has a complete set of orthonormal eigenvectors
, with corresponding eigenvalues
satisfying
. Then,
so represents a system with probability
of being in the pure state
.
I now consider how mixed states can arise. First, they can occur in the same way that probabilities enter into classical physics — the probability measure describes the ignorance that we have regarding the `true’ state of the system. Given a set
of pure states of the system, we do not know whether the state of the system in question lies in
, so only assign a probability
. However, there are other ways that mixed states can occur due to the inherently probabilistic nature of quantum theory.
Consider a system in pure state , and suppose that we perform a measurement whose possible outcomes are represented by orthogonal projections
. A measurement result of `n’ occurs with probability
. This outcome leaves the system in the pure state
. Suppose that we perform the measurement but have not, as yet, observed the result. Then, the system will be in a mixed state, where the pure state
occurs with probability
. More generally, if prior to the measurement, the system is described by an expectation operator
then, after the measurement but before observing the result, the system will be described by the new expectation
(4) |
Another method by which mixed states arise, is due to interactions with an external physical system. Suppose that the system is represented with a Hilbert space , and the external system is described by
. The composite system is represented by the tensor product
. An observable
is described by
in the composite system. For a state of the composite system of the form
with
, the expectation is
which is just the same as for the pure state . However, more general pure states for the combined system are of the form
, and it is always possible to choose
to be an orthonormal basis for
. Normalisation of the state means that
. Then, the expectation is
This describes a mixed state for which the system is in each of the pure states with probability
. Any such state, in which more than one of the
is nonzero, is called entangled. Entangled states can arise from non-entangled ones of the form
due to unitary evolution in the presence of interactions between the systems. This process, by which pure states evolve to mixed states by interaction with the external environment is described by quantum decoherence.
The Algebra Of Observables
As described above, a quantum system is represented by a Hilbert space . A state on this system, whether pure or mixed, is described by a linear map
which is self-adjoint, positive and normalized so that
. These properties are expressed using the property of
as a *-algebra, and does not explicitly require that it is a set of bounded operators on
. More generally, the observables could generate a strict sub-algebra
, and the state can be regarded as a map
.
A set of (self-adjoint) observables are simultaneously measurable if they commute,
. In the Hilbert space representation, this means that they can be simultaneously diagonalised (at least, if they have a discrete spectrum). Hence, it is possible in theory to simultaneously measure their values. As we have previously seen, commutativity is also sufficient to imply a uniquely joint probability distribution for
. That is, there is a unique probability measure
on
satisfying
(5) |
for all polynomials . On the other hand, if the observables do not commute, then the left hand side of (5) is not even defined, as polynomials can only be evaluated at commuting values of the arguments
. In fact, there may not even be any probability measure on
which is consistent with the distributions of the commuting subsets of
, as shown by Bell’s inequality.
Any single possible outcome of a measurement on a quantum system is described by an orthogonal projection . This can also be described using the *-algebra properties: an element
is an orthogonal projection if and only if
or, equivalently, if
is self-adjoint and
. I will denote the collection of orthogonal (or, self-adjoint) projections in
by
. Two projections,
are mutually orthogonal iff
. Furthermore, we can define a partial order on
by
iff
or, equivalently, if
. Roughly speaking, we can think of projections as akin to events, or measurable sets, in classical probability theory, and
is the probability of the event. Two events
are simultaneously measurable if they commute, with
and
representing, respectively, their intersection and union. The complement of an event
is
, and the inequality
represents containment of
in
.
Given two systems represented by Hilbert spaces and
, the composite system is represented by their tensor product
. For operators
and
, we form the operator
on
by
More generally, we can form operators in by taking finite sums
of
and
. For finite dimensional Hilbert spaces, all operators in
can be expressed in this form, so that
is the tensor product. More generally, will be the closure of the tensor product in the weak (or strong) operator topology. This suggests that, if we represent the two systems by *-algebras
and
, then the composite system is represented by their tensor product
, or some completion of this. There are natural homomorphisms
The images of any element of and any element of
commute, showing that
is a kind of `commutative product’ of the two algebras. Furthermore, for any states
and
, there is a unique state
satisfying
This is a kind of independent product of the states, similar to the product measure in classical probability. Entangled states on are precisely those which cannot be expressed in this form.
The use of the tensor product to represent the composite of quantum subsystems does have a very important physical consequence. Suppose that we have two systems represented by *-algebras and
. I will identify each of these with its image in the *-algebra
representing the composite system. Suppose, also, that we have a state
, and two observers, one for each of the two subsystems, and that the second observer makes a measurement on the second system. The possible measurement outcomes are described by a sequence of mutually orthogonal projections
satisfying
. If the outcome is `n’, then this observer updates her state to be
, in accordance with (3). Now, the two subsystems need not be physically interacting in any way, and could well be at completely different places in the the universe. The first observer may not be in any communication with the second one, or be observing the second subsystem at all. We can compute the impact of the measurement on the state of the first subsystem, as seen by the first observer. This is updated according to (4). Making use of the fact that
and
commute with each other, the updated state for any
is,
The measurement has no effect on the first system, which is as it should be. The alternative would inevitably lead to contradictions, or to systems being continually impacted by some spooky action at a distance due to measurements or transformations on other systems with which they may have become entangled at some point in the past.