Terse Postulative Introduction to Quantum Mechanics

Written 15 Oct 2022 by Michael Huang



This is essentially a brief postulative reformulation of the theoretical content of quantum mechanics; it covers most of the theory in Griffith’s Chapter 1 to 4.

Content

  1. Hilbert Space
  2. Generalized Statistics Interpretation
  3. Schrodinger’s Equation
  4. How to apply Quantum Mechanics

Hilbert Space

Vectors can be infinite dimensional

Vectors are usually represented by a finite dimensional tuple (a1,a2,a3) (a_1, a_2, a_3) . It is more general to represent a vector with an integer valued function ff such that f(1)=a1f(1)=a_1, f(2)=a2f(2)=a_2, f(3)=a3f(3)=a_3. In other words, the function ff takes as input an index and output the value corresponding to that index for the vector.

Thus, vectors can be generalized to infinite dimensions (a1,a2,a3,) (a_1,a_2,a_3,\cdots) by extending the domain of ff to all natural numbers N \mathbb{N}. We can further generalize infinite dimensional vectors by making the number of dimensions uncountable–we can allow an index to the vector to be any real number (or even complex number). Thus, any function ff could be perceived as a representation of a vector whereby the value corresponding to the xthx^\text{th} index is f(x)f(x) . We will henceforth use f |f\rangle to denote a vector that is represented by the function ff.

Inner product

The inner product of two countable vectors a |a\rangle, b |b\rangle is represented by,

ab=iaibi.\langle a | b\rangle = \sum_i a_i^\star b_i.

This weird ab \langle a | b\rangle notation is called the bracket notation. It should be conceived of as an operator a \langle a| operating upon a vector b |b\rangle whereby the operation is performing inner product between a |a\rangle and b |b\rangle.

For uncountable vectors f |f\rangle and g |g\rangle, the inner product is defined by,

fg=f(x)g(x)dx.\langle f | g\rangle = \int f(x)^\star g(x) dx.

The norm of a vector f |f\rangle is ff \langle f | f\rangle. Notice that the norm is always a positive number.

A discrete set of vectors a1,a2,,an {|a_1\rangle,|a_2\rangle,\cdots,|a_n\rangle} is orthonormal if aiaj=δij \langle a_i | a_j\rangle = \delta_{ij}, where δij \delta_{ij} is a shorthand notation that basically means the value is 11 if i=ji=j, otherwise the value is 00.

A continuous set of vectors xxR {|x\rangle \mid x\in \mathbb{R}} is orthonormal if xx=δ(xx) \langle x’ | x\rangle = \delta(x-x’) , where δ \delta is the Dirac delta function. Basically, δ(xx)=0 \delta(x-x’)=0 if xxx\not=x’ and satisfies the property that xδ(xx)dx=1 \int_x \delta(x-x’) dx = 1 (when x=xx=x’, δ(xx) \delta(x-x’) is technically infinite/undefined).

Hilbert space definition

In physics, Hilbert space is an infinite dimensional vectorspace over the scalar field of complex numbers. If that sentence confused you, I recommend that you take a look at linear algebra. Hilbert space further satisfies the property that the norm of any vector is finite.

An infinite dimensional vectorspace requires an infinite set of basis. Let xxR {|x\rangle \mid x\in \mathbb{R}} be a set of orthnormal basis. To clarify, there’s a unique vector x |x\rangle corresponding to each real scalar xx. Any vector ψ |\psi \rangle can be represented as a linear combination of x |x\rangle. Clearly, xψ \langle x| \psi\rangle picks out the component of x |x\rangle in ψ |\psi \rangle. Therefore, we can rewrite,

ψ=dxxxψ.|\psi \rangle = \int dx |x\rangle \langle x| \psi \rangle.

For this reason, dxxx \int dx |x\rangle \langle x| as a whole is called a unit operator.

Operator, eigenstates, and eigenvalues

It may appear that all this discussion of vectors in Hilbert space is irrelevant to physics. The reason we spent some time treating the subject of a Hilbert space is because the physical state of particle can be represented as a vector in the Hilbert space. We will use S |S\rangle to denote a Hilbert space vector that represents the physical state of a particle.

To obtain some observables (position, momentum, energy, etc) from this physical state, we need to operate on the state S |S\rangle using linear operators. Each physical observable corresponds to a linear operator.

The position operator is denoted by x^ \hat{x}, the momentum p^ \hat{p}, and the Hamiltonian/energy H^ \hat{H}. So what does the operator do? Can we write it out? Nope. Operator is an abstract thing like a vector. It is possible to write out the representation of a vector on a basis, but the vector is the vector itself. Similarly, it is possible to explain what an operator do to the different components of a representation of a vector in a certain basis, it is not possible to abstractly define what an operator does.

These operators all have eigenstates and corresponding eigenvalues. The eigenstates of x^ \hat{x} are defined to be the set of vectors xxR {|x\rangle \mid x\in \mathbb{R}} and the corresponding eigenvalue for an eigenstate x |x\rangle is the scalar xx. The eigenstates of x^ \hat{x} form an orthonormal basis.

Similarly, the eigenstates of p^ \hat{p} are defined to be the set ppR {|p\rangle \mid p \in \mathbb{R}} where each eigenstate p |p\rangle has an eigenvalue pp.

For any operator that corresponds to a physical observable, it turns out that its eigenstates necessarily form an orthonormal basis of the Hilbert space. This interesting result is because of the fact that operators that correspond to physical observables are Hermitian, which you may take as a postulate.

So, at this point, we have,

S=dxxΨ(x)=dppΦ(p)|S\rangle = \int dx |x\rangle \Psi(x) \\ = \int dp |p\rangle \Phi(p)

Reversedly,

Ψ(x)=xSΦ(x)=pS\Psi(x) = \langle x|S\rangle \\ \Phi(x) = \langle p|S\rangle

Relationship between position and momentum operator

These variables are what represent the vector S |S\rangle in different basis. Our next enterprise is to relate them to each other by relating the basis x |x\rangle, p |p\rangle. The relation between the operators and basis is significant because it turns out that the law of quantum physics (Schrodinger’s Equation) is written in terms of the Hamiltonian operator. Thus, an operator is quite useless if we cannot relate it to the Hamiltonian operator. Further, it turns out that we need to first establish a connection between x^ \hat{x} and p^ \hat{p} in order to properly introduce the Hamiltonian operator.

To relate the operators, we need two more postulates. First, we take as postulate the position momentum operator relation,

xp^ψ=ixxψ.\langle x|\hat{p}|\psi \rangle = -i\hbar \frac{\partial}{\partial x} \langle x|\psi\rangle.

It should be noted that most people would start with the postulate xppx=i \langle x|p\rangle - \langle p | x\rangle = i\hbar, but we will not since Griffith doesn’t.

Let me explain this postulate more intuitively. It is essentially saying that the p^ \hat{p} operator, when projected along x \langle x|, is essentially taking the derivative of the x |x\rangle component of ψ |\psi\rangle.

xp^ψ=ixxψxp^p=ixxppxp=ixxp.\langle x|\hat{p}|\psi \rangle = -i\hbar \frac{\partial}{\partial x} \langle x|\psi\rangle \\ \langle x|\hat{p}|p \rangle = -i\hbar \frac{\partial}{\partial x} \langle x|p\rangle \\ p \langle x|p \rangle = -i\hbar \frac{\partial}{\partial x} \langle x|p\rangle.

This can be solved as a differential equation, viewing xp \langle x|p\rangle as a function of xx, giving,

xp=Aeipx/.\langle x | p\rangle = Ae^{ipx/\hbar}.

p |p\rangle is orthonormal. This fact may help us determine AA.

pp=(dxxxp)(dxxxp)=(dxxxp)(dxxxp)=dxdxxxxpxp=dxdxδ(xx)xpxp=dxxpxp=A2dxxeix(pp)/=A22πδ(pp).\langle p|p'\rangle = \left( \int dx |x\rangle \langle x| p\rangle \right)^\dagger \left( \int dx |x\rangle \langle x|p'\rangle \right) \\ = \left( \int dx |x\rangle^\dagger \langle x| p\rangle^\star \right) \left( \int dx |x\rangle \langle x|p'\rangle \right) \\ = \iint dx dx' \langle x | x'\rangle \langle x| p\rangle^\star \langle x'|p'\rangle \\ = \iint dx dx' \delta(x-x') \langle x| p\rangle^\star \langle x'|p'\rangle \\ = \int dx \langle x|p\rangle^\star \langle x|p'\rangle \\ = |A|^2\int dx \langle x e^{ix (p'-p)/\hbar } \\ = |A|^2 2\pi \hbar \delta(p-p').

The last step is due to the fourier decomposition of δ \delta function. Thus, we may let A=12πA = \frac{1}{\sqrt{2\pi \hbar}}.

We have therefore derived that,

xp=12πeipx/ and px=12πeipx/.\langle x | p \rangle = \frac{1}{\sqrt{2\pi \hbar}}e^{ipx/\hbar} \text{ and }\\ \langle p | x \rangle = \frac{1}{\sqrt{2\pi \hbar}}e^{-ipx/\hbar}.

We may henceforth take these as fundamental theorems.

Now, given the relation between these two basis, we may represent one basis in terms of another, which will aid us in transforming between basis.

x=dppxp\|x \rangle = \int dp \langle p | x \rangle | p \rangle \\

A vector S |S\rangle may thus be represented in two manners,

S=dppΦ(p)=dxxΨ(x)=dxdppxpΨ(x)=(dppdxpxΨ(x)).|S\rangle = \int dp |p \rangle \Phi(p) = \int dx |x \rangle \Psi(x)\\ = \iint dx dp \langle p | x \rangle | p \rangle \Psi(x)\\ = \int \left(dp |p \rangle \int dx \langle p | x \rangle \Psi(x) \right). \\

Thus, the equivalence of,

Φ(p)=dxpxΨ(x)=dx12πeipx/Ψ(x)\Phi(p) = \int dx \langle p | x \rangle \Psi(x) \\ = \int dx \frac{1}{\sqrt{2\pi \hbar}} e^{-ipx /\hbar} \Psi(x)

can be established in an intuitive fashion. The symmetric reverse is,

Ψ(x)=dpxpΦ(p)\Psi(x) = \int dp \langle x | p \rangle \Phi(p)

We have now derived the relation between x {|x\rangle} and p {|p\rangle}. We also obtained from that the relation between Ψ \Psi and Φ \Phi, the two representations of a wavefunction S |S\rangle in the position basis and the momentum basis. Now, time is ripe to introduce the Hamiltonian operator.

Other operators

Notice that any classical observable QQ can be represented as a combination of xx and pp via a function Q(x,p)Q(x,p) . We take as the third postulate that the operator corresponding to QQ results from the same combination of x^ \hat{x} and p^ \hat{p}. In other words, Q^=Q(x^,p^) \hat{Q} = Q(\hat{x}, \hat{p}) . Thus, following this postulate, the Hamiltonian operator is defined as,

H^=p^22m+V(x).\hat{H} = \frac{\hat{p}^2}{2m} + V(x).

Hamiltonian operator is most commonly expressed in the x {|x\rangle} basis, so let’s do that,

xH^ψ=12mxp^2ψ+V(x)xψ=(22m2x2+V)xψ\langle x| \hat{H} |\psi \rangle = \frac{1}{2m}\langle x|\hat{p}^2 |\psi\rangle + V(x) \langle x |\psi\rangle \\ = \left(-\frac{\hbar^2}{2m} \frac{\partial^2}{\partial x^2} + V \right)\langle x|\psi\rangle

You would recognize this as the familiar form of the Hamiltonian operator if you have exposure to quantum mechanics before. An interesting insight can be obtained if we let ψ |\psi\rangle equal to the eigenstate of the Hamiltonian operator. Let the eigenstate be n |n\rangle and the eigenvalue be EnE_n, the equation becomes,

xH^n=(22m2x2+V)xnEnxn=(22m2x2+V)xn\langle x|\hat{H}|n\rangle = \left(-\frac{\hbar^2}{2m} \frac{\partial^2}{\partial x^2} + V \right)\langle x|n\rangle \\ E_n \langle x|n\rangle = \left(-\frac{\hbar^2}{2m} \frac{\partial^2}{\partial x^2} + V \right)\langle x|n\rangle

This can be perceived as a differential equation, whereby xn \langle x|n\rangle is a function of xx. This differential equation is known as the time-independent Schrodinger’s Equation. It’s given a fancy name although it follows trivially from the definition of the Hamiltonian operator. Solving this equation gives us xn \langle x|n\rangle which finally connects all three basis x {|x\rangle}, p {|p\rangle} and n {|n\rangle}.

Generalized Statistics Interpretation

So now we know S |S\rangle is a vector that corresponds to a physical state, and that x^S \hat{x} |S\rangle somehow tells us something about the positional information of a particle at physical state S |S\rangle, how do we exactly know the position of the particle?

The generalized statistics interpretation is what connects the mathematics with physics. Let x {|x\rangle} be an orthonormal basis that are the eigenstates of x^ \hat{x} with corresponding eigenvalues x {x}. We know that,

S=dxxΨ(x)|S\rangle = \int dx |x\rangle \Psi(x)

where Ψ(x)=xS \Psi(x) = \langle x|S\rangle is just a scalar valued function. Then, the generalized statistics interpretation states that the probability density of measuring the eigenvalue xx is Ψ(x)2 \Psi(x)^2. In other words, the only values we can possibly measure are the eigenvalues of x^ \hat{x}; the probability of measuring each eigenvalue is the square of the component of that eigenvalue’s eigenstate in the state the particle is in. The general statistics interpretation is the fourth postulate of quantum mechanics.

From this definition, we can readily derive that the expected observed position x \langle x\rangle satisfies,

x=xxS2dx=xΨΨdx=dxxSxx^S=dxSxxx^S=S(dxxx)x^S=Sx^S\langle x \rangle = \int x |\langle x| S\rangle|^2 dx = \int x \Psi^\star \Psi dx \\ = \int dx \langle x|S\rangle^\star \langle x|\hat{x}|S\rangle \\ = \int dx \langle S|x\rangle \langle x|\hat{x}|S\rangle \\ = \langle S| \left(\int dx |x\rangle \langle x|\right)\hat{x}|S\rangle \\ = \langle S| \hat{x}|S\rangle

By the same reasoning, any general observable Q^ \hat{Q} has expected value SQ^S \langle S| \hat{Q} |S\rangle.

Schrodinger Equation

The first section of this article is on the mathematics of quantum mechanics. The second section is on the connection between the math to physics. This final one is about physics. Physics is all about predicting the future state of a particle/system based on its past. And that’s what Schrodinger’s Equation does–you can take it as the fifth postulate. It’s deceptively simple:

iSt=H^Si\hbar\frac{\partial |S\rangle}{\partial t} = \hat{H} |S\rangle

We may also write it in terms of the x {|x\rangle} basis, which is what most introductory textbooks do,

xiSt=xH^SixSt=(22m2x2+V)xSiΨ(x)t=(22m2x2+V)Ψ(x)\langle x| i\hbar \frac{\partial |S\rangle}{\partial t} = \langle x|\hat{H}|S\rangle \\ i\hbar \frac{\partial \langle x|S\rangle}{\partial t} = \left(-\frac{\hbar^2}{2m}\frac{\partial^2}{\partial x^2} +V\right) \langle x | S\rangle \\ i\hbar \frac{\partial \Psi(x)}{\partial t} = \left(-\frac{\hbar^2}{2m}\frac{\partial^2}{\partial x^2} +V\right) \Psi(x)

How to apply Quantum Mechanics

The great thing of a postulative introduction is that it is very terse and gives somewhat of a big picture without glossing over too much details. But a problem with it is that you can totally read the first 3 parts of this article and walk away with no clue at all about how to apply quantum mechanics. That is what this final section addresses–for those petty engineers who care about applications.

Usually, in a quantum mechanics problem (as in any other physics problem), you are given an initial state S(0) |S(0)\rangle and you want to find the state at time tt, S(t) |S(t)\rangle.

Or more realistically, you are given an initial state’s representation in some basis. Usually it is the position basis. So, in most cases, you start with Ψ(x,0) \Psi(x,0) . You with to derive Ψ(x,t) \Psi(x,t) or Φ(p,t) \Phi(p,t) or En(t)E_n(t) . It doesn’t matter. You can always convert between them based on what we learned in the first part of the article.

The magic formula is always the same:

Part 1

Find the eigenstates and eigenvalues of the Hamiltonian operator Hˉ \bar{H} in the basis in which the initial state S(0) |S(0)\rangle is given. We need to find it each time because it is dependent on the potential function VV. This is done by solving the so-called time-independent Schrodinger’s equation,

Enxn=(22m2x2+V)xnE_n\langle x|n\rangle = \left(-\frac{\hbar^2}{2m} \frac{\partial^2}{\partial x^2} + V \right)\langle x|n\rangle

because xn \langle x|n\rangle gives us a representation of eigenstate n |n\rangle in the x {|x\rangle} basis. Note that if you are given Φ(x,0) \Phi(x,0) instead of Ψ(x,0) \Psi(x,0) , you should solve for pn \langle p|n\rangle instead.

Part 2

Represent the original state S(0) |S(0)\rangle in terms of the Hamiltonian eigenstates n |n\rangle.

S(0)=ncn(0)n|S(0)\rangle = \sum_n c_n(0) |n\rangle \\

So, we essentially want to transform the position basis wavefunction representation Ψ(x,0) \Psi(x,0) to the Hamiltonian basis.

S(0)=dxxΨ(x,0)=dxΨ(x,0)nnxn=nndxxnΨ(x,0)|S(0)\rangle = \int dx |x\rangle \Psi(x,0) \\ = \int dx \Psi(x,0) \sum_n \langle n|x\rangle |n\rangle \\ = \sum_n |n\rangle \int dx \langle x|n\rangle^\star\Psi(x,0)

Clearly,

cn(0)=dxxnΨ(x,0)c_n(0) = \int dx \langle x|n\rangle^\star\Psi(x,0)

Part 3

Plug S(0) |S(0)\rangle back into the Schrodinger’s Equation.

itncnn=H^ncnninncnt=ncnEnni\hbar\frac{\partial}{\partial t}\sum_n c_n |n\rangle = \hat{H} \sum_n c_n |n\rangle \\ i\hbar \sum_n |n\rangle\frac{\partial c_n}{\partial t} = \sum_n c_n E_n |n\rangle

Because n {|n\rangle} are linearly independent, it must be that each component of n |n\rangle must equal for the overall equality to hold,

icnt=cnEncn=cn(0)eEntii\hbar\frac{\partial c_n}{\partial t} = c_n E_n \\ c_n = c_n(0) e^{\frac{E_nt}{i\hbar}}

Now, cnc_n is found, thus S |S\rangle is found in terms of energy basis. It can be translated into other basis easily.