The Path Integral

Published on
24 min read––– views

If this seems very mysterious, you are not alone. Understanding what is going on here is in some sense equivalent to understanding Quantum Mechanics. I do not understand Quantum Mechanics. Feynman admitted that he never understood Quantum Mechanics. It may be true that nobody can understand Quantum Mechanics in the usual meaning of the word "understand."1

Intro

Continuing the quantum inquisition, this post aims to explain/derive the Feynman's path integral and describe its utility.

Dx  eis[x]/\int Dx \; e^{is[x]/\hbar}

This expression is key to understanding several areas of applied and theoretical mathematics including statistics, field theory, physics, and –notably for us/me/this series– quantum mechanics.

Whereas a standard, definite integral of the form A=abf(x)dxA = \int_a^b f(x)dx can be understood to compute the area under some curve on the range [a,b][a, b] by adding up infinitely many infinitesimal rectangular slices of the curve with area ×w\ell\times w where =f(x)\ell = f(x) and w=dxw = dx:

and summing up these areas in the limit as we take the width dxdx to be infinitesimally small, we get the continuous area under the curve. The path integral does something else all together: it allows us to compute quantities such as transition probabilities perturbatively – that is to say, multiple (possibly infinite) areas under several possibilities of curves, rather than single curve.

In our quantum context, the path integral allows us to sum over paths between points:

If we assign to each path a complex number given by some function ϕ:XC\phi : X \rightarrow \mathbb C, where XX is the topological space of paths between points a,bR2a, b \in \mathbb R^2.2 ϕ(x)\phi(x) can be thought of as a unit vector which we can visualize on the complex plane:

Feynman's path integral finds purchase for us in quantum mechanics by allowing us to measure quantum states. Consider a quantum particle at some place and time: (x0,t0)(x_0, t_0). If we wait for some amount of time to elapse and measure the particle again, we must consider all possible trajectories the particle could have taken to reach its destination at (xf,tfx_f, t_f), and our measurement will be the quantum probability of observing the particle at a given location (or, more commonly, a band or window of locations).

To compute this probability, we must sum over all paths:

ϕΦϕ(x)=Φ(x)  Dx\sum_{\phi \in \Phi} \phi(x) = \int \Phi(x)\; Dx

Specifically, the probability of observing the particle at (xf,tf)(x_f, t_f) will be proportional to the modulus of this integral as established by Feynman when he was 23:3

p[q=(xf,tf)]Φϕ(x)  Dx2p[q = (x_f, t_f)] \propto \Big| \int_{\Phi} \phi(x)\; Dx \Big|^2

Classical Kinematics

Whereas a classical trajectory can be described with kinematics equations with position as a function of time, e.g. throwing a ball:

The expression describing this function can be straightforwardly derived from F=maF=ma, with a=ga=g being a downwards gravitational force:

F=md2xdt2F = m\frac{d^2x}{dt^2}

If we know the state xx of the ball/particle at initial time tit_i, we can precisely compute its state at some future time x(tf)x(t_f). Crucially, this is not true in quantum mechanics where we're limited to computing a probability of observation of some state in the future.

Double Slit Experiment

Enter the famous Double Slit experiment performed by Thomas Young in 1804,4 but more illustratively for electrons in particular by George Thompson,5 Davisson and Germer in 1927.6 Referring back to the epigraph, this mysterious experiment demonstrates that when slits are placed in the path of an electron, they behave not like particles but instead like waves, causing interference patterns on the measurement backdrop (unless they're explicitly observed). When firing classical particles through the slits at the backdrop, we would expect to see a distribution of measured impacts following a Gaussian like so:

If a quantum particle moved along a unique, definite curve like a baseball, then we could say for certain whether it passes through the left or the right slot. As noted by Heisenberg, only if we observe the path of the electron does it solidify into existence. Prior to/in the absence of observation, the electron behaves strictly probabilistically (or, more or less imprecisely: quantum-ly), perturbing all possible paths through all slits.

As such, after passing many particles through the slits towards the backdrop, we'd expect to see a normal distribution of particle impacts around the center of the backstop. But instead, we see an interference pattern:

Here, the peaks of dense particle clusters and valleys devoid of particle presence. This experiment indicates that quantum particles are somehow probing both slits and "cancelling" themselves out in at the valleys. Note that it is not the case that multiple particles are interfering with one another which can be proved by emitting a single electron at a time, repeatedly.

And this, like the cotton gin was to the civil war, is actually right where we get the path integral.

If we add a third slit, this now allows for quantum trajectories through that slit as well:

And we can inductively generalize this approach, scaling the number and placement of slits horizontally and vertically till we have a mesh of infinitely many many slits in infinitely many bars lying between the location of particle emission aa and our backdrop bb:

In the limit, we find that the particle probes every possible path from aba \rightarrow b. Thus, we have a physical motivation for the path integral, but what exactly is it doing quantitatively?

Path Function

Recall that we said that ϕ(x)\phi(x) is some function which maps a path to a complex number. This is accomplished via:

ϕ(x)=eis[x]/\phi(x) = e^{is[x]/\hbar}

where s[x]s[x] describes the action of the path, and \hbar is Planck's constant.7 ss can be physically defined as the kinetic energy of the particle EE at any moment minus its potential energy UU integrated over the trajectory of the given path:

s[x]=titf(KU)dt,sRs[x] = \int_{t_i}^{t_f}(K -U)dt, \quad s \in\mathbb R

On the complex plane, this quantity will be a vector with angle s[x]s[x]. The amplitude for a particle to propagate from xixfx_i \rightarrow x_f is given by the product of the path integral with some constant AA:

Kfi=AϕΦeis[x]/    p(xf)Kfi2\begin{aligned} K_{fi} &= A\sum_{\phi \in \Phi} e^{is[x]/\hbar} \\ &\implies p(x_f) \propto|K_{fi}|^2 \end{aligned}

and, lastly, when measuring quantum particles, we're really measuring windows rather than individual points (since p[xf=x]=0p[x_f = x] = 0 in the limit), so we multiply the complex amplitude by an infinitesimal width:

    p(xf)Kfi2dxf\implies p(x_f) \propto|K_{f_i}|^2dx_f

Computing the Path Integral

If we try to actually compute a path integral, we'll quickly be anti-chuffed because it is impossible to recover a discrete list of infinitely many paths. Recall that an ordinary integral just sums the slices under some curve in the limit:

For path integrals, ϕ\phi is a functional which assigns a scalar to a curve. For illustrative purposes, we can discretize our toy curve, and consider what happens if we split a trajectory up into NN slices:

Our trajectory is a function which assigns a point in space xix_i for each discrete time step tit_i. Connecting these points, we get a zig-zag shape which approximates our curve. Note that we can move each xix_i up or down in order to approximate any path. So, to sum over all trajectories, we just sum over all positions that each xix_i can take for all xx in our trajectory f(t)f(t), so we end up integrating over the entire range of values in the x,tx, t plane which is what's meant by the capital infinitesimal DxDx:8

xixfDx=dx1dx2dxN1=dx1dx2dxN1\begin{aligned} \int_{x_i}^{x_f} Dx &= \int_{-\infty}^{\infty}dx_1\int_{-\infty}^{\infty}dx_2 \cdots \int_{-\infty}^{\infty}dx_{N-1} \\ &= \int_{-\infty}^{\infty}dx_1dx_2\cdots dx_{N-1} \end{aligned}

Recall we also include an AA term for getting a sensible factor in the limit because of course we're taking the limit of NN intermediate points in our discrete approximation of our path in order to recover a high fidelity, continuous path:

xixfϕ  Dx=AlimNϕ  dx1dx2dxN1\begin{aligned} \int_{x_i}^{x_f} \phi \; Dx &= A \lim_{N \rightarrow \infty} \int_{-\infty}^{\infty}\phi \; dx_1dx_2\cdots dx_{N-1} \end{aligned}

Example

Let's work an example on a free particle. "Free" meaning no forces are acting on it s.t. F=0F=0. For a classical particle like a baseball, then F=ma=0F = ma = 0 would imply a straight line trajectory from xixfx_i \rightarrow x_f, but for a quantum particle, we must compute the action:

s[x]=ab(KU)dt=ab(12mv2U(x(t)))dt=ab(12m(dxdt)20)dt\begin{aligned} s[x] &= \int_a^b (K - U)dt \\ &= \int_a^b \Big(\frac{1}{2}mv^2 - U(x(t))\Big)dt \\ &= \int_a^b \Big( \frac{1}{2}m\Big(\frac{dx}{dt}\Big)^2 - 0\Big)dt \\ \end{aligned}

So we end up just raising ee to the power of ii times the kinetic portion of the action divided by Planck's constant for all points. By breaking up the path into discrete time slices, approximating the curve with each piece of the trajectory we're approximating being a straight line, we get:

s[xij]=12m(dxdt)2dt=12m(xjxiΔt)2dt=Δt12m(xjxiΔt)2=m2Δt(xjxi)2\begin{aligned} s[x_{ij}] &= \int \frac{1}{2}m\Big(\frac{dx}{dt}\Big)^2 dt \\ &= \int \frac{1}{2}m\Big(\frac{x_j - x_i}{\Delta t}\Big)^2 dt \\ &= \Delta t \frac{1}{2}m\Big(\frac{x_j - x_i}{\Delta t}\Big)^2 \\ &= \frac{m}{2\Delta t} (x_j - x_i)^2 \end{aligned}

And we repeat this process for all segments i[1,N1],ji+1i \in [1, N-1], j \leftarrow i+1 to compute s[x]s[x]:

s[x]=m2Δt(x1xi)2+m2Δt(x2x1)2+m2Δt(xfxN1)2\begin{aligned} s[x] &= \frac{m}{2\Delta t} (x_1 - x_i)^2 + \frac{m}{2\Delta t} (x_2 - x_1)^2 + \cdots \frac{m}{2\Delta t} (x_f - x_{N-1})^2 \end{aligned}

and, raising ee to the power of is[x]\frac{is[x]}{\hbar} gives us the weight of this path:

xixfeis[x]/  Dx=limNeim2Δt((x1xi)2+(x2x1)2++(xfxN1)2)dx1dx2dxN1\int_{x_i}^{x_f} e^{is[x]/\hbar} \; Dx = \lim_{N \rightarrow \infty}\int_{-\infty}^{\infty} e^{\frac{i}{\hbar}\frac{m}{2\Delta t}\Big((x_1 - x_i)^2 + (x_2 - x_1)^2 + \cdots +(x_f - x_{N-1})^2 \Big)} dx_1dx_2\cdots dx_{N-1}

Note that the beastly right hand side has the form of a bunch of Gaussians:

eiax2dx=πia    eis[x]/  Dx=limNA(2πiΔtm)Nm2πi(tfti)ei12m(xfxi)2tfti\begin{aligned} \int_{-\infty}^{\infty} e^{iax^2}dx &= \sqrt{\frac{\pi i}{a}} \\ \\ \implies \int_{-\infty}^{\infty} e^{is[x]/\hbar}\; Dx &=\lim_{N\rightarrow \infty} A \Bigg(\sqrt{\frac{2\pi i \hbar \Delta t}{m}} \Bigg)^N \sqrt{\frac{m}{2\pi i\hbar(t_f - t_i)}}e^{\frac{i}{\hbar}\frac{1}{2}m\frac{(x_f - x_i)^2}{t_f-t_i}} \end{aligned}

and also that only one term depends on NN, so in order recover our path integral formula, we know that AA must be the inverse of this term:

A=(m2πiΔt)NA = \Bigg(\sqrt{\frac{m}{2\pi i \hbar \Delta t}} \Bigg)^N

All together the expression for the path integral of a free particle is:

eis[x]/  Dx=limN(m2πiΔt)N(2πiΔtm)Nm2πi(tfti)ei12m(xfxi)2tfti=m2πi(tfti)ei12m(xfxi)2tfti\begin{aligned} \int_{-\infty}^{\infty} e^{is[x]/\hbar}\; Dx &= \lim_{N\rightarrow \infty} \Bigg(\sqrt{\frac{m}{2\pi i \hbar \Delta t}} \Bigg)^N \Bigg(\sqrt{\frac{2\pi i \hbar \Delta t}{m}} \Bigg)^N \sqrt{\frac{m}{2\pi i\hbar(t_f - t_i)}}e^{\frac{i}{\hbar}\frac{1}{2}m\frac{(x_f - x_i)^2}{t_f-t_i}} \\ \\ &= \sqrt{\frac{m}{2\pi i\hbar(t_f - t_i)}}e^{\frac{i}{\hbar}\frac{1}{2}m\frac{(x_f - x_i)^2}{t_f-t_i}} \end{aligned}

This is the quantum mechanical amplitude KK for a free particle to propagate from from position xixfx_i \rightarrow x_f:

Kfi=m2πi(tfti)ei12m(xfxi)2tftiK_{fi} = \sqrt{\frac{m}{2\pi i\hbar(t_f - t_i)}}e^{\frac{i}{\hbar}\frac{1}{2}m\frac{(x_f - x_i)^2}{t_f-t_i}}

Physically, this corresponds to the probability of finding the particle at position xfx_f, or rather, in an infinitesimally thin window of width dxfdx_f centered about xfx_f. To interpret the above amplitude as a probability, we take the absolute value squared:

p(xf)=Kfi2dxfp(x_f) = |K_{fi}|^2dx_f

In other words, the square of KK tells us the probability density of finding the particle at any given point:

p(xf)dxf=Kfi2\frac{p(x_f)}{dx_f} = |K_{fi}|^2

Taking the modulus squared of our KfiK_{fi} term, we can note that it is comprised of two terms, the square root term and the exponential:

Kfi=m2πi(tfti)ei12m(xfxi)2tftiK_{fi} = \sqrt{\frac{m}{2\pi i\hbar(t_f - t_i)}}e^{\frac{i}{\hbar}\frac{1}{2}m\frac{(x_f - x_i)^2}{t_f-t_i}}

Note that the square of ebi=1  bR|e^{bi}| = 1 \; \forall b \in \mathbb R, so the 2nd term will just be 1, regardless of the mass of the particle, position, or time interval. The square of the modulus of first term will just remove the square root, so we get:

p(xf)dxf=Kfi2=m2πi(tfti)\begin{aligned} \frac{p(x_f)}{dx_f} = |K_{fi}|^2 = \frac{m}{2\pi i\hbar(t_f - t_i)} \end{aligned}

Curiously, this doesn't actually depend on the position at all! This implies that there is a uniform likelihood that the particle can be found anywhere in space... Let's sanity check if/how this makes sense. This stems from the Uncertainty Principle which indirectly states that the more narrowly we restrict a particles position in space, the broader the range of possible velocities it can have:

H(v)1H(x)H(v) \propto \frac{1}{H(x)}

And furthermore, since we assumed that the particle has a definite initial position of xix_i, we've effectively unbounded the range of possible initial velocities it might have, therefore it could feasibly shoot off into any other point in space with equal probability when observed later. This is why, in practice, we proscribe windows of initial conditions describing our particle.

Footnotes

Footnotes

  1. The Feynman Double Slit. University of Toronto.

  2. and probably R3\mathbb R^3 too

  3. Feynman, Richard P. "Space-Time Approach to Non-Relativistic Quantum Mechanics." Cornell University, 1948.

  4. Young, Thomas. "The Bakerian lecture. Experiments and calculation relative to physical optics". Philosophical Transactions of the Royal Society of London, 1804. Vol. 94, pp. 1–16. 10.1098/rstl.1804.0001.

  5. Davisson, C. J.; Germer, L. H. "Reflection of Electrons by a Crystal of Nickel." Proceedings of the National Academy of Sciences of the United States of America, 1928. Vol. 14, Iss. 4, pp. 317–322. 1928PNAS...14..317D.

  6. Thomson, G. P.; Reid, A. "Diffraction of Cathode Rays by a Thin Film." Nature, 1927. Vol. 119  Iss. 3,007: 890. 10.1038/119890a0.

  7. 1900, spectral radiance per unit frequency of a body for frequency ν\nu at absolute temperature TT given by Bν(ν,T)dν=2hν3c21ehνkBT1dνB_\nu(\nu, T)d\nu = \frac{2h\nu^3}{c^2}\frac{1}{e^{\frac{h\nu}{k_BT}}-1}d\nu where kBk_B is the Boltzmann constant,9 \hbar is Planck's constant.

  8. This term is called a "measure" and it's actually not well defined, but we'll suspend our disbelief for this installment of ELI25

  9. kB=R/NAk_B = R/N_A from pV=nRTpV = nRT, NAN_A = Avagadro's constant