Huygens' Principle |
In 1678 the great Dutch physicist Christian Huygens (1629-1695) wrote a treatise called Traite de la Lumiere on the wave theory of light, and in this work he stated that the wavefront of a propagating wave of light at any instant conforms to the envelope of spherical wavelets emanating from every point on the wavefront at the prior instant (with the understanding that the wavelets have the same speed as the overall wave). An illustration of this idea, now known as Huygens' Principle, is shown below. |
From this simple principle Huygens was able to derive the laws of reflection and refraction, but the principle is deficient in that it fails to account for the directionality of the wave propagation in time, i.e., it doesn't explain why the wavefront at time t + Dt in the above figure is the upper rather than the lower envelope of the secondary wavelets. Why does an expanding spherical wave continue to expand outward from its source, rather than re-converging inward back toward the source? Also, the principle originally stated by Huygens does not account for diffraction. Subsequently, Augustin Fresnel (1788-1827) elaborated on Huygens' Principle by stating that the amplitude of the wave at any given point equals the superposition of the amplitudes of all the secondary wavelets at that point (with the understanding that the wavelets have the same frequency as the original wave). The Huygens-Fresnel Principle is adequate to account for a wide range of optical phenomena, and it was later shown by Gustav Kirchoff (1824-1887) how this principle can be deduced from Maxwell's equations. Nevertheless (and despite statements to the contrary in the literature), it does not actually resolve the question about "backward" propagation of waves, because Maxwell's equations themselves theoretically allow for advanced as well as retarded potentials. It's customary to simply discount the advanced waves as "unrealistic", and to treat the retarded wave as if it was the unique solution, although there have occasionally been interesting proposals, such as the Feynman-Wheeler theory, that make use of both solutions. |
Incidentally, as an undergraduate, Feynman gave a seminar on this "new idea" at Princeton. Among the several "monster minds" (as Feynman called them) in attendance was Einstein, to whom the idea was not so new, because 30 years earlier Einstein had debated the significance of the advanced potentials with Walther Ritz. |
In any case, the Huygens-Fresnel Principle has been very useful and influential in the field of optics, although there is a wide range of opinion as to its scientific merit. Many people regard it as a truly inspired insight, and a fore-runner of modern quantum electro-dynamics, whereas others dismiss it as nothing more than a naive guess that sometimes happens to work. For example, in his excellent "Principles of Electrodynamics", Melvin Schwartz wrote |
Huygens' principle tells us to consider each point on a wavefront as a new source of radiation and add the "radiation" from all of the new "sources" together. Physically this makes no sense at all. Light does not emit light; only accelerating charges emit light. Thus we will begin by throwing out Huygens' principle completely; later we will see that it actually does give the right answer for the wrong reasons. |
Whether we have now actually found the true "reason" for the behavior of light is debatable, and ultimately every theory is based on some fundamental principle(s), but it's interesting how widely the opinions on various principles differ. (I'm reminded of the history of Fermat's Principle, and of Planck's reverence for the Principle of Least Action.) |
Setting aside these weighty considerations, it's interesting to review the mathematical content of Huygens' original principle. The usual wave equation for a scalar field y(x1,x2,..,xn,t) in n space and 1 time dimension is |
(We've chosen units of time and space so that the wave velocity is 1.) If we consider a spherically symmetrical wave we have y = y(r,t) where |
For future reference, notice that |
for every index j from 1 to n. It follows that |
Returning to the basic wave equation, and assuming y is strictly a function of r and t, we have the following partial derivatives with respect to each of the space variables: |
Since partial differentiation is commutative, the second factor in the last term of the right- hand equation can be written as |
Now, since |
the preceding mixed partial is simply |
Substituting back into the expression for the second partial derivative of y with respect to xj, we have |
Summing all these partials for j = 1 to n gives |
Hence the spherically symmetrical wave equation in n spatial dimensions can be written as |
Now suppose we define a new scalar field by the relation f(r,t) = rk y(r,t) where k is some fixed constant. The partial derivative of this scalar field with respect to r are |
Notice that if we set k = (n-1)/2, and if we divide through this second partial by rk, we have |
This is nearly the same as the left-hand side of the spherically symmetrical wave equation, except for the last term. Hence we can write the wave equation in the form |
Furthermore, we can multiply through by rk = r(n-1)/2 to put this in the equivalent form |
If n equals 1, meaning that we have just a single space dimension, then r = x1 and f = y, so we expect the second term on the left hand side to vanish identically, as indeed it does, leaving us with just the original one-dimensional wave equation, with the well-known general solution |
for arbitrary functions f and g. However, we might not have anticipated that the second term in the transformed wave equation also vanishes if n equals 3, i.e., in the case of three spatial dimensions. In this case the spherically symmetrical wave equation once again reduces to a one-dimensional wave equation, although in the modified wave function f = ry. Hence the general solution in three space dimensions is |
The fact that this solution is divided by r signifies that the magnitude of the wave tends to drop as r increases (unlike the one-dimensional case, in which a wave would theoretical propagate forever with undiminished strength). Focusing on just the "retarded" component of the wave, f(r-t)/r, the fact that the time parameter t appears only in the difference r-t implies that the (attenuated) wave propagates in time with a phase velocity of precisely 1, because for any fixed phase b we have r-t = b and so dr/dt for this phase point is 1. Consequently if f is a single pulse, it will propagate outward in a spherical shell at precisely the speed 1, i.e., on the light cone. Conversely, it can be shown that the wave function at any point in space and time is fully determined by the values and derivatives of that function on the past light cone of the point. Any wave equation for which this is true (i.e., for which disturbances propagate at a single precise speed) is said to satisfy Huygens' Principle. The connection with Huygens' original statement about secondary wavelets is that each wavelet - with the same speed as the original wave - represents a tiny light cone at that point, and Huygens' principle asserts that light is confined to those light cones. |
It's worth noting that in the above derivation we were able to reduce the polar wave equation to a simple one-dimensional equation by taking advantage of the fact that an unwanted term vanished when the number of space dimensions is n = 3 (or n = 1). For the case of two dimensional space this doesn't work (nor would it work with four space dimensions). We can still solve the wave equation, but the solution is not just a simple spherical wave propagating with unit velocity. Instead, we find that there are effectively infinitely many velocities, in the sense that a single pulse disturbance at the origin will propagate outward on infinitely many "light cones" (and sub-cones) with speeds ranging from the maximum down to zero. Hence if we lived in a universe with two spatial dimensions (instead of three), an observer at a fixed location from the origin of a single pulse would "see" an initial flash but then the disturbance "afterglow" would persist, becoming less and less intense, but continuing forever, as slower and slower subsidiary branches arrive. (It's interesting to compare and contrast this "afterglow" with the cosmic microwave background radiation that we actually do observe in our 3+1 dimensional universe.) |
It turns out that Huygens' Principle applies only with one time dimension and n = 3, 5, 7.., or any odd number of space dimensions, but not for any even number of space dimensions. (The case n = 1 is degenerate, because a pulse has only one path to take.) To see why, let's return to the general spherically symmetrical wave equation in n space dimensions |
and consider a solution of the form y(r,t) = f(r)g(t). (Naturally not all solutions are separable in this way, but since the wave equation is linear, we can construct more general solutions by summing a sufficient number of solutions of the separable form f(r)g(t).) Inserting this into the wave equation and expanding the derivatives by the product rule gives |
Dividing through by fg gives |
Notice that the left hand side is strictly a function of r, and the right hand side is strictly a function of t. Since r and t are independent variables, the left and right sides must both equal a constant, which we will denote by k. Hence we have two separate ordinary differential equations |
The right hand equation shows that g(t) satisfies the simple harmonic equation, whose solutions include functions of the form sin(ct) and cos(ct) where . The left hand equation can be re-written in the form |
If we multiplied this through by r, it would be in the form of what is called Bessel's equation, named after Friedrich Wilhelm Bessel (1784-1846), the German astronomer who (incidentally) was the first person to determine the distance to a star (other than the Sun). In 1838 he determined the distance to the star called "61 Cygni" based on the parallax as viewed from the Earth at six-month intervals. Bessel functions are solutions of a standard Bessel equation, just as the ordinary trigonometric functions, sine and cosine, are solutions of the differential equation y" + y = 0. |
To solve the above equation we can assume a series solution of the form |
for some integer q (which may be positive, negative, or zero) such that c0 is non-zero. The derivatives of this function are |
Substituting these into the differential equation, and collecting terms by powers of r, we get |
The coefficient of each power of r must vanish, and since c0 is non-zero, the expression for the first coefficient implies q(q - 2 + n) = 0. This is called the indicial equation, because it determines the acceptable value(s) of q. In this case we must have either q = 0 or else q = 2 - n. If q = 0 then the coefficient of rq equals c1(n-1), so either n = 1 or else c1 = 0. On the other hand, if q = 2 - n, then the coefficient of rq equals c1(3-n), so either n = 3 or else again c1 = 0. We've already seen that the original differential equation has a particularly simple analytical solution when n (the number of space dimensions) equals either 1 or 3, so we need not consider them here. For all other value of n, we must have c1 = 0. (Of course, even with n = 1 or 3, we are free to set c1 = 0.) |
Now, examining the coefficients of the higher powers of r, we see that in general the coefficient of rq+m is of the form |
Inserting q = 0, setting the overall coefficient to zero, and solving for cm+1 gives |
for m = 0, 1, 2,... Since c0 is, by definition, the first non-zero coefficient, it follows that c-1 is zero, and therefore c1 = 0. Moreover, applying the above formula recursively, we see that all the cj coefficients for odd indices j must vanish. On the other hand, the coefficients with even indices are given recursively by |
and so on. Notice that if n = 1 the denominators are 1 2, 3 4, 5 6, ..., etc., so the general non-zero coefficient of the solution can be written simply as |
giving the solution |
Hence if we normalize k to -1, the solution is simply f(r) = c0 cos(r). Recall that g(t) has solutions of the form cos(ct) and sin(ct), and we can create a solution given by the sum of two separable solutions, so, for example, one solution is |
Similarly if n = 3 the denominators of the recursive formulas are 2 3, 4 5, 6 7, ..., etc., so the general non-zero coefficient is |
giving the solution |
In this case if we normalize k = -1 the solution is f(r) = c0 sin(r)/r. Combining this with suitable solutions for g(t) as in the case of n=1, we can arrive at overall solutions such as y(r,t) = cos(r-ct)/r. This shows (again) that the cases of 1 and 3 spatial dimensions lead to especially simple solutions. |
In general, for arbitrary positive integer n, the coefficient c2j is of the form |
Notice that for n = 1 the factors in the square brackets are consecutive odd integers, and they can be interleaved between the consecutive even integers to give a pure factorial product. Likewise for n = 3 the odd and even factors can be interleaved to give a pure factorial product. For higher odd integers we can interleave the factors in the same way, although there will be a fixed number of leading even factors and the same number of trailing odd factors that don't overlap. For example, with n=7 the coefficient c12 is as shown below, after re-arranging the six even and six odd factors in the denominator |
Taking advantage of this interleaving, we can express the general coefficient (for sufficiently large j) with odd n > 3 in the form |
For any fixed n, the first factor on the right is just a constant, and the second factor is just one over a polynomial of degree (n-3)/2 in the index j. Therefore, after some number of terms, the series solution goes over to a simple factorial form with a polynomial divisor. It can be shown that the resulting function f(r) is such that Huygens' Principle is satisfied, so this implies that the principle is satisfied for any odd number of space dimensions. |
In contrast, if the number of space dimensions is even, we do not have interleaving of the factors in the denominator of the coefficients. In this case we can only re-write (1) in the form |
For example, in the case n = 2 (i.e., two spatial dimensions) we have the coefficients |
This gives the function |
Taking k = -1 again, a plot of this function is shown below. |
This is the Bessel function of order zero, often denoted as J0. |
It would be interesting to work out the connections between Huygens' Principle and the zeta function (whose value can only be given in simple closed form for even arguments) and the Bernoulli numbers (which are non-zero only for even indices). It's also interesting to note the analogy between Huygens' spherical wavelets centered on the boundary of the wave front and the technique of analytic continuation, by which we expand the boundary of an analytic region by means of disks of convergence centered on or near the boundary of the existing analytic region. |
Paul Dirac (1902-1984) gave an interesting general argument for a much stronger version of Huygens' Principle in the context of quantum mechanics. In his "Principles of Quantum Mechanics" he noted that a measurement of a component of the instantaneous velocity of a free electron must give the value c, which implies that electrons (and massive particles in general) always propagate along null intervals, i.e., on the local light cone. At first this may seem to contradict the fact that we observe massive objects to move at speeds much less than the speed of light, but Dirac points out that observed velocities are always average velocities over appreciable time intervals, whereas the equations of motion of the particle show that its velocity oscillates between +c and -c in such a way that the mean value agrees with the average value. He argues that this must be the case in any relativistic theory that incorporates the uncertainty principle, because in order to measure the velocity of a particle we must measure its position at two different times, and then divide the change in position by the elapsed time. To approximate as closely as popssible to the instantaneous velocity, the time interval must go to zero, which implies that the position measurements must approach infinite precision. However, according to the uncertainty principle, the extreme precision of the position measurement implies an approach to infinite indeterminancy in the momentum, which means that almost all values of momentum - from zero to infinity - become equally probable. Hence the momentum is almost certainly infinite, which corresponds to a speed of c. This is obviously a very general argument, and applies to all massive particles (not just fermions). |