1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 From Conservation of Energy to the Principle of Least Action: A Story Line Jozef Hanca) Technical University, Vysokoskolska 4, 042 00 Kosice, Slovakia Edwin F. Taylorb) Massachusetts Institute of Technology, Cambridge, MA 02139 ABSTRACT A schematic story line outlines an introduction to Newtonian mechanics that starts by employing the conservation of energy to predict the motion of a particle in a one-dimensional potential. Incorporating constraints and constants of the motion into the energy expression allows analysis of some more complicated systems. A heuristic transition embeds kinetic and potential energy in the still more powerful principle of least action and Lagrange's equations. NOTES TO EDITORS AND REFEREES: 1. Preprints of some papers referenced here but not yet published and some papers published elsewhere are available at the website http://www.eftaylor.com/leastaction.html 2. Comments and suggestions can conveniently be indexed by page and line number in this pdf file. I. INTRODUCTION It is sometimes claimed that nature has a "purpose", in that it seeks to take the path that produces the minimum action. . . . In fact [according to Feynman's version of quantum mechanics] nature does exactly the opposite. It takes every path, treating them all on [an] equal footing. We simply end up seeing the path with the stationary action, due to the way the quantum mechanical phases add. It would be a harsh requirement, indeed, to demand that nature make a "global" decision (that is, to compare paths that are separated by large distances), and to choose the one with the smallest action. Instead, we see that everything takes place on a "local” scale. Nearby phases simply add and everything works out automatically. — David Morin 1 2 Here is a nine-sentence history of Newtonian mechanics: In the second half of the 1600s 3 37 38 39 40 41 42 43 Newton proposed his three laws of motion, including what has become F = ma. In the mid1700s Leonhard Euler devised and applied one version of the principle of least action using mostly geometrical methods. On 12 August 1755 the 19-year-old Ludovico de la Grange Tournier of Turin (whom we call Joseph Louis Lagrange) sent Euler a letter with a mathematical attachment that streamlined Euler's methods into algebraic form. "[A]fter seeing Lagrange's work Euler dropped his own method, espoused that of Lagrange, and 4 renamed the subject the calculus of variations." Lagrange, in his path-breaking 1788 Analytical 44 Mechanics, introduced what we call the Lagrangian function and Lagrange's equations of 45 motion. Almost half a century later (1834-35) William Rowan Hamilton developed 5 6 1 7 8 1 Hamilton's principle, to which Landau and Lifshitz and Feynman more recently reassigned 2 the name principle of least action. Between 1840 and 1860 the conservation of energy was 3 established in all its generality. 9 10 11 In 1918 Emma Noether proved relations between 12 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 symmetries and conserved quantities. Finally, in 1949 Richard Feynman devised the formulation of quantum mechanics that not only explicitly underpins the principle of least action but also demarcates the limits of validity of Newtonian mechanics. 42 43 44 45 applications of energy and its environmental consequences is a matter of choice. In addition to expressions for kinetic and potential energy of a particle, this introduction should include angular velocity, moment of inertia, and the kinetic energy of a rigid body rotating about a fixed or parallel-moving axis. 13 We have proposed that the principle of least action and Lagrange's equations become the basis of introductory Newtonian mechanics. Three recent articles in this Journal demonstrate how to use elementary calculus to derive from the principle of least action (1) Newton's laws 14 15 16 of motion, (2) Lagrange's equations, and (3) examples of Noether's theorem. How are these concepts and methods to be introduced to students? In the present paper we suggest a reversal of the historical order: Begin with conservation of energy and graduate to the principle of least action followed by Lagrange's equations. We start by using the conservation of energy to analyze particle motion in a one-dimensional potential. Much of the power of the principle of least action and its logical offspring Lagrange's equations results from the fact that they are based on energy, a scalar. When we start with the conservation of energy, we not only preview more advanced concepts and procedures but also invoke some of their power. For example, expressions for energy conforming to constraints automatically eliminate corresponding forces of constraint from the analysis of motion. Applying a limited version of Noether's theorem expressed in terms of energy identifies constants of the motion. Using constraints and constants of the motion permits us to reduce to one coordinate the description of some important multi-dimensional systems, whose motion can then be solved completely using the conservation of energy. Equilibrium and statics also derive from the conservation of energy. Prerequisites for the proposed introduction include elementary trigonometry, polar coordinates, introductory differential calculus and partial derivatives, and the idea of the integral as a sum of increments. In preparation for later formalism, we symbolize the time derivative with a dot over the variable. The story line presented in this article omits most details and is offered for discussion, correction, and elaboration. The authors have no present intent to write an introductory text on Newtonian mechanics that follows the story line of this article. II. ENERGY CONSERVATION Begin with a discursive description of the forms of energy and the law of global conservation of energy. Introductory treatments of the conservation of energy and conversion among its 17, 18, 19 How far one goes into engineering various forms are available in the literature. 20, 21 2 1 2 3 4 5 6 7 8 9 10 11 12 13 14 III. ONE-DIMENSIONAL MOTION: ANALYTIC SOLUTIONS Narrow the focus to conservation of mechanical energy of a particle moving in a onedimensional potential. The complete description of such one-dimensional motion follows from the conservation of energy. Unfortunately, an explicit function of position vs time can be derived for only a small fraction of such systems. In other cases one is encouraged to guess the analytic solution, as illustrated in the following example. Any proposed analytic solution is easily checked by substitution into the energy equation. Heuristic guesses are assisted by the fact that the first time derivative of position, not the second, appears in the energy conservation equation. The following example introduces the potential energy diagram, a central tool in our story line and important in later study of advanced mechanics, quantum mechanics, general relativity, and many other subjects. Harmonic oscillator Energy Total energy of particle A B C D E F Kinetic energy Potential energy Position 15 16 17 18 19 20 21 22 23 24 25 26 27 Figure 1. The potential energy diagram is central to our treatment. Sample student exercise: For the value of the energy shown, describe the motion of the particle qualitatively but in detail. Compare the magnitudes and directions of each of the following quantities that describe the particle motion at different positions marked A through F: velocity, force, acceleration. In order to describe the motion, what initial condition(s) must be specified in addition to an initial starting point, and are these conditions required for all starting points? How do answers to these questions change for different values of the total energy? Qualitative analysis of the parabolic potential energy diagram (or indeed any diagram with motion bounded near a single potential energy minimum) tells us that the motion will be periodic. For the parabolic potential the conservation of energy is written 3 1 2 3 4 5 6 7 8 9 10 11 12 13 1 1 E = mx˙ 2 + kx 2 2 2 (1) Rearrange Eq. (1) to read: m 2 k 2 x x˙ = 1− 2E 2E (2) Recalling the periodicity of motion, we are reminded of trigonometric identity cos2 θ = 1− sin 2 θ (3) Set θ = ωt and equate corresponding terms in Eqs. (2) and (3), choosing the positive root, which yields: 2E 1/ 2 x = sin ωt (4) k 14 15 16 17 Take the time derivative of this expression and compare its square with the left side of Eq. 22 (2). The result tells us that 18 k 1/ 2 ω = m 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 (5) The fact that ω does not depend on the amplitude of the displacement function (4) means that the period is the same whatever the value of the total energy E. The simple harmonic oscillator has wide application because a large number of potential energy curves can be approximated as parabolas near their minima. IV. ACCELERATION AND FORCE In the absence of dissipation, force can be defined in terms of energy. In the simple harmonic oscillator example, take the time derivative of both sides of (1) (for constant energy E and mass m). 0 = mx˙x˙˙ + kxx˙ (6) or k x m 34 ˙x˙ = − 35 36 37 38 39 Define force as m times the acceleration. Then the simple harmonic force is –kx. More generally, take the time derivative of both sides of the conservation of energy equation for a particle moving in a one-dimensional potential: 40 0 = m˙xx˙˙ + (7) dU ( x ) dU ( x ) dx dU ( x ) = m˙xx˙˙ + = m˙xx˙˙ + x˙ dt dx dt dx 4 (8) 1 2 3 from which 4 m˙x˙ ≡ F = − 5 6 7 8 9 10 11 12 13 14 15 16 17 18 dU ( x ) dx (9) Thus for motion without dissipation force is defined in terms of energy. V. NON-INTEGRABLE MOTIONS IN ONE DIMENSION We cannot solve most one-dimensional motions in closed form, as we did for the simple harmonic oscillator. Our strategy will be to ask the student to begin every solution with a qualitatively analysis using the potential energy diagram (as exemplified in the caption of Fig. 1), then to complete the exact solution employing an interactive computer display (Fig. 2). Such a computer display, yet to be programmed but easily specified, numerically integrates the particle motion in a given potential. On this display the student sets up initial conditions by dragging the horizontal energy line up or down and the position dot left or right, both in the classically permitted region (lower panel of Fig. 2). The computer then moves the dot back and forth along the E-line at a rate proportional to that at which the particle will move, while simultaneously drawing the worldline (upper panel of Fig. 2). 5 time Position E Energy U Position 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 Figure 2. Interactive computer display of the worldline derived numerically from the initial energy and the potential energy function. VI. REDUCTION TO ONE COORDINATE The analysis of one-dimensional motion using the conservation of energy is powerful but limited. In some important cases we can use constraints and constants of the motion to reduce the description of a wider class of motions to one coordinate. To these we apply our standard procedure: qualitative analysis using the diagram of the (effective) potential energy followed by interactive computer solution. Here are some examples. (a) Object rolling without slipping A marble with mass m, radius r, and moment of inertia Imarble rolls without slipping along a curved ramp that lies in the xy plane. A uniform gravitational acceleration g acts in the negative y-direction. The non-slip constraint tells us that v = rω . Conservation of energy leads to the expression: 6 2 1 v 1 1 1 1 E = mv 2 + Imarbleω 2 + mgy = mv 2 + Imarble + mgy r 2 2 2 2 2 I 1 1 = m + marble v + mgy = Mv 2 + mgy 2 r 2 2 2 3 where 4 M =m+ 5 6 7 8 9 10 11 12 13 14 15 16 17 Imarble r2 (10) (11) Constraints are used twice in this example: explicitly in rolling without slipping and implicitly in the relation between height y and displacement along the curve. (b) Projectile motion For projectile motion in a vertical plane subject to a vertical uniform vertical gravitational field, the total energy is 1 1 (12) E = mx˙ 2 + my˙ 2 + mgy 2 2 The potential energy is not a function of x; therefore, as shown below, x-momentum px is a constant of the motion. The energy equation becomes: 1 p2 E = m˙y 2 + x + mgy 2m 2 (13) 18 19 20 21 In Newtonian mechanics the zero-point of energy is arbitrary, so we can reduce the analysis to one dimension by making the substitution: 22 E'= E − 23 24 25 26 27 28 29 30 31 In analyzing projectile motion, we have already applied a limited version of the powerful 23 Noether's theorem. The special version of Noether's theorem used here, expressed in terms of energy, says that when the total energy E is not an explicit function of an independent 24 coordinate, x for example, then the function ∂E ∂x˙ is a constant of the motion. In some important cases we can use the resulting constants of the motion, together with constraints, to describe some two- and even three-dimensional motions with just one coordinate. Projectile motion, above, is one example. Here is another. px2 1 2 = m˙y + mgy 2m 2 (14) 7 1 (c) Marble, ramp, and turntable m x θ φ 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 Figure 3. System of marble, ramp, and turntable. [NOTE: This figure will be redrawn professionally after any referee suggestions.] A marble of mass m and moment of inertia Imarble rolls without slipping along a slot on an inclined ramp fixed rigidly to a turntable which rotates freely so that its angular velocity is not necessarily constant (Fig. 3). The moment of inertia of the combined turntable and ramp is Irot. Assuming that the marble stays on the ramp, find its position x as a function of time. Start with a qualitative analysis, assuming the marble initially moves outward along the ramp. This outward rolling will slow down the rotation of the turntable, perhaps leading the marble to reverse direction. One possibility is oscillatory motion of the marble along the ramp; another possibility is an equilibrium location of the marble on the ramp with simple circular motion of the marble around the axis of the turntable. Now the analysis. The square of the velocity of the marble is: v 2 = x˙ 2 + φ˙ 2 x 2 cos2 θ (15) Conservation of energy yields the equation 1 1 1 Mx˙ 2 + Irot φ˙ 2 + Mx 2φ˙ 2 cos2 θ + mgx sin θ 2 2 2 23 E= 24 25 26 27 28 29 Here M indicates the marble's mass augmented by the energy effects of its rotation, Eq. (11). The expression (16) is not an explicit function of the angle of rotation φ. Therefore our version of Noether's theorem tells us that ∂E ∂φ˙ is a constant of the motion, which we recognize as the total angular momentum J: 30 (16) ∂E = ( Irot + Mx 2 cos2 θ )φ˙ = J ˙ ∂φ (17) 31 8 1 2 Substitute for φ˙ from Eq. (17) into (16): 3 E= 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 1 J2 1 M˙x 2 + + mgx sin θ = M˙x 2 + U eff ( x ) 2 2 2 2 2( Irot + Mx cos θ ) (18) Values of E and J are determined by initial conditions. The effective potential energy U eff ( x ) can have a minimum as a function of x. If the marble starts at rest at the position of this minimum, it will move in a circle. The general motion of this system is difficult to analyze using F = ma. (d) Motion in a central potential Analyze satellite motion in a central inverse-square gravitational field. Choose coordinates r and φ in the plane of the orbit. Then the expression for total energy is: 1 GMm 1 GMm E = mv 2 − = m( r˙ 2 + r 2φ˙ 2 ) − 2 r 2 r (19) The angle φ does not appear in this equation. Therefore we expect that a constant of the motion is given by ∂E ∂φ˙ , which is the angular momentum J: 19 ∂E = mr 2φ˙ = J ˙ ∂φ 20 21 22 Substitute the resulting expression for φ˙ in Eq. (20) into the energy equation (19). 23 1 J2 GMm 1 2 E = m˙r 2 + − = m˙r + U eff ( r) 2 2mr 2 r 2 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 (20) (21) The student employs the plot of effective potential energy to carry out the usual qualitative analysis of radial motion followed by computer integration. Emphasize the distinction between bound orbits, unbound orbits, and the intermediate limiting case. With additional use of Eq. (20), the computer can plot the trajectory in the plane. VII. EQUILIBRIUM AND STATICS 25 Friction is an important feature of motion in everyday life, but it is complicated to analyze. 26 Frictional forces are often ignored in advanced mechanics texts. The tendency of a system toward increased entropy and the statistical mechanics and thermodynamics that account for this tendency are not part of our story line here. Nevertheless, in common experience motion usually slows down and stops. Our use of potential energy diagrams makes straightforward the formulation of "stopping" as a tendency of systems to reach equilibrium at a local minimum of the potential energy. (In principle the system could come to equilibrium perched at a maximum of the potential energy, or even at a zero-slope inflection point.) In the absence of dissipative processes, equilibrium is really an application of conservation of energy; a system released from rest in a configuration at a minimal (or stationary) value of the potential energy will remain at rest. (Proof: An infinitesimal displacement results in zero 9 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 change in potential energy. Because of energy conservation, the change in kinetic energy must also remain zero. Since the system is initially at rest, the zero change in kinetic energy forbids displacement.) Here, as usual, Feynman is ahead of us. Figure 4 shows an example from his treatment of 27 statics. The problem is to find the value of the weight W that keeps the structure at rest. Using units of feet, inches, and pounds, Feynman balances the decrease in potential energy when the weight W drops 4 inches with increases in potential energy for the corresponding 2-inch rise of the 60 pound weight plus the 1-inch rise in the 100 pound weight. He demands that the net potential energy change of the system be zero, yielding Figure 4. Feynman's example of the principle of virtual work (−4 inch)W + (2 inch)(60 pounds) + (1 inch)(100 pounds) = 0 (22) or W = 55 pounds. This method, known formally as the principle of virtual work, is simply an application of the conservation of energy. There are many examples of the principle of virtual work, including a mass hanging on a spring, the lever, hydrostatic balance, uniform chain suspended at both ends, vertically 28 hanging "slinky," and a ball perched on top of a large sphere, not to mention every static engineering and natural structure in the universe. VIII. FREE PARTICLE. PRINCIPLE OF LEAST AVERAGE KINETIC ENERGY The conservation of energy is limited in predicting the motion of mechanical systems. Now we begin the transition to the more powerful principle of least action by looking at the time average of the kinetic energy of a free particle, a particle that moves in a region of zero potential energy or potential energy that has the same value everywhere in the region. In this and the following sections we seek a principle that is more fundamental than the conservation of energy. We will test any fundamental principle we uncover by demanding that, at least, it lead back to the conservation of energy. For this reason the trial worldlines we explore hereafter do not necessarily satisfy conservation of energy. 10 C fixed ttotal ttotal – t 2 t B 1 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 A fixed d 2d x Figure 5. Varying the time of the middle event to determine the path of minimum time-averaged kinetic energy. The solid line in Fig. 5 shows the trial worldline of a particle that moves freely between fixed events A and C. We know that a free particle travels with constant velocity, that is, along a straight worldline. Verify that this is the actual motion by examining a trial worldline for which the central event B in the worldline is displaced in time (and therefore kinetic energy is not the same on legs 1 and 2). Whether the potential energy is zero or uniform in this region, its contribution to the time average energy does not change with the time-displacement of event B, so we ignore the potential energy. Let K represent the kinetic energy. Then the time average of the kinetic energy along the two segments 1 and 2 of the worldline is given by the expression. K1t1 + K 2 t 2 1 1 2 1 = mv1 t1 + mv 22 t 2 t total 2 2 t total 16 K average = 17 18 19 20 Multiply through by the fixed total time and expand the velocity expressions using the notation displayed in Fig. 5: 21 (23) 1 d2 1 d2 K averaget total = m 2 t + m ( ttotal − t ) 2 t 2 ( t total − t ) 2 (24) 1 d2 1 d2 = m + m 2 t 2 (t total − t ) 22 11 1 2 3 Find the minimum value of the average kinetic energy by taking the derivative with respect to the time t of the central event and setting the result equal to zero: 4 dK average 1 d2 1 d2 1 2 1 2 t = − m + m mv 2 = 0 total 2 = − mv1 + 2 2 t 2 ( t total − t ) 2 2 dt 5 6 7 The right-hand equation leads to the equality: 8 1 2 1 2 mv 2 = mv1 2 2 (25) (26) 9 10 11 12 13 14 15 16 As expected, when the time-average kinetic energy has a minimum value the kinetic energy for a free particle is the same on both segments; the worldline is straight between the fixed initial and final events A and C. In summary, we have illustrated the fact that for the special case of a particle moving in a region of zero (or uniform) potential energy, kinetic energy is conserved if we demand that the time average of the kinetic energy has a minimum value. The general expression for this average is: 17 K average = 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 1 t total ∫ Kdt (27) In an introductory text one might insert at this point a sidebar on Fermat's principle of least time for the propagation of light rays, which predicts every central feature of ray optics. Now we are ready to develop a similar law that predicts every central feature of mechanics. IX. PRINCIPLE OF LEAST ACTION Section VIII examined the principle of least average kinetic energy, which predicts the motion of a particle moving in a region of zero (or uniform) potential energy. Along the way we mentioned Fermat's principle of least time that describes ray optics. Now we move on to the more general principle which includes the principle of least average kinetic energy as a special case. Can we find some quantity whose time average value is a minimum when both kinetic energy and potential energy are functions of position? Let K represent kinetic energy and U represent potential energy. One might guess (incorrectly) that the time averaged total K + U would have a minimum value between fixed initial and final events. (Remember: In calculating the value of this average we do not assume the conservation of total energy, but rather demand that energy conservation emerge as one result of our analysis.) It turns out that the time averaged difference K – U between kinetic and potential energy has a minimum value for the path taken by the particle. How can this be? To sort this out, think of a baseball pitched in a uniform gravitational field, with the two events of pitch and catch fixed in both location and time (Fig. 6). How will the ball move between these two fixed events? 12 y E (K − U) avg minimum D C pitch catch B x A 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 (K + U )avg minimum? Figure 6. Alternative trial trajectories of a pitched baseball. To begin with, forget about averages and ask why the baseball does not simply move steadily along the straight horizontal trajectory B in the figure. Moving from pitch to catch with constant kinetic energy and constant potential energy certainly satisfies conservation of energy. But the straight horizontal trajectory is excluded because of the importance of the averaged potential energy. To see why the horizontal trajectory will not occur, go back to time averages and try the (incorrect) assumption that the trajectory taken by the baseball is the one that has a minimum value of the time average of the sum K + U. That trajectory would fall and then rise, like curve A in Fig. 6. Here is why: By moving downward the ball decreases the average value of the potential energy U (while increasing K a little because of the longer path). This is clearly not what occurs when a baseball is pitched! As an alternative hypothesis, try the assumption that the trajectory taken by the baseball is the one that leads to a minimum value of the time average of the difference K – U. Which of the trajectories C, D, or E can satisfy this criterion? By following a trajectory such as C that rises and then falls, the baseball increases the time average value of its potential energy U. Since U enters the average of K – U with a minus sign, a higher trajectory decreases the overall contribution of the potential energy to the average (while increasing kinetic energy somewhat). A minimum value of the time average of K – U is achieved along some trajectory such as D in Fig. 6. For a still higher trajectory E the average value of the potential energy U increases further, but the kinetic energy (proportional to the square of the speed) increases even faster, because the baseball must complete in the same time the longer path between the fixed events of pitch and catch. For higher and higher trajectories the average kinetic energy increases without limit, leading to no maximum value. The actual path, indicated schematically by D in Fig. 6, represents a compromise that makes the time-average value of K 13 1 2 3 4 5 6 7 8 29 – U along the path a minimum. We will see that total energy is automatically conserved along this path of minimum time average of the quantity (K – U) . Check this result analytically in the simplest case we can imagine (Fig. 7). In a uniform vertical gravitational field a marble of mass m and momentum of inertia Imarble rolls from one horizontal surface to another via a smooth ramp so narrow that we neglect its width. In this system the potential energy changes just once, halfway between initial and final positions. 1 2 v1 A z1 z2 B x d 9 10 11 12 13 C d Figure 7. Motion across a potential energy step. The worldline of the particle will be bent, corresponding to the reduced speed after the marble mounts the ramp, as shown in Fig. 8. 14 1 2 C fixed ttotal ttotal – t 2 B t 1 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 d A fixed 2d x Figure 8. Broken worldline of a marble rolling across steps connected by a narrow ramp. The region on the right is shaded to represent the higher potential energy of the marble on the second step We demand that the total travel time from position A to position C have a fixed value ttotal and check whether minimizing the time average of the difference (K – U) leads to the conservation of energy. time average = 1 t total [(K − U ) t + (K − U ) t ] 1 1 (28) 2 2 Define a new symbol S, called the action: Action ≡ S ≡ time average × t total = (K − U )1 t1 + (K − U ) 2 t 2 (29) Use the symbols in Fig. 8 to write Eq. (29) in the form 1 1 S = mv12 t1 − mgz1t1 + mv 22 t 2 − mgz2 t 2 2 2 2 md md 2 = − mgz1t + − mgz 2 (t total − t ) 2t 2(t total − t ) (30) 15 1 2 3 4 (For simplicity we neglect the marble's kinetic energy of rotation.) Demand that the value of the action be a minimum with respect to the choice of the intermediate time t: 5 dS md 2 md 2 = − 2 − mgz1 + 2 + mgz 2 = 0 dt 2t 2(t total − t ) 6 7 8 This result can be written: 9 10 11 12 13 (31) 1 1 dS = − mv12 + mgz1 + mv 22 + mgz2 = 0 2 2 dt (32) A simple rearrangement shows that Eq. (32) represents the conservation of energy. Here energy conservation has been derived from the more fundamental principle of least action. 3 2 z1 1 d 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 z3 z2 4 z4 d d d Figure 9. A more complicated potential energy diagram, leading in the limit to a potential energy curve that varies smoothly with position The action Eq. (29) can be generalized for multiple steps connected by smooth, narrow transitions, such as the one shown in Fig. 9. The argument leading to conservation of Eq. (32) then applies to every adjacent pair of steps in the potential energy diagram. The corresponding generalization of the action is: S = (K − U )1 t1 + (K − U ) 2 t 2 + (K − U ) 3 t 3 + (K − U ) 4 t 4 (33) The computer can hunt for and find the minimum value of S directly by varying the values of the intermediate times, as shown schematically in Fig. 10. 16 t E Fixed D C B A Fixed 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 26 27 28 29 30 1 4 x Figure 10. Varying the times at the potential energy steps to find a worldline of least action. The computer program temporarily fixes the end events of a two-segment section, say events B and D, then varies the middle event C to find the minimum value of the total S, then varies D while C is kept fixed, and so on. (End-events A and E always remain fixed.) The computer cycles through this process repeatedly until the value of S does not change further because this value has reached a minimum for the worldline as a whole. The resulting worldline approximates the one taken by the particle. A continuous potential energy curve is the limiting case of that shown in Fig. 9 as the number of steps increases without limit while the time along each step becomes an increment dt. For the resulting potential energy curve the general expression for the action S is: Action ≡ S ≡ ∫ L dt = ∫ (K − U ) dt entire path 16 17 18 19 20 21 22 23 24 25 3 2 (34) entire path Here L (= K – U for the cases we treat) is called the Lagrangian. The principle of least action says that the value of the action S is a minimum for the actual motion of the particle, a condition that leads not only to conservation of energy but also to a unique specification of the worldline. A more general form of the principle of least action also predicts the motion of a particle in more than one spatial dimension as well as the time development of systems containing many particles. The principle of least action can even predict the motion of some systems in 30 which energy is not conserved. With generalizations of the Lagrangian L, the principle of 31 least action describes relativistic motion, governs many non-mechanical systems, and can be used in the derivations of Maxwell's equations, Schroedinger's wave equation, the diffusion equation, geodesic worldlines in general relativity, and electric currents in steadystate circuits, among many other applications. 17 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 The present section has not provided a proof of the principle of least action in Newtonian mechanics. A fundamental proof rests on nonrelativistic quantum mechanics, for instance 32 that outlined by Tyc which uses the deBroglie relation to show that the phase change of a quantum wave along any worldline is equal to S h where S is the classical action along that 33 worldline. Starting with this result, Feynman and Hibbs show that his "sum over all paths" description of quantum motion reduces seamlessly to the Newtonian principle of least action as the mass of particles increases. Once students have mastered the principle of least action, it is easy to motivate the introduction of nonrelativistic quantum mechanics: Quantum mechanics simply assumes that the electron actually explores all the possible worldlines considered in finding the Newtonian worldline of least action. X. LAGRANGE'S EQUATIONS Lagrange's equations are conventionally derived from the principle of least action using the 34 calculus of variations. These derivations analyze the worldline as a whole. However, the action is a summation; if the value of the sum is minimum along the entire worldline, then the contribution along each incremental segment of the worldline must also be a minimum. This simplifying insight, due originally to Euler, allows a derivation of Lagrange's equations 35 using elementary calculus. directly from F = ma. The appendix shows a derivation of Lagrange's equation ACKNOWLEDGMENT The authors would like to acknowledge useful comments by Don S. Lemons. APPENDIX: DERIVATION OF LAGRANGE'S EQUATIONS DIRECTLY FROM F=ma Both F = ma and Lagrange's equations can be derived from the more fundamental principle of 36 least action. Here we move laterally from one derived formulation to the other, showing that Lagrange's equation leads to F = ma for particle motion in one dimension, as adapted 37 from a similar derivation by Zeldovich and Myskis. Using the definition of force in Eq. (9), we write Newton's second law as: dU = mx˙˙ dx 33 F =− 34 35 36 Assume that m is constant. Rearrange and recast Eq. (A1) to read: (A1) 37 d(−U ) d d m˙x 2 d(−U ) d − (m˙x ) = − = 0 dt dt d˙x 2 dx dx 38 39 40 41 In the right-hand Eq. (A2) add, under the derivative signs, specifically chosen terms whose partial derivatives are equal to zero: (A2) 18 1 d ∂ m˙x 2 ∂ m˙x 2 − U ( x ) − − U ( x ) = 0 ∂x 2 dt ∂x˙ 2 2 3 4 Set L = K – U, leading to the Lagrange equation in one dimension: 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 (A3) ∂L d ∂L =0 − ∂x dt ∂x˙ (A4) This sequence of steps can be reversed to show that Lagrange's equation leads to Newton's second law of motion F = ma. FOOTNOTES AND REFERENCES a) email: [email protected] b) email: [email protected] 1 David Morin, "The Lagrangian Method," Chapter 5 of a draft honors introductory mechanics text. Available at http://www.courses.fas.harvard.edu/~phys16/handouts/textbook/ch5.pdf (no final slash). 2 We use the phrase Newtonian mechanics to mean non-relativistic, non-quantum mechanics. The alternative phrase classical mechanics also applies to relativity, both special and general, and is thus inappropriate in the context of the present paper. 3 Sir Isaac Newton's Mathematical Principles of Natural Philosophy and his System of the World, Tr. Andrew Motte and Florian Cajori, University of California Press, 1946, page 13. 4 Herman H. Goldstine, A History of the Calculus of Variations from the 17th through the 19th Century, Springer-Verlag, 1980, page 110. 5 J. L. Lagrange, Analytic Mechanics, Victor N. Vagliente and Auguste Boissonnade Tr. (Kluwer Academic Publishers, 2001). 6 William Rowan Hamilton, "On a general method in dynamics, by which the study of the motions of all free systems of attracting or repelling points is reduced to the search and differentiation of one central Relation or characteristic Function," Philosophical Transactions of the Royal Society, part II for 1834, pp. 247-308; "Second essay on a general method in dynamics," Philosophical Transactions of the Royal Society, part I for 1835, pp. 95-144. Both papers are available at http://www.emis.de/classics/Hamilton/ 19 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 7 Landau and E. M. Lifshitz, Mechanics, Course of Theoretical Physics (Butterworth-Heinemann, London, 1976), 3rd ed., Vol. 1, Chap. 1. Their renaming first occurred in the original 1957 Russian edition. 8 Richard P. Feynman, Robert B. Leighton and Matthew Sands, The Feynman Lectures on Physics (Addison-Wesley, 1964), Vol. II, p. 19-8. 9 The technically correct term is principle of stationary action. See I. M. Gelfand and S. V. Fomin, Calculus of Variations, Richard A. Silverman, Tr., (1963, Dover Publications 2000) Chapter 7, Sec. 36.2. However, we have a strong preference for the conventional term principle of least action for reasons not central to the argument of the present paper. 10 Thomas H. Kuhn, "Energy Conservation as an Example of Simultaneous Discovery?" in The Conservation of Energy and the Principle of Least Action, I. Bernard Cohen, ed, Arno Press, 1981. 11 E. Noether, "Invariante Variationsprobleme," Goett. Nachr. 1918, pp. 235-257. 12 Richard P. Feynman, Nobel Lecture, December 11, 1965, from Nobel Lectures, Physics 19631970 (Elsevier). 13 Edwin F. Taylor, "A Call to Action," Am. J. Phys., 71 (5), May 2003, 423-425. 14 Jozef Hanc, Slavomir Tuleja, Martina Hancova, "Simple derivation of Newtonian mechanics from the principle of least action," Am. J. Phys., 71 (4) April 2003, 385-391. 15 Jozef Hanc, Edwin F. Taylor, and Slavomir Tuleja, "Deriving Lagrange's Equations Using Elementary Calculus," accepted for publication in Am. J. Phys.. 16 Jozef Hanc, Slavomir Tuleja, Martina Hancova, "Symmetries and Conservations Laws: Consequences of Noether's Theorem," accepted for publication in Am. J. Phys. 17 Reference 8, Vol. 1, Chapter 4. Feynman's parable about Dennis the Menace and his indestructible blocks is classic. The final statement "There are no blocks" emphasizes that energy is not a thing, cannot be pointed at, and typically cannot even be localized; it is the property of a system. 18 Benjamin Crowell: Conservation Laws (Light and Matter, Fullerton, CA, 1998-2002). Available at http://www.lightandmatter.com 19 Alan Van Heuvelen and Xueli Zou, "Multiple representations of work-energy processes," Am. J. Phys. 69 (2), 2001, pp 184-194 20 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 20 Roger A. Hinrich and Merlin Kleinbach, Energy: Its Use and the Environment (Harcourt, Inc. 2002). 21 Gordon J. Aubrecht, Energy, 2nd ed. Edition (Prentice-Hall, 1995). 22 Student exercise: Show that B cos ωt and C cos ωt + D sin ωt are also solutions. Use the occasion to discuss the importance of relative phase. 23 Benjamin Crowell bases an entire introductory treatment on Noether's theorem: Discover Physics (Light and Matter, Fullerton, CA, 1998-2002). Available at http://www.lightandmatter.com 24 Noether's theorem says that when the Lagrangian of a system (L = K – U for our simple cases) is not a function of an independent coordinate, x for example, then the function ∂L d˙x is a constant of the motion. This statement can also be expressed in terms of Hamiltonian dynamics. See for example Cornelius Lanczos, The Variational Principles of Mechanics, 4th ed. (Dover Publications, 1970), Chap VI, Sec. 9, statement above Eq. (69.1). In all the mechanical systems that we consider here, total energy is conserved and thus automatically does not contain time explicitly. The expression for kinetic energy K is always quadratic in the velocities, while the potential energy U is independent of velocities. These conditions are sufficient for the Hamiltonian of the system to be the total energy. Then our limited version of Noether's theorem is identical to the statement in Lanczos referenced above. However, we have prepared for students a simpler, deeper, and more intuitive argument that justifies our modified Noether theorem. 25 Ruth Chabay and Bruce Sherwood, Matter & Interactions (John Wiley, 2002) Vol. 1, pp 188194 and 244-248. Page 248 lists background references in Am. J. Phys. 26 Of course on the molecular level there is no friction. Of the current advanced mechanics texts listed in our references, only Landau and Lifshitz, Ref. 7, mentions sliding friction even in passing (p. 122); concerning motion in a dissipative medium they say (p. 74) " . . . there are no equations of motion in the mechanical sense. Thus the problem of the motion of a body in a medium is not one of mechanics." Their conclusion would surprise Isaac Newton. 27 Reference 8,Vol. 1, page 4-5. 28 Don S. Lemons, Perfect Form (Princeton University Press, 1997), Chapter 4. 29 Slavomir Tuleja, Edwin F. Taylor, Java software: Principle of Least Action Interactive, 2003, available at the website http://www.eftaylor.com/leastaction.html. 30 Herbert Goldstein, Charles Poole, and John Safko, Classical Mechanics, 3rd ed.(AddisonWesley, Reading, MA 2002), Section2.7. 21 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 31 Reference 8, Vol. II, p. 19-8 32 Tomas Tyc, "The de Broglie hypothesis leading to path integrals," Eur. J. Phys. 17 (May 1996), 156-157. Version available at the URL given in reference 29. 33 R. P. Feynman and A. R. Hibbs, Quantum Mechanics and Path Integrals (McGraw-Hill, 1965), p. 29. 34 Reference 7, Chap. 1; Reference 30, Sect. 2.3; D. Gerald Jay Sussman and Jack Wisdom, Structure and Interpretation of Classical Mechanics (MIT, Cambridge, 2001), Chap. 1. 35 Reference 15 36 References 14 and 15. 37 Ya. B. Zeldovich and A. D. Myskis, Elements of Applied Mathematics, George Yankovsky Tr. (MIR Publishers Moscow,1976) pages 499-500 22
© Copyright 2025 Paperzz