#jsDisabledContent { display:none; } My Account | Register | Help

Laplace–Runge–Lenz vector

Article Id: WHEBN0000719460
Reproduction Date:

 Title: Laplace–Runge–Lenz vector Author: World Heritage Encyclopedia Language: English Subject: Collection: Publisher: World Heritage Encyclopedia Publication Date:

Laplace–Runge–Lenz vector

In classical mechanics, the Laplace–Runge–Lenz vector (or simply the LRL vector) is a vector used chiefly to describe the shape and orientation of the orbit of one astronomical body around another, such as a planet revolving around a star. For two bodies interacting by Newtonian gravity, the LRL vector is a constant of motion, meaning that it is the same no matter where it is calculated on the orbit;[1] equivalently, the LRL vector is said to be conserved. More generally, the LRL vector is conserved in all problems in which two bodies interact by a central force that varies as the inverse square of the distance between them; such problems are called Kepler problems.[2]

The hydrogen atom is a Kepler problem, since it comprises two charged particles interacting by Coulomb's law of electrostatics, another inverse square central force. The LRL vector was essential in the first quantum mechanical derivation of the spectrum of the hydrogen atom,[3] before the development of the Schrödinger equation. However, this approach is rarely used today.

In classical and quantum mechanics, conserved quantities generally correspond to a symmetry of the system. The conservation of the LRL vector corresponds to an unusual symmetry; the Kepler problem is mathematically equivalent to a particle moving freely on the surface of a four-dimensional (hyper-)sphere,[4] so that the whole problem is symmetric under certain rotations of the four-dimensional space.[5] This higher symmetry results from two properties of the Kepler problem: the velocity vector always moves in a perfect circle and, for a given total energy, all such velocity circles intersect each other in the same two points.[6]

The Laplace–Runge–Lenz vector is named after Pierre-Simon de Laplace, Carl Runge and Wilhelm Lenz. It is also known as the Laplace vector, the Runge–Lenz vector and the Lenz vector. Ironically, none of those scientists discovered it. The LRL vector has been re-discovered several times[7] and is also equivalent to the dimensionless eccentricity vector of celestial mechanics.[8] Various generalizations of the LRL vector have been defined, which incorporate the effects of special relativity, electromagnetic fields and even different types of central forces.

Contents

• Context 1
• History of rediscovery 2
• Mathematical definition 3
• Derivation of the Kepler orbits 4
• Circular momentum hodographs 5
• Constants of motion and superintegrability 6
• Evolution under perturbed potentials 7
• Poisson brackets 8
• Quantum mechanics of the hydrogen atom 9
• Conservation and symmetry 10
• Rotational symmetry in four dimensions 11
• Generalizations to other potentials and relativity 12
• Proofs that the Laplace–Runge–Lenz vector is conserved in Kepler problems 13
• Direct proof of conservation 13.1
• Hamilton–Jacobi equation in parabolic coordinates 13.2
• Noether's theorem 13.3
• Lie transformation 13.4
• Alternative scalings, symbols and formulations 14
• References 16

Context

A single particle moving under any conservative central force has at least four constants of motion, the total energy E and the three Cartesian components of the angular momentum vector L with respect to the origin. The particle's orbit is confined to a plane defined by the particle's initial momentum p (or, equivalently, its velocity v) and the vector r between the particle and the center of force (see Figure 1, below).

As defined below (see Mathematical definition), the Laplace–Runge–Lenz vector (LRL vector) A always lies in the plane of motion for any central force. However, A is constant only for an inverse-square central force.[1] For most central forces, however, this vector A is not constant, but changes in both length and direction; if the central force is approximately an inverse-square law, the vector A is approximately constant in length, but slowly rotates its direction. A generalized conserved LRL vector \mathcal{A} can be defined for all central forces, but this generalized vector is a complicated function of position, and usually not expressible in closed form.[9][10]

The plane of motion is perpendicular to the angular momentum vector L, which is constant; this may be expressed mathematically by the vector dot product equation r·L = 0; likewise, since A lies in that plane, A·L = 0.

The LRL vector differs from other conserved quantities in the following property. Whereas for typical conserved quantities, there is a corresponding cyclic coordinate in the three-dimensional Lagrangian of the system, there does not exist such a coordinate for the LRL vector. Thus, the conservation of the LRL vector must be derived directly, e.g., by the method of Poisson brackets, as described below. Conserved quantities of this kind are called "dynamic", in contrast to the usual "geometric" conservation laws, e.g., that of the angular momentum.

History of rediscovery

The LRL vector A is a constant of motion of the important Kepler problem, and is useful in describing astronomical orbits, such as the motion of the planets. Nevertheless, it has never been well known among physicists, possibly because it is less intuitive than momentum and angular momentum. Consequently, it has been rediscovered independently several times over the last three centuries.[7]

Jakob Hermann was the first to show that A is conserved for a special case of the inverse-square central force,[11] and worked out its connection to the eccentricity of the orbital ellipse. Hermann's work was generalized to its modern form by Johann Bernoulli in 1710.[12] At the end of the century, Pierre-Simon de Laplace rediscovered the conservation of A, deriving it analytically, rather than geometrically.[13] In the middle of the nineteenth century, William Rowan Hamilton derived the equivalent eccentricity vector defined below,[8] using it to show that the momentum vector p moves on a circle for motion under an inverse-square central force (Figure 3).[6]

At the beginning of the twentieth century, Josiah Willard Gibbs derived the same vector by vector analysis.[14] Gibbs' derivation was used as an example by Carle Runge in a popular German textbook on vectors,[15] which was referenced by Wilhelm Lenz in his paper on the (old) quantum mechanical treatment of the hydrogen atom.[16] In 1926, the vector was used by Wolfgang Pauli to derive the spectrum of hydrogen using modern quantum mechanics, but not the Schrödinger equation;[3] after Pauli's publication, it became known mainly as the Runge–Lenz vector.

Mathematical definition

For a single particle acted on by an inverse-square central force described by the equation \mathbf{F}(r)=\frac{-k}{r^{2}}\mathbf{\hat{r}}, the LRL vector A is defined mathematically by the formula[1]

Figure 1: The LRL vector A (shown in red) at four points (labeled 1, 2, 3 and 4) on the elliptical orbit of a bound point particle moving under an inverse-square central force. The center of attraction is shown as a small black circle from which the position vectors (likewise black) emanate. The angular momentum vector L is perpendicular to the orbit. The coplanar vectors p×L and (mk/r)r are shown in blue and green, respectively; these variables are defined below. The vector A is constant in direction and magnitude.
 \mathbf{A} = \mathbf{p} \times \mathbf{L} - m k \mathbf{\hat{r}}

where

• m is the mass of the point particle moving under the central force,
• p is its momentum vector,
• L = r × p is its angular momentum vector,
• k is a parameter that describes strength of the central force,
• r is the position vector of the particle (Figure 1), and
• \mathbf{\hat{r}}\!\, is the corresponding unit vector, i.e., \mathbf{\hat{r}} = \frac{\mathbf{r}}{r} where r is the magnitude of r.

Since the assumed force is conservative, the total energy E is a constant of motion,

E = \frac{p^{2}}{2m} - \frac{k}{r} = \frac{1}{2} mv^{2} - \frac{k}{r} ~.

Furthermore, the assumed force is a central force, and thus the angular momentum vector L is also conserved and defines the plane in which the particle travels. The LRL vector A is perpendicular to the angular momentum vector L because both p × L and r are perpendicular to L. It follows that A lies in the plane of the orbit.

This definition of the LRL vector A pertains to a single point particle of mass m moving under the action of a fixed force. However, the same definition may be extended to two-body problems such as Kepler's problem, by taking m as the reduced mass of the two bodies and r as the vector between the two bodies.

A variety of alternative formulations for the same constant of motion may also be used. The most common is to scale by mk to define the eccentricity vector

\mathbf{e} = \frac{\mathbf{A}}{m k} = \frac{1}{m k}(\mathbf{p} \times \mathbf{L}) - \mathbf{\hat{r}} ~ .

Derivation of the Kepler orbits

Figure 2: Simplified version of Figure 1, defining the angle θ between A and r at one point of the orbit.

The shape and orientation of the Kepler problem orbits can be determined from the LRL vector as follows.[1] Taking the dot product of A with the position vector r gives the equation

\mathbf{A} \cdot \mathbf{r} = Ar \cos\theta = \mathbf{r} \cdot \left( \mathbf{p} \times \mathbf{L} \right) - mkr

where θ is the angle between r and A (Figure 2). Permuting the scalar triple product

\mathbf{r} \cdot\left(\mathbf{p}\times \mathbf{L}\right) = \left(\mathbf{r} \times \mathbf{p}\right)\cdot\mathbf{L} = \mathbf{L}\cdot\mathbf{L}=L^2

and rearranging yields the defining formula for a conic section, provided that A is a constant, which is the case for the inverse square force law,

 \frac{1}{r} = \frac{mk}{L^{2}} \left( 1 + \frac{A}{mk} \cos\theta \right)

of eccentricity e,

e = \frac{A}{mk} = \frac{\left|\mathbf{A}\right|}{m k}

and latus rectum

\left| 2\ell \right| = \frac{2L^{2}}{mk} ~.

The major semiaxis a of the conic section may be defined using the latus rectum and the eccentricity

a \left( 1 \pm e^{2} \right) = \ell = \frac{L^{2}}{mk} ~,

where the minus sign pertains to ellipses and the plus sign to hyperbolae.

Taking the dot product of A with itself yields an equation involving the energy E,

A^2= m^2 k^2 + 2 m E L^2 \, ,

which may be rewritten in terms of the eccentricity,

e^{2} - 1= \frac{2L^{2}}{mk^{2}}E ~.

Thus, if the energy E is negative (bound orbits), the eccentricity is less than one and the orbit is an ellipse. Conversely, if the energy is positive (unbound orbits, also called "scattered orbits"), the eccentricity is greater than one and the orbit is a hyperbola. Finally, if the energy is exactly zero, the eccentricity is one and the orbit is a parabola. In all cases, the direction of A lies along the symmetry axis of the conic section and points from the center of force toward the periapsis, the point of closest approach.

Circular momentum hodographs

Figure 3: The momentum vector p (shown in blue) moves on a circle as the particle moves on an ellipse. The four labeled points correspond to those in Figure 1. The circle is centered on the y-axis at position A/L (shown in magenta), with radius mk/L (shown in green). The angle η determines the eccentricity e of the elliptical orbit (cos η = e). By the inscribed angle theorem for circles, η is also the angle between any point on the circle and the two points of intersection with the px axis, pxp0.

The conservation of the LRL vector A and angular momentum vector L is useful in showing that the momentum vector p moves on a circle under an inverse-square central force.[6][7]

Taking the dot product of

mk ~\hat{\mathbf{r}} = \mathbf{p} \times \mathbf{L} - \mathbf{A}

with itself yields

(mk)^2= A^2+ p^2 L^{2} + 2 \mathbf{L} \cdot (\mathbf{p} \times \mathbf{A}) ~.

Further choosing L along the z-axis, and the major semiaxis as the x-axis, yields the locus equation for p,

 p_{x}^{2} + \left(p_{y} - A/L \right)^{2} = \left( mk/L \right)^{2}
.

In other words, the momentum vector p is confined to a circle of radius mk/L = L/ℓ centered on (0, A/L). The eccentricity e corresponds to the cosine of the angle η shown in Figure 3.

In the degenerate limit of circular orbits, and thus vanishing A, the circle centers at the origin (0,0). For brevity, it is also useful to introduce the variable p_{0} = \sqrt{2m\left| E \right|}. This circular hodograph is useful in illustrating the symmetry of the Kepler problem.

Constants of motion and superintegrability

The seven scalar quantities E, A and L (being vectors, the latter two contribute three conserved quantities each) are related by two equations, A · L = 0 and A2 = m2k2+ 2 mEL2, giving five independent constants of motion. (Since the magnitude of A, hence the eccentricity e of the orbit, can be determined from the total angular momentum L and the energy E, only the direction of A is conserved independently; moreover, since A must be perpendicular to L, it contributes only one additional conserved quantity.)

This is consistent with the six initial conditions (the particle's initial position and velocity vectors, each with three components) that specify the orbit of the particle, since the initial time is not determined by a constant of motion. The resulting 1-dimensional orbit in 6-dimensional phase space is thus completely specified.

A mechanical system with d degrees of freedom can have at most 2d − 1 constants of motion, since there are 2d initial conditions and the initial time cannot be determined by a constant of motion. A system with more than d constants of motion is called superintegrable and a system with 2d − 1 constants is called maximally superintegrable.[17] Since the solution of the Hamilton–Jacobi equation in one coordinate system can yield only d constants of motion, superintegrable systems must be separable in more than one coordinate system.[18] The Kepler problem is maximally superintegrable, since it has three degrees of freedom (d=3) and five independent constant of motion; its Hamilton–Jacobi equation is separable in both spherical coordinates and parabolic coordinates,[19] as described below.

Maximally superintegrable systems follow closed, one-dimensional orbits in phase space, since the orbit is the intersection of the phase-space isosurfaces of their constants of motion. Consequently, the orbits are perpendicular to all gradients of all these independent isosurfaces, five in this specific problem, and hence are determined by the generalized cross products of all of these gradients. As a result, all superintegrable systems are automatically describable by Nambu mechanics,[20] alternatively, and equivalently, to Hamiltonian mechanics.

Maximally superintegrable systems can be quantized using commutation relations, as illustrated below.[21] Nevertheless, equivalently, they are also quantized in the Nambu framework, such as this classical Kepler problem into the quantum hydrogen atom.[22]

Evolution under perturbed potentials

Figure 5: Gradually precessing elliptical orbit, with an eccentricity e = 0.667. Such precession arises in the Kepler problem if the attractive central force deviates slightly from an inverse-square law. The rate of precession can be calculated using the formulae in the text.

The Laplace–Runge–Lenz vector A is conserved only for a perfect inverse-square central force. In most practical problems such as planetary motion, however, the interaction potential energy between two bodies is not exactly an inverse square law, but may include an additional central force, a so-called perturbation described by a potential energy h(r). In such cases, the LRL vector rotates slowly in the plane of the orbit, corresponding to a slow apsidal precession of the orbit.

By assumption, the perturbing potential h(r) is a conservative central force, which implies that the total energy E and angular momentum vector L are conserved. Thus, the motion still lies in a plane perpendicular to L and the magnitude A is conserved, from the equation A2 = m2k2 + 2mEL2. The perturbation potential h(r) may be any sort of function, but should be significantly weaker than the main inverse-square force between the two bodies.

The rate at which the LRL vector rotates provides information about the perturbing potential h(r). Using canonical perturbation theory and action-angle coordinates, it is straightforward to show[1] that A rotates at a rate of,

\begin{align} \frac{\partial}{\partial L} \langle h(r) \rangle & = \displaystyle \frac{\partial}{\partial L} \left\{ \frac{1}{T} \int_0^T h(r) \, dt \right\} \\[1em] & = \displaystyle\frac{\partial}{\partial L} \left\{ \frac{m}{L^{2}} \int_0^{2\pi} r^2 h(r) \, d\theta \right\} ~, \end{align}

where T is the orbital period, and the identity L dt = m r2  was used to convert the time integral into an angular integral (Figure 5). The expression in angular brackets, h(r)〉, represents the perturbing potential, but averaged over one full period; that is, averaged over one full passage of the body around its orbit. Mathematically, this time average corresponds to the following quantity in curly braces. This averaging helps to suppress fluctuations in the rate of rotation.

This approach was used to help verify Einstein's theory of general relativity, which adds a small effective inverse-cubic perturbation to the normal Newtonian gravitational potential,[23]

h(r) = \frac{kL^{2}}{m^{2}c^{2}} \left( \frac{1}{r^{3}} \right) ~.

Inserting this function into the integral and using the equation

\frac{1}{r} = \frac{mk}{L^{2}} \left( 1 + \frac{A}{mk} \cos\theta \right)

to express r in terms of θ, the precession rate of the periapsis caused by this non-Newtonian perturbation is calculated to be[23]

\frac{6\pi k^{2}}{TL^{2}c^{2}} ~,

which closely matches the observed anomalous precession of Mercury[24] and binary pulsars.[25] This agreement with experiment is strong evidence for general relativity.[26][27]

Poisson brackets

The algebraic structure of the problem is, as explained in later sections, SO(4)/ℤ2 ~ SO(3) × SO(3).[5] The three components Li of the angular momentum vector L have the Poisson brackets[1]

\left\{ L_{i}, L_{j}\right\} = \sum_{s=1}^{3} \epsilon_{ijs} L_{s} ~,

where i=1,2,3 and ϵijs is the fully antisymmetric tensor, i.e., the Levi-Civita symbol; the summation index s is used here to avoid confusion with the force parameter k defined above. The Poisson brackets will be extended to quantum mechanical commutation relations in the next section and Lie brackets in a following section.

As noted below, a scaled Laplace–Runge–Lenz vector D may be defined with the same units as angular momentum by dividing A by p_0= \sqrt{2m|E|}. The Poisson brackets of D with the angular momentum vector L can then be written in a similar form[5][28]

\left\{ D_{i}, L_{j}\right\} = \sum_{s=1}^{3} \epsilon_{ijs} D_{s} ~.

The Poisson brackets of D with itself depend on the sign of E, i.e., on whether the total energy E is negative (producing closed, elliptical orbits under an inverse-square central force) or positive (producing open, hyperbolic orbits under an inverse-square central force). For negative energies – i.e., for bound systems – the Poisson brackets are

\left\{ D_{i}, D_{j}\right\} = \sum_{s=1}^{3} \epsilon_{ijs} L_{s} ~;

whereas, for positive energy, the Poisson brackets have the opposite sign,

\left\{ D_{i}, D_{j}\right\} = -\sum_{s=1}^{3} \epsilon_{ijs} L_{s} ~.

The Casimir invariants for negative energies are

C_{1} = \mathbf{D} \cdot \mathbf{D} + \mathbf{L} \cdot \mathbf{L} = \frac{mk^{2}}{2\left|E\right|}
C_{2} = \mathbf{D} \cdot \mathbf{L} = 0,

and have vanishing Poisson brackets with all components of D and L,

\left\{ C_{1}, L_{i} \right\} = \left\{ C_{1}, D_{i} \right\} = \left\{ C_{2}, L_{i} \right\} = \left\{ C_{2}, D_{i} \right\} = 0 ~.

C2 is trivially zero, since the two vectors are always perpendicular.

However, the other invariant, C1, is non-trivial and depends only on m, k and E. Upon canonical quantization, this invariant allows the energy levels of hydrogen-like atoms to be derived using only quantum mechanical canonical commutation relations, instead of the conventional solution of the Schrödinger equation.

Quantum mechanics of the hydrogen atom

Figure 6: Energy levels of the hydrogen atom as predicted from the commutation relations of angular momentum and Laplace–Runge–Lenz vector operators; these energy levels have been verified experimentally.

Poisson brackets provide a simple guide for quantizing most classical systems: the commutation relation of two quantum mechanical operators is specified by the Poisson bracket of the corresponding classical variables, multiplied by .[29]

By carrying out this quantization and calculating the eigenvalues of the C1 Casimir operator for the Kepler problem, Wolfgang Pauli was able to derive the energy levels of hydrogen-like atoms (Figure 6) and, thus, their atomic emission spectrum.[3] This elegant 1926 derivation was obtained before the development of the Schrödinger equation.[30]

A subtlety of the quantum mechanical operator for the LRL vector A is that the momentum and angular momentum operators do not commute; hence, the quantum operator cross product of p and L must be defined carefully.[28] Typically, the operators for the Cartesian components As are defined using a symmetrized (Hermitian) product,

A_{s} = - m k \hat{r}_{s} + \frac{1}{2} \sum_{i=1}^{3} \sum_{j=1}^{3} \epsilon_{sij} \left( p_{i} l_{j} + l_{j} p_{i} \right) ,

from which the corresponding additional ladder operators for L can be defined,

J_{0} = A_{3} \,
J_{\pm 1} = \mp \frac{1}{\sqrt{2}} \left( A_{1} \pm i A_{2} \right) ~.

These further connect different eigenstates of L2, so different spin multiplets, among themselves.

A normalized first Casimir invariant operator, quantum analog of the above, can likewise be defined,

C_{1} = - \frac{m k^{2}}{2 \hbar^{2}} H^{-1} - I ~,

where H−1 is the inverse of the Hamiltonian energy operator, and I is the identity operator.

Applying these ladder operators to the eigenstates |ℓmn〉 of the total angular momentum, azimuthal angular momentum and energy operators, the eigenvalues of the first Casimir operator, C1, are seen to be quantized, n2 − 1. Importantly, by dint of the vanishing of C2, they are independent of the ℓ and m quantum numbers, making the energy levels degenerate.[28]

Hence, the energy levels are given by

E_{n} = - \frac{m k^{2}}{2\hbar^{2} n^{2}} ~ ,

which coincides with the Rydberg formula for hydrogen-like atoms (Figure 6). The additional symmetry operators A have connected the different ℓ multiplets among themselves, for a given energy (and C1), dictating n2 states at each level. In effect, they have enlarged the angular momentum group SO(3) to SO(4)/ℤ2 ~ SO(3) × SO(3).

Conservation and symmetry

The conservation of the LRL vector corresponds to a subtle symmetry of the system. In classical mechanics, symmetries are continuous operations that map one orbit onto another without changing the energy of the system; in quantum mechanics, symmetries are continuous operations that "mix" electronic orbitals of the same energy, i.e., degenerate energy levels. A conserved quantity is usually associated with such symmetries.[1] For example, every central force is symmetric under the rotation group SO(3), leading to the conservation of angular momentum L. Classically, an overall rotation of the system does not affect the energy of an orbit; quantum mechanically, rotations mix the spherical harmonics of the same quantum number l without changing the energy.

Figure 7: The family of circular momentum hodographs for a given energy E. All the circles pass through the same two points \pm p_{0} = \pm \sqrt{2m\left| E \right|} on the px-axis (cf. Figure 3). This family of hodographs corresponds to one family of Apollonian circles, and the σ isosurfaces of bipolar coordinates.

The symmetry for the inverse-square central force is higher and more subtle. The peculiar symmetry of the Kepler problem results in the conservation of both the angular momentum vector L and the LRL vector A (as defined above) and, quantum mechanically, ensures that the energy levels of hydrogen do not depend on the angular momentum quantum numbers l and m. The symmetry is more subtle, however, because the symmetry operation must take place in a higher-dimensional space; such symmetries are often called "hidden symmetries".[31]

Classically, the higher symmetry of the Kepler problem allows for continuous alterations of the orbits that preserve energy but not angular momentum; expressed another way, orbits of the same energy but different angular momentum (eccentricity) can be transformed continuously into one another. Quantum mechanically, this corresponds to mixing orbitals that differ in the l and m quantum numbers, such as the s (l=0) and p (l=1) atomic orbitals. Such mixing cannot be done with ordinary three-dimensional translations or rotations, but is equivalent to a rotation in a higher dimension.

For negative energies −– i.e., for bound systems −– the higher symmetry group is SO(4), which preserves the length of four-dimensional vectors

\left| \mathbf{e} \right|^{2} = e_{1}^{2} + e_{2}^{2} + e_{3}^{2} + e_{4}^{2} .

In 1935, Vladimir Fock showed that the quantum mechanical bound Kepler problem is equivalent to the problem of a free particle confined to a three-dimensional unit sphere in four-dimensional space.[4] Specifically, Fock showed that the Schrödinger wavefunction in the momentum space for the Kepler problem was the stereographic projection of the spherical harmonics on the sphere. Rotation of the sphere and reprojection results in a continuous mapping of the elliptical orbits without changing the energy; quantum mechanically, this corresponds to a mixing of all orbitals of the same energy quantum number n. Valentine Bargmann noted subsequently that the Poisson brackets for the angular momentum vector L and the scaled LRL vector D formed the Lie algebra for SO(4).[5] Simply put, the six quantities D and L correspond to the six conserved angular momenta in four dimensions, associated with the six possible simple rotations in that space (there are six ways of choosing two axes from four). This conclusion does not imply that our universe is a three-dimensional sphere; it merely means that this particular physics problem (the two-body problem for inverse-square central forces) is mathematically equivalent to a free particle on a three-dimensional sphere.

For positive energies – i.e., for unbound, "scattered" systems – the higher symmetry group is SO(3,1), which preserves the Minkowski length of 4-vectors

ds^{2} = e_{1}^{2} + e_{2}^{2} + e_{3}^{2} - e_{4}^{2} .

Both the negative- and positive-energy cases were considered by Fock[4] and Bargmann[5] and have been reviewed encyclopedically by Bander and Itzykson.[32][33]

The orbits of central-force systems – and those of the Kepler problem in particular – are also symmetric under reflection. Therefore, the SO(3), SO(4) and SO(3,1) groups cited above are not the full symmetry groups of their orbits; the full groups are O(3), O(4) and O(3,1), respectively. Nevertheless, only the connected subgroups, SO(3), SO(4) and SO(3,1), are needed to demonstrate the conservation of the angular momentum and LRL vectors; the reflection symmetry is irrelevant for conservation, which may be derived from the Lie algebra of the group.

Rotational symmetry in four dimensions

Figure 8: The momentum hodographs of Figure 7 correspond to stereographic projections of great circles on the three-dimensional η unit sphere. All of the great circles intersect the ηx axis, which is perpendicular to the page; the projection is from the North pole (the w unit vector) to the ηxy plane, as shown here for the magenta hodograph by the dashed black lines. The great circle at a latitude α corresponds to an eccentricity e = sin α. The colors of the great circles shown here correspond to their matching hodographs in Figure 7.

The connection between the Kepler problem and four-dimensional rotational symmetry SO(4) can be readily visualized.[32][34][35] Let the four-dimensional Cartesian coordinates be denoted (w, x, y, z) where (x, y, z) represent the Cartesian coordinates of the normal position vector r. The three-dimensional momentum vector p is associated with a four-dimensional vector \boldsymbol\eta on a three-dimensional unit sphere

\begin{align} \boldsymbol\eta & = \displaystyle \frac{p^2 - p_0^2}{p^2 + p_0^2} \mathbf{\hat{w}} + \frac{2 p_0}{p^2 + p_0^2} \mathbf{p} \\[1em] & = \displaystyle \frac{mk - r p_0^2}{mk} \mathbf{\hat{w}} + \frac{rp_0}{mk} \mathbf{p} \end{align}

where \mathbf{\hat{w}} is the unit vector along the new w-axis. The transformation mapping p to η can be uniquely inverted; for example, the x-component of the momentum equals

p_x = p_0 \frac{\eta_x}{1 - \eta_w}

and similarly for py and pz. In other words, the three-dimensional vector p is a stereographic projection of the four-dimensional \boldsymbol\eta vector, scaled by p0 (Figure 8).

Without loss of generality, we may eliminate the normal rotational symmetry by choosing the Cartesian coordinates such that the z-axis is aligned with the angular momentum vector L and the momentum hodographs are aligned as they are in Figure 7, with the centers of the circles on the y-axis. Since the motion is planar, and p and L are perpendicular, pz = ηz = 0 and attention may be restricted to the three-dimensional vector \boldsymbol\eta = (ηw, ηx, ηy). The family of Apollonian circles of momentum hodographs (Figure 7) correspond to a family of great circles on the three-dimensional \boldsymbol\eta sphere, all of which intersect the ηx-axis at the two foci ηx = ±1, corresponding to the momentum hodograph foci at px = ±p0. These great circles are related by a simple rotation about the ηx-axis (Figure 8). This rotational symmetry transforms all the orbits of the same energy into one another; however, such a rotation is orthogonal to the usual three-dimensional rotations, since it transforms the fourth dimension ηw. This higher symmetry is characteristic of the Kepler problem and corresponds to the conservation of the LRL vector.

An elegant action-angle variables solution for the Kepler problem can be obtained by eliminating the redundant four-dimensional coordinates \boldsymbol\eta in favor of elliptic cylindrical coordinates (χ, ψ, φ)[36]

\eta_{w} = \mathrm{cn}\, \chi \ \mathrm{cn}\, \psi
\eta_{x} = \mathrm{sn}\, \chi \ \mathrm{dn}\, \psi \ \cos \phi
\eta_{y} = \mathrm{sn}\, \chi \ \mathrm{dn}\, \psi \ \sin \phi
\eta_{z} = \mathrm{dn}\, \chi \ \mathrm{sn}\, \psi

where sn, cn and dn are Jacobi's elliptic functions.

Generalizations to other potentials and relativity

The Laplace–Runge–Lenz vector can also be generalized to identify conserved quantities that apply to other situations.

In the presence of a uniform electric field E, the generalized Laplace–Runge–Lenz vector \mathcal{A} is[19][37]

\mathcal{A} = \mathbf{A} + \frac{mq}{2} \left[ \left( \mathbf{r} \times \mathbf{E} \right) \times \mathbf{r} \right] ,

where q is the charge of the orbiting particle. Although \mathcal{A} is not conserved, it gives rise to a conserved quantity, namely \mathcal{A} \cdot \mathbf{E}.

Further generalizing the Laplace–Runge–Lenz vector to other potentials and special relativity, the most general form can be written as[9]

\mathcal{A} = \left( \frac{\partial \xi}{\partial u} \right) \left(\mathbf{p} \times \mathbf{L}\right) + \left[ \xi - u \left( \frac{\partial \xi}{\partial u} \right)\right] L^{2} \mathbf{\hat{r}}

where u = 1/r (cf. Bertrand's theorem) and ξ = cos θ, with the angle θ defined by

\theta = L \int^{u} \frac{du}{\sqrt{m^{2} c^{2} \left(\gamma^{2} - 1 \right) - L^{2} u^{2}}}

and γ is the Lorentz factor. As before, we may obtain a conserved binormal vector B by taking the cross product with the conserved angular momentum vector

\mathcal{B} = \mathbf{L} \times \mathcal{A} .

These two vectors may likewise be combined into a conserved dyadic tensor W,

\mathcal{W} = \alpha \mathcal{A} \otimes \mathcal{A} + \beta \, \mathcal{B} \otimes \mathcal{B}

In illustration, the LRL vector for a non-relativistic, isotropic harmonic oscillator can be calculated.[9] Since the force is central,

\mathbf{F}(r)= -k \mathbf{r} ,

the angular momentum vector is conserved and the motion lies in a plane.

The conserved dyadic tensor can be written in a simple form

\mathcal{W} = \frac{1}{2m} \mathbf{p} \otimes \mathbf{p} + \frac{k}{2} \, \mathbf{r} \otimes \mathbf{r} ~,

although it should be noted that p and r are not necessarily perpendicular.

The corresponding Runge–Lenz vector is more complicated,

\mathcal{A} = \frac{1}{\sqrt{mr^{2}\omega_{0} A - mr^{2}E + L^{2}}} \left\{ \left( \mathbf{p} \times \mathbf{L} \right) + \left(mr\omega_{0} A - mrE \right) \mathbf{\hat{r}} \right\} ,

where \omega_{0} = \sqrt{\frac{k}{m}} is the natural oscillation frequency and A=(E^{2}-\omega^{2}L^{2})^{1/2}/\omega.

Proofs that the Laplace–Runge–Lenz vector is conserved in Kepler problems

The following are arguments showing that the LRL vector is conserved under central forces that obey an inverse-square law.

Direct proof of conservation

A central force \mathbf{F} acting on the particle is

\mathbf{F} = \frac{d\mathbf{p}}{dt} = f(r) \frac{\mathbf{r}}{r} = f(r) \mathbf{\hat{r}}

for some function f(r) of the radius r. Since the angular momentum \mathbf{L} = \mathbf{r} \times \mathbf{p} is conserved under central forces, \frac{d}{dt}\mathbf{L} = 0 and

\frac{d}{dt} \left( \mathbf{p} \times \mathbf{L} \right) = \frac{d\mathbf{p}}{dt} \times \mathbf{L} = f(r) \mathbf{\hat{r}} \times \left( \mathbf{r} \times m \frac{d\mathbf{r}}{dt} \right) = f(r) \frac{m}{r} \left[ \mathbf{r} \left(\mathbf{r} \cdot \frac{d\mathbf{r}}{dt} \right) - r^{2} \frac{d\mathbf{r}}{dt} \right]

where the momentum \mathbf{p} = m \frac{d\mathbf{r}}{dt} and where the triple cross product has been simplified using Lagrange's formula

\mathbf{r} \times \left( \mathbf{r} \times \frac{d\mathbf{r}}{dt} \right) = \mathbf{r} \left(\mathbf{r} \cdot \frac{d\mathbf{r}}{dt} \right) - r^{2} \frac{d\mathbf{r}}{dt}

The identity

\frac{d}{dt} \left( \mathbf{r} \cdot \mathbf{r} \right) = 2 \mathbf{r} \cdot \frac{d\mathbf{r}}{dt} = \frac{d}{dt} \left( r^{2} \right) = 2r\frac{dr}{dt}

yields the equation

\frac{d}{dt} \left( \mathbf{p} \times \mathbf{L} \right) = -m f(r) r^{2} \left[ \frac{1}{r} \frac{d\mathbf{r}}{dt} - \frac{\mathbf{r}}{r^{2}} \frac{dr}{dt}\right] = -m f(r) r^{2} \frac{d}{dt} \left( \frac{\mathbf{r}}{r}\right)

For the special case of an inverse-square central force f(r)=\frac{-k}{r^{2}}, this equals

\frac{d}{dt} \left( \mathbf{p} \times \mathbf{L} \right) = m k \frac{d}{dt} \left( \frac{\mathbf{r}}{r}\right) = \frac{d}{dt} \left( mk\mathbf{\hat{r}} \right)

Therefore, A is conserved for inverse-square central forces

\frac{d}{dt} \mathbf{A} = \frac{d}{dt} \left( \mathbf{p} \times \mathbf{L} \right) - \frac{d}{dt} \left( mk\mathbf{\hat{r}} \right) = \mathbf{0}

A shorter proof is obtained by using the relation of angular momentum to angular velocity, \mathbf{L} = m r^2 \boldsymbol{\omega}, which holds for a particle traveling in a plane perpendicular to \mathbf{L}. Specifying to inverse-square central forces, the time derivative of \mathbf{p} \times \mathbf{L} is

\frac{d}{dt} \mathbf{p} \times \mathbf{L} = \left( \frac{-k}{r^2} \mathbf{\hat{r}} \right) \times \left(m r^2 \boldsymbol{\omega}\right) = m k \, \boldsymbol{\omega} \times \mathbf{\hat{r}} = m k \,\frac{d}{dt}\mathbf{\hat{r}}

where the last equality holds because a unit vector can only change by rotation, and \boldsymbol{\omega}\times\mathbf{\hat{r}} is the orbital velocity of the rotating vector. Thus, A is seen to be a difference of two vectors with equal time derivatives.

As described below, this LRL vector A is a special case of a general conserved vector \mathcal{A} that can be defined for all central forces.[9][10] However, since most central forces do not produce closed orbits (see Bertrand's theorem), the analogous vector \mathcal{A} rarely has a simple definition and is generally a multivalued function of the angle θ between r and \mathcal{A}.

Hamilton–Jacobi equation in parabolic coordinates

The constancy of the LRL vector can also be derived from the Hamilton–Jacobi equation in parabolic coordinates (ξ, η), which are defined by the equations

\xi = r + x \,
\eta = r - x \,

where r represents the radius in the plane of the orbit

r = \sqrt{x^{2} + y^{2}}

The inversion of these coordinates is

x = \frac{1}{2} \left( \xi - \eta \right)
y = \sqrt{\xi\eta}

Separation of the Hamilton–Jacobi equation in these coordinates yields the two equivalent equations[19][38]

2\xi p_{\xi}^{2} - mk - mE\xi = -\Gamma
2\eta p_{\eta}^{2} - mk - mE\eta = \Gamma

where Γ is a constant of motion. Subtraction and re-expression in terms of the Cartesian momenta px and py shows that Γ is equivalent to the LRL vector

\Gamma = p_{y} \left( x p_{y} - y p_{x} \right) - mk\frac{x}{r} = A_{x}

Noether's theorem

The connection between the rotational symmetry described above and the conservation of the LRL vector can be made quantitative by way of Noether's theorem. This theorem, which is used for finding constants of motion, states that any infinitesimal variation of the generalized coordinates of a physical system

\delta q_{i} = \epsilon g_{i}(\mathbf{q}, \mathbf{\dot{q}}, t)

that causes the Lagrangian to vary to first order by a total time derivative

\delta L = \epsilon \frac{d}{dt} G(\mathbf{q}, t)

corresponds to a conserved quantity Γ

\Gamma = -G + \sum_{i} g_{i} \left( \frac{\partial L}{\partial \dot{q}_{i}}\right)

In particular, the conserved LRL vector component As corresponds to the variation in the coordinates[39]

\delta x_{i} = \frac{\epsilon}{2} \left[ 2 p_{i} x_{s} - x_{i} p_{s} - \delta_{is} \left( \mathbf{r} \cdot \mathbf{p} \right) \right]

where i equals 1, 2 and 3, with xi and pi being the ith components of the position and momentum vectors r and p, respectively; as usual, δis represents the Kronecker delta. The resulting first-order change in the Lagrangian is

\delta L = \epsilon mk\frac{d}{dt} \left( \frac{x_{s}}{r} \right)

Substitution into the general formula for the conserved quantity Γ yields the conserved component As of the LRL vector,

A_{s} = \left[ p^{2} x_{s} - p_{s} \ \left(\mathbf{r} \cdot \mathbf{p}\right) \right] - mk \left( \frac{x_{s}}{r} \right) = \left[ \mathbf{p} \times \left( \mathbf{r} \times \mathbf{p} \right) \right]_{s} - mk \left( \frac{x_{s}}{r} \right)

Lie transformation

Figure 9: The Lie transformation from which the conservation of the LRL vector A is derived. As the scaling parameter λ varies, the energy and angular momentum changes, but the eccentricity e and the magnitude and direction of A do not.

The Noether theorem derivation of the conservation of the LRL vector A is elegant, but has one drawback: the coordinate variation δxi involves not only the position r, but also the momentum p or, equivalently, the velocity v.[40] This drawback may be eliminated by instead deriving the conservation of A using an approach pioneered by Sophus Lie.[41][42] Specifically, one may define a Lie transformation[31] in which the coordinates r and the time t are scaled by different powers of a parameter λ (Figure 9),

t \rightarrow \lambda^{3}t , \qquad \mathbf{r} \rightarrow \lambda^{2}\mathbf{r} , \qquad\mathbf{p} \rightarrow \frac{1}{\lambda}\mathbf{p}~.

This transformation changes the total angular momentum L and energy E,

L \rightarrow \lambda L, \qquad E \rightarrow \frac{1}{\lambda^{2}} E ~,

but preserves their product EL2. Therefore, the eccentricity e and the magnitude A are preserved, as may be seen from the 2Aequation for

A^2 = m^2 k^2 e^{2} = m^2 k^2 + 2 m E L^2

The direction of A is preserved as well, since the semiaxes are not altered by a global scaling. This transformation also preserves Kepler's third law, namely, that the semiaxis a and the period T form a constant T2/a3.

Alternative scalings, symbols and formulations

Unlike the momentum and angular momentum vectors p and L, there is no universally accepted definition of the Laplace–Runge–Lenz vector; several different scaling factors and symbols are used in the scientific literature. The most common definition is given above, but another common alternative is to divide by the constant mk to obtain a dimensionless conserved eccentricity vector

\mathbf{e} = \frac{1}{mk} \left(\mathbf{p} \times \mathbf{L} \right) - \mathbf{\hat{r}} = \frac{m}{k} \left(\mathbf{v} \times \left( \mathbf{r} \times \mathbf{v} \right) \right) - \mathbf{\hat{r}}

where v is the velocity vector. This scaled vector e has the same direction as A and its magnitude equals the eccentricity of the orbit. Other scaled versions are also possible, e.g., by dividing A by m alone

\mathbf{M} = \mathbf{v} \times \mathbf{L} - k\mathbf{\hat{r}}

or by p0

\mathbf{D} = \frac{\mathbf{A}}{p_{0}} = \frac{1}{\sqrt{2m\left| E \right|}} \left\{ \mathbf{p} \times \mathbf{L} - m k \mathbf{\hat{r}} \right\}

which has the same units as the angular momentum vector L. In rare cases, the sign of the LRL vector may be reversed, i.e., scaled by −1. Other common symbols for the LRL vector include a, R, F, J and V. However, the choice of scaling and symbol for the LRL vector do not affect its conservation.

Figure 4: The angular momentum vector L, the LRL vector A and Hamilton's vector, the binormal B, are mutually perpendicular; A and B point along the major and minor axes, respectively, of an elliptical orbit of the Kepler problem.

An alternative conserved vector is the binormal vector B studied by William Rowan Hamilton[8]

\mathbf{B} = \mathbf{p} - \left(\frac{mk}{L^{2}r} \right) \ \left( \mathbf{L} \times \mathbf{r} \right)

which is conserved and points along the minor semiaxis of the ellipse; the LRL vector A = B × L is the cross product of B and L (Figure 4).

The vector B is denoted as "binormal" since it is perpendicular to both A and L. Similar to the LRL vector itself, the binormal vector can be defined with different scalings and symbols.

The two conserved vectors, A and B can be combined to form a conserved dyadic tensor W,[9]

\mathbf{W} = \alpha \mathbf{A} \otimes \mathbf{A} + \beta \, \mathbf{B} \otimes \mathbf{B} ~.

where α and β are arbitrary scaling constants and \otimes represents the tensor product (which is not related to the vector cross product, despite their similar symbol). Written in explicit components, this equation reads

W_{ij} = \alpha A_{i} A_{j} + \beta B_{i} B_{j} \, .

Being perpendicular to each another, the vectors A and B can be viewed as the principal axes of the conserved tensor W, i.e., its scaled eigenvectors. W is perpendicular to L

\mathbf{L} \cdot \mathbf{W} = \alpha \left( \mathbf{L} \cdot \mathbf{A} \right) \mathbf{A} + \beta \left( \mathbf{L} \cdot \mathbf{B} \right) \mathbf{B} = 0 ~,

since A and B are both perpendicular to L as well, LA = LB = 0. For clarification, this equation reads, in explicit components,

\left( \mathbf{L} \cdot \mathbf{W} \right)_{j} = \alpha \left( \sum_{i=1}^{3} L_{i} A_{i} \right) A_{j} + \beta \left( \sum_{i=1}^{3} L_{i} B_{i} \right) B_{j} = 0 ~.

References

1. ^ a b c d e f g
2. ^
3. ^ a b c
4. ^ a b c
5. ^ a b c d e
6. ^ a b c
7. ^ a b c
8. ^ a b c
9. ^ a b c d e
10. ^ a b
11. ^
12. ^
13. ^
14. ^
15. ^
16. ^
17. ^
18. ^
19. ^ a b c
20. ^
21. ^
22. ^
23. ^ a b
24. ^
25. ^
26. ^
27. ^
28. ^ a b c
29. ^
30. ^
31. ^ a b
32. ^ a b
33. ^
34. ^
35. ^
36. ^
37. ^
38. ^
39. ^
40. ^
41. ^
42. ^