spotlights

Conference Calendar
High Scientific Council

ENS High Scientific Council
Nuclear Issues

ENS YG Reporter

ENS YG Reporter
Sign up as a Young Generation Reporter

RRFM 2010

RRFM 2010
21-25 March 2010 in Marrakech, Morocco

ENC 2010

ENC 2010
30 May-2 June 2010 in Barcelona, Spain

e-news
Glossary

2005 is physics year

2005 marks the centenary of the publication by Albert Einstein of what became one of the most celebrated texts in the history of science: "Zur elektrodynamik bewegter Körper" (On the electrodynamics of moving bodies). This 30-page paper, published in 1905 in the "Annalen der Physik", lays the foundations of the theory of special relativity. It was one among several other papers submitted the same year, making 1905 Einstein's annus mirabilis. Amongst the other papers published by Einstein the same year is a short article titled "Ist die Trägheit eines Körpers von seinem Energieeinhalt abhängig?" (Is the inertia of a body dependent on its energy content?). In this paper is derived a relation that has become familiar to everybody under the form E = mc2. It asserts the equivalence of energy and matter and, in particular, states that matter can be transformed into energy. This possibility found its first application forty years later with the fission of heavy nuclei, unfortunately in an atom bomb. The very forces that had been unleashed for destruction however could also be harnessed for peaceful uses. This was commemorated in 2003, including by this web site, with the fiftieth anniversary of the "Atoms for Peace" initiative launched by President Eisenhower. The objective was to put the same equation to work in controlled fission chain reactions to generate electricity. In the coming decades of this century, the same equation might once again preside over the success of fusion and so provide an inexhaustible source of energy for mankind1.

While concentrating here on Einstein’s famous papers, we think it fair to also acknowledge the contributions of several other distinguished scientists such as Hendrik Lorentz, Jules-Henri Poincaré and Max Planck. They greatly contributed to the extraordinary development of the physical sciences, including relativity theory, at that time. (This observation should not be construed however as casting doubt on the value of Einstein’s overall contribution to science; click here for more on this topic).

To participate in this year's celebrations and for the reasons explained above, the European Nuclear Society wishes to focus on

,

the most famous equation of all times. It has become part of everybody's culture, but few remember what is behind it. We have therefore gathered here three proofs for the benefit of those who would like to fill this gap. The arguments have been kept as short as possible. They assume however that the reader is acquainted with the principles of classical mechanics and the basics of special relativity.

The standard derivation

The derivation of E = mc2 usually found in modern textbooks is based on the following formula, obtained when applying the theory of special relativity to the dynamics of a particle:

where

m0 is the mass of the particle when it is at rest

 

c is the speed of light

 

v is the speed of the particle

 

E is the total energy of the particle at speed v.

One readily observes that when the speed of the particle is set to zero, its energy does not vanish. It takes a rest value E0 that is precisely equal to m0c2. The difference E – E0 is equal to

(1)

which reduces to for values of v that are very small compared to c. Since is the kinetic energy K (in the classical sense of the term) of a particle of mass m0 moving at speed v , one can write:

E = E0 + K

The total energy E of the particle is the sum of its rest energy E0 and of its kinetic energy K. E0 so emerges as the energy the particle possesses simply as a result of having a rest mass m0.

Einstein's original derivation

The above derivation is not the one initially presented by Albert Einstein. His original derivation, as outlined in his 1905 paper, is summarised below.
In the paper titled "Is the inertia of a body dependent on its energy content?", Albert Einstein takes as starting point a formula established in his main paper on special relativity (On the electrodynamics of moving bodies).

Let E be the energy of a system of plane light waves measured in a co-ordinate system in which the light source is at rest. The light rays are emitted in a direction making an anglewith the x-axis. The said formula gives the energy E* of the same light

source when measured in a co-ordinate system moving at uniform speed v along the x-axis of the "rest" co-ordinate system:

(The notations have been adapted: in particular, in the original paper, the symbol for the speed of light was V)
A. Einstein then proceeds with the following thought experiment. He considers a body characterised by a quantity of energy E0 in the rest co-ordinate system and by a quantity of energy H0in the moving co-ordinate system. This body starts now emitting a plane light wave of energy L/2 in a direction making an angle with the x-axis and simultaneously another light wave of equal energy in the opposite direction. This body remains at rest in the rest co-ordinate system. Albert Einstein calls the energy of the body after the light emission respectively E1 and H1. The principle of relativity stipulates that the laws of physics must be the same in both co-ordinate systems since ones operates a uniform translation with respect to the other. One can therefore write:

By subtracting these two equalities, one obtains:

(2)

Einstein then notes that the quantities E and H represent the energy of the same body expressed in two co-ordinate systems in relative motion with respect to each other, one of these being the co-ordinate system in which the body is at rest. The difference H – E can therefore differentiate itself from the kinetic energy of the body (with respect to the co-ordinate system in which it is moving) only by an additive constant C. This constant depends only on the arbitrary additive constants used to define E and H. One can therefore write:

which then leads to rewriting (2) as

The expression between brackets, already encountered above in equation (1), reduces to when v is small with respect to c. The said expression becomes in such case:

From which it follows immediately that, should a body emit a quantity of energy L in the form of radiation, its mass will be reduced by a quantity L/c2. Einstein concludes his note by stating that the actual type of energy emitted is unimportant and that his theory could be tested by measuring the change of mass of substances for which the mass-energy conversion rate is high, e.g. radium salts.

Note of the editor: Einstein's famous equation does not appear explicitly in his paper. Furthermore, if it had been made explicit, the context of its derivation would have naturally led the author to write it under the form

Such formulation is preferable to the one under which it entered history. Contrary to the latter, the former shows clearly that

  • only variations of mass and energy are to be considered;

  • it is the finiteness of the speed of light c that is responsible for the equivalence between mass and energy. Should c be infinite, as assumed in Newtonian dynamics, any change in energy would result in a zero change of mass. The formula E = mc2 is not as clear in this respect, since it could lead one to think that E would become infinite for an infinite c.


A non relativistic derivation by A. Einstein

That E = mc2 can be proved without having recourse to the theory of the relativity is perhaps not so well known. Albert Einstein did provide such a derivation based on the fact that radiation exerts a pressure. The simplest way of demonstrating this fact would be to use the relation linking the energy E of a particle of mass m0 to its momentum p = mv:

(3)

It would then suffice to note that the mass of light being equal to zero, the above equation yields p = E/c when m0 is set to zero, which is the relation we need as starting point. The momentum p is here the momentum transferred to an absorbing surface by a short flash of light; it is equal to E/c, where E is the energy of the light flash. As noted by physicist Max Born, another Nobel Prize winner, "Exactly the same pressure is experienced by a body which emits light, just as a gun experience a recoil when a shot is fired".

But this derivation of p = E/c will not do in the present context! Equation (3) also yields E = m0c2 when one sets p to zero, as would be the case of a particle at rest. Obviously, we cannot use as starting point a relation that already implies what we want to demonstrate. This is why it is absolutely indispensable to use in the present case the demonstration based on Maxwell's electromagnetics theory of the electromagnetic field. It must be added that this demonstration was confirmed experimentally as early as 1890 (see Max Born for further details – reference given below).

We reproduce now the announced derivation of E = m0c2, as recounted by Max Born in his book Einstein’s Theory of Relativity (Dover Publications, Inc., New York, 1962). The text below is found in Chapter VI: Einstein’s Special Principle of Relativity, Section 8: The Inertia of Energy (pages 283-286).

[Let us] "imagine a long tube at whose ends are two bodies A and B which are exactly equal and are composed of the same material and which, according to ordinary ideas, have the same mass (Fig. 1). But the body A is to have an excess of energy E over that of B, say in the form of heat, and there is to be an arrangement (concave mirror or something similar) by which this energy E can be sent in the form of radiation to B. Let the spatial extent of this flash of light be small compared with the length l of the tube (Fig. 1).

Then A experiences the recoil E/c. If this were transferred to the whole tube of mass M this would acquire a velocity v given by the momentum equation

Mv = E/c.

Fig. 1 A tube with two equal bodies, A and B, at its ends. A carries an energy E which is sent from A to B in the form of a light flash with velocity c; the recoil produces a velocity v of the tube. When E is absorbed by B, the tube is at rest again, but displaced by a distance x.

Now the transfer of momentum to the tube does not happen instantaneously; for if the tube were rigid the forces would propagate with a velocity larger than that of light. In fact, the propagation of the recoil through the tube from A to B is due to the elastic forces in the wall of the tube which are much slower than light. One has therefore to regard the process as consisting of two separate parts: (1) the emission from A, and (2) the absorption at B, and then to consider their effect on the tube, independent of one another, at a moment of time so late that not only has the elastic movement excited by the impacts expanded over the whole tube but also all elastic vibrations have died out and only the displacements of the whole tube are left over. In order to obtain the total effect one has to add the two displacements due to the impacts at A and B because elastic waves (of small amplitude) superpose undistortedly.

1) The recoil at A transfers a movement to the tube in such a way that the late instant t1 when all vibrations have disappeared its velocity is v and its displacement
x1 = vt1

2) When the light is absorbed at B the tube receives a movement of which at the instant t1 only a resultant velocity in the opposite direction –v, is left over; the corresponding displacement is
x2 = -v(t1 -t),

if t is the time light needs to travel from A to B; for the impact on B happens the time interval t later. The sum of the two displacements is
x = x1 + x2 = vt,

the same as if the tube were rigid2. If we substitute here and we obtain for the displacement of the tube

.

Now the bodies A and B may be exchanged (this may be done without using external influences). Let us suppose that two men are situated in the tube, who put A in the place of B, and B in the place of A, and then themselves return to the original positions. According to ordinary mechanics the tube as a whole must suffer no displacement, for changes of position can be effected only by external forces.

If this exchange were to be carried out, everything in the interior of the tube would be as at the beginning, the energy E would again be at the same place as before, and the distribution of mass would be exactly the same. But the whole tube would be displaced a distance x with respect to its initial position by the light impulse. This, of course, contradicts all the fundamental canons of mechanics. We could repeat the process and thus impart any arbitrary change of position to the system without applying external forces. This is, however, an impossibility. The only escape from the difficulty is to assume that when the bodies A and B are exchanged, these two bodies are not mechanically equivalent but that B has a mass greater by m than A in consequence of its excess of energy E. Then the symmetry during the exchange is not maintained, and the mass m is displaced from right to left by a distance l. At the same time the whole tube is displaced a distance x in the reverse direction. This distance is determined by the circumstance that the process occurs without the intervention of external influences. The total momentum, consisting of that of the tube M x/t and that of the transported mass –m l/t, is thus zero. Then

Mx - ml = 0,

from which if follows that

x = ml / M.

Now this displacement must exactly counterbalance that produced by the light impulse, hence we must have

.

This allow us to calculate m and we get

.

This is the amount of inertial mass that must be ascribed to the energy E in order that the principle of mechanics which states that no changes of position can occur without the action of external forces remains valid.

Since every form of energy is finally transformable into radiation by some process or other, this law must be universally valid. Thus we have a great unification in our knowledge of the material world: Matter is the widest meaning of the word (including light and other forms of pure energy, in the language of classical physics) has two fundamental qualities: inertia, measured by its mass, and the capability of performing work, measured by its energy. These two are strictly proportional to one another. Wherever electric and magnetic fields or other effects lead to intense accumulations of energy, they are accompanied by inertia. Electrons and atoms are examples of enormous concentrations of energy." (end of quotation)

Derivation of p = E/c based on Maxwell’s theory – The pressure of radiation

Let us consider a particle of charge q, initially at rest, submitted to the electromagnetic field of travelling plane waves moving along the z-axis. These waves are characterised by an electric field E aligned with the x-axis and a magnetic field B aligned with the y-axis. Furthermore, it can be shown that, for plane waves and in the MKSA system of units,

|E| = c|B| (a).

The said particle will be subjected to the Lorentz force

If we call ex, ey, ez the unit vectors along the three axes of reference, we have:

E = Exex, B = Byey, v = vxex + vyey + vzez and (a) becomes

Ex = cBy

while the expression of the Lorentz force becomes:


F = q(Exex + vxByez vzByex).

Let us now compute the average <F> of force F over one cycle. Since the field E is a sine function of time, its average over one cycle is equal to zero. Now we note that during the first cycles, the particle has not had time to gather much speed in the z direction. We can therefore assume that vz is almost constant, in which case the average value of the third term is proportional to the average of By. But By is also a sine function of time and its average over one cycle is also equal to zero. The average of force F is therefore equal to the average of the second term in the expression (b). Furthermore, according to Newton's third law, F is equal to dp/dt the rate of change of momentum of the body on which F is applied. One can therefore write:

<dp/dt> = q<vxBy>ez

We now compute the average <dW/dt> of the work W done unit of time by the wave on the particle. Since dW/dt = F.v, we have which reduces to dW/dt = qvxEx and, since Ex = c By, we also have

<dW/dt> = qc<vxBy>


If we compare the last two formulas, we see immediately that

<dp/dt> = ez<dW/dt>/c

This last expression indicates that, in a period of time during which the plane waves impart a quantity of energy W to the particle, they also impart it a increase of momentum p equal to W/c.

In defence of Einstein

This year’s celebrations have unfortunately given rise to a peripheral controversy regarding Albert Einstein’s actual merit as a scientist. It is a regrettable feature of our times that there will always some people who find it appropriate to try and spoil celebrations and make us believe that established values are unfounded. Building on the fact that Einstein did not give any credit to his predecessors in the above-mentioned 1905 papers, they have thought it fit to push the argument further. They have painted Einstein as a pla-gia-rist3 who was not the first to propose the famous mass energy relationship, who derived it incorrectly and even who was not the actual author of the theory of general relativity.

It is true that Einstein did not quote any reference in his 1905 papers. But even a cursory review of his scientific career makes it abundantly clear that he made first class contributions to physics. The anteriority question was taken due care of when he was awarded the Nobel Prize in Physics 1921 for “his services to Theoretical Physics, and especially for his discovery of the law of the photoelectric effect" and not, as one would have expected, for the relativity theory. Albert Einstein was a respected member of the scientific community from 1905 until his death in 1955. It is essential to observe at this stage that those lambasting Einstein today have no more supporting evidence for doing so than was available 50 or 80 years ago. If anything, there is less evidence available today since all the protagonists have long passed away. Unless one assumes that it is possible to fool everybody during fifty years, one must come to the logical conclusion that Albert Einstein’ recognition by his peers was not unfounded. Only speculation of dubious scientific value coming a century after the facts can contradict the first-hand experience of Einstein’s contemporaries. To make this last statement clear, one need only observe that it would be equally easy to make a similar case against Isaac Newton:

  • he did not find out the inverse square relationship governing attraction force (it was his colleague Robert Hooke, or at least the latter claimed he did),

  • one can claim that he was not the first to invent differential calculus and give Leibniz priority in this field,

  • one could claim that he was not entitled to carry out the operations relating to his “fluxion” calculus (bishop Berkeley actually did object and the matter was not resolved until the middle of the 19th century thanks to Weierstrass) and, to cap it all,

  • he was solitary, suspicious and bad tempered.

This single example should make it clear that the type of malicious criticism this year’s celebrations have unfortunately prompted reflects more on the mediocrity of their authors than on the individuals they are trying to disparage. Furthermore, they insult Einstein’s contemporaries by implying that they were too stupid to understand what was going on although they had access to first-hand evidence.

1) For further details on fission and fusion as energy generating reactions, see “Binding energy” in the Glossary of nuclear terms on this web site.

2) Einstein’s first derivation (1905) supposed the tube to be rigid. Later (1907) he himself criticized the concept of a rigid body in the theory of relativity. Our [Max Born's] derivation is a simplified version of a consideration by E. Feenberg.

3) hyphenation added to prevent search engines from this listing this web page on searches associating Albert Einstein with the p-word

Reference: Berkeley physics course – volume 3, chapter 7, pp 362-364 (the derivation is provided in CGS units).