"Reflections
of this type made it clear to me as long ago as shortly after 1900, i.e.,
shortly after Planck's trailblazing work, that neither mechanics nor
electrodynamics could (except in limiting cases) claim exact validity. Gradually
I despaired of the possibility of discovering the true laws by means of
constructive efforts based on known facts. The longer and the more desperately I
tried, the more I came to the conviction that only the discovery of a universal
formal principle could lead us to assured results... How, then, could such a
universal principle be found?"
Einstein,
A.
Special relativity (SR) (also known as the special theory of
relativity or STR) is the physical theory of
measurement in inertial frames of reference
proposed in 1905 by Albert Einstein (after the considerable and
independent contributions of Hendrik Lorentz, Henri Poincaré and others) in the paper "On the Electrodynamics of
Moving Bodies". It
generalizes Galileo's principle of relativity—that all
uniform motion is relative, and
that there is no absolute and well-defined state of rest (no privileged reference
frames)—from mechanics to all
the laws of physics, including both the laws of
mechanics and of electrodynamics, whatever they may be.
Special
relativity incorporates the principle that the speed of light is the same for all inertial observers regardless of the state
of motion of the source.
This theory has a wide range of consequences which have been experimentally
verified, including
counter-intuitive ones such as length contraction, time dilation and relativity of simultaneity,
contradicting the classical notion that the duration of the time interval
between two events is equal for all observers. (On the other hand, it introduces
the space-time interval, which is
invariant.) Combined with other laws of physics, the two postulates of special
relativity predict the equivalence of matter and energy, as expressed in the mass-energy
equivalence formula E = mc2, where c is the
speed of light in a
vacuum. The
predictions of special relativity agree well with Newtonian mechanics in their
common realm of applicability, specifically in experiments in which all
velocities are small compared with the speed of light. Special relativity
reveals that c is not just the velocity of a certain phenomenon—namely
the propagation of electromagnetic radiation (light)—but
rather a fundamental feature of the way space and time are unified as spacetime. One of the consequences of
the theory is that it is impossible for any particle that has rest mass to be
accelerated to the speed of light.
The theory is termed "special" because it applies the principle
of relativity only to inertial reference frames, i.e.
frames of reference in uniform relative motion with respect to each other.Einstein developed
general
relativity to apply the principle more generally, that is, to any frame so
as to handle general coordinate transformations, and that
theory includes the effects of gravity. From the theory of general relativity it
follows that special relativity will still apply locally (i.e., to first
order), and hence to any
relativistic situation where gravity is not a significant factor. Inertial
frames should be identified with non-rotating Cartesian coordinate systems
constructed around any free
falling trajectory as a
time axis.
The derivation of special relativity depends not only on these two explicit
postulates, but also on several tacit assumptions (made in almost all theories of
physics), including the isotropy
and homogeneity of space and the independence
of measuring rods and clocks from their past history.
Following Einstein's original presentation of special relativity in 1905,
many different sets of postulates have been proposed in various alternative
derivations.However, the most
common set of postulates remains those employed by Einstein in his original
paper. A more mathematical statement of the Principle of Relativity made later
by Einstein, which introduces the concept of simplicity not mentioned above
is:
Henri Poincaré
provided the mathematical framework for relativity theory by proving that Lorentz transformations are a subset of
his Poincaré
group of symmetry transformations. Einstein later derived these
transformations from his axioms.
Many of Einstein's papers present derivations of the Lorentz transformation
based upon these two principles.
Einstein consistently based the derivation of Lorentz invariance (the
essential core of special relativity) on just the two basic principles of
relativity and light-speed invariance. He wrote:
Thus many modern treatments of special relativity base it on the single
postulate of universal Lorentz covariance, or, equivalently, on the single
postulate of Minkowski spacetime.
From the principle of relativity alone without assuming the constancy of the
speed of light, i.e. using the isotropy of space and the symmetry implied by the
principle of special relativity, one can show that the space-time
transformations between inertial frames are either Euclidean, Galilean, or
Lorentzian. In the Lorentzian case, one can then obtain relativistic interval
conservation and a certain finite limiting speed. Experiments suggest that this
speed is the speed of light in vacuum.
The constancy of the speed of light was motivated by Maxwell's theory of
electromagnetism and the lack of evidence for the luminiferous
ether but not, contrary to widespread belief, the null result of the Michelson–Morley
experiment.
However the null
result of the Michelson–Morley experiment helped the notion of the constancy of
the speed of light gain widespread and rapid acceptance.
In addition to the papers referenced above—which give derivations of the
Lorentz transformation and describe the foundations of special
relativity—Einstein also wrote at least four papers giving heuristic arguments
for the equivalence (and transmutability) of mass and energy, for E =
mc2.
Mass-energy equivalence is a consequence of special relativity. The energy
and momentum, which are separate in Newtonian mechanics, form a four-vector in relativity, and
this relates the time component (the energy) to the space components (the
momentum) in a nontrivial way. For an object at rest, the energy-momentum
four-vector is (E, 0, 0, 0): it has a time component which is the energy,
and three space components which are zero. By changing frames with a Lorentz
transformation in the x direction with a small value of the velocity v, the
energy momentum four-vector becomes (E, Ev/c2,
0, 0). The momentum is equal to the energy divided by c2 times
the velocity. So the newtonian mass of an object, which is the ratio of the
momentum to the velocity for slow velocities, is equal to
E/c2.
The energy and momentum are properties of matter, and it is impossible to
deduce that they form a four-vector just from the two basic postulates of
special relativity by themselves, because these don't talk about matter, they
only talk about space and time. The derivation therefore requires some
additional physical reasoning. In his 1905 paper, Einstein used the additional
principles that Newtonian mechanics should hold for slow velocities, so that
there is one energy scalar and one three-vector momentum at slow velocities, and
that the conservation law for energy and momentum is exactly true in relativity.
Furthermore, he assumed that the energy/momentum of light transforms like the
energy/momentum of massless particles, which was known to be true from Maxwell's
equations. The first
of Einstein's papers on this subject was "Does the Inertia of a Body Depend upon
its Energy Content?" in 1905. Although
Einstein's argument in this paper is nearly universally accepted by physicists
as correct, even self-evident, many authors over the years have suggested that
it is wrong. Other
authors suggest that the argument was merely inconclusive because it relied on
some implicit assumptions.
Einstein acknowledged the controversy over his derivation in his 1907 survey
paper on special relativity. There he notes that it is problematic to rely on
Maxwell's equations for the heuristic mass-energy argument. The argument in his
1905 paper can be carried out with the emission of any massless particles, but
the Maxwell equations are implicitly used to make it obvious that the emission
of light in particular can be achieved only by doing work. To emit
electromagnetic waves, all you have to do is shake a charged particle, and this
is clearly doing work, so that the emission is of energy.
The principle of relativity, which states
that there is no preferred inertial reference frame, dates back
to Galileo, and was
incorporated into Newtonian Physics. However, in the late 19th
century, the existence of electromagnetic waves led physicists
to suggest that the universe was filled with a substance known as "aether", which
would act as the medium through which these waves, or vibrations travelled. The
aether was thought to constitute an absolute reference frame against which
speeds could be measured, and could be considered fixed and motionless. Aether
supposedly had some wonderful properties: it was sufficiently elastic that it
could support electromagnetic waves, and those waves could interact with matter,
yet it offered no resistance to bodies passing through it. The results of
various experiments, including the Michelson-Morley experiment,
indicated that the Earth was always 'stationary' relative to the
aether–something that was difficult to explain, since the Earth is in orbit
around the Sun. Einstein's solution was to discard the notion of an aether and
an absolute state of rest. Special relativity is formulated so as to not assume
that any particular frame of reference is special; rather, in relativity, any
reference frame moving with uniform motion will observe the same laws of
physics. In particular, the speed of light in a vacuum is always measured to be
c, even when measured by multiple systems that are moving at different
(but constant) velocities.
Einstein has said that all of the consequences of special relativity can be
derived from examination of the Lorentz
transformations.
These transformations, and hence special relativity, lead to different
physical predictions than Newtonian mechanics when relative velocities become
comparable to the speed of light. The speed of light is so much larger than
anything humans encounter that some of the effects predicted by relativity are
initially counter-intuitive:
- Time dilation
– the time lapse between two events is not invariant from one observer to
another, but is dependent on the relative speeds of the observers' reference
frames (e.g., the twin
paradox which concerns a twin who flies off in a spaceship traveling near
the speed of light and returns to discover that his or her twin sibling has aged
much more).
- Relativity of simultaneity – two
events happening in two different locations that occur simultaneously in the
reference frame of one inertial observer, may occur non-simultaneously in the
reference frame of another inertial observer (lack of absolute
simultaneity).
- Lorentz contraction – the dimensions
(e.g., length) of an object as measured by one observer may be smaller than the
results of measurements of the same object made by another observer (e.g., the
ladder paradox
involves a long ladder traveling near the speed of light and being contained
within a smaller garage).
- Composition of velocities –
velocities (and speeds) do not simply 'add', for example if a rocket is moving
at 2⁄3 the speed of
light relative to an observer, and the rocket fires a missile at 2⁄3 of the speed of
light relative to the rocket, the missile does not exceed the speed of light
relative to the observer. (In this example, the observer would see the missile
travel with a speed of 12⁄13 the speed of
light.)
- Inertia and momentum – as
an object's speed approaches the speed of light from an observer's point of
view, its mass appears to increase thereby making it more and more difficult to
accelerate it from within the observer's frame of reference.
- Equivalence of mass and energy, E = mc2 –
The energy content of an object at rest with mass m equals
mc2. Conservation of energy implies that in any reaction a
decrease of the sum of the masses of particles must be accompanied by an
increase in kinetic energies of the particles after the reaction. Similarly, the
mass of an object can be increased by taking in kinetic energies.
Relativity theory depends on "reference frames". The term reference frame
as used here is an observational perspective in space at rest, or in uniform
motion, from which a position can be measured along 3 spatial axes. In addition,
a reference frame has the ability to determine measurements of the time of
events using a 'clock' (any reference device with uniform periodicity).
An event is an occurrence that can be assigned a single unique time and
location in space relative to a reference frame: it is a "point" in space-time. Since the speed of light
is constant in relativity in each and every reference frame, pulses of light can
be used to unambiguously measure distances and refer back the times that events
occurred to the clock, even though light takes time to reach the clock after the
event has transpired.
For example, the explosion of a firecracker may be considered to be an
"event". We can completely specify an event by its four space-time coordinates:
The time of occurrence and its 3-dimensional spatial location define a reference
point. Let's call this reference frame S.
In relativity theory we often want to calculate the position of a point from
a different reference point.
Suppose we have a second reference frame S′, whose spatial axes and
clock exactly coincide with that of S at time zero, but it is moving at a
constant velocity v with respect to S along the x-axis.
Since there is no absolute reference frame in relativity theory, a concept of
'moving' doesn't strictly exist, as everything is always moving with respect to
some other reference frame. Instead, any two frames that move at the same speed
in the same direction are said to be comoving. Therefore S and
S′ are not comoving.
Let's define the event to have space-time coordinates
(t,x,y,z) in system S and
(t′,x′,y′,z′) in S′. Then the Lorentz
transformation specifies that these coordinates are related in the following
way:
-
where
-
is the Lorentz
factor and c is the speed of light in a vacuum.
The y and z coordinates are unaffected; only the x and
t axes transformed. These Lorentz transformations form a one-parameter
group of linear mappings, that parameter being called rapidity.
A quantity invariant under Lorentz
transformations is known as a Lorentz scalar.
The Lorentz transformation given above is for the particular case in which
the velocity v of S′ with respect to S is parallel to the
x-axis. We now give the Lorentz transformation in the general case.
Suppose the velocity of S′ with respect to S is v. Denote
the space-time coordinates of an event in S by (t,r)
(instead of (t,x,y,z)). Then the coordinates
(t′,r′) of this event in S′ are given by:
-
where vT denotes the transpose of v, α(v) = 1/γ(v),
and P(v) denotes the projection onto the direction of
v.
From the first equation of the Lorentz transformation in terms of coordinate
differences
-
it is clear that two events that are simultaneous in frame S
(satisfying Δt = 0), are not
necessarily simultaneous in another inertial frame S′ (satisfying Δt′ = 0). Only if these events are
colocal in frame S (satisfying Δx = 0), will they be simultaneous in
another frame S′.
Writing the Lorentz transformation and its inverse in terms of coordinate
differences we get
-
and
-
Suppose we have a clock at rest in the
unprimed system S. Two consecutive ticks of this clock are then characterized by
Δx = 0. If we want to know the relation
between the times between these ticks as measured in both systems, we can use
the first equation and find:
-
for events satisfying
This shows that the time Δt' between the
two ticks as seen in the 'moving' frame S' is larger than the time Δt between these ticks as measured in the rest frame
of the clock. This phenomenon is called time dilation. Time dilation explains a number of
physical phenomena; for example, the decay rate of muons produced by cosmic rays impinging on the Earth's
atmosphere.
Similarly, suppose we have a measuring rod at rest in the unprimed system. In
this system, the length of this rod is written as Δx. If we want to find the length of this rod as
measured in the 'moving' system S', we must make sure to measure the distances
x' to the end points of the rod simultaneously
in the primed frame S'. In other words, the measurement is characterized by
Δt' = 0, which we can combine with the
fourth equation to find the relation between the lengths Δx and Δx':
-
for events satisfying
This shows that the length Δx' of the rod
as measured in the 'moving' frame S' is shorter than the length Δx in its own rest frame. This phenomenon is called
length
contraction or Lorentz contraction.
These effects are not merely appearances; they are explicitly related to our
way of measuring time intervals between events which occur at the same
place in a given coordinate system (called "co-local" events). These time
intervals will be different in another coordinate system moving with
respect to the first, unless the events are also simultaneous. Similarly, these
effects also relate to our measured distances between separated but simultaneous
events in a given coordinate system of choice. If these events are not co-local,
but are separated by distance (space), they will not occur at the same
spatial distance from each other when seen from another moving coordinate
system. However, the space-time interval will be the same for
all observers. The underlying reality remains the same. Only our perspective
changes.
In diagram 2 the interval AB is 'time-like'; i.e., there is a frame of
reference in which events A and B occur at the same location in space, separated
only by occurring at different times. If A precedes B in that frame, then A
precedes B in all frames. It is hypothetically possible for matter (or
information) to travel from A to B, so there can be a causal relationship (with
A the cause and B the effect).
The interval AC in the diagram is 'space-like'; i.e., there is a frame
of reference in which events A and C occur simultaneously, separated only in
space. However there are also frames in which A precedes C (as shown) and frames
in which C precedes A. If it were possible for a cause-and-effect relationship
to exist between events A and C, then paradoxes of causality would result. For
example, if A was the cause, and C the effect, then there would be frames of
reference in which the effect preceded the cause. Although this in itself won't
give rise to a paradox, one can show
that faster than light signals can be sent back into one's own past. A causal
paradox can then be constructed by sending the signal if and only if no signal
was received previously.
Therefore, if causality is to
be preserved, one of the consequences of special relativity is that no
information signal or material object can travel faster than light
in a vacuum. However, some things can still move faster than light. For example,
the location where the beam of a search light hits the bottom of a cloud can
move faster than light when the search light is turned rapidly.
Even without considerations of causality, there are other strong reasons why
faster-than-light travel is forbidden by special relativity. For example, if a
constant force is applied to an object for a limitless amount of time, then
integrating F = dp/dt gives a momentum that grows without
bound, but this is simply because
approaches infinity as v
approaches c. To an observer who is not accelerating, it appears as
though the object's inertia is increasing, so as to produce a smaller
acceleration in response to the same force. This behavior is in fact observed in
particle accelerators.
If the observer in S sees an object moving along the x
axis at velocity w, then the observer in the S' system, a
frame of reference moving at velocity v in the x direction
with respect to S, will see the object moving with velocity
w' where
-
This equation can be derived from the space and time transformations
above.
-
Notice that if the object were moving at the speed of light in the
S system (i.e. w = c), then
it would also be moving at the speed of light in the S' system. Also,
if both w and v are small with respect to the speed of
light, we will recover the intuitive Galilean transformation of velocities: .
The usual example given is that of a train (call it system K) travelling due east with a velocity v with respect to the tracks (system K'). A child inside the train throws a baseball due
east with a velocity u with respect to the
train. In classical physics, an observer at rest on the tracks will measure the
velocity of the baseball as v + u.
In special relativity, this is no longer true. Instead, an observer on the
tracks will measure the velocity of the baseball as .
If u and v
are small compared to c, then the above
expression approaches the classical sum v +
u.
In the more general case, the baseball is not necessarily travelling in the
same direction as the train. To obtain the general formula for Einstein velocity
addition, suppose an observer at rest in system K measures the velocity of an object as .
Let K' be an inertial system such that the
relative velocity of K to K' is ,
where
and
are now vectors in R3. An observer
at rest in K' will then measure the velocity
of the object as
-
where
and
are the components of
parallel and perpendicular, respectively, to ,
and .
Einstein velocity addition is commutative only when
and
are parallel. In fact,
-
,where gyr is the mathematical abstraction of Thomas precession
into an operator called Thomas gyration and given by
-
for all w.
The gyr operator forms the foundation of gyrovector spaces.
Einstein's addition of colinear velocites is consistent with the Fizeau experiment
which determined the speed of light in a fluid moving parallel to the light, but
no experiment has ever tested the formula for the general case of non-parallel
velocities.
In addition to modifying notions of space and time, special relativity forces
one to reconsider the concepts of mass, momentum, and energy, all of which are important constructs in Newtonian mechanics. Special relativity
shows, in fact, that these concepts are all different aspects of the same
physical quantity in much the same way that it shows space and time to be
interrelated.
There are a couple of (equivalent) ways to define momentum and energy in SR.
One method uses conservation laws. If these laws are to remain
valid in SR they must be true in every possible reference frame. However, if one
does some simple thought experiments using the Newtonian
definitions of momentum and energy, one sees that these quantities are not
conserved in SR. One can rescue the idea of conservation by making some small
modifications to the definitions to account for relativistic velocities. It is
these new definitions which are taken as the correct ones for momentum and
energy in SR.
The energy and momentum of an object with invariant mass m (also called rest
mass in the case of a single particle), moving with velocity v with respect to a given frame of
reference, are given by
-
respectively, where γ (the Lorentz factor) is given by
-
The quantity γm is often called the relativistic mass of the
object in the given frame of reference, although recently
this concept is falling into disuse, and Lev B.
Okun suggested that "this terminology [...] has no rational justification
today", and should no longer be taught.Other physicists,
including Wolfgang
Rindler and T. R.
Sandin, have argued that relativistic mass is a useful concept and there is
little reason to stop using it. See Mass
in special relativity for more information on this debate. Some authors use
the symbol m to refer to relativistic mass, and the symbol
m0 to refer to rest mass.
The energy and momentum of an object with invariant mass m are related
by the formulas
-
-
The first is referred to as the relativistic energy-momentum equation.
While the energy E and the momentum p depend on the frame of
reference in which they are measured, the quantity E2 −
(pc)2 is invariant, being equal to the squared invariant mass
of the object (up to the multiplicative
constant c4).
It should be noted that the invariant mass of a system
-
is greater than the sum of the rest masses of the particles it is
composed of (unless they are all stationary with respect to the center of mass of the
system, and hence to each other). The sum of rest masses is not even always
conserved in closed
systems, since rest mass may be converted to particles which individually
have no mass, such as photons. Invariant mass, however, is conserved and
invariant for all observers, so long as the system remains closed. This is due
to the fact that even massless particles contribute invariant mass to systems,
as also does the kinetic energy of particles. Thus, even under transformations
of rest mass to photons or kinetic energy, the invariant mass of a system which
contains these energies still reflects the invariant mass associated with
them.
For massless
particles, m is zero. The relativistic energy-momentum equation still
holds, however, and by substituting m with 0, the relation E =
pc is obtained; when substituted into
Ev = c2p, it gives v = c: massless
particles (such as photons) always travel at the speed of light.
A particle which has no rest mass (for example, a photon) can nevertheless
contribute to the total invariant mass of a system, since some or all of its
momentum is cancelled by another particle, causing a contribution to the
system's invariant mass due to the photon's energy. For single photons this does
not happen, since the energy and momentum terms exactly cancel.
Looking at the above formula for invariant mass of a system, one sees that,
when a single massive object is at rest (v = 0, p = 0), there is a
non-zero mass remaining: mrest =
E/c2. The corresponding energy, which is also the total
energy when a single particle is at rest, is referred to as "rest energy". In
systems of particles which are seen from a moving inertial frame, total energy
increases and so does momentum. However, for single particles the rest mass
remains constant, and for systems of particles the invariant mass remain
constant, because in both cases, the energy and momentum increases subtract from
each other, and cancel. Thus, the invariant mass of systems of particles is a
calculated constant for all observers, as is the rest mass of single
particles.
For systems of particles, the energy-momentum equation requires summing the
momentum vectors of the particles:
-
The inertial frame in which the momenta of all particles sums to zero is
called the center of momentum frame. In this
special frame, the relativistic energy-momentum equation has ,
and thus gives the invariant mass of the system as merely the total energy of
all parts of the system, divided by c2
-
This is the invariant mass of any system which is measured in a frame where
it has zero total momentum, such as a bottle of hot gas on a scale. In such a
system, the mass which the scale weighs is the invariant mass, and it depends on
the total energy of the system. It is thus more than the sum of the rest masses
of the molecules, but also includes all the totaled energies in the system as
well. Like energy and momentum, the invariant mass of closed systems cannot be
changed so long as the system is closed (no mass or energy allowed in or out),
because the total relativistic energy of the system remains constant so long as
nothing can enter or leave it.
An increase in the energy of such a system which is caused by translating the
system to an inertial frame which is not the center of momentum frame, causes an
increase in energy and momentum without an increase in invariant mass. E
= mc2, however, applies only to closed systems in their
center-of-momentum frame where momentum sums to zero.
Taking this formula at face value, we see that in relativity, mass is
simply another form of energy. In 1927 Einstein remarked about special
relativity, "Under this theory mass is not an unalterable magnitude, but a
magnitude dependent on (and, indeed, identical with) the amount of energy."
Einstein was not referring to closed (isolated) systems in this remark,
however. For, even in his 1905 paper, which first derived the relationship
between mass and energy, Einstein showed that the energy of an object had to be
increased for its invariant mass (rest mass) to increase. In such cases, the
system is not closed (in Einstein's thought experiment, for example, a mass
gives off two photons, which are lost).
In a closed system (i.e., in the sense of a totally isolated system) the
total energy, the total momentum, and hence the total invariant mass are
conserved. Einstein's formula for change in mass translates to its simplest ΔE =
Δmc2 form, however, only in non-closed systems in which energy is
allowed to escape (for example, as heat and light), and thus invariant mass is
reduced. Einstein's equation shows that such systems must lose mass, in
accordance with the above formula, in proportion to the energy they lose to the
surroundings. Conversely, if one can measure the differences in mass between a
system before it undergoes a reaction which releases heat and light, and the
system after the reaction when heat and light have escaped, one can estimate the
amount of energy which escapes the system. In both nuclear and chemical
reactions, such energy represents the difference in binding energies of
electrons in atoms (for chemistry) or between nucleons in nuclei (in atomic
reactions). In both cases, the mass difference between reactants and (cooled)
products measures the mass of heat and light which will escape the reaction, and
thus (using the equation) give the equivalent energy of heat and light which may
be emitted if the reaction proceeds.
In chemistry, the mass differences associated with the emitted energy are
around one-billionth of the molecular mass. However, in
nuclear reactions the energies are so large that they are associated with mass
differences, which can be estimated in advance, if the products and reactants
have been weighed (atoms can be weighed indirectly by using atomic masses, which
are always the same for each nuclide).
Thus, Einstein's formula becomes important when one has measured the masses of
different atomic nuclei. By looking at the difference in masses, one can predict
which nuclei have stored energy that can be released by certain nuclear reactions,
providing important information which was useful in the development of nuclear
energy and, consequently, the nuclear bomb. Historically, for example, Lise Meitner was able to use
the mass differences in nuclei to estimate that there was enough energy
available to make nuclear fission a favorable process. The implications of this
special form of Einstein's formula have thus made it one of the most famous
equations in all of science.
Because the E = mc2 equation applies only to
isolated systems in their center of momentum frame, it has been
popularly misunderstood to mean that mass may be converted to energy,
after which the mass disappears. However, popular explanations of the
equation as applied to systems include open systems for which heat and light are
allowed to escape, when they otherwise would have contributed to the mass (invariant mass) of the
system.
Historically, confusion about mass being "converted" to energy has been aided
by confusion between mass and "matter",
where matter is defined as fermion
particles. In such a definition, electromagnetic radiation and kinetic energy
(or heat) are not considered "matter." In some situations, matter may indeed be
converted to non-matter forms of energy (see above), but in all these
situations, the matter and non-matter forms of energy still retain their
original mass.
For closed/isolated systems, mass never disappears in the center of momentum
frame, because energy cannot disappear. Instead, this equation, in context,
means only that when any energy is added to, or escapes from, a system in the
center-of-momentum frame, the system will be measured as having gained or lost
mass, in proportion to energy added or removed. Thus, in theory, if an atomic
bomb were placed in a box strong enough to hold its blast, and detonated upon a
scale, the mass of this closed system would not change, and the scale would not
move. Only when a transparent "window" was opened in the super-strong
plasma-filled box, and light and heat were allowed to escape in a beam, and the
bomb components to cool, would the system lose the mass associated with the
energy of the blast. In a 21 kiloton bomb, for example, about a gram of light
and heat is created. If this heat and light were allowed to escape, the remains
of the bomb would lose a gram of mass, as it cooled. In this thought-experiment,
the light and heat carry away the gram of mass, and would therefore deposit this
gram of mass in the objects that absorb them.[35]
In special relativity, Newton's second law does not hold in its form F
= ma, but it does if it is expressed as
-
where p is the momentum as defined above ()
and "m" is the invariant
mass. Thus, the force is given by
-
Carrying out the derivatives gives
-
which, taking into account the identity ,
can also be expressed as
-
If the acceleration is separated into the part
parallel to the velocity and the part perpendicular to it, one gets
-
-
-
-
Consequently in some old texts, γ3m is referred to
as the longitudinal mass, and γm is referred to as the
transverse mass, which is the same as the relativistic mass.
See mass in special relativity.
The Work-energy Theorem says
the change in
kinetic energy is equal to the work done on the body, that is
-
-
If in the initial state the body was at rest (γ0 = 1) and
in the final state it has speed v (γ1 = γ), the
kinetic energy is K = (γ −
1)mc2, a result that can be directly obtained by
subtracting the rest energy mc2 from the total relativistic
energy γmc2.
Here, we see how to write the equations of special relativity in a manifestly
Lorentz
covariant form. The position of an event in spacetime is given by a contravariant four vector whose components
are:
-
where x1 = x and x2 = y and x3 = z as usual. We define x0 = ct so that the time
coordinate has the same dimension of distance as the other spatial dimensions;
in accordance with the general principle that space and time are treated
equally, so far as possible. Superscripts are contravariant indices in this section rather than exponents
except when they indicate a square. Subscripts are covariant indices which also range
from zero to three as with the spacetime gradient of a field φ:
-
Having recognised the four-dimensional nature of spacetime, we are driven to
employ the Minkowski metric, η, given in components (valid in any inertial reference frame) as:
-
which is equal to its reciprocal, ηαβ,
in those frames.
Then we recognize that coordinate transformations between inertial reference
frames are given by the Lorentz transformation tensor Λ. For the special case of motion along the
x-axis, we have:
-
which is simply the matrix of a boost (like a rotation) between the x
and ct coordinates. Where μ' indicates the row and ν indicates the
column. Also, β and γ are defined as:
-
More generally, a transformation from one inertial frame (ignoring
translations for simplicity) to another must satisfy:
-
where there is an implied summation of
and
from 0 to 3 on the right-hand side in accordance with the Einstein summation
convention. The Poincaré group is the most general group of
transformations which preserves the Minkowski metric and
this is the physical symmetry underlying special relativity.
All proper physical quantities are given by tensors. So to transform from one
frame to another, we use the well-known tensor transformation law
-
Where
is the reciprocal matrix of .
To see how this is useful, we transform the position of an event from an
unprimed coordinate system S to a primed system S', we
calculate
-
which is the Lorentz transformation given above. All tensors transform by the
same rule.
The squared length of the differential of the position four-vector
constructed using
-
is an invariant. Being invariant means that it takes the same value in all
inertial frames, because it is a scalar (0 rank tensor), and so no Λ appears in
its trivial transformation. Notice that when the line element
is negative that
is the differential of proper
time, while when
is positive,
is differential of the proper distance.
The primary value of expressing the equations of physics in a tensor form is
that they are then manifestly invariant under the Poincaré group, so that we do
not have to do a special and tedious calculation to check that fact. Also in
constructing such equations we often find that equations previously thought to
be unrelated are, in fact, closely connected being part of the same tensor
equation.
Recognising other physical quantities as tensors also simplifies their
transformation laws. First note that the velocity four-vector Uμ is
given by
-
Recognising this, we can turn the awkward looking law about composition of
velocities into a simple statement about transforming the velocity four-vector
of one particle from one frame to another. Uμ also has an
invariant form:
-
So all velocity four-vectors have a magnitude of c. This is an
expression of the fact that there is no such thing as being at coordinate rest
in relativity: at the least, you are always moving forward through time. The acceleration 4-vector
is given by .
Given this, differentiating the above equation by τ produces
-
So in relativity, the acceleration four-vector and the velocity four-vector
are orthogonal.
The momentum and energy combine into a covariant 4-vector:
-
where m is the invariant mass.
The invariant magnitude of the momentum 4-vector is:
-
We can work out what this invariant is by first arguing that, since it is a
scalar, it doesn't matter which reference frame we calculate it, and then by
transforming to a frame where the total momentum is zero.
-
We see that the rest energy is an independent invariant. A rest energy can be
calculated even for particles and systems in motion, by translating to a frame
in which momentum is zero.
The rest energy is related to the mass according to the celebrated equation
discussed above:
-
Note that the mass of systems measured in their center of momentum frame
(where total momentum is zero) is given by the total energy of the system in
this frame. It may not be equal to the sum of individual system masses measured
in other frames.
To use Newton's third law of motion,
both forces must be defined as the rate of change of momentum with respect to
the same time coordinate. That is, it requires the 3D force defined above.
Unfortunately, there is no tensor in 4D which contains the components of the 3D
force vector among its components.
If a particle is not traveling at c, one can transform the 3D force
from the particle's co-moving reference frame into the observer's reference
frame. This yields a 4-vector called the four-force. It is the rate of change of the above
energy momentum four-vector
with respect to proper time. The covariant version of the four-force is:
-
where
is the proper time.
In the rest frame of the object, the time component of the four force is zero
unless the "invariant
mass" of the object is changing (this requires a non-closed system in which
energy/mass is being directly added or removed from the object) in which case it
is the negative of that rate of change of mass, times c. In general,
though, the components of the four force are not equal to the components of the
three-force, because the three force is defined by the rate of change of
momentum with respect to coordinate time, i.e.
while the four force is defined by the rate of change of momentum with respect
to proper time, i.e. .
In a continuous medium, the 3D density of force combines with the
density of power to form a covariant 4-vector. The spatial part is the
result of dividing the force on a small cell (in 3-space) by the volume of that
cell. The time component is −1/c times the power transferred to that cell
divided by the volume of the cell. This will be used below in the section on
electromagnetism.
Theoretical investigation in classical electromagnetism led to
the discovery of wave propagation. Equations generalizing the electromagnetic
effects found that finite propagation-speed of the E and B fields required
certain behaviors on charged particles. The general study of moving charges
forms the Liénard–Wiechert
potential, which is a step towards special relativity.
The Lorentz transformation of the electric field of a moving charge into a
non-moving observer's reference frame results in the appearance of a
mathematical term commonly called the magnetic field. Conversely, the magnetic
field generated by a moving charge disappears and becomes a purely
electrostatic field in a comoving frame of reference. Maxwell's
equations are thus simply an empirical fit to special relativistic effects
in a classical model of the Universe. As electric and magnetic fields are
reference frame dependent and thus intertwined, one speaks of
electromagnetic fields. Special relativity provides the transformation
rules for how an electromagnetic field in one inertial frame appears in another
inertial frame.
Maxwell's
equations in the 3D form are already consistent with the physical content of
special relativity. But we must rewrite them to make them manifestly
invariant.
The charge density
and current density
are unified into the current-charge 4-vector:
-
The law of charge conservation, ,
becomes:
-
The electric field
and the magnetic
induction
are now unified into the (rank 2 antisymmetric covariant) electromagnetic field tensor:
-
The density, ,
of the Lorentz force,
,
exerted on matter by the electromagnetic field becomes:
-
Faraday's law of induction, ,
and Gauss's law for magnetism, ,
combine to form:
-
Although there appear to be 64 equations here, it actually reduces to just
four independent equations. Using the antisymmetry of the electromagnetic field
one can either reduce to an identity (0=0) or render redundant all the equations
except for those with λ,μ,ν = either 1,2,3 or 2,3,0 or 3,0,1 or 0,1,2.
The electric displacement
and the magnetic field
are now unified into the (rank 2 antisymmetric contravariant) electromagnetic
displacement tensor:
-
Ampère's law, ,
and Gauss's law, ,
combine to form:
-
In a vacuum, the constitutive equations are:
-
Antisymmetry reduces these 16 equations to just six independent equations.
Because it is usual to define
by
-
the constitutive equations may, in a vacuum, be combined with Ampère's
law etc. to get:
-
The energy
density of the electromagnetic field combines with Poynting vector and the
Maxwell
stress tensor to form the 4D electromagnetic stress-energy
tensor. It is the flux (density) of the momentum 4-vector and as a rank 2
mixed tensor it is:
-
where
is the Kronecker
delta. When upper index is lowered with η, it becomes symmetric and is part
of the source of the gravitational field.
The conservation of linear momentum and energy by the electromagnetic field
is expressed by:
-
where
is again the density of the Lorentz force. This equation can be deduced from
the equations above (with considerable effort).
Special relativity is accurate only when the absolute value of the gravitational potential is much less
than c2. In a strong gravitational field, one must use general
relativity. General relativity becomes special relativity at the limit of
weak field. At very small scales, such as at the Planck length and below, quantum effects must be
taken into consideration resulting in quantum gravity. However, at macroscopic scales
and in the absence of strong gravitational fields, special relativity is
experimentally tested to extremely high degree of accuracy
(10−20)
and thus accepted
by the physics community. Experimental results which appear to contradict it are
not reproducible and are thus widely believed to be due to experimental
errors.
Special relativity is mathematically self-consistent, and it is an organic
part of all modern physical theories, most notably quantum field
theory, string
theory, and general relativity (in the limiting case of negligible
gravitational fields).
Newtonian mechanics mathematically follows from special relativity at small
velocities (compared to the speed of light) — thus Newtonian mechanics can be
considered as a special relativity of slow moving bodies. See Status of special relativity for a
more detailed discussion.
Several experiments predating Einstein's 1905 paper are now interpreted as
evidence for relativity. (Of these, Einstein was only aware of the Fizeau
experiment before 1905.)
Copyright(c) 2007
- 2012. All rights reserved.
|