Derivation of the Lorentz transformation without the use of Einstein's second postulate Andrei Galiautdinov Department of Physics and Astronomy,University of Georgia,Athens,GA 30602,USA (Dated:January 3,2017) Derivation of the Lorentz transformation without the use of Einstein's Second Postulate is pro- vided along the lines of Ignatowsky,Terletskii,and others.This is a write-up of the lecture first delivered in PHYS 4202 E&M class during the Spring semester of 2014 at the University of Georgia The main motivation for pursuing this approach was to develop a better understanding of why the faster-than-light neutrino controversy (OPERA experiment,2011)was much ado about nothing. Special relativity as a theory of space and time Capsule All physical phenomena take place in space and time. The theory of space and time(in the absence of gravity) is called the Special Theory of Relativity. We do not get bogged down with the philosophical Floating Ball problems related to the concepts of space and time.We simply acknowledge the fact that in physics the notions of space and time are regarded as basic and cannot be reduced to something more elementary or fundamental. We therefore stick to pragmatic operational definitions: Touch-sensitive Time is what clocks measure.Space is what measuring surface rods measure. n FIG.1:(Color online.)A floating-ball inertial detector.After In order to study and make conclusions about the prop- [ erties of space and time we need an observer.A natural choice is an observer who moves freely (the one who is free from any external influences).An observer is not a Definition of clock synchronization single person sitting at the origin of a rectangular coor- dinate grid.Rather,it is a bunch of friends(call it Team K)equipped with identical clocks distributed through- It is pretty clear how to measure distances:the team out the grid who record the events happening at their simply uses its rectangular grid of rods. respective locations. It is also clear how to measure time intervals at a par- ticular location:the team member situated at that loca- How do we know that this bunch of friends is free tion simply looks at his respective clock.What's not so from any external influences?We look around and make clear,however,is how the team measures time intervals sure that nothing is pulling or pushing on any member between events that are spacially separated. of the bunch;no strings,no springs,no ropes are at- A confusion about measuring this kind of time inter- tached to them.An even better way is to use a collection vals was going on for two hundred years or so,until one of"floating-ball detectors"(Fig.1)distributed through- day Einstein said:"We need the notion of synchronized out the grid [1].When detector balls are released,they clocks!Clock synchronization must be operationally de- should remain at rest inside their respective capsules.If fined." any ball touches the touch-sensitive surface of the cap- The idea that clock synchronization and,consequently, sule,the frame is not inertial. the notion of simultaneity of spacially separated events, has to be defined (and not assumed apriori)is the single In the reference frame associated with a freely moving most important idea of Einstein's,the heart of special observer (our rectangular coordinate grid),Galileo's Law relativity.Einstein proposed to use light pulses.The of Inertia is satisfied:a point mass,itself free from any procedure then went like this: external influences,moves with constant velocity.To be In frame K,consider two identical clocks equipped able to say what "constant velocity"really means,and with light detectors,sitting some distance apart,at A thus to verify the law of inertia,we need to be able to and B.Consider another clock equipped with a light measure distances and time intervals between events hap- emitter at location C which is half way between A and pening at different grid locations. B(we can verify that C is indeed half way between A and
Derivation of the Lorentz transformation without the use of Einstein’s second postulate Andrei Galiautdinov Department of Physics and Astronomy, University of Georgia, Athens, GA 30602, USA (Dated: January 3, 2017) Derivation of the Lorentz transformation without the use of Einstein’s Second Postulate is provided along the lines of Ignatowsky, Terletskii, and others. This is a write-up of the lecture first delivered in PHYS 4202 E&M class during the Spring semester of 2014 at the University of Georgia. The main motivation for pursuing this approach was to develop a better understanding of why the faster-than-light neutrino controversy (OPERA experiment, 2011) was much ado about nothing. Special relativity as a theory of space and time All physical phenomena take place in space and time. The theory of space and time (in the absence of gravity) is called the Special Theory of Relativity. We do not get bogged down with the philosophical problems related to the concepts of space and time. We simply acknowledge the fact that in physics the notions of space and time are regarded as basic and cannot be reduced to something more elementary or fundamental. We therefore stick to pragmatic operational definitions: Time is what clocks measure. Space is what measuring rods measure. In order to study and make conclusions about the properties of space and time we need an observer. A natural choice is an observer who moves freely (the one who is free from any external influences). An observer is not a single person sitting at the origin of a rectangular coordinate grid. Rather, it is a bunch of friends (call it Team K) equipped with identical clocks distributed throughout the grid who record the events happening at their respective locations. How do we know that this bunch of friends is free from any external influences? We look around and make sure that nothing is pulling or pushing on any member of the bunch; no strings, no springs, no ropes are attached to them. An even better way is to use a collection of “floating-ball detectors” (Fig. 1) distributed throughout the grid [1]. When detector balls are released, they should remain at rest inside their respective capsules. If any ball touches the touch-sensitive surface of the capsule, the frame is not inertial. In the reference frame associated with a freely moving observer (our rectangular coordinate grid), Galileo’s Law of Inertia is satisfied: a point mass, itself free from any external influences, moves with constant velocity. To be able to say what “constant velocity” really means, and thus to verify the law of inertia, we need to be able to measure distances and time intervals between events happening at different grid locations. Floating Ball Capsule Touch-sensitive surface FIG. 1: (Color online.) A floating-ball inertial detector. After [1]. Definition of clock synchronization It is pretty clear how to measure distances: the team simply uses its rectangular grid of rods. It is also clear how to measure time intervals at a particular location: the team member situated at that location simply looks at his respective clock. What’s not so clear, however, is how the team measures time intervals between events that are spacially separated. A confusion about measuring this kind of time intervals was going on for two hundred years or so, until one day Einstein said: “We need the notion of synchronized clocks! Clock synchronization must be operationally de- fined.” The idea that clock synchronization and, consequently, the notion of simultaneity of spacially separated events, has to be defined (and not assumed apriori) is the single most important idea of Einstein’s, the heart of special relativity. Einstein proposed to use light pulses. The procedure then went like this: In frame K, consider two identical clocks equipped with light detectors, sitting some distance apart, at A and B. Consider another clock equipped with a light emitter at location C which is half way between A and B (we can verify that C is indeed half way between A and arXiv:1701.00270v1 [physics.class-ph] 1 Jan 2017
2 B with the help of the grid of rods that had already been Einstein constructed his theory of relativity on the ba- put in place when we constructed our frame K).Then, sis of(1)The Principle of Relativity (laws of nature are at some instant,emit two pulses from C in opposite di- the same in all inertial reference frames).and (2)The rections,and let those pulses arrive at A and B.If the Postulate of the Constancy of the Speed of Light (the clocks at A and B show same time when the pulses arrive speed of light measured by any inertial observer is in- then the clocks there are synchronized,by definition. dependent of the state of motion of the emitting body). The light pulses used in the synchronization procedure NOTE:This is not the same as saying that the speed can be replaced with two identical balls initially sitting at of light emitted and measured in K is the same as the C and connected by a compressed spring.The spring is speed of light emitted and measured in K'.This latter released(say,the thread holding the spring is cut in the type of constancy of the speed of light is already implied middle),the balls fly off in opposite directions towards A by the principle of relativity. and B,respectively. Here we want to stick to mechanics and push the How do we know that the balls are identical?Because derivation of the coordinate transformation as far as Team K made them in accordance with a specific manu- possible without the use of the highly counterintuitive facturing procedure. Einstein's Second Postulate.The method that achieves How do we know that all clocks at K are identical? this will be presented below and was originally due to Because Team K made all of them in accordance with a Vladimir Ignatowsky [2. specific manufacturing procedure. DISCLAIMER:I never read Ignatowsky's original How do we know that a tic-toc of any clock sitting papers,but the idea is well-known within the community, in frame K corresponds to 1 second?Because Team K often mentioned and discussed.Anyone with time to called a tic-toc of a clock made in accordance with the burn can reproduce the steps without much difficulty. manufacturing procedure "a second" The derivation below consists of 14 steps.If you can Similarly,clocks in K'are regarded as identical and reduce that number,let me know. tick-tocking at 1 second intervals because in that frame all of the clocks were made in accordance with the same manufacturing procedure. Step 1:Galileo's Law of Inertia for freely moving Now,how do we know that the manufacturing proce- particles dures in K and K'are the same?(Say,how do we know that a Swiss shop in frame k makes watches the same ..implies the linearity of the coordinate transforma- way as its counterpart in frame K?)Hmm....That's tion between K and K'(see Fig.2), an interesting question to ponder about. x'=a11(v)z+a12(v)y+a13(v)z+a14(v)t,(1) When studying spacetime from the point of view of =a21(v)z+a22()y+a23()z+a24(v)t,(2) inertial frames of reference discussed above,people dis- covered the following. 2=a31(v)z+a32(v)y+a33(v)z+a31(v)t,(3) t=a41(v)z+a42(v)y+a43()z+a44(v)t.(4) Properties of space and time: Here we assumed that the origins of the two coordinate 1.At least one inertial reference frame exists.(Geo- systems coincide,that is event(0,0,0,0)in K has coor- centric is OK for crude experiments;geliocentric is dinates (0,0,0,0)in K'. better:the frame in which microwave background radiation is uniform is probably closest to ideal). 2.Space is uniform (translations;3 parameters). P(x,y三,) 3.Space is isotropic (rotations;3 parameters). (x',y,,1门 4.Time is uniform (translation;1 parameter). 5.Space is continuous (down to ~10-18 [m]). 6.Time is continuous (down to ~10-26 [s]) 7.Space is Euclidean (apart from local distortions, which we ignore:cosmological observations put the limit at~1026 [m],the size of the visible Universe; this property is what makes rectangular grids of FIG.2:(Color online.)Two inertial reference frames (or- rods possible) thogonal grids of rods equipped with synchronized clocks)in relative motion along the z-axis. 8.Relativity Principle (boosts;3 parameters)
2 B with the help of the grid of rods that had already been put in place when we constructed our frame K). Then, at some instant, emit two pulses from C in opposite directions, and let those pulses arrive at A and B. If the clocks at A and B show same time when the pulses arrive then the clocks there are synchronized, by definition. The light pulses used in the synchronization procedure can be replaced with two identical balls initially sitting at C and connected by a compressed spring. The spring is released (say, the thread holding the spring is cut in the middle), the balls fly off in opposite directions towards A and B, respectively. How do we know that the balls are identical? Because Team K made them in accordance with a specific manufacturing procedure. How do we know that all clocks at K are identical? Because Team K made all of them in accordance with a specific manufacturing procedure. How do we know that a tic-toc of any clock sitting in frame K corresponds to 1 second? Because Team K called a tic-toc of a clock made in accordance with the manufacturing procedure “a second”. Similarly, clocks in K0 are regarded as identical and tick-tocking at 1 second intervals because in that frame all of the clocks were made in accordance with the same manufacturing procedure. Now, how do we know that the manufacturing procedures in K and K0 are the same? (Say, how do we know that a Swiss shop in frame K makes watches the same way as its counterpart in frame K0 ?) Hmm. . . . That’s an interesting question to ponder about. When studying spacetime from the point of view of inertial frames of reference discussed above, people discovered the following. Properties of space and time: 1. At least one inertial reference frame exists. (Geocentric is OK for crude experiments; geliocentric is better; the frame in which microwave background radiation is uniform is probably closest to ideal). 2. Space is uniform (translations; 3 parameters). 3. Space is isotropic (rotations; 3 parameters). 4. Time is uniform (translation; 1 parameter). 5. Space is continuous (down to ∼ 10−18 [m]). 6. Time is continuous (down to ∼ 10−26 [s]). 7. Space is Euclidean (apart from local distortions, which we ignore; cosmological observations put the limit at ∼ 1026 [m], the size of the visible Universe; this property is what makes rectangular grids of rods possible). 8. Relativity Principle (boosts; 3 parameters). Einstein constructed his theory of relativity on the basis of (1) The Principle of Relativity (laws of nature are the same in all inertial reference frames), and (2) The Postulate of the Constancy of the Speed of Light (the speed of light measured by any inertial observer is independent of the state of motion of the emitting body). [NOTE: This is not the same as saying that the speed of light emitted and measured in K is the same as the speed of light emitted and measured in K0 . This latter type of constancy of the speed of light is already implied by the principle of relativity.] Here we want to stick to mechanics and push the derivation of the coordinate transformation as far as possible without the use of the highly counterintuitive Einstein’s Second Postulate. The method that achieves this will be presented below and was originally due to Vladimir Ignatowsky [2]. [DISCLAIMER: I never read Ignatowsky’s original papers, but the idea is well-known within the community, often mentioned and discussed. Anyone with time to burn can reproduce the steps without much difficulty. The derivation below consists of 14 steps. If you can reduce that number, let me know.] Step 1: Galileo’s Law of Inertia for freely moving particles . . . implies the linearity of the coordinate transformation between K and K0 (see Fig. 2), x 0 = α11(v)x + α12(v)y + α13(v)z + α14(v)t, (1) y 0 = α21(v)x + α22(v)y + α23(v)z + α24(v)t, (2) z 0 = α31(v)x + α32(v)y + α33(v)z + α34(v)t, (3) t 0 = α41(v)x + α42(v)y + α43(v)z + α44(v)t. (4) Here we assumed that the origins of the two coordinate systems coincide, that is event (0, 0, 0, 0) in K has coordinates (0, 0, 0, 0) in K0 . K K’ (x ’ , y ’ , z ’ , t’) P (x, y, z, t) O O’ v x ’ y ’ z ’ y x z FIG. 2: (Color online.) Two inertial reference frames (orthogonal grids of rods equipped with synchronized clocks) in relative motion along the x-axis
Step 2:Isotropy and homogeneity of space and Step 5:Inversion i=-,i=-y,andi=-', homogeneity of time 动=-y ..imply that (i)x'is independent of y and z,(ii)y is independent of z,x,and t;(iii)z'is independent of x, y,and t;(iv)t'is independent of y and z,so P(民剪,,) r'=a11(v)x+a14(v)t, (5) =a22(v)y, (6) (,,, 2=a33(v)z, (7) t=a41(v)z+a44(v)t. (8) NOTE:The fact that y'and z'are independent of x and t follows from the requirement that the x'-axis (the line y'=2'=0)always coincides with the x-axis (the line y=z=0);this would not be possible if y'and 2' depended on z and t. IMPORTANT:Eq.(8)indicates that it is possible to have two spacially separated events A and B that are 1 simultaneous in frame K and,yet,non-simultaneous in frame K,that is FIG.3:(Color online.)"Inverted"frames in relative motion. △tAB=0,△xAB≠0:△tAB=a41△xAB+0.(9) ..which is just a relabeling of coordinate marks,pre- This is not as obvious as might seem:for example,be- serves the right-handedness of the coordinate systems fore Einstein it was assumed that whenever AtAB is zero, At'AB must also be zero.So keeping a41(v)in (8)is a and is physically equivalent to (inverted)frame K'mov- significant departure from classical Newtonian mechan- ing with velocity i=-v relative to (inverted)frame K ics. (see Fig.3),so that Once the standard method of clock synchronization is =a(-v)(位-vt) (18) adopted,it is,however,relatively easy to give an example of two events satisfying(9).Try that on your own! 到=k(-v), (19) 2'=k(-v)2, (20) t=6(-v):+y(-v)t, (21) Step 3:Isotropy of space or, ..also implies that y'and z'are physically equivalent, so that a22(v)=a33(v)≡k(v),and thus -x'=a(-v)(-x-t), (22) x'=a11(v)z+a14(v)t, (10) -=-k(-v)y, (23) y=k(v)y, (11) 2'=k(-v)z, (24) 2'=k(v)2, (12) =-6(-v)x+y(-v)t, (25) t=a41(v)x+a44(v)t. (13) which gives a(-v)=a(v) (26) Step 4:Motion of O'(the origin of frame K) k(-U)=k(v), (27) ..as seen from K gives xo uto',or o-vto =0. 6(-v)=-6(w), (28) On the other hand,as seen from K,o=0.For this y(-v)=Y(w) (29) to be possible,we must have 'o (x-vt),and thus I'=a(v)(x-ut), (14 Step 6:Relativity principle and isotropy of space y=k(v)4, (15) 2'=k(v)z, (16) ..tell us that the velocity of K relative to K',as mea- t'=8(v)I+y(v)t, (17) sured by K'using primed coordinates (r',t'),is equal to -v.REMINDER:the velocity of K'relative to K,as where we have re-labeled o41≡6anda44≡Y, measured by K using unprimed coordinates (z,t),is v. NOTE:The y just introduced will soon become the I justify this by considering two local observers co- celebrated gamma factor. moving with O and O,respectively,and firing identical
3 Step 2: Isotropy and homogeneity of space and homogeneity of time . . . imply that (i) x 0 is independent of y and z, (ii) y 0 is independent of z, x, and t; (iii) z 0 is independent of x, y, and t; (iv) t 0 is independent of y and z, so x 0 = α11(v)x + α14(v)t, (5) y 0 = α22(v)y, (6) z 0 = α33(v)z, (7) t 0 = α41(v)x + α44(v)t. (8) NOTE: The fact that y 0 and z 0 are independent of x and t follows from the requirement that the x 0 -axis (the line y 0 = z 0 = 0) always coincides with the x-axis (the line y = z = 0); this would not be possible if y 0 and z 0 depended on x and t. IMPORTANT: Eq. (8) indicates that it is possible to have two spacially separated events A and B that are simultaneous in frame K and, yet, non-simultaneous in frame K0 , that is ∆tAB = 0, ∆xAB 6= 0 : ∆t 0 AB = α41∆xAB 6= 0. (9) This is not as obvious as might seem: for example, before Einstein it was assumed that whenever ∆tAB is zero, ∆t 0 AB must also be zero. So keeping α41(v) in (8) is a significant departure from classical Newtonian mechanics. Once the standard method of clock synchronization is adopted, it is, however, relatively easy to give an example of two events satisfying (9). Try that on your own! Step 3: Isotropy of space . . . also implies that y 0 and z 0 are physically equivalent, so that α22(v) = α33(v) ≡ k(v), and thus x 0 = α11(v)x + α14(v)t, (10) y 0 = k(v)y, (11) z 0 = k(v)z, (12) t 0 = α41(v)x + α44(v)t. (13) Step 4: Motion of O 0 (the origin of frame K0 ) . . . as seen from K gives xO0 = vtO0 , or xO0 −vtO0 = 0. On the other hand, as seen from K0 , x 0 O0 = 0. For this to be possible, we must have x 0 ∝ (x − vt), and thus x 0 = α(v)(x − vt), (14) y 0 = k(v)y, (15) z 0 = k(v)z, (16) t 0 = δ(v)x + γ(v)t, (17) where we have re-labeled α41 ≡ δ and α44 ≡ γ. NOTE: The γ just introduced will soon become the celebrated gamma factor. Step 5: Inversion x˜ = −x, y˜ = −y, and x˜ 0 = −x 0 , y˜ 0 = −y 0 K K’ (x ’ , y ’ , z ’ , t’) P (x, y, z, t) x ’ O O’ v = − v y ’ z ’ y x z ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ FIG. 3: (Color online.) “Inverted” frames in relative motion. . . . which is just a relabeling of coordinate marks, preserves the right-handedness of the coordinate systems and is physically equivalent to (inverted) frame K˜ 0 moving with velocity ˜v = −v relative to (inverted) frame K˜ (see Fig. 3), so that x˜ 0 = α(−v)(˜x − vt), (18) y˜ 0 = k(−v)˜y, (19) z 0 = k(−v)z, (20) t 0 = δ(−v)˜x + γ(−v)t, (21) or, −x 0 = α(−v)(−x − vt), (22) −y 0 = −k(−v)y, (23) z 0 = k(−v)z, (24) t 0 = −δ(−v)x + γ(−v)t, (25) which gives α(−v) = α(v), (26) k(−v) = k(v), (27) δ(−v) = −δ(v), (28) γ(−v) = γ(v). (29) Step 6: Relativity principle and isotropy of space . . . tell us that the velocity of K relative to K0 , as measured by K0 using primed coordinates (x 0 , t0 ), is equal to −v. REMINDER: the velocity of K0 relative to K, as measured by K using unprimed coordinates (x, t), is v. I justify this by considering two local observers comoving with O and O0 , respectively, and firing identical
4 spring guns in opposite directions at the moment when and they pass each other(for a more formal approach,see [3]). If the ball shot in the +x direction by O stays next to z Y(v)(x'+vt'), (49) O'then,by the relativity principle and isotropy of space, y-y, (50) the ball shot in the-z'direction by O'should stay next z=2, to O.This means that the velocity of O relative to O'as (51) measured by K'is negative of the velocity of O'relative t=-6(v)x+y()t', (52) to O as measured by K.Thus, or,in matrix notation, x=a(-v)(x'+vt), (30) y=k(-v), (31) (53) z=k(-U)z', (32) t=6(-v)x+y(-v)t, (33) and and since -[品+目 (54) y=k(-v)y=k(-v)k(v)y=k2(v)y, (34) Step 8:The odd function 6(v) we get k()=士1. ..can be written as (35 6v)=-vf(u2)(w), (55) Choosing k(v)=+1,which corresponds to parallel rela- tive orientation of y and y'(as well as of z and 2),gives, since y(v)is even.[NOTE:The newly introduced func- for the direct transformation, tion f of v2 will turn out to be a constant!Actually,one of the goals of the remaining steps of this derivation is to I'=a(v)(z-vt); (36) show that f is a constant.It will later be identifies with y=y, (37) 1/c2.]Therefore, d=2, (38) t'=6(v)x+y(v)t, (39) 同=到同 (56) and,for the inverse transformation, and x=a()(x'+vt), (40) 同-同 (57) y=, (41) z=, (42) t=-6(v)x'+y(u)t. (43) Step 9:Lorentz transformation followed by its inverse must give the identity transformation Step 7:Motion of O(the origin of frame K) This seems physically reasonable.We have, ..as seen from K gives o 0;also,as seen from K';it gives o =a(v)(zo-uto)=-va(v)to and to 周=到目 6()o+)to=7(u)to.From this,But =y2 号=-u,which gives (58) a(v)=7(v). (44) and thus As a result, 2(1-w2f)=1, (59) I'=7(v)(I-vt), (45) =, (46) from where 2=2, (47) 1 t'=8(v)I+7(v)t, (48) Y=士 V1-v2f (60)
4 spring guns in opposite directions at the moment when they pass each other (for a more formal approach, see [3]). If the ball shot in the +x direction by O stays next to O0 then, by the relativity principle and isotropy of space, the ball shot in the −x 0 direction by O0 should stay next to O. This means that the velocity of O relative to O0 as measured by K0 is negative of the velocity of O0 relative to O as measured by K. Thus, x = α(−v)(x 0 + vt0 ), (30) y = k(−v)y 0 , (31) z = k(−v)z 0 , (32) t = δ(−v)x 0 + γ(−v)t 0 , (33) and since y = k(−v)y 0 = k(−v)k(v)y = k 2 (v)y, (34) we get k(v) = ±1. (35) Choosing k(v) = +1, which corresponds to parallel relative orientation of y and y 0 (as well as of z and z 0 ), gives, for the direct transformation, x 0 = α(v)(x − vt), (36) y 0 = y, (37) z 0 = z, (38) t 0 = δ(v)x + γ(v)t, (39) and, for the inverse transformation, x = α(v)(x 0 + vt0 ), (40) y = y 0 , (41) z = z 0 , (42) t = −δ(v)x 0 + γ(v)t 0 . (43) Step 7: Motion of O (the origin of frame K) . . . as seen from K gives xO = 0; also, as seen from K0 , it gives x 0 O = α(v)(xO − vtO) = −vα(v)tO and t 0 O = δ(v)xO +γ(v)tO = γ(v)tO. From this, x 0 O t 0 O = −v α(v) γ(v) . But x 0 O t 0 O = −v, which gives α(v) = γ(v). (44) As a result, x 0 = γ(v)(x − vt), (45) y 0 = y, (46) z 0 = z, (47) t 0 = δ(v)x + γ(v)t, (48) and x = γ(v)(x 0 + vt0 ), (49) y = y 0 , (50) z = z 0 , (51) t = −δ(v)x 0 + γ(v)t 0 , (52) or, in matrix notation, x 0 t 0 = γ(v) −vγ(v) δ(v) γ(v) x t , (53) and x t = γ(v) +vγ(v) −δ(v) γ(v) x 0 t 0 . (54) Step 8: The odd function δ(v) . . . can be written as δ(v) = −vf(v 2 )γ(v), (55) since γ(v) is even. [NOTE: The newly introduced function f of v 2 will turn out to be a constant! Actually, one of the goals of the remaining steps of this derivation is to show that f is a constant. It will later be identifies with 1/c2 .] Therefore, x 0 t 0 = γ 1 −v −vf 1 x t , (56) and x t = γ 1 v vf 1 x 0 t 0 . (57) Step 9: Lorentz transformation followed by its inverse must give the identity transformation This seems physically reasonable. We have, x 0 t 0 = γ 1 −v −vf 1 x t = γ 2 1 −v −vf 1 1 v vf 1 x 0 t 0 = γ 2 1 − v 2f 0 0 1 − v 2f x 0 t 0 , (58) and thus γ 2 (1 − v 2 f) = 1, (59) from where γ = ± 1 p 1 − v 2f . (60)
5 To preserve the parallel orientation of the x and x'axes Squaring and rearranging give we have to choose the plus sign(as can be seen by taking thev→0 limit),so that ("2= (68】 1 品) Y= (61) V1-v2f or. Thus, "=±+心 1+vv'f (69) 同-同 62) Choosing the plus sign(to make sure that u"=v when v'=0),we get and 月-,间 U+v (63) U”=1+w时 (70) where,we recall,f=f(v2). Step 12:The universal constant f cannot be negative .. Step 10:Two Lorentz transformations performed in succession is a Lorentz transformation ..because in that case the conclusions of relativistic dynamics would violate experimental observations!For This step is crucial for everything that we've been do- example,the force law. ing so far,for it shows that f is a constant,which will be identified with 1/c2,where c is Nature's limiting speed. d mvp=F, (71) We have a sequence of two transformations:from(,t) to (x',t),and then from ('t')to (x",t"), 1-哈别 同=[r where vp is the velocity of a particle,would get messed up.In particular,such law would violate the observed fact that it requires an infinite amount of work (and. thus,energy)to accelerate a material particle from rest to speeds approaching 3x 108 m/s](this argument is due [1+v'f-(v+)「x to Terletskii [4]).In fact,it would become "easier"to ac- [-(f+v'f)1+wv'f t (64) celerate the particle,the faster it is moving.Incidentally, this erperimental fact is what "replaces"Einstein's Sec- where v is the velocity of K'relative to K(as measured ond Postulate in the present derivation!Thus,Eq.(70)is in K using the (z,t)coordinates),and v'is the velocity the limit to which our (actually,Ignatowski's)derivation of K"relative to K(as measured in K'using the (x',t') can be pushed. coordinates).But this could also be written as a single REMARK:Relativistic dynamics has to be discussed transformation from (r,t)to (r",t") separately.We won't do that here,but maybe you can suggest a different reason for f not to be negative?See (65) [3]and [5]for possible approaches. with v"being the velocity of K"relative to K(as mea- sured in K using the (z,t)coordinates).This shows that Step 13:Existence of the limiting speed the(1,1)and(2,2)elements of the transformation ma- trix (64)must be equal to each other and,thus, Denoting f=f', (66) which means that f is a constant that has units of inverse f三2 (72) speed squared,[s2/m2].Wow!! we get the velocity addition formula, v+v Step 11:Velocity addition formula (for reference v”= 1+g (73) frames) If we begin with v'<c and attempt to take the limit To derive the velocity addition formula (along the x- v'→c,we'll get axis)we use Eqs.(64)and (65)to get y'y(v+v')=y"". (67) v" v+c 1+警s6 (74)
5 To preserve the parallel orientation of the x and x 0 axes we have to choose the plus sign (as can be seen by taking the v → 0 limit), so that γ = 1 p 1 − v 2f . (61) Thus, x 0 t 0 = 1 p 1 − v 2f 1 −v −vf 1 x t , (62) and x t = 1 p 1 − v 2f 1 v vf 1 x 0 t 0 , (63) where, we recall, f = f(v 2 ). Step 10: Two Lorentz transformations performed in succession is a Lorentz transformation This step is crucial for everything that we’ve been doing so far, for it shows that f is a constant, which will be identified with 1/c2 , where c is Nature’s limiting speed. We have a sequence of two transformations: from (x, t) to (x 0 , t0 ), and then from (x 0 , t0 ) to (x 00, t00), x 00 t 00 = γ 0 1 −v 0 −v 0f 0 1 x 0 t 0 = γ 0 1 −v 0 −v 0f 0 1 γ 1 −v −vf 1 x t = γ 0 γ 1 + vv0f −(v + v 0 ) −(vf + v 0f 0 ) 1 + vv0f 0 x t , (64) where v is the velocity of K0 relative to K (as measured in K using the (x, t) coordinates), and v 0 is the velocity of K00 relative to K0 (as measured in K0 using the (x 0 , t0 ) coordinates). But this could also be written as a single transformation from (x, t) to (x 00, t00), x 00 t 00 = γ 00 1 −v 00 −v 00f 00 1 x t , (65) with v 00 being the velocity of K00 relative to K (as measured in K using the (x, t) coordinates). This shows that the (1, 1) and (2, 2) elements of the transformation matrix (64) must be equal to each other and, thus, f = f 0 , (66) which means that f is a constant that has units of inverse speed squared, [s2/m2 ]. Wow!! Step 11: Velocity addition formula (for reference frames) To derive the velocity addition formula (along the xaxis) we use Eqs. (64) and (65) to get γ 0 γ(v + v 0 ) = γ 00v 00 . (67) Squaring and rearranging give (v 00) 2 = v + v 0 1 + vv0f 2 , (68) or, v 00 = ± v + v 0 1 + vv0f . (69) Choosing the plus sign (to make sure that v 00 = v when v 0 = 0), we get v 00 = v + v 0 1 + vv0f . (70) Step 12: The universal constant f cannot be negative . . . . . . because in that case the conclusions of relativistic dynamics would violate experimental observations! For example, the force law, d dt q mvp 1 − v 2 p f = F, (71) where vp is the velocity of a particle, would get messed up. In particular, such law would violate the observed fact that it requires an infinite amount of work (and, thus, energy) to accelerate a material particle from rest to speeds approaching 3×108 [m/s] (this argument is due to Terletskii [4]). In fact, it would become “easier” to accelerate the particle, the faster it is moving. Incidentally, this experimental fact is what “replaces” Einstein’s Second Postulate in the present derivation! Thus, Eq. (70) is the limit to which our (actually, Ignatowski’s) derivation can be pushed. REMARK: Relativistic dynamics has to be discussed separately. We won’t do that here, but maybe you can suggest a different reason for f not to be negative? See [3] and [5] for possible approaches. Step 13: Existence of the limiting speed Denoting f ≡ 1 c 2 , (72) we get the velocity addition formula, v 00 = v + v 0 1 + vv0 c 2 . (73) If we begin with v 0 < c and attempt to take the limit v 0 → c, we’ll get v 00 → v + c 1 + vc c 2 = c, (74)
6 which tells us that c is the limiting speed that a material photons exist.Similarly,tachyons may also exist and. object can attain.(Notice that "material"here means like photons,(a)should be created instantaneously (that "the one with which an inertial frame can be associated". is,can't be created at rest.and then accelerated),and The photons do not fall into this category,as will be (b)should not be allowed to form a "legitimate"inertial discussed shortly!) reference frame. The possibilities therefore are: What about violation of causality? 1.c=+oo (Newtonian mechanics;contradicts (71)); 2.c>0 and finite (Special Relativity); B 3.c=0(Contradics observations.) So we stick with option 2. What if an object were created to have v'>c from the start (a so-called tachyon),like in the recent superluminal tA such that tachyon's speed,vpc is greater than c as measured in K(see Fig.4),then in (76) frame K'moving with velocity v c relative to K (imagine a whole fleet of them,forming a grid which where y=1/V1-v2/c2,which shows that it is possible makes up K),the spacetime coordinates of any event to find v <c such that te-t<0;that is,in K'event would be imaginary!In order for the spacetime measure- B happens before event A.This seems to indicate that ments to give real values for (t',',y,2),the reference tachyons are impossible.However,causality is a conse- frame K'made of tachyons must be rejected. quence of the Second Law of Thermodynamics,which What about a reference frame made of photons?In is a statistical law,applicable to macroscopic systems; that case,coordinates would be infinite and should also it does not apply to processes involving individual ele- be rejected.So a fleet of photons cannot form a "le- mentary particles.As a result,the existence of tachyons gitimate"reference frame.Nevertheless,we know that cannot be so easily ruled out
6 which tells us that c is the limiting speed that a material object can attain. (Notice that “material” here means “the one with which an inertial frame can be associated”. The photons do not fall into this category, as will be discussed shortly!) The possibilities therefore are: 1. c = +∞ (Newtonian mechanics; contradicts (71)); 2. c > 0 and finite (Special Relativity); 3. c = 0 (Contradics observations.) So we stick with option 2. What if an object were created to have v 0 > c from the start (a so-called tachyon), like in the recent superluminal neutrino controversy? We’d get some strange results. For example, if we take v = c/2 and v 0 = 2c, we get v 00 = (c/2) + (2c) 1 + (c/2)(2c) c 2 = 5 4 c, (75) so in K the object would move to the right at a slower speed than relative to K0 , while K0 itself is moving to the right relative to K. Bizarre, but OK, the two speeds are measured by different observers, so maybe it’s not a big deal . . . . Step 14: Lorentz transformation in standard form However, if we consider the resulting Lorentz transformation, x 0 t 0 = 1 q 1 − v 2 c 2 1 −v − v c 2 1 x t , (76) or, t 0 = t − v c 2 x q 1 − v 2 c 2 , (77) x 0 = x − vt q 1 − v 2 c 2 , (78) y 0 = y, (79) z 0 = z, (80) we notice that in a reference frame K0 associated with hypothetical tachyons moving with v > c relative to K (imagine a whole fleet of them, forming a grid which makes up K0 ), the spacetime coordinates of any event would be imaginary! In order for the spacetime measurements to give real values for (t 0 , x0 , y0 , z0 ), the reference frame K0 made of tachyons must be rejected. What about a reference frame made of photons? In that case, coordinates would be infinite and should also be rejected. So a fleet of photons cannot form a “legitimate” reference frame. Nevertheless, we know that photons exist. Similarly, tachyons may also exist and, like photons, (a) should be created instantaneously (that is, can’t be created at rest, and then accelerated), and (b) should not be allowed to form a “legitimate” inertial reference frame. What about violation of causality? x ’ t’ t x B A t’ A t’ B tA such that tachyon’s speed, vp = xB−xA tB−tA > c, is greater than c as measured in K (see Fig. 4), then in frame K0 moving with velocity v < c relative to K we’ll have from (77) and (78), t 0 B − t 0 A = γ h (tB − tA) − v c 2 (xB − xA) i = γ 1 − v c 2 xB − xA tB − tA (tB − tA) = γ 1 − vvp c 2 (tB − tA), (81) x 0 B − x 0 A = γ [(xB − xA) − v (tB − tA)] = γ 1 − v tB − tA xB − xA (xB − xA) = γ 1 − v vp (xB − xA), (82) where γ = 1/ p 1 − v 2/c2, which shows that it is possible to find v < c such that t 0 B − t 0 A < 0; that is, in K0 event B happens before event A. This seems to indicate that tachyons are impossible. However, causality is a consequence of the Second Law of Thermodynamics, which is a statistical law, applicable to macroscopic systems; it does not apply to processes involving individual elementary particles. As a result, the existence of tachyons cannot be so easily ruled out
7 Step 15:Speed of light is the limiting speed for Eq.(85)says that t-t0 Event A Event B (=O,for simplicity) 1'A=0 FIG.6:(Color online.)Time dilation. This time a single clock belonging to K',Fig.6,passes =0 1a=0 by two different clocks in K.The corresponding two events,A and B,have r=B,and are related to each Event A Event B other by (=O,for simplicity) FIG.5:(Color online.)Length contraction and relativity of 名-4=7[ee-a-是eB-z simultaneity. =y1- U TB-TA c2tB-tA」 (tB-tA) Here we have a rod of (proper)length to =B-A>0 v- sitting at rest in frame K,see Fig.5.Its speed relative =(1-2)-t to frame K is v.The two events,A and B,represent the meetings of the two clocks at the ends of the rod with tB-iA (88) the corresponding clocks in the K frame at ta =tB.We Y have from (77)and (78), This means that upon arrival at B the moving clock will 指-么=7()e-小 read less time than the K-clock sitting at that location. (83) This phenomenon is called time dilation(moving clocks TB-TA Y(TB-IA), (84) run slower). or Acknowledgments 招-=()-, (85) I thank Todd Baker,Amara Katabarwa,and Loris TB-TA TB-TA (86) Magnani for helpful discussions. [1]T.A.Moore,Sir Ideas That Shaped Physics.Unit R:The (McGraw Hill,2003). Laws of Physics Are Frame Independent,Secon Edition [2]W.von Ignatowsky,Das Relativitatsprinzip,Archiv der
7 Step 15: Speed of light is the limiting speed for material objects Finally, returning to Eq. (74), we see that if something moves with c relative to K0 , it also moves with c relative to any other frame K00. That is: the limiting speed is the same in all inertial reference frames. And there is no mentioning of any emitter. Also, as follows from (73), c is the only speed that has this property (of being the same in all inertial frames). We know that light has this property (ala Michelson-Morley experiment), so the speed of light is the limiting speed for material objects. Since neutrinos have mass, they cannot move faster than light, and thus superluminal neutrinos are not possible. Immediate consequences of the Lorentz transformation A. Length contraction and relativity of simultaneity x ’ v x Event A Event B ( = O, for simplicity) t’ A = 0 t’ B 0 sitting at rest in frame K0 , see Fig. 5. Its speed relative to frame K is v. The two events, A and B, represent the meetings of the two clocks at the ends of the rod with the corresponding clocks in the K frame at tA = tB. We have from (77) and (78), t 0 B − t 0 A = γ − v c 2 (xB − xA), (83) x 0 B − x 0 A = γ (xB − xA), (84) or t 0 B − t 0 A = − v c 2 (x 0 B − x 0 A), (85) xB − xA = x 0 B − x 0 A γ . (86) Eq. (85) says that t 0 B −t 0 A 0 x ’ Same moving clock FIG. 6: (Color online.) Time dilation. This time a single clock belonging to K0 , Fig. 6, passes by two different clocks in K. The corresponding two events, A and B, have x 0 A = x 0 B, and are related to each other by t 0 B − t 0 A = γ h (tB − tA) − v c 2 (xB − xA) i = γ 1 − v c 2 xB − xA tB − tA (tB − tA) = γ 1 − v 2 c 2 (tB − tA) = tB − tA γ . (88) This means that upon arrival at B the moving clock will read less time than the K-clock sitting at that location. This phenomenon is called time dilation (moving clocks run slower). Acknowledgments I thank Todd Baker, Amara Katabarwa, and Loris Magnani for helpful discussions. [1] T. A. Moore, Six Ideas That Shaped Physics. Unit R: The Laws of Physics Are Frame Independent, Secon Edition (McGraw Hill, 2003). [2] W. von Ignatowsky, Das Relativit¨atsprinzip, Archiv der
8 Mathematik und Physik 17,124 (1910);ibid.18,17-40 [4]Y.P.Terletskii,Paradores in the Theory of Relativity (1911). (Plenum,New York,1968). [3]V.Berzi and V.Gorini,"Reciprocity Principle and 5]N.D.Mermin,"Relativity without light,"Am.J.Phys. the Lorentz Transformations,"J.Math.Phys.10,1518 52(2),119(1984). (1969)
8 Mathematik und Physik 17, 124 (1910); ibid. 18, 17-40 (1911). [3] V. Berzi and V. Gorini, “Reciprocity Principle and the Lorentz Transformations,” J. Math. Phys. 10, 1518 (1969). [4] Y. P. Terletskii, Paradoxes in the Theory of Relativity (Plenum, New York, 1968). [5] N. D. Mermin, “Relativity without light,” Am. J. Phys. 52 (2), 119 (1984)