Special Relativity Derivation from Magnetic Effects

Written 05 Jun 2021 by Michael Huang



We start with Maxwell’s Equations and the Principle of Relativity. From there, we derive, using only basic mathematics and some symmetry arguments, the most prominent phenomena or predictions of Special Relativity: length contraction and time dilation. We avoid the usage of Einstein’s thought experiments involving the constancy of speed of light, but instead use a single thought experiment involving magnetic effects as the basis of the derivation.

Contents

Galilean Relativity

Several principles/assumptions underlie Galilean Relativity, the relativity that Newton adopted. These principles construct the relativity that we are all familiar with: when you throw a ball in a moving train, the velocity of the ball to the ground = velocity of ball to train + velocity of train.

Because Galilean Relativity eventually fails, it’s important to describe it in terms of principles so that we can figure exactly which principles are causing its failure and thus find a way to correct them.

Principle of Relativity

Principle of Relativity states that physics laws should be all the same in every inertial frame of reference. This implies that if we perform two same experiments in two different frames of references, the same result should be observed.

Independence of space dimensions

Each space dimension x,y,zx,y,z are independent from one another. That is, their motion can be considered independently and don’t interfere with each other. In particular, if two reference frames are moving relative to each other along the xx directions. Motions/experiments along the yy should be the same to the two, due to principle of relativity. This principle comes in handy in our derivation of special relativity later.

Symmetry of Reference Frame

If two frames of references are anti-symmetric, the results of their experiments should also be anti-symmetric. In particular, consider reference frames A,BA,B where A,BA,B are moving relative to each other. From AA’s perspective, BB moves at velocity vBv_B. From BB’s perspective, AA moves at vAv_A. This symmetric property dictates that vA=vBv_A=-v_B.

Absoluteness of Length

The length of an object measured from all reference frames is the same. To define the concept of length more precisely: a length is the spatial separation between 2 simultaneous events.

Absoluteness of Time

The time interval between any 2 events is the same measured from all frames of references.

Others

There are a few other symmetric properties which are relatively intuitive to the mind and unimportant for our derivation. For example, performing an experiment in location A and location B (given their environment is the same) should yield the same result. Similarly, experiments at time A and time B should also yield the same result if the two times are basically indistinguishable.

Equation of Transformation

These principles can give rise to the equations of transformation of Galilean Relativity: x=x+vtt=tx'=x+vt\\ t'=t Where x,tx’,t’ indicates the position and time in the primed coordinate. vv is the velocity of primed reference frame.

Everything works out harmoniously. Galilean Relativity is consistent under Newton’s Laws (and thus all classical mechanics laws).

But things break down when Maxwell’s Equations and Lorentz Force Law came along, which are experimentally tested to be valid.

Maxwell’s Equations and Lorentz Force Law

δσEdA=Qσϵ0δσBdA=0δσEdl=ΦBtδσBdl=μ0I+μ0ϵ0ΦEtF=q(E+v×B)\iint_{\delta\sigma}\bold{E}\cdot \bold{dA}=\frac{Q_\sigma}{\epsilon_0} \\ \iint_{\delta\sigma}\bold{B}\cdot \bold{dA}=0 \\ \int_{\delta\sigma} \bold{E}\cdot \bold{dl}=-\frac{\partial \bold{\Phi_B}}{\partial t} \\ \int_{\delta\sigma} \bold{B}\cdot \bold{dl} = \mu_0 \bold{I} + \mu_0\epsilon_0 \frac{\partial \bold{\Phi_E}}{\partial t} \\ \bold{F}=q(\bold{E}+\bold{v} \times\bold{B})

Principle of Relativity governs that laws of electromagnetism should be invariant under transformation of coordinate system. For these laws to be invariant, we only need E+v×B\bold{E}+\bold{v}\times \bold{B} to be exert the same effect on charged particles in both frames of references. It doesn’t need E=EE=E’ or B=BB=B’. In fact, E+v×BE+v\times B doesn’t necessarily equate E+v×BE’+v’\times B’. It’s only required that the effects they produce are the same, which we will explore in greater detail later.

Failure of Galilean Relativity

Thought Experiment Setup

setup

We have 2 frames of references S1,S2S_1,S_2. A positive charge qq moves ABA\rightarrow B at velocity vv. In the wire, the positive charges moves to the right with velocity vv while the negative charges are fixed. Reference frame S2S_2 also traverses right at velocity vv. We denote the separation between each positive charge with Δx+\Delta x^+ and that of negative charges as Δx\Delta x^-. Let ρ+,ρ\rho^+,\rho^- denote the charge density of positive and negative charges. Then, ρ=eΔx\rho=\frac{e}{\Delta x}. A,B,qA,B,q are all rr away from the wire.

Contradiction

From perspective of S1S_1, Δx1+=Δx1\Delta x^+_1=\Delta x^-_1 since the wire cannot be charged. If it was charged, the charges would create a electrostatic force that pushes against the battery to make the charges space out. The current S1S_1 observes in the wire is I1=vρ1+I_1=v\rho^+_1. It generates a magnetic field at A,B,qA,B,q.

By Ampere’s Law

2πrB1=μ0I12\pi rB_1=\mu_0 I_1

B1=μ0I12πr=μ0vρ1+2πrB_1=\frac{\mu_0I_1}{2\pi r}=\frac{\mu_0v\rho^+_1}{2\pi r}

The E+v×BE+v\times B quantity for qq is vB1=μ0v2ρ1+2πrvB_1 = \frac{\mu_0v^2\rho^+_1}{2\pi r}

Now let’s shift to the perspective of S2S_2. If we assume Δx2+=Δx2\Delta x^+_2=\Delta x^-_2, then E2=0E_2=0. It’s easy to see that B2=B1=μ0vρ22πrB_2=B_1=\frac{\mu_0v\rho^-_2}{2\pi r}. The E+v×BE+v\times B value from the perspective of S2S_2 is 00!

Therefore something is wrong since the Principle of Relativity is broken: for S1S_1, qq accelerates towards the wire when it moves from ABA\rightarrow B. For S2S_2, qq doesn’t accelerate towards the wire. According to the principle of relativity and the independence of space coordinates, qq should see its momentum increase by the same amount when it moves from ABA\rightarrow B regardless of the reference frame in which we measure it.

Modifying Galilean Relativity to resolve the paradox

The flaw occurs when we assume Δx2+=Δx2\Delta x^+_2=\Delta x^-_2.

In other words, we were wrong to assume that length is absolute. In other words, it must be that Δx2+Δx1+\Delta x_2^+\ne \Delta x_1^+ and Δx2Δx1\Delta x_2^-\neq \Delta x_1^-. Length of objects cannot be the same under all reference frames! In fact, given the knowledge that qq is attracted towards the wire in S1S_1, we know that ρ2+<ρ2\rho^+_2<\rho^-_2 and Δx2+>Δx2\Delta x^+_2>\Delta x^-_2. How is this possible? Does charges undergo motion (congregate together for negative charges and spacing out for positive charges) when we shift the frame of reference? Impossible. The only plausible explanation is that length contracted for the now moving negative charges and length expanded for the now stationary positive charges.

Therefore, the first principle of Galilean Relativity that we must abandon is the absoluteness of length.

As we will see, for the math to workout correctly, we also need to abandon the absoluteness of time. You don’t have to trust me. You can see it for yourself in our derivation.

Mathematical Derivation of Length Contraction and Time Dilation

setup

Therefore, the act of boosting from S1S_1 to S2S_2 changes the state of motion of positive charges from velocity vv to stationary, and changes the state of motion of negative charges from stationary to moving with velocity vv. If this change of motion decrease Δx\Delta x^- to a factor of λ\lambda, it should symmetrically increase Δx+\Delta x^+ by a factor of 1λ\frac{1}{\lambda}.

Thus, we can write Δx1+=Δx1=Δx1\Delta x^+_1=\Delta x^-_1=\Delta x_1.

And Δx2+=Δx1λ\Delta x^+_2=\frac{\Delta x_1}{\lambda}, Δx2=λΔx1\Delta x^-_2=\lambda \Delta x_1.

We can also write ρ2+=eΔx2+=eλΔx1\rho^+_2=\frac{e}{\Delta x^+_2}=\frac{e\lambda}{\Delta x_1}

Similarly ρ2=eΔx2=eλΔx1\rho^-_2=\frac{e}{\Delta x^-_2}=\frac{e}{\lambda\Delta x_1}

By Gauss’s Law, we can compute the electric field at A,B,qA,B,q:

2πrLE2=Lρ2ϵ02\pi r L E_2 = \frac{L \rho_2}{\epsilon_0}

E2=ρ22πrϵ0E_2 = \frac{\rho_2}{2\pi r\epsilon_0}

Here ρ2=ρ2ρ2+\rho_2=\rho^-_2-\rho^+_2 and the electric field points towards the wire.

Now, as qq travels from ABA\rightarrow B, this E2E_2 accelerates qq towards the wire. It will change its momentum by Δp2=Δt2F2=Δt2qE2\Delta p_2=\Delta t_2 F_2=\Delta t_2 q E_2 =ρ2qΔt22πrϵ0=\frac{\rho_2 q \Delta t_2}{2\pi r\epsilon_0} =(ρ2ρ2+)qΔt22πrϵ0=\frac{(\rho^-_2 - \rho^+_2) q \Delta t_2}{2\pi r\epsilon_0}

Now if we return to S1S_1, we can compute Δp1\Delta p_1.

Δp1=Δt1F1=Δt1qv×B1\Delta p_1=\Delta t_1 F_1 = \Delta t_1 q v\times B_1 =μ0qv2ρ1+Δt12πr=\frac{\mu_0qv^2\rho^+_1\Delta t_1}{2\pi r}

Due to the Principle of Relativity and independence of spatial dimensions, our experiment along the y direction should yield the same result regardless of whether we are in S1S_1 or S2S_2.

Therefore, Δp1=Δp2\Delta p_1=\Delta p_2

(ρ2ρ2+)qΔt22πrϵ0=μ0qv2ρ1+Δt12πr\frac{(\rho^-_2 - \rho^+_2) q \Delta t_2}{2\pi r\epsilon_0}=\frac{\mu_0qv^2\rho^+_1\Delta t_1}{2\pi r}

So,

(ρ2ρ2+)Δt2ϵ0=μ0v2ρ1+Δt1\frac{(\rho^-_2 - \rho^+_2)\Delta t_2}{\epsilon_0}=\mu_0v^2\rho^+_1\Delta t_1

(ρ2ρ2+)Δt2=ϵ0μ0v2ρ1+Δt1(\rho^-_2 - \rho^+_2)\Delta t_2=\epsilon_0\mu_0v^2\rho^+_1\Delta t_1

Plugging in ρ2,ρ2+\rho^-_2,\rho^+_2:

(eλΔx1eλΔx1)Δt2=ϵ0μ0v2eΔx1Δt1(\frac{e}{\lambda \Delta x_1} - \frac{e\lambda}{\Delta x_1})\Delta t_2=\epsilon_0\mu_0v^2\frac{e}{\Delta x_1}\Delta t_1

(1λλ)Δt2=ϵ0μ0v2Δt1(\frac{1}{\lambda} - \lambda)\Delta t_2=\epsilon_0\mu_0v^2 \Delta t_1

Now, we recognize that ϵ0μ0=1c2\epsilon_0\mu_0=\frac{1}{c^2}

The equation becomes

(1λλ)Δt2=v2c2Δt1(\frac{1}{\lambda} - \lambda)\Delta t_2=\frac{v^2}{c^2} \Delta t_1

Now we just need to relate Δt1,Δt2,λ\Delta t_1, \Delta t_2, \lambda.

That is simple because Δt1\Delta t_1 is just the amount of time it takes for qq to traverse ABA\rightarrow B from S1S_1’s frame of reference. Therefore Δt1=L1v\Delta t_1 = \frac{L_1}{v}. Symmetrically, Δt2=L2v\Delta t_2 = \frac{L_2}{v}.

However, as we boost from S1S_1 to S2S_2, A,BA,B changes from stationary to moving with velocity vv. Therefore, similar to the negative charges in the wire, LL should shrink to a factor of λ\lambda. Therefore L2=λL1L_2=\lambda L_1.

Δt2=λL1v=λΔt1\Delta t_2 = \frac{\lambda L_1}{v}= \lambda \Delta t_1

Notice that Δt2<Δt1\Delta t_2<\Delta t_1. So we say Δt1\Delta t_1 is dilated with respect to Δt2\Delta t_2.

Now let’s substitute everything back to the original equation

(1λλ)Δt2=v2c2Δt1(\frac{1}{\lambda} - \lambda)\Delta t_2=\frac{v^2}{c^2} \Delta t_1

It becomes

(1λλ)λΔt1=v2c2Δt1(\frac{1}{\lambda} - \lambda)\lambda\Delta t_1=\frac{v^2}{c^2} \Delta t_1

So,

1λ2=v2c21- \lambda^2=\frac{v^2}{c^2}

In other words,

λ=1v2c2\lambda = \sqrt{1-\frac{v^2}{c^2}}

Conclusion

In a nutshell, throughout our efforts to resolve the paradox which breaks the Principle of Relativity, we assumed the correctness of Maxwell’s Equations and Lorentz Force Law, which are well-tested. From these principles and symmetry arguments, we concluded that the only way to resolve to paradox is the forego the absoluteness of length and time across different reference frames. And then we computed exactly how length and time will change under a boost from S1S_1 to S2S_2. In this way, we have essentially derived the formula for length contraction and time dilation.

Length Contraction

If an object, or more appropriately space itself, moves at velocity vv relative to SS and moves at velocity 00 relative to SS’, then LS=1v2c2LSL_S = \sqrt{1-\frac{v^2}{c^2}}L_{S’}. LSL_{S’} can be called the proper length.

Time Dilation

In SS’, let the space-distance separation between events A,BA,B to be 00 and the time separation to be ΔtS\Delta t_{S’}. For SS moving at velocity vv relative to SS’, let its time separation of A,BA,B to be ΔtS\Delta t_S, then ΔtS=ΔtS1v2c2\Delta t_S=\frac{\Delta t_{S’}}{\sqrt{1-\frac{v^2}{c^2}}}.

Given these two rather fundamental effects, we may further derive the full Lorentz Transformation like how Galilean Transformation maybe derived from its principles.

If we make speed of light the standard unit for speed (so c=1c=1), full Lorentz Transformations are:

x=x+vt1v2x'=\frac{x+vt}{\sqrt{1-v^2}} t=t+vx1v2t'=\frac{t+vx}{\sqrt{1-v^2}}

We can see the hidden λ=1v2\lambda=\sqrt{1-v^2} term clearly.