Discussion about this post

User's avatar
lcamtuf's avatar

As a second postscript, the simplest way to explain Euler's version of the equation is to make four observations on top of what we discussed in the article. I should note that this isn't a rigorous proof (the second observation is an appeal to intuition), but it should still be quite enlightening:

1) Exponentiation that uses one constant base greater than 1 can be rewritten in another base greater than 1 by including an appropriate scaling factor in the exponent. For example, 8^x can be written as 2^(3*x). We don't need to dwell on how to calculate the scaling factor; the important part is that we effectively rolled the old base into the exponent.

2) A bit less obviously, the same principle extends to the imaginary unit, which can be moved from the base to the exponent if you toss in the right scaling factor. For example, if we want to go from base i to base 10, the solution is i^x = 10^(~0.682 i * x). We can't trivially show why 0.682 is the right number, but *conceptually*, the operation is analogous to #1. In the end, both forms produce the same result: they give us continuous CCW rotation in the complex plane.

3) As discussed in the article, the expression i^x implements rotation. In essence, as x goes from 0 to 4, the resulting point, starting at (1,0), travels the distance of 2*pi along the unit circle (radius 1, circumference 2*pi).

Importantly, this is different from how we usually define sin(x) and cos(x): when using radians, there is a 1:1 correspondence of "input" and "output" distances: 2*pi radians means a full cycle along the circumference of 2*pi. This is why in the formula that equated two methods of rotating a point, we needed a scaling factor on the i^x side for the equality to hold. We wrote:

i^(2/pi*m) = cos(m) + i * sin(m)

The 2/pi factor means that when the parameter of sin() & cos() changes from 0 to 2*pi, the exponent on the left side changes from 0 to 4 (2/pi*pi*2). The rotation speeds are matched and the equality holds.

4) It would be nice to find a different base where the rate of change on the left side naturally matches the right side, so that we could lose the 2/pi bit! And there is only base with an initial 1:1 correspondence between an increment in the exponent and the resulting increment in the value of the expression. It's, by definition, the mathematical constant e.

If you were introduced to e in a different way and are suspicious, you can run a simple numerical test with a small exponent delta of, say, 0.0001:

Rate of change for base 2: (2^0.0001 − 2^0) / 0.0001 = ~0.69

Rate for base e: (e^0.0001 − e^0) / 0.0001 = ~1.00

Rate for base 4: (4^0.0001 − 4^0) / 0.0001 = ~1.39

This rate changes as we get away from real zero: that's the nature of exponential growth. For example, if the exponent is in the vicinity of 2, the rate jumps to (e^2.0001 − e^2) / 0.0001 = ~7.39. That said, only the real part of the exponent changes the growth rate. In the absence of a real part, in accord with #2, the equation produces rotation without acceleration. The rate of change for e^imaginary remains 1:1 at all times.

Now, from #1 + #2, if we want to switch the equation developed in the article from base i to base e, it follows that we can rewrite the scaled expression on the left side -- i^(2/pi*m) -- as e^(<sth>*i*m), where <sth> is a new scaling factor to maintain the same rate of rotation as before.

At first blush, it's not clear what <sth> ought to be. But from #3 + #4, we know that the rate of change for e^(i*m) is always 1, so we don't actually need any further scaling to match the rate-of-change on the sin / cos side. That is to say, <sth> = 1. This gives us Euler's formula:

e^(i*m) = cos(m) + i*sin(m)

Expand full comment
Wyrd Smythe's avatar

I understand Gauss wanted to call the "imaginary" axis the "lateral" axis, which would have been a lot less confusing for math students. There is also the nice fact that understanding multiplication as rotation explains something else that students can find confusing: It makes sense that multiplying positive (real) numbers results in a positive real number, and it sort of makes sense that multiplying a positive real number times a negative real number gives a negative number, but much harder to understand why multiplying two negative numbers results in a positive number. But it's just a matter of 180° rotations.

Expand full comment
5 more comments...

No posts