What’s the rd derivative of ?
What an absurd question - does it even make sense? I think so, but in order to build up some intuition let’s take a few steps back.
Imagine that you lived in the middle ages and you were comfortable with the concepts of addition and multiplication. You even understand exponents, as a shorthand for repeated multiplication. Then someone asks you, what’s ?
Nonsense, right? means . There are three twos. You can’t have half a two.
Well, as I’m sure you know, yes - you can. But think about it for a second. What does it mean? What does it mean to multiply by half a time?
is the number that you get when you multiply -times. Thinking about it this way makes the property obvious. If you multiply by times, and then more times, you’ve multiplied by times. And that property is nice, because it makes sense even when is not an integer. If I do something a time and then I do it again a time, how many times have I done it? time, right?
Which brings us to the (obvious because we already learned it) answer, which is that , i.e. .
Let’s generalize a bit and talk about repeated function application.
Consider the function . What’s ? That’s pretty easy:
Ok, how about ? Given the setup, I bet you can figure it out. It’s some function that, when applied twice, gives us . What might that be? seems like a good guess.
Let’s check it:
Ok, how about another this one? If , what’s ? Again, you can guess it. It’s .
Alright, now let’s level up. Previously we were dealing with functions from a number to a number, but functions can take other types of things too. How about a function which takes, as input, a function and returns a new function? What does it do to the function? Let’s start with something easy, like it shifts it to the right:
Can we guess the answer for ? I’m going to go out on a limb and say yes. If you want to do something twice such that the end result is shifting to the right, shifting to the right each time will probably do the trick.
Ok, now for the finale. What if our function takes the derivative of the input function? In other words:
Eek… that is a bit harder.
Let’s take a quick detour and draw an analogy to linear algebra, specifically eigenvectors. If you want to multiply a vector, , by a matrix, , times (where is sufficiently large), a fast way to do it is to follow these three steps:
- Compute the eigenvectors of the matrix . These are the vectors that, when multiplied by , are just scaled by a constant (the constant being the eigenvalue).
- Decompose your vector into a linear combination (weighted sum) of those eigenvectors.
- Your answer is the linear combination of those eigenvectors, where each eigenvector is first scaled by its eigenvalue to the th power.
I tried to explain why this works in depth here, but the quick summary is that we found special inputs (the eigenvectors) which were particularly easy compute for our function (multiplication by ), and then we reformulated our answer as a weighted sum of the function applied to those special inputs ( times). In doing so, we turned our somewhat hard problem into a much easier one.
One thing to mention is that this only work for linear functions, i.e. functions which have the following two properties:
Does the derivative function have these properties? Actually yes:
The derivative is a linear function (often called a linear operator). So, we can utilize the same trick.
Can you think of any functions which have a derivative that are equal to the function itself (or, maybe, a scaled version of it)?
Yep, you bet: , and .
is an eigenfunction of the derivative function. How cool!
So, if we could represent our input function as a weighted sum of exponential functions, then we can trivially take the derivative any number of times (where that number doesn’t have to be an integer).
Oh, what’s that you say? The fourier transform can convert any function into a integral (read: weighted sum) of complex exponential functions (sometimes called complex sinusoids)?
So, we’ve rewritten our function as a weighted sum of eigenfunctions of the derivative operator. The weights are and the eigenfunctions are . So, now we can trivially1 take the th derivative:
At this point, we’ve solved how to take the th derivative in the general case, but we haven’t technically answered our original question: what’s the rd derivative of ?
Lucky for us, the fourier transform of is quite simple. To get a handle on it, let’s first graph . Unfortunately, since is a complex number for a given , in order to graph the function for a range of values I’d need 3 dimensions. So, instead, I’ll graph as a function of time (time will by my 3rd dimension).
So that’s a single complex exponential function. What if we add one more which rotates at exactly the same rate but in the opposite direction, and then add the two values together?
The imaginary (vertical) components cancel each other out perfectly and all we’re left with is a real number, which is twice a curve.
Why multiply by and by ? Since starts at 0, I want the counter-clockwise complex exponential () to start out pointing down (and multiplication by will rotate clockwise by . Similarly, I want the clockwise one () to start out pointing up (and multiplying by will do that).
Let’s test our function for a few values of :
So far so good. How about its derivative?
Well, we know what it should come out to, . Does it?
Yes, and here’s one way to think about it (you could also plug in a few values of to really convince yourself). The form of this equation looks similar to the form of our equation for , except that the two complex exponential functions aren’t multiplied by and , respectively. That just means they both start out pointing directly to the right, instead of one pointing down and one pointing up like in the case. You can look at the animation above and verify for yourself that if you start watching when the red and blue components are both pointing right, the graph looks like a curve.
What this also makes apparent, though, is that and are generated by the same process, it’s just that is just “ahead” of . This probably sounds familiar - that and are the same thing. One easy way to prove to yourself that this is to consider the fact that in the (left) right triangle below and that .
Ok, this interesting and all, but let’s solve the problem.
And, in general:
Ok, one last thing (I promise!). We’ve been focusing on fractional derivatives, but how about negative ones? We have a general formula in terms of , is there anything wrong with taking the derivative “-1” times? Nope! That should just correspond to taking the anti-derivative.
So, in conclusion, the th derivative of is (obviously) .
Note this is using the mathematician’s definition of trivial, i.e. “theoretically possible” ↩