Implicit Differentiation

Knowing the chain rule, we're now armed with a powerful tool for performing some very clever algebraic techniques. One such technique is implicit differentiation. With implicit differentiation, we can compute derivatives that we've never seen before. For example, we know from the power rule that:

\dfrac{d}{dx} x^a = a x^{a - 1}

However, we've only applied this rule where ${a}$ is some explicit number. For example, ${x^2,}$ ${x^3,}$ ${x^{-4},}$ etc. But what if the function we're dealing with is something of the form:

f(x) = x^{m/n}, \space (m, n \in \uint)

In other words, what happens when we have rational exponents? Algebraically, we know that where ${m = 1,}$ the expression ${x^{m/n}}$ yields the ${n^{\text{\scriptsize{th}}}}$ root:

x^{1/n} = \sqrt[n]{x}

And more generally:

x^{m/n} = \sqrt[n]{x^m}

Recognizing these relationships, suppose we had the function ${y = x^{m/n}.}$ We can rewrite the function as:

y^n = x^m

Now, we can apply differentiation to the equation above:

\dfrac{d}{dx} y^n = x^m

We performed the manipulation because we simply don't know how to differentiate ${x^{m/n}.}$ Applying the derivative to both sides:

\dfrac{d}{dx} y^n = \dfrac{d}{dx} x^m

In the equation above, ${y}$ is a function of ${x}$ — we have to apply the chain rule:

\left(\dfrac{d}{dy}y^n\right) \dfrac{dy}{dx} = mx^{m-1}

Writing the expression this way, we know that ${\dfrac{d}{dy}y^n = ny^{n-1}.}$ Hence:

ny^{n - 1} \dfrac{dy}{dx} = mx^{m-1}

Solving for ${\dfrac{dy}{dx}:}$

\begin{aligned} \dfrac{dy}{dx} &= \dfrac{mx^{m-1}}{ny^{n-1}} \\[1em] &= \dfrac{m}{n} \dfrac{x^{m-1}}{(x^{m/n})^{n-1}} \\[1em] &= \dfrac{m}{n} x^{m - 1 - \frac{m}{n}(n-1)} \\[1em] &= \dfrac{m}{n} x^{m - 1 - m + \frac{m}{n}} \\[1em] &= \dfrac{m}{n} x^{-1 + \frac{m}{n}} \\[1em] \end{aligned}

We can clean the final result above by suppose that ${a = \dfrac{m}{n}:}$

\dfrac{dy}{dx} = a x^{a - 1}

Let's consider another example: ${x^2 + y^2 = 1.}$ As we know, this is the equation for a circle. Solving for ${y,}$ we obtain the following:

\begin{aligned} y^2 = 1 - x^2 y = \pm \sqrt{1 - x^2} \end{aligned}

We say that ${x^2 + y^2 = 1}$ is the implicit definition for the function, and ${y = \pm \sqrt{1-x^2}}$ is the explicit definition. Of note, the relation ${x^2 + y^2 = 1}$ is not a function. It's more accurately described as a multifunction, but the notion of implicit and explicit definitions remains equally applicable. The circle multifunction consists of two branches, a positive and a negative branch (i.e., the top half and the bottom half of the circle). For the purposes of simplicity, we'll only consider the positive branch, ${y = \sqrt{1-x^2}.}$

In terms of differentiation, function definitions containing radicands are unsightly. Often, the first step to differentiating such functions is to rewrite the definition in terms of rational number exponents:

\begin{aligned} y &= \sqrt{1 - x^2} \\[1em] &= (1 - x^2)^{\frac{1}{2}} \end{aligned}

Writing the definition in terms of fractional exponents, we clearly see an opportunity to apply the chain rule.

\begin{aligned} y' &= \dfrac{1}{2}(1 - x^2)^{- \frac{1}{2}}(-2x) \\[1em] &= \dfrac{(-2x)(1-x^2)^{- \frac{1}{2}}}{2} \\[1em] &= - \dfrac{x}{(1 - x^2)^{\frac{1}{2}}} \\[1em] &= - \dfrac{x}{\sqrt{1 - x^2}} \\[1em] \end{aligned}

The evaluation above yields an explicit solution to computing the derivative. We can, however, take an implicit approach, yielding an implicit solution:

\dfrac{d}{dx} (x^2 + y^2 = 1) = (2x + 2yy' = 0)

Solving for ${y'}$ :

y' = \dfrac{-2x}{2y} = - \dfrac{x}{y} \space \space \space (y = \sqrt{1 - x^2})

When substitute for ${y}$ :

y' = \dfrac{-2x}{2y} = - \dfrac{x}{\sqrt{1-x^2}}

Consider the mechanics of this approach. We left the original equation, ${x^2 + y^2 = 1,}$ intact. Then, we applied the derivative, arriving at an implicit solution. If we wanted the explicit solution, we substituted for ${y.}$ Moreover, the implicit solution can be easily modified to yield both halves of the circle:

y' = \dfrac{-2x}{2y} = - \dfrac{x}{y} \space \space \space (y = \pm \sqrt{1 - x^2})

This analysis evidences the fact that the implicit approach is often much easier than the explicit approach. Let's consider another example:

y^4 + xy^2 - 2 = 0

Like the previous example, we can compute the derivative for this equation with the explicit approach:

y^2 = \dfrac{-x \pm \sqrt{x^2 - 4(-2)}}{2}

Accordingly, isolating ${y:}$

y = \pm \sqrt{\dfrac{-x \pm \sqrt{x^2 + 8}}{2}}

As we can see, this is a very messy quartic equation. We've got a nasty equation with not just two cases, but four. The implicit method is much, much easier. We differentiate by keeping the original equation intact. The first term:

\dfrac{d}{dx} y^4 = 4y^3y'

Then we differentiate the second term (applying the product rule):

\dfrac{d}{dx}xy^2 = y^2 + x(2yy')

Then we differentiate the third term:

\dfrac{d}{dx} (-2) = 0

This process yields:

4y^3y' + y^2 + x(2yy') - 0 = 0

Now all we have to do is solve for ${y':}$

\begin{aligned} 4y^3y' + y^2 + x(2yy') - 0 &= 0 \\ 4y^3(y') + y^2 + 2xy(y') &= 0 \\ 4y^3(y') + 2xy(y') &= -y^2 \\ (y')(4y^3 + 2xy) &= -y^2 \\ y' &= -\dfrac{y^2}{4y^3 + 2xy} \\ \end{aligned}

Now, this is just an implicit solution. The explicit solution requires substituting for ${y,}$ as we did earlier:

\dfrac{dy}{dx} = y' = -\dfrac{\left(\pm \sqrt{\dfrac{-x \pm \sqrt{x^2 + 8}}{2}}\right)^2}{4\left(\pm \sqrt{\dfrac{-x \pm \sqrt{x^2 + 8}}{2}}\right)^3 + 2x\left(\pm \sqrt{\dfrac{-x \pm \sqrt{x^2 + 8}}{2}}\right)}

We leave it to the reader to try the explicit method. While the implicit approach is often faster, there are limitations. For starters, it doesn't avoid the complexity of quartic equation like the one above. For example, with the original equation, ${y^4 + xy^2 - 2 = 0,}$ we know that one solution is the coordinate ${(1,1).}$ Thus, the point ${(1,1)}$ likes on the graph of ${y^4 + xy^2 - 2 = 0.}$ If we plug in this point to the derivative above:

\begin{aligned} \dfrac{dy}{dx} &= -\dfrac{\left(\pm \sqrt{\dfrac{-x \pm \sqrt{x^2 + 8}}{2}}\right)^2}{4\left(\pm \sqrt{\dfrac{-x \pm \sqrt{x^2 + 8}}{2}}\right)^3 + 2x\left(\pm \sqrt{\dfrac{-x \pm \sqrt{x^2 + 8}}{2}}\right)} \\[2em] &= -\dfrac{\left(\pm \sqrt{\dfrac{-(1) \pm \sqrt{(1)^2 + 8}}{2}}\right)^2}{4\left(\pm \sqrt{\dfrac{-(1) \pm \sqrt{(1)^2 + 8}}{2}}\right)^3 + 2(1)\left(\pm \sqrt{\dfrac{-(1) \pm \sqrt{(1)^2 + 8}}{2}}\right)} \\[2em] &= -\dfrac{(1)^2}{4(1)^3 + 2(1)(1)} \\[2em] &= -\dfrac{1}{6} \end{aligned}

Thus, the slope at the point ${(1,1)}$ on the graph of ${y^4 + xy^2 - 2 = 0}$ is ${- \frac{1}{6}.}$ For the point ${x = 2,}$ however, we have no choice but to tackle the complexity head on — we must go through all of the tedious manipulation.

Differentiating Inverse Functions

Suppose we have the function ${y = \sqrt{x}.}$ We can rewrite this function as ${y^2 = x.}$ Alternatively, we could write ${f(x) = \sqrt{x},}$ which in turn can be written as ${g(y) = x \ni g(y) = y^2}$ (the notation ${\ni}$ means “such that”). More generally:

Notation. Suppose ${y: x \to f(x)}$ is a function, written as ${y = f(x).}$ The function ${g: y \to x,}$ written as ${g(y) = x,}$ is called the inverse function of ${y.}$

In other words, if we have a function ${y = f(x),}$ we can rewrite it as ${g(y) = x.}$ The function ${g(y) = x,}$ which is really ${g(f(x)) = x,}$ is called the inverse function of ${y = f(x).}$ The inverse function of ${y = f(x),}$ which is ${g(y) = x,}$ is usually written with the notation ${g = f^{-1}.}$ Similarly, the inverse of ${g(y) = x,}$ which is ${y = f(x),}$ is usually written as ${f = g^{-1}.}$

Given our definition above, we can see that ${f(x) = \sqrt{x}}$ has the inverse function ${f^{-1}(x) = x^2.}$ Comparing these two graphs:

The graph of ${y = \sqrt{x}}$ to the left, and the graph of ${x = y^2}$ to the right.

The graphs above evidence the fact that the inverse functions are really just switching the ${x}$ and ${y}$ values. We can see this more clearly when we plot both functions on the same plane:

Notice how the roles of ${x}$ and ${y}$ are switched. Original: ${x}$ is the input, ${y}$ is the output. Inverse: ${y}$ is the input, ${x}$ is the output.

So what does this have to do with implicit differentiation? Well, as long as we know the derivative of some function ${f,}$ we can find the derivative of the inverse function ${f^{-1}.}$ In other words, we can find the derivative of any inverse function ${f^{-1}}$ provided we know the derivative of ${f.}$

This may not seem like much of an insight given the examples we've seen earlier, but it proves to be immeasurably useful when we confront some fairly complex functions. For example, consider the function ${y = \arctan x.}$ This is the function ${y = \tan^{-1} x,}$ which is the inverse of ${x = \tan y.}$ Plotting both these graphs together, we have:

A more focused view reveals:

First, let's consider the derivative of ${\tan y.}$ From trigonometry, we know that ${\tan y = \dfrac{\sin y}{\cos y}.}$ Accordingly, the derivative of ${\tan y}$ is computed by applying the quotient rule:

\begin{aligned} \dfrac{d}{dy} \tan y &= \dfrac{(\sin y)'(\cos y) - (\sin y)(\cos y)'}{(\cos y)^2} \\[2em] &= \dfrac{(\cos y)(\cos y) - (\sin y)(-\sin y)}{(\cos y)^2} \\[2em] &= \dfrac{\cos^2 y + \sin^2 y}{\cos^2 y} \\[2em] &= \dfrac{1}{\cos^2 y} \\[2em] &= \sec^2 y \end{aligned}

Thus, ${\dfrac{d}{dy} \tan y = \sec^2 y.}$ Now we differentiate:

\begin{aligned} \dfrac{d}{dy}(\tan y = x) &= \left(\dfrac{d}{dy} \tan y\right)\left(\dfrac{dy}{dx}\right) = 1 \\[2em] &= \dfrac{1}{\cos^2 y} \cdot y' = 1 \end{aligned}

Solving for ${y',}$ we have:

y' = \cos^2 y

Substituting for ${y,}$ we have:

\dfrac{d}{dx} \arctan x = y' = \cos^2 (\arctan x)

The derivative above is correct, but it is very complicated. Whenever we work with trigonometric functions, we want to always think of applying a trigonometric identity that simplifies the expressions. In this case, we have an identity that's directly applicable:

\cos y = \dfrac{1}{\sqrt{1 + x^2}}

Accordingly:

\cos^2 y = \dfrac{1}{1 + x^2}

Applying this fact to our derivative:

\dfrac{d}{dx} \arctan x = \dfrac{1}{1 + x^2}

Having seen implicit differentiation as applied to ${f(x) = \arctan x,}$ we can see how easy it is for the other inverse trigonometric functions. For example, consider ${y = \arcsin x.}$

\begin{aligned} y &= \arcsin x \\ \sin y &= x \\ \dfrac{d}{dy} (\sin y = x) &= \left(\dfrac{d}{dy} \sin y = \dfrac{d}{dy} x\right) \\ &= (\cos y) y' = 1 \\ \end{aligned}

Solving for ${y'}$ and substituting for ${y}$ :

\begin{aligned} y' &= \dfrac{1}{\cos y} \\ &= \dfrac{1}{\sqrt{1 - \sin^2 y}} \\ &= \dfrac{1}{\sqrt{1 - x^2}} \end{aligned}

We now have another useful derivative:

definition. Given the function ${y = \arcsin x,}$ the function's derivative ${y'}$ is:
$y' = \dfrac{1}{\sqrt{1-x^2}}$