Inverse functions and differentiation

From HandWiki
Short description: Calculus identity
Rule:
f(x)=1(f1)(f(x))

Example for arbitrary x05.8:
f(x0)=14
(f1)(f(x0))=4

In mathematics, the inverse of a function y=f(x) is a function that, in some fashion, "undoes" the effect of f (see inverse function for a formal and detailed definition). The inverse of f is denoted as f1, where f1(y)=x if and only if f(x)=y.

Their two derivatives, assuming they exist, are reciprocal, as the Leibniz notation suggests; that is:

dxdydydx=1.

This relation is obtained by differentiating the equation f1(y)=x in terms of x and applying the chain rule, yielding that:

dxdydydx=dxdx

considering that the derivative of x with respect to x is 1.

Writing explicitly the dependence of y on x, and the point at which the differentiation takes place, the formula for the derivative of the inverse becomes (in Lagrange's notation):

[f1](a)=1f(f1(a)).

This formula holds in general whenever f is continuous and injective on an interval I, with f being differentiable at f1(a)(I) and wheref(f1(a))0. The same formula is also equivalent to the expression

𝒟[f1]=1(𝒟f)(f1),

where 𝒟 denotes the unary derivative operator (on the space of functions) and denotes function composition.

Geometrically, a function and inverse function have graphs that are reflections, in the line y=x. This reflection operation turns the gradient of any line into its reciprocal.[1]

Assuming that f has an inverse in a neighbourhood of x and that its derivative at that point is non-zero, its inverse is guaranteed to be differentiable at x and have a derivative given by the above formula.

Examples

  • y=x2 (for positive x) has inverse x=y.
dydx=2x    ;    dxdy=12y=12x
dydxdxdy=2x12x=1.

At x=0, however, there is a problem: the graph of the square root function becomes vertical, corresponding to a horizontal tangent for the square function.

  • y=ex (for real x) has inverse x=lny (for positive y)
dydx=ex    ;    dxdy=1y
dydxdxdy=ex1y=exex=1

Additional properties

f1(x)=1f(f1(x))dx+C.
This is only useful if the integral exists. In particular we need f(x) to be non-zero across the range of integration.
It follows that a function that has a continuous derivative has an inverse in a neighbourhood of every point where the derivative is non-zero. This need not be true if the derivative is not continuous.
  • Another very interesting and useful property is the following:
f1(x)dx=xf1(x)F(f1(x))+C
Where F denotes the antiderivative of f.

Higher derivatives

The chain rule given above is obtained by differentiating the identity f1(f(x))=x with respect to x. One can continue the same process for higher derivatives. Differentiating the identity twice with respect to x, one obtains

d2ydx2dxdy+ddx(dxdy)(dydx)=0,

that is simplified further by the chain rule as

d2ydx2dxdy+d2xdy2(dydx)2=0.

Replacing the first derivative, using the identity obtained earlier, we get

d2ydx2=d2xdy2(dydx)3.

Similarly for the third derivative:

d3ydx3=d3xdy3(dydx)43d2xdy2d2ydx2(dydx)2

or using the formula for the second derivative,

d3ydx3=d3xdy3(dydx)4+3(d2xdy2)2(dydx)5

These formulas are generalized by the Faà di Bruno's formula.

These formulas can also be written using Lagrange's notation. If f and g are inverses, then

g(x)=f(g(x))[f(g(x))]3

Example

  • y=ex has the inverse x=lny. Using the formula for the second derivative of the inverse function,
dydx=d2ydx2=ex=y    ;    (dydx)3=y3;

so that

d2xdy2y3+y=0    ;    d2xdy2=1y2,

which agrees with the direct calculation.

See also

References