The Implicit Function Theorem: A Proof

We prove the Implicit Function Theorem for the bivariate case.

See also:



Theorem (Implicit function theorem)

Let F(x,y) be a function of class C1 on an open region UR2. Suppose the point P=(a,b) satisfies the following conditions:
  • F(a,b)=0 (i.e., P is a point on the curve F(x,y)=0),
  • Fy(a,b)0.
Then, there exist an open interval I on the x-axis such that aI and a univariate function y=φ(x) on I such that
  1. F(x,φ(x))=0 for all xI, that is, φ(x)  is an implicit function of F(x,y)=0, and
  2. b=φ(a).
Furthermore, the function φ(x) is differentiable on I and
(Eq:IF)φ(x)=Fx(x,φ(x))Fy(x,φ(x)).

Proof. Since Fy(a,b)0, either Fy(a,b)>0 or Fy(a,b)<0. In the following, we assume Fy(a,b)>0 (the other case is similar). We prove the theorem in three steps.

Step 1. Constructing a potential implicit function φ(x).

Since Fy(a,b)>0 and Fy(x,y) is continuous, we can find a sufficiently small ε>0 such that
|xa|<ε,|yb|<2εFy(x,y)>0.
(That is, if (x,y) is sufficiently close to (a,b), then Fy(x,y)>0.) For all x0satisfying |x0a|<ε, consider the univariate function g(y)=F(x0,y). Then, on the interval (b2ε,b+2ε), g(y)=Fy(x0,y)>0 so g(y) is a strictly monotone increasing function. In particular, F(a,b)=0 implies F(a,bε)<0 and F(a,b+ε)>0.

Again, since Fy(x,y) is continuous, we can find a sufficiently small ε>0 such that, for all x0 such that |x0a|<ε, the following hold:
  • The univariate function (x0,y) of y is strictly monotone increasing on (b2ε,b+2ε),
  •  F(x0,bε)<0 and F(x0,b+ε)>0.
By the Intermediate Value Theorem, for all x0 such that |x0a|<ε, there exists a y0 such that F(x0,y0)=0 in the range |y0b|<ε (i.e., y0(bε,b+ε)). Moreover, there is exactly one such y0 as F(x0,y) is strictly monotone increasing. Define the open interval I=(aε,a+ε). For each x0I, we can determine a unique y0 as the (unique) solution of F(x0,y0)=0. In other words, we can define a function y=φ(x) in this way, and this function satisfies the conditions (1) and (2) of the Theorem.

Step 2. Showing that φ(x) is continuous on I.

It suffices to show that, for all x0I, φ(x0+h)φ(x0) as h0. We prove this by contradiction. 

Suppose limh0φ(x0+h)φ(x0). Then we can find a sequence hn that converges to 0 (limnhn=0) and a positive real number ε>0 such that
  • For all nN, |φ(x0+hn)φ(x0)|ε.
By construction in Step 1, for all nN, φ(x0+hn)[bε,b+ε]. Thus, by the Bolzano-Weierstrass Theorem:
"A sequence that is bounded in a closed interval contains a subsequence that converges to some value in the interval."
we can find a subsequence {hnk} of {hn} such that limkφ(x0+hnk)=α[bε,b+ε] where α is a constant. Since |φ(x0+hnk)φ(x0)|ε for all k, we have |αφ(x0)|ε and, in particular, αφ(x0).

Now, consider a point sequence on R2: {(x0+hnk,φ(x0+hnk))}. This point sequence converges to (x0,α). Since F(x,y) is continuous, F(x0+hnk,φ(x0+hnk)) converges to F(x0,α) as k. However, for all k, we have F(x0+hnk,φ(x0+hnk))=0 and, in particular, F(x0,α)=0. We also have F(x0,φ(x0))=0 by construction. On the other hand, if any y0[bε,b+ε] satisfies F(x0,y0)=0, it must be unique (for a given x0). Therefore, it must be that α=φ(x0), which is a contradiction. Thus, φ(x) is continuous on I.

Step 3. Showing that φ(x) is differentiable on I.

It suffices to show that limh0φ(x+h)φ(x)h exists. By Taylor's Theorem, we can write
F(x+h,y+k)=F(x,y)+Fx(x+θh,y+θk)h+Fy(x+θh,y+θk)k
for some θ(0,1).  (See also: Taylor's theorem and asymptotic expansion)
If we set y=φ(x) and k=φ(x+h)φ(x), then, since F(x,φ(x))=F(x+h,φ(x+h))=0, we have
Fx(x+θh,φ(x)+θk)h+Fy(x+θh,φ(x)+θk)k=0,
that is,
φ(x+h)φ(x)h=kh=Fx(x+θh,φ(x)+θk)Fy(x+θh,φ(x)+θk).
Since φ(x) is continuous, we have k0 as h0. By the continuity of Fx(x,y) and Fy(x,y), we have
(Eq:dphi)limh0φ(x+h)φ(x)h=Fx(x,φ(x))Fy(x,φ(x)).
Therefore, φ(x) is differentiable on I and its derivative is given by the right-hand side of (Eq:dphi).

Comments

Popular posts from this blog

Birth process

Branching processes: Mean and variance

Informal introduction to formal logic