Partial and total differentiation of multivariate functions

A multivariate function may be differentiated with respect to each variable, which is called partial differentiation. By combining all the partial differentiations, we define total differentiation. The essence of (total) differentiation is a linear approximation. In the case of a univariate function, we approximate the function y=f(x) in the neighbor of a point, say x=a, by the tangent line y=f(a)(xa)+f(a). In the case of a multivariate function, we approximate the function y=f(x1,x2,,xn) in the neighbor of a point, say a=(a1,a2,,an), by the tangent hyperplane at the point a.

Partial differentiation

Let f(x,y) be a function on an open region UR2 and (a,b)U. If we fix y=b in f(x,y), we have a univariate function g(x)=f(x,b). Since U is open, there exists δ>0 such that Nδ(a,b)U. Therefore g(x) is defined on the open interval (aδ,a+δ). In other words, the function g(x) is defined in a neighbor of x=a.

Remark. We write Nδ(a,b) (rather than Nδ((a,b)), to save keystrokes!) to mean the δ-neighbor of the point (a,b)R2. □

If g(x) is differentiable at x=a, its differential coefficient is called the partial differential coefficient with respect to x (at (a,b)) and dgdx(a) is denoted as fx(a,b) or fx(a,b).

Remark. Here is one way to understand the partial differential coefficient. We have a surface z=f(x,y) in R3. Find its cross-section with the plane y=b. This cross-section is a curve defined by z=g(x)=f(x,b). The partial differential coefficient fx(a,b)=dgdx(a) is the slope of the tangent line of the curve at x=a. □

Similarly, if we fix x=a in f(x,y), we have a univariate function h(y)=f(a,y) which is defined in a neighbor of y=b. If dhdy(b) exists, it is called the partial differential coefficient with respect to y (at b) and denoted fy(a,b) or fy(a,b).

Example. Let f(x,y)=x2y+2xy2y3. Let us find the partial differential coefficients fx(a,b) and fy(a,b). Letting y=b, we have f(x,b)=x2b+2xb2b3. Differentiating the right-hand side with respect to x, we have 2xb+2b2. Setting x=a, we have fx(a,b)=2ab+2b2. 

Similarly, we have fy(a,b)=a2+4ab3b2.

Partial derivatives

If the partial differential coefficient fx(a,b) exists at every (a,b)U, then it defines a function on U. This function is called the partial derivative of f(x,y) with respect to x and is denoted 

fx(x,y) or fx(x,y).

Similarly, we define the partial derivative of f(x,y) with respect to y, denoted 

fy(x,y) or fy(x,y).

Example. Let f(x,y)=5x3+2x2y3xy2+y3. Then fx(x,y)=15x2+4xy3y2,fy(x,y)=2x26xy+3y2.

Total differentiation

Let us review the notion of differentiation of univariate functions. We defined the differential coefficient of a univariate function f(x) at x=a by
f(a)=dfdx(a)=limxaf(x)f(a)xa.
This is equivalent to
limxaf(x)f(a)f(a)(xa)|xa|=0,
or
f(x)=f(a)+f(a)(xa)+o(|xa|)
where o is Landau's little-o. This equation suggests that the function y=f(x) is approximated by a linear function, namely the tangent of y=f(x) at x=a,
y=f(a)+f(a)(xa).
Conversely, suppose that the function y=f(x) can be approximated by a linear function in a neighbor of x=a:
f(x)=f(a)+m(xa)+o(|xa|).
From this equation, we can see that
limxaf(x)f(a)xa=m.
This means that y=f(x) is differentiable at x=a and f(a)=m.

In summary, f(x) is approximated by the linear function f(a)+f(a)(xa) in a neighbor of x=a, and its slope is the differential coefficient f(a) itself. Such linear approximation is the essence of differentiation.

The same argument applies to multivariate functions. Differentiating the function z=f(x,y) at the point P=(a,b) is to approximate it by a linear function
z=f(a,b)+m(xa)+n(yb).
That is, for the point X=(x,y) in a neighbor of P=(a,b), we consider the linear approximation
(Eq:LA)f(x,y)=f(a,b)+m(xa)+n(yb)+o(XP)
where XP=(xa)2+(yb)2=d(X,P) is the distance between the points X and P. Setting y=b in this equation, we have
limxaf(x,b)f(a,b)xa=m.
That is, fx(a,b)=m. Similarly, we can show that fy(a,b)=n. In summary, if the linear approximation (Eq:LA) holds, it must be
f(x,y)=f(a,b)+fx(a,b)(xa)+fy(a,b)(yb)+o(XP)
in a neighbor of P.

Definition (Total differentiability)

Let U be an open region in R2 and P=(a,b)U. The function f(x,y) on U is said to be (totally) differentiable at (a,b) if there exist constants m and n such that
f(x,y)=f(a,b)+m(xa)+n(yb)+o(XP) as X=(x,y)P=(a,b)
or equivalently,
lim(x,y)(a,b)f(x,y)f(a,b)m(xa)n(yb)(xa)2+(yb)2=0.
f(x,y) is said to be (totally) differentiable on U if it is (totally) differentiable at every point in U.

Remark. The word "totally" in "totally differentiable" is used in contrast to "partially differentiable." However, "totally" may be omitted. If we simply say, "a multivariate function is differentiable," it means the function is totally differentiable. □

From the above discussion, if the function f(x,y) is totally differentiable at (a,b), it is partially differentiable at (a,b), and m=fx(a,b) and n=fy(a,b). (The converse is not necessarily true; We will see such an example in a later post.) The linear function
z=f(a,b)+fx(a,b)(xa)+fy(a,b)(yb)
is the tanget plane of z=f(x,y) at (a,b)

Remark. More generally, when the domain is in Rn, for the function y=f(x)=f(x1,x2,,xn) at the point a=(a1,a2,,an), we have the linear function
y=f(a)+fx1(a)(x1a1)+fx2(a)(x2a2)++fxn(a)(xnan)
that is the tangent hyperplane of y=f(x1,x2,,xn) at a=(a1,a2,,an). □

Example. Let us find the equation of the tangent plane of the surface defined by the function z=2x3+y2 at (1,2,2) (make sure this point indeed belongs to the given surface). Let f(x,y)=2x3+y2. Then
fx(x,y)=6x2,fy(x,y)=2y,
so that fx(1,2)=6 and fy(1,2)=4. Since f(1,2)=2, the tangent plane is given by
z=2+6(x+1)+4(y2),
that is,
6x+4yz=0.


Comments

Popular posts from this blog

Birth process

Branching processes: Mean and variance

Informal introduction to formal logic