Extreme values of multivariate functions



A local maximum (or minimum) value of a function is the maximum (or minimum) value of the function in the neighbor of a point. More formally,

Definition (Local maximum, local minimum)

Let f(x,y) be a function on an open region UR2 and P=(a,b)U.

  1. f(a,b) is said to be a local maximum value of the function f(x,y) if there exists δ>0 such that, for all (x,y)U, if (x,y)Nδ(P)U and (x,y)(a,b), then f(x,y)<f(a,b).
  2. f(a,b) is said to be a local minimum value of the function f(x,y) if there exists δ>0 such that, for all (x,y)U, if (x,y)Nδ(P)U and (x,y)(a,b), then f(x,y)>f(a,b).
Local maximum and minimum values are collectively called extreme values.

Theorem (Necessary condition for extreme values)

Let f(x,y) be a function on an open region U, and (a,b)U. Suppose that fx(a,b) and fy(a,b) exist. If f(a,b) is an extreme value, then fx(a,b)=fy(a,b)=0.
Proof. If f(x,y) has an extreme value at (a,b), then the univariate function f(x,b) of x has an extreme value at x=a. By the corresponding theorem for univariate functions, fx(a,b)=0. Similarly, fy(a,b)=0. ■

The converse of this theorem is not necessarily true.

Example. Consider f(x,y)=x2y2 (See Fig:NoExtreme below). We can see that fx(0,0)=fy(0,0)=0. However, the graph of f(x,0)=x2 is convex whereas that of f(0,y)=y2 is concave. This means that, if we move from the origin along the x-axis, the value of f(x,y) increases, but if we move from the origin along the y-axis, the value decreases. Therefore, f(0,0)a  is neither a a local maximum nor a local minimum. □
Figrue Fig:NoExtreme. The graph of z=x2y2. This function has neither a local maximum nor a local minimum at (x,y)=(0,0). (See the above example.)

In the case of univariate functions, we can determine whether a function takes a local maximum or local minimum value at a given point by the sign of the second differential coefficient at that point. We have a corresponding theorem for the case of bivariate functions, but the situation is a bit more complicated. We remark the following theorem without proof.

Theorem (Criteria of extreme values)

Let f(x,y) be a function of class C2 on an open region U. Suppose that (a,b)U and fx(a,b)=fy(a,b)=0. Let us define the determinant D by
D=fxx(a,b)fyy(a,b)[fxy(a,b)]2.
  1. If D>0,
    1. if fxx(a,b)>0, then f(x,y) has a local minimum value at (x,y)=(a,b);
    2. if fxx(a,b)<0, then f(x,y) has a local maximum value at (x,y)=(a,b).
  1. If D<0, f(x,y) does not have a local extremum at (x,y)=(a,b).

Proof. Omitted (beyond the scope of this module). ■

Remark. Note that the "determinant" D in the above theorem is indeed the determinant of a matrix:D=|fxx(a,b)fxy(a,b)fyx(a,b)fyy(a,b)|. This matrix is called the Hessian of the function. You may guess the Hessian of a general multivariate function f(x1,x2,,xn). □

Remark. This theorem is only applicable to bivariate (two-variable) functions. For general multivariate functions, such as f(x1,x2,,xn), the corresponding theorem is even more complicated (but it exists). That is, if all the eigenvalues of the Hessian are positive (or negative) at a given point, then the function takes a local minimum (or maximum) value at the point. (Note that the Hessian is always symmetric if the function is of class C2. Therefore, all the eigenvalues are guaranteed to be real.) □

Example. Let us find the extreme values of the function f(x,y)=x3+y33(x+y) on R2 (See the figure below).
The graph of z=x3+y33(x+y).

Solving 
fx(x,y)=3(x21)=0,fy(x,y)=3(y21)=0,
we have
(x,y)=(1,1),(1,1),(1,1),(1,1).

From
fxx(x,y)=6x,fxy(x,y)=0,fyy(x,y)=6y,
we have 
D=fxx(x,y)fyy(x,y)[fxy(x,y)]2=36xy.
  • When (x,y)=(1,1), we have D=36>0 and fxx(1,1)=6>0 so that f(1,1)=4 is a local minimum.
  • When (x,y)=(1,1), we have D=36>0 and fxx(1,1)=6<0 so that f(1,1)=4 is a local maximum.
  • When (x,y)=(1,1) or (1,1), we have D=36<0 so f(x,y) does not take extreme values.




Comments

Popular posts from this blog

Birth process

Branching processes: Mean and variance

Informal introduction to formal logic