Introductory university-level calculus, linear algebra, abstract algebra, probability, statistics, and stochastic processes.
Method of Lagrange multipliers
Get link
Facebook
X
Pinterest
Email
Other Apps
-
We have studied how to identify extreme values of two-variable functions . In practice, we may have additional constraints. For example,
Find the extreme values of subject to the constraint .
We can use the method of Lagrange multipliers to solve this type of problem.
Let's consider the following example.
Problem. If moves on the unit circle , find the extreme values of . □
This problem can be restated as follows:
Problem (restated). Let . Find the extreme values of subject to the constraint . □
Let's solve this problem using an "explicit" method.
Solution 1 (Explicit method). Consider the implicit function of on the open interval and substitute it to to have
Since
has a local minimum at and a local maximum at . Therefore, has a local minimum value at point , and a local maximum value at point on the unit circle.
Similarly, using the implicit function on the open interval , we can define and find that has a local minimum value at and a local maximum value at .
Next, we examine the neighbors of the points and . In these cases, we consider the implicit functions (the branch passing through ) and (the branch passing through ), and find the same points as above to give local minimum and maximum.
In summary, has a local minimum value at , and a local maximum value at . □
The above solution is explicit and easy to understand. However, this method applies only when the implicit functions of can be obtained explicitly. If that is not the case, the following method of Lagrange (undetermined) multipliers provides an effective approach. We first show how to solve the above problem using the method of Lagrange multipliers. We will prove the method in general later.
Solution 2 (Lagrange multiplier). First, let us define a new function
where is a new variable (undetermined constant) called the Lagrange multiplier. Note that we have if the constraint is satisfied. We now consider the extreme value problem of this function.
Next, we differentiate with respect to and and solve
Furthermore,
should also be satisfied. The latter equation implies, in particular, that (i.e., and cannot be simultaneously equal to 0). However, from (Eq:lagx) and (Eq:lagy) above, if , then and vice versa, which is a contradiction. Thus, both and are non-zero. Eliminating, for example, from (Eq:lagx) and (Eq:lagy) and some rearrangement gives
Since , we have . For each of these values of , we solve the simultaneous equations (Eq:lagx) and (Eq:lagy) for and :
For , we have .
For , we have .
Combining with (Eq:lambda), we have
For , we have .
For , we have .
In order to see if indeed has extreme values at these points, let us examine its behavior in the neighbor of these points .
Since , each point on the unit circle can be expressed as where is an implicit function of . Thus, we need to solve the extreme value problem of
Differentiating the equation of constraint , or
with respect to , we have
From this, we have
Using the multivariate chain rule, we obtain
If (corresponding to ), we have and so has a local minimum value .
If (corresponding to ), we have and so has a local maximum value . □
It should be stressed that using the method of Lagrange multipliers, we did not need to know the explicit form of the implicit function . That is, the implicit function remains, in fact, implicit. The mere existence of the implicit function is sufficient. Thus, this method is applicable even when the implicit function is hard (if possible) to find.
Remark. In general, constrained extreme value problems can be solved by the following procedure:
Enumerate the possible extreme points by the method of Lagrange multipliers.
For each possible extreme point, consider the implicit function in the neighbor of the point and evaluate the first and second derivatives.
□
We now give the method of Lagrange multipliers as a theorem.
Theorem [Method of Lagrange multipliers]
Let and be functions of class . With a new variable , let us define
Suppose the point satisfies the following conditions:
The function has an extreme value at subject to the constraint (in particular, );
is a regular point of (i.e., it is not the case that ).
Then, there exists such that
Proof. By condition (1), . Thus, it suffices to show that there exists such that
By condition (2), or . Without loss of generality (what does this mean?), we may assume . By the Implicit Function Theorem, there exists a function in a neighbor of such that and . In the neighbor of , any point satisfying is of the form . Therefore, we need to solve the extreme value problem of the univariate function in the neighbor of . Note that
Differentiating with respect to , we have
By assumption, has an extreme value at ,
But so that
It follows that
Based on this, let us define
Clearly, this satisfies (Eq:lagax}) and (Eq:lagay). ■
Defining the birth process Consider a colony of bacteria that never dies. We study the following process known as the birth process , also known as the Yule process . The colony starts with cells at time . Assume that the probability that any individual cell divides in the time interval is proportional to for small . Further assume that each cell division is independent of others. Let be the birth rate. The probability of a cell division for a population of cells during is . We assume that the probability that two or more births take place in the time interval is . That is, it can be ignored. Consequently, the probability that no cell divides during is . Note that this process is an example of the Markov chain with states \({n_0}, {n_0 + 1}, {n_0 + 2}...
Generational growth Consider the following scenario (see the figure below): A single individual (cell, organism, etc.) produces descendants with probability , independently of other individuals. The probability of this reproduction, , is known. That individual produces no further descendants after the first (if any) reproduction. These descendants each produce further descendants at the next subsequent time with the same probabilities. This process carries on, creating successive generations. Figure 1. An example of the branching process. Let be the random variable representing the population size (number of individuals) of generation . In the above figure, we have , , , , We shall assume as the initial condition. Ideally, our goal would be to find how the population size grows through generations, that is, to find the probability for e...
In mathematics, we must prove (almost) everything and the proofs must be done logically and rigorously. Therefore, we need some understanding of basic logic. Here, I will informally explain some rudimentary formal logic. Definitions (Proposition): A proposition is a statement that is either true or false. "True" and "false" are called the truth values, and are often denoted and . Here is an example. "Dr. Akira teaches at UBD." is a statement that is either true or false (we understand the existence of Dr. Akira and UBD), hence a proposition. The following statement is also a proposition, although we don't know if it's true or false (yet): Any even number greater than or equal to 4 is equal to a sum of two primes. See also: Goldbach's conjecture Next, we define several operations on propositions. Note that propositions combined with these operations are again propositions. (Conjunction, logical "and"): Let ...
Comments
Post a Comment