Introductory university-level calculus, linear algebra, abstract algebra, probability, statistics, and stochastic processes.
Partial and total differentiation of multivariate functions
Get link
Facebook
X
Pinterest
Email
Other Apps
-
A multivariate function may be differentiated with respect to each variable, which is called partial differentiation. By combining all the partial differentiations, we define total differentiation. The essence of (total) differentiation is a linear approximation. In the case of a univariate function, we approximate the function \(y = f(x)\) in the neighbor of a point, say \(x = a\), by the tangent line \(y = f'(a)(x - a) + f(a)\). In the case of a multivariate function, we approximate the function \(y = f(x_1, x_2, \cdots, x_n)\) in the neighbor of a point, say \(a = (a_1, a_2, \cdots, a_n)\), by the tangent hyperplane at the point \(a\).
Partial differentiation
Let \(f(x,y)\) be a function on an open region \(U\subset \mathbb{R}^2\) and \((a,b) \in U\). If we fix \(y = b\) in \(f(x,y)\), we have a univariate function \(g(x) = f(x,b)\). Since \(U\) is open, there exists \(\delta > 0\) such that \(N_{\delta}(a,b) \subset U\). Therefore \(g(x)\) is defined on the open interval \((a - \delta, a + \delta)\). In other words, the function \(g(x)\) is defined in a neighbor of \(x = a\).
Remark. We write \(N_{\delta}(a,b)\) (rather than \(N_{\delta}((a,b))\), to save keystrokes!) to mean the \(\delta\)-neighbor of the point \((a,b)\in \mathbb{R}^2\). □
If \(g(x)\) is differentiable at \(x = a\), its differential coefficient is called the partial differential coefficient with respect to \(x\) (at \((a,b)\)) and \(\frac{dg}{dx}(a)\) is denoted as \(\frac{\partial f}{\partial x}(a,b)\) or \(f_x(a,b)\).
Remark. Here is one way to understand the partial differential coefficient. We have a surface \(z = f(x,y)\) in \(\mathbb{R}^3\). Find its cross-section with the plane \(y = b\). This cross-section is a curve defined by \(z = g(x) = f(x,b)\). The partial differential coefficient \(\frac{\partial f}{\partial x}(a,b) = \frac{dg}{dx}(a)\) is the slope of the tangent line of the curve at \(x = a\). □
Similarly, if we fix \(x = a\) in \(f(x,y)\), we have a univariate function \(h(y) = f(a,y)\) which is defined in a neighbor of \(y = b\). If \(\frac{dh}{dy}(b)\) exists, it is called the partial differential coefficient with respect to \(y\) (at \(b\)) and denoted \(\frac{\partial f}{\partial y}(a,b)\) or \(f_y(a,b)\).
Example. Let \(f(x,y) = x^2y + 2xy^2 - y^3\). Let us find the partial differential coefficients \(f_x(a,b)\) and \(f_y(a,b)\). Letting \(y = b\), we have \(f(x,b) = x^2b + 2xb^2 - b^3\). Differentiating the right-hand side with respect to \(x\), we have \(2xb + 2b^2\). Setting \(x=a\), we have \[f_x(a,b) = 2ab + 2b^2.\]
Similarly, we have \[f_y(a,b) = a^2 + 4ab - 3b^2.\] □
Partial derivatives
If the partial differential coefficient \(\frac{\partial f}{\partial x}(a,b)\) exists at every \((a,b)\in U\), then it defines a function on \(U\). This function is called thepartial derivative of \(f(x,y)\) with respect to \(x\) and is denoted
Let us review the notion of differentiation of univariate functions. We defined the differential coefficient of a univariate function \(f(x)\) at \(x=a\) by
where \(o\) is Landau's little-o. This equation suggests that the function \(y = f(x)\) is approximated by a linear function, namely the tangent of \(y=f(x)\) at \(x = a\),
\[y = f(a) + f'(a)(x - a).\]
Conversely, suppose that the function \(y = f(x)\) can be approximated by a linear function in a neighbor of \(x = a\):
\[f(x) = f(a) + m(x - a) + o(|x - a|).\]
From this equation, we can see that
\[\lim_{x \to a}\frac{f(x) - f(a)}{x - a} = m.\]
This means that \(y = f(x)\) is differentiable at \(x=a\) and \(f'(a) = m\).
In summary, \(f(x)\) is approximated by the linear function \(f(a) + f'(a)(x-a)\) in a neighbor of \(x = a\), and its slope is the differential coefficient \(f'(a)\) itself. Such linear approximation is the essence of differentiation.
The same argument applies to multivariate functions. Differentiating the function \(z = f(x,y)\) at the point \(P= (a,b)\) is to approximate it by a linear function
\[z = f(a,b) + m(x-a) + n(y-b).\]
That is, for the point \(X= (x,y)\) in a neighbor of \(P = (a,b)\), we consider the linear approximation
where \(\|X - P\| = \sqrt{(x-a)^2 + (y-b)^2} = d(X,P)\) is the distance between the points \(X\) and \(P\). Setting \(y = b\) in this equation, we have
\[\lim_{x \to a}\frac{f(x, b) - f(a,b)}{x - a} = m.\]
That is, \(f_x(a,b) = m.\) Similarly, we can show that \(f_y(a,b) = n\). In summary, if the linear approximation (Eq:LA) holds, it must be
Let \(U\) be an open region in \(\mathbb{R}^2\) and \(P=(a,b) \in U\). The function \(f(x,y)\) on \(U\) is said to be (totally) differentiable at \((a,b)\) if there exist constants \(m\) and \(n\) such that
\[f(x,y) = f(a,b) + m(x-a) + n(y-b) + o(\|X-P\|) \text{ as $X = (x,y)\to P = (a,b)$}\]
\(f(x,y)\) is said to be (totally) differentiable on \(U\) if it is (totally) differentiable at every point in \(U\).
Remark. The word "totally" in "totally differentiable" is used in contrast to "partially differentiable." However, "totally" may be omitted. If we simply say, "a multivariate function is differentiable," it means the function is totally differentiable. □
From the above discussion, if the function \(f(x,y)\) is totally differentiable at \((a,b)\), it is partially differentiable at \((a,b)\), and \(m = f_x(a,b)\) and \(n = f_y(a,b)\). (The converse is not necessarily true; We will see such an example in a later post.) The linear function
is the tanget plane of \(z = f(x,y)\) at \((a,b)\).
Remark. More generally, when the domain is in \(\mathbb{R}^n\), for the function \(y = f(x) = f(x_1, x_2,\cdots, x_n)\) at the point \(a = (a_1, a_2, \cdots, a_n)\), we have the linear function
that is the tangent hyperplane of \(y = f(x_1, x_2, \cdots, x_n)\) at \(a = (a_1, a_2, \cdots, a_n)\). □
Example. Let us find the equation of the tangent plane of the surface defined by the function \(z = 2x^3 + y^2\) at \((-1, 2, 2)\) (make sure this point indeed belongs to the given surface). Let \(f(x,y) = 2x^3 + y^2\). Then
Open sets In \(\mathbb{R}\), we have the notion of an open interval such as \((a, b) = \{x \in \mathbb{R} | a < x < b\}\). We want to extend this idea to apply to \(\mathbb{R}^n\). We also introduce the notions of bounded sets and closed sets in \(\mathbb{R}^n\). Recall that the \(\varepsilon\)-neighbor of a point \(x\in\mathbb{R}^n\) is defined as \(N_{\varepsilon}(x) = \{y \in \mathbb{R}^n | d(x, y) < \varepsilon \}\) where \(d(x,y)\) is the distance between \(x\) and \(y\). Definition (Open set) A subset \(U\) of \(\mathbb{R}^n\) is said to be an open set if the following holds: \[\forall x \in U ~ \exists \delta > 0 ~ (N_{\delta}(x) \subset U).\tag{Eq:OpenSet}\] That is, for every point in an open set \(U\), we can always find an open ball centered at that point, that is included in \(U\). See the following figure. Perhaps, it is instructive to see what is not an open set. Negating (Eq:OpenSet), we have \[\exists x \in U ~ \forall \delta > 0 ~ (N_{\delta}(x) \not
We would like to study multivariate functions (i.e., functions of many variables), continuous multivariate functions in particular. To define continuity, we need a measure of "closeness" between points. One measure of closeness is the Euclidean distance. The set \(\mathbb{R}^n\) (with \(n \in \mathbb{N}\)) with the Euclidean distance function is called a Euclidean space. This is the space where our functions of interest live. The real line is a geometric representation of \(\mathbb{R}\), the set of all real numbers. That is, each \(a \in \mathbb{R}\) is represented as the point \(a\) on the real line. The coordinate plane , or the \(x\)-\(y\) plane , is a geometric representation of \(\mathbb{R}^2\), the set of all pairs of real numbers. Each pair of real numbers \((a, b)\) is visualized as the point \((a, b)\) in the plane. Remark . Recall that \(\mathbb{R}^2 = \mathbb{R}\times\mathbb{R} = \{(x, y) | x, y \in \mathbb{R}\}\) is the Cartesian product of \(\mathbb{R}\) with i
We can use multiple integrals to compute areas and volumes of various shapes. Area of a planar region Definition (Area) Let \(D\) be a bounded closed region in \(\mathbb{R}^2\). \(D\) is said to have an area if the multiple integral of the constant function 1 over \(D\), \(\iint_Ddxdy\), exists. Its value is denoted by \(\mu(D)\): \[\mu(D) = \iint_Ddxdy.\] Example . Let us calculate the area of the disk \(D = \{(x,y)\mid x^2 + y^2 \leq a^2\}\). Using the polar coordinates, \(x = r\cos\theta, y = r\sin\theta\), \(dxdy = rdrd\theta\), and the ranges of \(r\) and \(\theta\) are \([0, a]\) and \([0, 2\pi]\), respectively. Thus, \[\begin{eqnarray*} \mu(D) &=& \iint_Ddxdy\\ &=&\int_0^a\left(\int_0^{2\pi}rd\theta\right)dr\\ &=&2\pi\int_0^a rdr\\ &=&2\pi\left[\frac{r^2}{2}\right]_0^a = \pi a^2. \end{eqnarray*}\] □ Volume of a solid figure Definition (Volume) Let \(V\) be a solid figure in the \((x,y,z)\) space \(\mathbb{R}^3\). \(V\) is sai
Comments
Post a Comment