Partial derivatives and direction: Difference between revisions
No edit summary |
No edit summary |
||
Line 16: | Line 16: | ||
Notice that one variable is kept fixed, we don't have any increments for it. We are already using the notion of direction by having derivatives parallel to an axis. With a slightly modification to the definition of a partial derivative we define directional derivatives, which allow us to calculate the rate of change for directions that aren't parallel to an axis. | Notice that one variable is kept fixed, we don't have any increments for it. We are already using the notion of direction by having derivatives parallel to an axis. With a slightly modification to the definition of a partial derivative we define directional derivatives, which allow us to calculate the rate of change for directions that aren't parallel to an axis. | ||
'''Notation:''' | |||
==Directional derivative== | |||
With one variable we choose a point, take a step forwards to the next point and then calculate a limit as the distance between the two points go to zero. That's the derivative for one variable. We don't use vectors because we don't need to. For two and more variables we choose a point and then the next point can be in any direction, as long as both points are part of the function's domain. Which direction? We need a vector to know it ''(one is required to know the operation point + vector to understand the directional derivative)'': | |||
<div style="text-align:center;> | |||
[[file:partial_derivative_graph2.png|300px]] | |||
</div> | |||
We have: | |||
<math>(x_0 + a, \ y_0 + b) = (x_1, \ y_1)</math> | |||
Calculating the function at those points: | |||
<math>f(x_0 + a, \ y_0 + b)</math><br /> | |||
<math>f(x_0, \ y_0)</math> | |||
Now we have the two points required to calculate the same limit of a derivative: | |||
<math>\lim_{h \ \to \ 0} \frac{f(x_0 + ha, \ y_0 + hb) - f(x_0, \ y_0)}{h}</math> | |||
From the derivative's definition, remember that <math>h</math> is some ''(positive)'' increment. We are taking the same increment in both axis at the same time. | |||
The other way to write a derivative for a single variable involves a division by <math>x - p</math>. For two variables we can't write <math>(x,y) - (p_x, p_y)</math> because the operation of difference between points doesn't exist. |
Revision as of 20:24, 30 March 2022
The idea of partial derivatives is pretty similar to the regular derivative. The concept of a derivative is that of a rate of change. For multivariable functions we have to look for rates of change on a per variable basis. That's the meaning of "partial". A multivariable function can increase in one direction and decrease in another. We have to study how the function behaves for each direction separately from the others. With the axes being linearly independent we can differentiate in respect to one variable, while the others are treated as constants. The same discussion that we make about conditions for differentiability for a single variable can be made for many variables, albeit we are required to rely on linear algebra to do it properly.
Graphically we have this:
See that partial derivatives, graphically, mean that we are considering derivatives parallel to each axis. While we "walk" parallel to an axis we have variations in one direction but not in the others. That's why multivariable calculus requires vectors, because we have multiple variables and multiple directions. Notice that to differentiate in respect to one variable we keep a constant distance from the axis we are parallel to, the distance itself doesn't matter as long as it is a constant. That's the graphical meaning of treating a variable as a constant.
We can easily extend the same limit that we have to define the derivative for a single variable to many variables:
[math]\displaystyle{ \frac{\partial f}{\partial x} (x, y) = \lim_{x \ \to \ p} \frac{f(x, \ y) - f(p, \ y)}{x - p} }[/math] or [math]\displaystyle{ \lim_{h \ \to \ 0} \frac{f(x + h, \ y) - f(x, \ y)}{h} }[/math]
Notice that one variable is kept fixed, we don't have any increments for it. We are already using the notion of direction by having derivatives parallel to an axis. With a slightly modification to the definition of a partial derivative we define directional derivatives, which allow us to calculate the rate of change for directions that aren't parallel to an axis.
Notation:
Directional derivative
With one variable we choose a point, take a step forwards to the next point and then calculate a limit as the distance between the two points go to zero. That's the derivative for one variable. We don't use vectors because we don't need to. For two and more variables we choose a point and then the next point can be in any direction, as long as both points are part of the function's domain. Which direction? We need a vector to know it (one is required to know the operation point + vector to understand the directional derivative):
We have:
[math]\displaystyle{ (x_0 + a, \ y_0 + b) = (x_1, \ y_1) }[/math]
Calculating the function at those points:
[math]\displaystyle{ f(x_0 + a, \ y_0 + b) }[/math]
[math]\displaystyle{ f(x_0, \ y_0) }[/math]
Now we have the two points required to calculate the same limit of a derivative:
[math]\displaystyle{ \lim_{h \ \to \ 0} \frac{f(x_0 + ha, \ y_0 + hb) - f(x_0, \ y_0)}{h} }[/math]
From the derivative's definition, remember that [math]\displaystyle{ h }[/math] is some (positive) increment. We are taking the same increment in both axis at the same time.
The other way to write a derivative for a single variable involves a division by [math]\displaystyle{ x - p }[/math]. For two variables we can't write [math]\displaystyle{ (x,y) - (p_x, p_y) }[/math] because the operation of difference between points doesn't exist.