Real Analysis/Differentiation in Rn

From testwiki
Jump to navigation Jump to search

Template:Header

We will first revise some important concepts of Linear Algebra that are of importance in Multivariate Analysis. The reader with no background in Linear Algebra is advised to refer the book Linear Algebra.

Vector Space

A set 𝒱 is said to be a Vector Space over a field F if and only if operations addition and scalar multiplication are defined over it so as to satisfy for all 𝐯𝟏,𝐯𝟐,𝒱 and c1,c2F

(i)Commutativity:𝐯𝟏+𝐯𝟐=𝐯𝟐+𝐯𝟏

(ii)Associativity:(𝐯𝟏+𝐯𝟐)+π―πŸ‘=𝐯𝟏+(𝐯𝟐+π―πŸ‘)

(iii)Identity:There exists πŸŽπ’± such that 𝐯𝟏+𝟎=𝐯𝟏=𝟎+𝐯𝟏

(iv)Inverse:There exists π―πŸπ’± such that 𝐯𝟏+(𝐯𝟏)=𝟎

(v):c𝐯𝟏+c𝐯𝟐=c(𝐯𝟏+𝐯𝟐)

(vi)c1𝐯+c2𝐯=(c1+c2)𝐯

(vii)c1(c2𝐯𝟏)=c2(c1𝐯𝟏)=(c1c2)𝐯𝟏

Members of a vector space are called "Vectors" and those of the field are called "Scalars". ℝn, the set of all polynomials etc. are examples of vector spaces

A set of linearly independant vectors that spans the vector space is said to be a Basis for the vector space.

Linear Transformations

Let X,Y be vector spaces.

Let T:XY

We say that T is a Linear transformation if and only if for all 𝐯𝟏,𝐯𝟐X,

(i)T(𝐯𝟏+𝐯𝟐)=T(𝐯𝟏)+T(𝐯𝟐)

(ii)T(c𝐯𝟏)=cT(𝐯𝟏)


As we will see, there are two major ways to define a 'derivative' of a multivariable function. We first present the seemingly more straightforward way of using "Partial Derivatives".

Directional and Partial Derivatives

Let 𝐟:ℝnℝm

Let 𝐚,𝐲ℝn

We say that 𝐟 is differentiable at πšβ„n with respect to vector 𝐲 if and only if there exists 𝐋ℝm that satisfies

limh0𝐟(𝐚+h𝐲)𝐟(𝐚)h=𝐋

𝐋 is said to be the derivative of 𝐟 at 𝐚 with respect to 𝐲 and is written as 𝐟(𝐚;𝐲)

When 𝐲 is a unit vector, the derivative is said to be a partial derivative. Here we will explicitly define partial derivatives and see some of their properties.


Let f be a real multivariate function defined on an open subset Ω of ℝn

f:Ωℝ.

Then the partial derivative at some parameter (x1,...,xn) with respect to the coordinate xi is defined as the following limit

limh0f(x1,,xi+h,,xn)f(x1,,xi,,xn)h=fxi.

f is said to be differentiable at this parameter (x1,...,xn) if the difference f(x1,...,xi+h,...,xn)f(x1,...,xi,...,xn) is equivalent up to first order in h to a linear form L (of h), that is

f(x1,...,xi+h,...,xn)f(x1,...,xi,...,xn)=L×h+o(h).

The linear form L is then said to be the differential of f at (x1,...,xn), and is written as Df|(x1,,xn) or sometimes df(x1,,xn).

In this case, where f is differentiable at (x1,,xn), by linearity we can write

df=fx1dx1++fxndxn

f is said to be continuously differentiable if its differential is defined at any parameter in its domain, and if the differential is varying continuously relative to the parameter (x1,...,xn), that is if it coordinates (as a linear form) fx1 are varying continuously.

In case partial derivatives exists but f is not differentiable, and sometimes not even continuous exempli gratia

f:(x,y)(xy)2(x2+y2)

(and f(0,0)=0) we say that f is separably differentiable.

Total Derivatives

The total derivative is important as it preserves some of the key properties of the single variable derivative, most notably the assertion differentiability implies continuity

Let f:Aℝnℝm

We say that f is differentiable at 𝐚A if and only if there exists a linear transformation, 𝐃f(𝐚):ℝnℝm, called the derivative or total derivative of f at 𝐚, such that

lim𝐑0f(𝐚+𝐑)f(𝐚)𝐃f(𝐚)(𝐑)𝐑=0

One should read 𝐃f(𝐚)(𝐑) as the linear transformation 𝐃f(𝐚) applied to the vector 𝐑. Sometimes it is customary to write this as 𝐃f(𝐚)(𝐑).


Theorem

Suppose Aℝn is an open set and f:Aℝm is differentiable on A. Think of writing f in components so f(x1,,xn)=(f1(x1,,xn),,fm(x1,,xn)). Then the partial derivatives fjxi exist, and the matrix representing the linear transformation 𝐃f(𝐱) with respect to the standard bases of ℝn and ℝm is given by the Jacobian Matrix:

[f1x1f1xnfmx1fmxn].

evaluated at 𝐱=(x1,,xn).

NOTE: This theorem requires the function to be differentiable to begin with. It is a common mistake to assume that if the partial derivatives exist then this would imply that the function is differentiable because we can construct the Jacobian matrix. This however is completely false. Which brings us to the next theorem:

Theorem

Suppose Aℝn is an open set and f:Aℝm. Think of writing f in components so f(x1,,xn)=(f1(x1,,xn),,fm(x1,,xn)). If fjxi exists and is continuous on A for all j{1,,m} and for all i{1,,n}, then f is differentiable on A.


This theorem gives us a nice criteria for a function to be differentiable.