Real Analysis/Differentiation in Rn: Difference between revisions

From testwiki
Jump to navigation Jump to search
imported>Bridget
m Undid edits by 119.160.117.11 (talk) to last version by Adrignola
 
(No difference)

Latest revision as of 05:40, 4 February 2020

Template:Header

We will first revise some important concepts of Linear Algebra that are of importance in Multivariate Analysis. The reader with no background in Linear Algebra is advised to refer the book Linear Algebra.

Vector Space

A set 𝒱 is said to be a Vector Space over a field F if and only if operations addition and scalar multiplication are defined over it so as to satisfy for all 𝐯𝟏,𝐯𝟐,𝒱 and c1,c2F

(i)Commutativity:𝐯𝟏+𝐯𝟐=𝐯𝟐+𝐯𝟏

(ii)Associativity:(𝐯𝟏+𝐯𝟐)+𝐯𝟑=𝐯𝟏+(𝐯𝟐+𝐯𝟑)

(iii)Identity:There exists 𝟎𝒱 such that 𝐯𝟏+𝟎=𝐯𝟏=𝟎+𝐯𝟏

(iv)Inverse:There exists 𝐯𝟏𝒱 such that 𝐯𝟏+(𝐯𝟏)=𝟎

(v):c𝐯𝟏+c𝐯𝟐=c(𝐯𝟏+𝐯𝟐)

(vi)c1𝐯+c2𝐯=(c1+c2)𝐯

(vii)c1(c2𝐯𝟏)=c2(c1𝐯𝟏)=(c1c2)𝐯𝟏

Members of a vector space are called "Vectors" and those of the field are called "Scalars". n, the set of all polynomials etc. are examples of vector spaces

A set of linearly independant vectors that spans the vector space is said to be a Basis for the vector space.

Linear Transformations

Let X,Y be vector spaces.

Let T:XY

We say that T is a Linear transformation if and only if for all 𝐯𝟏,𝐯𝟐X,

(i)T(𝐯𝟏+𝐯𝟐)=T(𝐯𝟏)+T(𝐯𝟐)

(ii)T(c𝐯𝟏)=cT(𝐯𝟏)


As we will see, there are two major ways to define a 'derivative' of a multivariable function. We first present the seemingly more straightforward way of using "Partial Derivatives".

Directional and Partial Derivatives

Let 𝐟:nm

Let 𝐚,𝐲n

We say that 𝐟 is differentiable at 𝐚n with respect to vector 𝐲 if and only if there exists 𝐋m that satisfies

limh0𝐟(𝐚+h𝐲)𝐟(𝐚)h=𝐋

𝐋 is said to be the derivative of 𝐟 at 𝐚 with respect to 𝐲 and is written as 𝐟(𝐚;𝐲)

When 𝐲 is a unit vector, the derivative is said to be a partial derivative. Here we will explicitly define partial derivatives and see some of their properties.


Let f be a real multivariate function defined on an open subset Ω of n

f:Ω.

Then the partial derivative at some parameter (x1,...,xn) with respect to the coordinate xi is defined as the following limit

limh0f(x1,,xi+h,,xn)f(x1,,xi,,xn)h=fxi.

f is said to be differentiable at this parameter (x1,...,xn) if the difference f(x1,...,xi+h,...,xn)f(x1,...,xi,...,xn) is equivalent up to first order in h to a linear form L (of h), that is

f(x1,...,xi+h,...,xn)f(x1,...,xi,...,xn)=L×h+o(h).

The linear form L is then said to be the differential of f at (x1,...,xn), and is written as Df|(x1,,xn) or sometimes df(x1,,xn).

In this case, where f is differentiable at (x1,,xn), by linearity we can write

df=fx1dx1++fxndxn

f is said to be continuously differentiable if its differential is defined at any parameter in its domain, and if the differential is varying continuously relative to the parameter (x1,...,xn), that is if it coordinates (as a linear form) fx1 are varying continuously.

In case partial derivatives exists but f is not differentiable, and sometimes not even continuous exempli gratia

f:(x,y)(xy)2(x2+y2)

(and f(0,0)=0) we say that f is separably differentiable.

Total Derivatives

The total derivative is important as it preserves some of the key properties of the single variable derivative, most notably the assertion differentiability implies continuity

Let f:Anm

We say that f is differentiable at 𝐚A if and only if there exists a linear transformation, 𝐃f(𝐚):nm, called the derivative or total derivative of f at 𝐚, such that

lim𝐡0f(𝐚+𝐡)f(𝐚)𝐃f(𝐚)(𝐡)𝐡=0

One should read 𝐃f(𝐚)(𝐡) as the linear transformation 𝐃f(𝐚) applied to the vector 𝐡. Sometimes it is customary to write this as 𝐃f(𝐚)(𝐡).


Theorem

Suppose An is an open set and f:Am is differentiable on A. Think of writing f in components so f(x1,,xn)=(f1(x1,,xn),,fm(x1,,xn)). Then the partial derivatives fjxi exist, and the matrix representing the linear transformation 𝐃f(𝐱) with respect to the standard bases of n and m is given by the Jacobian Matrix:

[f1x1f1xnfmx1fmxn].

evaluated at 𝐱=(x1,,xn).

NOTE: This theorem requires the function to be differentiable to begin with. It is a common mistake to assume that if the partial derivatives exist then this would imply that the function is differentiable because we can construct the Jacobian matrix. This however is completely false. Which brings us to the next theorem:

Theorem

Suppose An is an open set and f:Am. Think of writing f in components so f(x1,,xn)=(f1(x1,,xn),,fm(x1,,xn)). If fjxi exists and is continuous on A for all j{1,,m} and for all i{1,,n}, then f is differentiable on A.


This theorem gives us a nice criteria for a function to be differentiable.