Density functional theory/Introduction to functional analysis

From testwiki
Jump to navigation Jump to search

Functional (mathematics)

In mathematics, and particularly in functional analysis and the Calculus of variations, a functional is a function from a vector space into its underlying scalar field, or a set of functions of the real numbers. In other words, it is a function that takes a vector as its input argument, and returns a scalar. Commonly the vector space is a space of functions, thus the functional takes a function for its input argument, then it is sometimes considered a function of a function. Its use originates in the calculus of variations where one searches for a function that minimizes a certain functional. A particularly important application in physics is searching for a state of a system that minimizes the energy functional.

Functional details

Duality

The mapping

x0f(x0)

is a function, where x0 is an argument of a function f. At the same time, the mapping of a function to the value of the function at a point

ff(x0)

is a functional, here x0 is a parameter.

Provided that f is a linear function from a linear vector space to the underlying scalar field, the above linear maps are dual to each other, and in functional analysis both are called linear functionals.

Definite integral

Integrals such as

fI[f]=ΩH(f(x),f(x),)μ(dx)

form a special class of functionals. They map a function f into a real number, provided that H is real-valued. Examples include

  • the area underneath the graph of a positive function f
fx0x1f(x)dx
f(|f|pdx)1/p
  • the arclength of a curve in 2-dimensional Euclidean space
fx0x11+|f(x)|2dx

Vector scalar product

Given any vector x in a vector space X, the scalar product with another vector y, denoted xy or x,y, is a scalar. The set of vectors such that this product is zero is a vector subspace of X, called the null space or kernel of X.

Local vs non-local

If a functional's value can be computed for small segments of the input curve and then summed to find the total value, a function is called local. Otherwise it is called non-local. For example:

F(y)=x0x1y(x)dx

is local while

F(y)=x0x1y(x)dxx0x1(1+[y(x)]2)dx

is non-local. This occurs commonly when integrals occur separately in the numerator and denominator of an equation such as in calculations of center of mass.

 Linear functionals

Linear functionals first appeared in functional analysis, the study of vector spaces of functions.  A typical example of a linear functional is integration: the linear transformation defined by the Riemann integral

I(f)=abf(x)dx

is a linear functional from the vector space C[a,b] of continuous functions on the interval [ab] to the real numbers.  The linearity of I(f) follows from the standard facts about the integral:

I(f+g)=ab(f(x)+g(x))dx=abf(x)dx+abg(x)dx=I(f)+I(g)
I(αf)=abαf(x)dx=αabf(x)dx=αI(f).

Nonlinear functional

Functional derivative

The functional derivative is defined first; Then the functional differential is defined in terms of the functional derivative.

Functional derivative

Given a manifold M representing (continuous/smooth/with certain boundary conditions/etc.) functions ρ and a functional F defined as

F:MorF:M,

the functional derivative of Template:Mathρ], denoted Template:Mathρ, is defined by[1]

δFδρ(x)ϕ(x)dx=limε0F[ρ+εϕ]F[ρ]ε=[ddϵF[ρ+ϵϕ]]ϵ=0,

where ϕ is an arbitrary function. ϵϕ is called the variation of ρ.

Functional differential

The differential (or variation or first variation) of the functional Template:Math[ρ] is,[2] [Note 1]

δF=δFδρ(x) δρ(x) dx ,

where Template:MathρTemplate:Math is the variation of ρTemplate:Math.Template:Clarify This is similar in form to the total differential of a function Template:Math(ρ1, ρ2, ..., ρn),

dF=i=1nFρi dρi ,

where ρ1, ρ2, ... , ρn are independent variables. Comparing the last two equations, the functional derivative Template:MathρTemplate:Math has a role similar to that of the partial derivative Template:Mathρi , where the variable of integration Template:Math is like a continuous version of the summation index Template:Math.[3]

Formal description

The definition of a functional derivative may be made more mathematically precise and formal by defining the space of functions more carefully. For example, when the space of functions is a Banach space, the functional derivative becomes known as the Fréchet derivative, while one uses the Gâteaux derivative on more general locally convex spaces. Note that the well-known Hilbert spaces are special cases of Banach spaces. The more formal treatment allows many theorems from ordinary calculus and analysis to be generalized to corresponding theorems in functional analysis, as well as numerous new theorems to be stated.

Properties

Like the derivative of a function, the functional derivative satisfies the following properties, where Template:Math[ρ] and Template:Math[ρ] are functionals:

δ(λF+μG)δρ(x)=λδFδρ(x)+μδGδρ(x), λ,μ   constant,
  • Product rule:[5]
δ(FG)δρ(x)=δFδρ(x)G+FδGδρ(x),
  • Chain rules:
If Template:Math is a differentiable function, then
δF[f(ρ)]δρ(x)=δF[f(ρ)]δf(ρ(x)) df(ρ(x))dρ(x) ,[6]
δf(F[ρ])δρ(x)=df(F[ρ])dF[ρ] δF[ρ]δρ(x).[7]

Determining functional derivatives

We give a formula to determine functional derivatives for a common class of functionals that can be written as the integral of a function and its derivatives. This is a generalization of the Euler–Lagrange equation: indeed, the functional derivative was introduced in physics within the derivation of the Lagrange equation of the second kind from the principle of least action in Lagrangian mechanics (18th century). The first three examples below are taken from density functional theory (20th century), the fourth from statistical mechanics (19th century).

Formula

Given a functional

F[ρ]=f(𝒓,ρ(𝒓),ρ(𝒓))d𝒓,

and a function Template:Math(Template:Math) that vanishes on the boundary of the region of integration, from a previous section Definition,

δFδρ(𝒓)ϕ(𝒓)d𝒓=[ddεf(𝒓,ρ+εϕ,ρ+εϕ)d𝒓]ε=0=(fρϕ+fρϕ)d𝒓=[fρϕ+(fρϕ)(fρ)ϕ]d𝒓=[fρϕ(fρ)ϕ]d𝒓=(fρfρ)ϕ(𝒓) d𝒓.

The second line is obtained using the total derivative, where Template:Mathρ is a derivative of a scalar with respect to a vector.[Note 2] The third line was obtained by use of a product rule for divergence. The fourth line was obtained using the divergence theorem and the condition that Template:Math on the boundary of the region of integration. Since Template:Math is also an arbitrary function, applying the fundamental lemma of calculus of variations to the last line, the functional derivative is

δFδρ(𝒓)=fρfρ

where ρ = ρ(Template:Math) and Template:Math, ρ, ∇ρ). This formula is for the case of the functional form given by Template:Math[ρ] at the beginning of this section. For other functional forms, the definition of the functional derivative can be used as the starting point for its determination. (See the example Coulomb potential energy functional.)

The above equation for the functional derivative can be generalized to the case that includes higher dimensions and higher order derivatives. The functional would be,

F[ρ(𝒓)]=f(𝒓,ρ(𝒓),ρ(𝒓),(2)ρ(𝒓),,(N)ρ(𝒓))d𝒓,

where the vector Template:Math, and Template:Math is a tensor whose Template:Math components are partial derivative operators of order Template:Math,

[(i)]α1α2αi=irα1rα2rαiwhereα1,α2,,αi=1,2,,n .[Note 3]

An analogous application of the definition of the functional derivative yields

δF[ρ]δρ=fρf(ρ)+(2)f((2)ρ)++(1)N(N)f((N)ρ)=fρ+i=1N(1)i(i)f((i)ρ) .

In the last two equations, the Template:Math components of the tensor f((i)ρ) are partial derivatives of Template:Math with respect to partial derivatives of ρ,

[f((i)ρ)]α1α2αi=fρα1α2αiwhereρα1α2αiiρrα1rα2rαi ,

and the tensor scalar product is,

(i)f((i)ρ)=α1,α2,,αi=1n irα1rα2rαi fρα1α2αi . [Note 4]

Examples

Thomas–Fermi kinetic energy functional

The Thomas–Fermi model of 1927 used a kinetic energy functional for a noninteracting uniform electron gas in a first attempt of density-functional theory of electronic structure:

TTF[ρ]=CFρ5/3(𝐫)d𝐫.

Since the integrand of Template:Math[ρ] does not involve derivatives of ρTemplate:Math, the functional derivative of Template:Math[ρ] is,[8]

δTTFδρ(𝒓)=CFρ5/3(𝐫)ρ(𝐫)=53CFρ2/3(𝐫).

Coulomb potential energy functional

For the electron-nucleus potential, Thomas and Fermi employed the Coulomb potential energy functional

V[ρ]=ρ(𝒓)|𝒓| d𝒓.

Applying the definition of functional derivative,

δVδρ(𝒓) ϕ(𝒓) d𝒓=[ddερ(𝒓)+εϕ(𝒓)|𝒓| d𝒓]ε=0=1|𝒓|ϕ(𝒓) d𝒓.

So,

δVδρ(𝒓)=1|𝒓| .

For the classical part of the electron-electron interaction, Thomas and Fermi employed the Coulomb potential energy functional

J[ρ]=12ρ(𝐫)ρ(𝐫)|𝐫𝐫|d𝐫d𝐫.

From the definition of the functional derivative,

δJδρ(𝒓)ϕ(𝒓)d𝒓=[d dϵJ[ρ+ϵϕ]]ϵ=0=[d dϵ(12[ρ(𝒓)+ϵϕ(𝒓)][ρ(𝒓)+ϵϕ(𝒓)]|𝒓𝒓|d𝒓d𝒓)]ϵ=0=12ρ(𝒓)ϕ(𝒓)|𝒓𝒓|d𝒓d𝒓+12ρ(𝒓)ϕ(𝒓)|𝒓𝒓|d𝒓d𝒓

The first and second terms on the right hand side of the last equation are equal, since Template:Math and Template:Math in the second term can be interchanged without changing the value of the integral. Therefore,

δJδρ(𝒓)ϕ(𝒓)d𝒓=(ρ(𝒓)|𝒓𝒓|d𝒓)ϕ(𝒓)d𝒓

and the functional derivative of the electron-electron coulomb potential energy functional Template:Math[ρ] is,[9]

δJδρ(𝒓)=ρ(𝒓)|𝒓𝒓|d𝒓.

The second functional derivative is

δ2J[ρ]δρ(𝐫)δρ(𝐫)=ρ(𝐫)(ρ(𝐫)|𝐫𝐫|)=1|𝐫𝐫|.

Weizsäcker kinetic energy functional

In 1935 von Weizsäcker proposed to add a gradient correction to the Thomas-Fermi kinetic energy functional to make it suit better a molecular electron cloud:

TW[ρ]=18ρ(𝐫)ρ(𝐫)ρ(𝐫)d𝐫=tW d𝐫,

where

tW18ρρρand  ρ=ρ(𝒓) .

Using a previously derived formula for the functional derivative,

δTWδρ(𝒓)=tWρtWρ=18ρρρ2(142ρρ14ρρρ2)where  2= ,

and the result is,[10]

δTWδρ(𝒓)=  18ρρρ2142ρρ .

Entropy

The entropy of a discrete random variable is a functional of the probability mass function.

H[p(x)]=xp(x)logp(x)

Thus,

xδHδp(x)ϕ(x)=[ddϵH[p(x)+ϵϕ(x)]]ϵ=0=[ddεx[p(x)+εϕ(x)] log[p(x)+εϕ(x)]]ε=0=x[1+logp(x)] ϕ(x).

Thus,

δHδp(x)=1logp(x).

Exponential

Let

F[φ(x)]=eφ(x)g(x)dx.

Using the delta function as a test function,

δF[φ(x)]δφ(y)=limε0F[φ(x)+εδ(xy)]F[φ(x)]ε=limε0e(φ(x)+εδ(xy))g(x)dxeφ(x)g(x)dxε=eφ(x)g(x)dxlimε0eεδ(xy)g(x)dx1ε=eφ(x)g(x)dxlimε0eεg(y)1ε=eφ(x)g(x)dxg(y).

Thus,

δF[φ(x)]δφ(y)=g(y)F[φ(x)].

This is particularly useful in calculating the correlation functions from the partition function in quantum field theory.

Functional derivative of a function

A function can be written in the form of an integral like a functional. For example,

ρ(𝒓)=F[ρ]=ρ(𝒓)δ(𝒓𝒓)d𝒓.

Since the integrand does not depend on derivatives of ρ, the functional derivative of ρTemplate:Math is,

δρ(𝒓)δρ(𝒓)δFδρ(𝒓)=  ρ(𝒓)[ρ(𝒓)δ(𝒓𝒓)]=δ(𝒓𝒓).

Application in calculus of variations

In the calculus of variations, functionals are usually expressed in terms of an integral of functions, their arguments, and their derivatives. In an integrand Template:Math of a functional, if a function Template:Math is varied by adding to it another function Template:Math that is arbitrarily small, and the resulting Template:Math is expanded in powers of Template:Math, the coefficient of Template:Math in the first order term is called the functional derivative.

For example, consider the functional

J[f]=abL[x,f(x),f(x)]dx ,

where Template:Math. If Template:Math is varied by adding to it a function Template:Math, and the resulting integrand Template:Math is expanded in powers of Template:Math, then the change in the value of Template:Math to first order in Template:Math can be expressed as follows:[11][Note 5]

δJ=abδJδf(x)δf(x)dx.

The coefficient of Template:Math, denoted as Template:Math, is called the functional derivative of Template:Math with respect to Template:Math at the point Template:Math.[3] For this example functional, the functional derivative is the left hand side of the Euler-Lagrange equation,[12]

δJδf(x)=LfddxLf.

Using the delta function as a test function

In physics, it's common to use the Dirac delta function δ(xy) in place of a generic test function ϕ(x), for yielding the functional derivative at the point y (this is a point of the whole functional derivative as a partial derivative is a component of the gradient):

δF[ρ(x)]δρ(y)=limε0F[ρ(x)+εδ(xy)]F[ρ(x)]ε.

This works in cases when F[ρ(x)+εf(x)] formally can be expanded as a series (or at least up to first order) in ε. The formula is however not mathematically rigorous, since F[ρ(x)+εδ(xy)] is usually not even defined.

The definition given in a previous section is based on a relationship that holds for all test functions Template:Math, so one might think that it should hold also when Template:Math is chosen to be a specific function such as the delta function. However, the latter is not a valid test function.

In the definition, the functional derivative describes how the functional F[φ(x)] changes as a result of a small change in the entire function φ(x). The particular form of the change in φ(x) is not specified, but it should stretch over the whole interval on which x is defined. Employing the particular form of the perturbation given by the delta function has the meaning that φ(x) is varied only in the point y. Except for this point, there is no variation in φ(x).

Notes

Template:Reflist

Functional equation

Template:Main

The traditional usage also applies when one talks about a functional equation, meaning an equation between functionals: an equation F=G between functionals can be read as an 'equation to solve', with solutions being themselves functions. In such equations there may be several sets of variable unknowns, like when it is said that an additive function f is one satisfying the functional equation

f(x+y)=f(x)+f(y).

Functional derivative and functional integration

Functional derivatives are used in Lagrangian mechanics. They are derivatives of functionals: i.e. they carry information on how a functional changes, when the function changes by a small amount. See also calculus of variations.

Richard Feynman used functional integrals as the central idea in his sum over the histories formulation of quantum mechanics. This usage implies an integral taken over some function space.

Footnotes

Template:Reflist

References

Template:Functional Analysis


See also

References

General
Citations

Template:Reflist


Cite error: <ref> tags exist for a group named "Note", but no corresponding <references group="Note"/> tag was found