Calculus of Variations/CHAPTER XIII

From testwiki
Revision as of 21:47, 26 April 2022 by imported>ShakespeareFan00
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

CHAPTER XIII: STATEMENT OF THE PROBLEM. DERIVATION OF THE NECESSARY CONDITIONS.

  • 179 The general problem stated.
  • 180 Existence of substitutions by which one integral remains unchanged while the other is caused to vary. An exceptional case.
  • 181 Case of two variables. Convergence of the series that appear.
  • 182 The nature of the substitutions that have been introduced.
  • 183 Formation of certain quotients which depend only upon the nature of the curve.
  • 184 Generalization, in which several integrals are to retain fixed values.
  • 185 The quotient of two definite integrals being denoted by λ, it is shown that λ has the same constant value for the whole curve.
  • 186 The differential equation G(0)λG(1)=0.
  • 187 Extension of the theorem of Article 97.
  • 188 Discontinuities, etc.
  • 189 The second variation: the three conditions formulated in Article 135 are also necessary here.

Article 179.
The nature of many problems whicli arise in the Calculus of Variations presents subsidiary conditions which limit the arbitrariness that we have hitherto employed in the indefinitely small variations of the analytical structure. Such problems are the most difficult and at the same time the most interesting that occur. These last conditions which enter into the requirement for a maximum or a minimum are in general of a double nature. On the one hand, it may be proposed that among the variables there are to exist equations of condition, as indicated in Arts. 176 and 177. On the other hand, we may require that the maximum or the minimum in question satisfy a further condition, viz., it must cause another given integral to have a prescribed value. Such cases are usually called Relative Maxima and Minima.

If we limit our discussion to the region of two variables, then the problem which we have to consider may be expressed as follows (cf. Art. 17):

Let F(0)(x,y,x,y) and F(1)(x,y,x,y) be two functions of the same nature as the function F(x,y,x,y) hitherto treated. The variables x and y are to be so determined as one-valued functions of t that the curve defined through the equations x=x(t),y=y(t) will cause the integral

1)I(0)=t0t1F(0)(x,y,x,y)dt

to be a maximum or a minimum, while at the same time for the same equations the integral

2)I(1)=t0t1F(1)(x,y,x,y)dt

will have a prescribed value; that is, for every indefinitely small variation of the curve for which the second integral retains its sign unaltered, the first integral, according as a maximum or a minimum is to enter, must be continuously smaller or continuously greater than it is for the curve x=x(t),y=y(t).

Article 180.
We must first show that it is possible to represent analytically the variations of a curve for which the integral I(1) retains a constant value.

In the place of the variables x,y let us make the substitution x+ξ¯,y+η¯. The variation of the second integral is accordingly

3)ΔI(1)=t0t1G(1)w¯dt+t0t1(ξ¯,η¯,ξ¯,η¯)2dt

where (ξ¯,η¯,ξ¯,η¯)2 denotes that the terms within the brackets are of the second and higher dimensions in ξ¯,η¯,ξ¯,η¯.

We have so to determine ξ¯ and η¯; that ΔI(1)=0. For this purpose we write

4)ξ¯=ϵξ+ϵ1ξ1+ϵ2ξ2+cdotsη¯=ϵη+ϵ1η1+ϵ2η2

where ϵ,ϵ1, are arbitrary constants and the functions ξ,ξ1,,η,η1, are functions similar to the quantities of the preceding Chapters and vanish for t=t0 and t=t1. Now write

wi=yξixηi

and

w¯=yξ¯xη¯=ϵw+ϵ1w1+ϵ2w2+

Hence, from 3) we have

ΔI(1)=ϵt0t1G(1)wdt+ϵ1t0t1G(1)w1dt++(ϵ,ϵ1,)2

If we write

5)Wi(1)=t0t1G(1)widt

it follows that

ΔI(1)=ϵW(1)+ϵ1W1(1)+ϵ2W2(1)++(ϵ,ϵ1,ϵ2,)2

The functions Wi(i) are completely determined as soon as definite values are given to ξ,ξ1,; and, in order that ΔI(1)=0, it is necessary that

(A)ϵW(1)+ϵ1W1(1)+ϵ2W2(1)++(ϵ,ϵ1,ϵ2,)2=0

If any of the quantities Wi(1), for example Wλ(1), are different from zero, we are able to express ϵλ in a power-series of the remaining ϵ's, when these quantities have been chosen sufficiently small.[1] The equation ΔI(1)=0 may consequently be satisfied for sufficiently small systems of values of the ϵ's.

Substitute one of these systems of values in 4) and it is seen that indefinitely small variations of the curve x=x(t),y=y(t) exist for which the integral I(1) remains unaltered. These variations may be analytically represented (see the next Article).

This proof is deficient in the case where all the quantities W(1),W1(1), are zero for all values of ξi,ηi, however ξi,ηi may have been chosen. When this is the case, G(1) must be zero along the whole curve. But this is one of the necessary conditions that the integral I(1)have a maximum or a minimum value.

If, then, for the curve which is derived through the solution of the differential equation G(0)=0 there also enters a maximum or a minimum value of the integral I(1) and consequently G(1)=0, it is in general not possible so to vary the curve that the second integral remains unaltered.

This case is excluded from the present discussion, and is left for special investigation in each particular problem.

Article 181.
Let us limit ourselves for the present to the simplest case where

ξ¯=ϵξ+ϵ1ξ1η¯=ϵη+ϵ1η1

and if we denote the integrals in the expansion of ΔI(1) that are associated with the coefficients eie1j by Wij(1), the equation correresponding to (A) of the last article is

(Aa)0=W10(1)+ϵ1W01(1)+ϵ2W20(1)+ϵϵ1W11(1)+ϵ12W02(1)+

which series we suppose convergent for sufficiently small values of ϵ and ϵ1.

Suppose next we express ϵ1 in terms of ϵ by the series

ϵ1=h1ϵ+h2ϵ2+h3ϵ3+

Then, when this value of ϵ1 is substituted in Aa, by equating the coefficients of the different powers of ϵ to zero, we have

W10(1)+h1W01(1)=0
W20(1)+h1W11(1)+h12W02(1)+h2W01(1)=0

.....................................

Hence, denoting the quotients Wij(1)W01(1) by Vij, where W01(1)0, we have

h1=V10
h2=V20+h1V11+h12V02

.............................

Further, the equation (Aa) may be written

(Ab)ϵ1=ϵV10+ϵ2V20+ϵϵ1V11+ϵ12V02+

Let us compare this series with the series

(C)ϵ1=g1(ϵr+ϵ1r1)ggϵ1r1
=g[ϵr+(ϵr+ϵ1r1)2+(ϵr+ϵ1r1)3+]

Suppose from this series we have ϵ1 expressed in terms of ϵ in the form

(Bb)ϵ1=h1ϵ+h2ϵ2+h3ϵ3+

where the hs have been derived from the coefficients of powers of ϵ and ϵ1 as the hs in (B) are formed from the coefficients V in (Ab).

The series (Bb) is convergent for

|ϵr|+|ϵ1r1|<1

If, then, the coefficients V of (Ab) are in absolute value less than the corresponding coefficients in (C), the coefficients h in (B) are less in absolute value than the coefficients h in (Bb), and therefore the series (B) is convergent.

Now the coefficients of ϵkϵ1μ in (Ab) and (C) are respectively

Vk,μ and (k+μk)grkr1μ

where the symbol (mn) denotes m(m1)(mn+1)n!. Hence for sufficiently small values of r and r1, if

|ϵr|+|ϵ1r1|<1

and

Vk,μ<(k+μk)grkr1μ

the series (B) is convergent, and when substituted in the expression for ΔI(1) causes this expression to vanish.

Article 182.
The expression for ϵ1 as a function of ϵ is had from the relation

ϵ1=g1(ϵr+ϵ1r1)ggϵ1r1

Hence, it follows that

(ϵ1r1)2+(ϵrr1r1+g)ϵ1r1+gϵr(r1+g)=0

or

ϵ1r1=12[r1r1+gϵr±(ϵrr1r1+g)24gϵr(r1+g)]

Of the two roots we choose the one with the lower sign in order that ϵ1 equal zero with ϵ. This root may be written

ϵ1r1=12[r1r1+gϵr(r1r1+gϵr)1(4gϵr(r1+g))(r1r1+gϵr)2]

It is seen that the expression under the radical is finite, continuous and one-valued for values of ϵ such that

ϵr<r1r1+g and rgϵr(r1+g)<(r1r1+gϵr)2

Article 183.
Returning to the substitutions

ξ¯=ϵξ+ϵ1ξ1η¯=ϵη+ϵ1η1

we assume that the functions ξ,ξ1,η,η1 become zero at the endpoints (or limits) of the curve and are so chosen that W01(1) does not vanish within the limits of integration. We have then at once from (Aa) the power-series

ϵ1=W10(1)W01(1)ϵ+ϵP(ϵ)

where the power-series P(ϵ) vanishes with ϵ.

From this we have

6)ξ¯=ϵ(ξW10(1)W01(1)ξ1)+ϵξ1P(ϵ)η¯=ϵ(ηW10(1)W01(1)η1)+ϵη1P(ϵ)

If we subject the integral I(0) to the same variation, we have [cf. formula (Aa)]

ΔI(0)=ϵW10(0)+ϵ1W01(0)+(ϵ,ϵ1)2

and consequently

ΔI(0)=ϵ(W10(0)W10(1)W01(1)W01(0))+(ϵ)2

If then, the integral I(0) is to have a maximum or a minimum value, it is necessary that

W10(0)W10(1)W01(1)W01(0)

be equal to zero.

We have, therefore, the necessary condition

W10(0)W10(1)=W01(0)W01(1)

From this it is seen that the quotient W10(0)W10(1), is independent of the arbitrary functions ξ,η, since it does not vary if we write for ξ,η as functions of t other functions ξ1,η1. Consequently it follows that the value of the above quotient depends only upon the nature of the curve x=x(t),y=y(t).

Article 184.
We might generalize the problem treated above by requiring the curve x=x(t),y=y(t) which minimizes or maximizes the integral

I(0)=t0t1F(0)(x,y,x,y)dt

while at the same time the following integrals have a prescribed value:

I(1)=t0t1F(1)(x,y,x,y)dt
I(2)=t0t1F(2)(x,y,x,y)dt

...............................................

I(μ)=t0t1F(μ)(x,y,x,y)dt

the functions F(0),F(1),,F(μ), being of the same nature as the function defined in Chapter I.

We must now consider the deformation of the curve caused by the variations

ξ¯=ϵξ+ϵ1ξ1+ϵ2ξ2++ϵμξμη¯=ϵη+ϵ1η1+ϵ2η2++ϵμημ

We have, then, if we write wi=yξixηi(i=1,2,,μ), and suppose that the ξ's and η's vanish for t=t0 and t=t1

ΔI(0)=ϵt0t1G(0)wdt+ϵ1t0t1G(0)w1dt++ϵμt0t1G(0)wμdt+(ϵ,ϵ1,,ϵμ)2
ΔI(1)=0=ϵt0t1G(1)wdt+ϵ1t0t1G(1)w1dt++ϵμt0t1G(1)wμdt+(ϵ,ϵ1,,ϵμ)2
ΔI(2)=0=ϵt0t1G(2)wdt+ϵ1t0t1G(2)w1dt++ϵμt0t1G(2)wμdt+(ϵ,ϵ1,,ϵμ)2

.................................................

ΔI(μ)=0=ϵt0t1G(μ)wdt+ϵ1t0t1G(μ)w1dt++ϵμt0t1G(μ)wμdt+(ϵ,ϵ1,,ϵμ)2

By means of the last μ equations, if the determinant

|t0t1G(i)wjdt|i,j=1,2,,μ

is different from zero, we may, for sufficiently small values of ϵ1,ϵ2,,ϵμ, express these quantities as convergent power-series in ϵ[2]

These power-series when substituted in ΔI(0) cause it to have the form

ΔI(0)=ϵD+(ϵ)2

where

D=|t0t1G(i)wjdt|i,j=0,1,,μandw0=w

In order that the integral I(1) have a maximum or a minimum value, it is therefore necessary that

D=

This determinant, when expanded, may be written in the form

t0t1[λ0G(0)+λ1G(1)++λμG(μ)]wdt=0

where λi is the first minor of t0t1G(i)wdt in the determinant D.

Hence, as before (cf. Art. 79, where we had G=0), we have here

λ0G(0)+λ1G(1)++λμG(μ)=0

Article 185.
Similarly, if in Art. 183 we denote the quotient W10(0)W10(1) by λ and then give to W01(0) and W01(1) their values, we have

t0t1(G(0)λG(1))wdt=0

From this it follows that

G(0)λG(1)

We may prove a very important theorem regarding the constant λ, viz: -it has one and the same value for the whole curve; i. e., we always have the same value of λ, whatever part of the curve x=x(t),y=y(t) we may vary. Consider the values of t laid off on a straight line, and suppose that the constant λ has a definite value for, say, the interval t2t3 which also corresponds to a certain portion of curve. This value (see Art. 183) is independent of the manner in which the portion of curve t2t3 has been varied. Next consider an interval tt which includes the interval t2t3; then, there belongs to all the possible variations of the interval tt, also that variation by which tt2 and t3t remain unchanged and only t2t3, varies. As λ has a definite value for this interval and is independent of the manner in which the curve has been varied, it must have the same value for tt.

Article 186.
The differential equation G(0)λG(1)=0 is the same as the one we would have if we require that the integral

t0t1f(x,y,x,y)dt

have a maximum or a minimum value, where F is written for the function

F(0)λF(1)

Through this differential equation (See Art. 90) ;x and y are expressible in terms of t and λ and two constants of integration α and β in the form

x=ϕ(t,α,β,λ)y=ψ(t,α,β,λ)

The curve represented by these equations is a solution of the problem, when indeed a solution is possible.

Article 187.
We prove next a very important theorem which often gives a criterion whether a sudden change in direction can take place or not within a stretch where the variation is unrestricted (cf. Art. 97). Suppose that on a position t=t, where the variation is unrestricted, a sudden change in direction is experienced. On either side of t take two points t1 and t2 so near to t that within the intervals t1t and tt2 a similar discontinuity in change of direction is not had. Among the possible variations there is one such that the whole curve remains unchanged except the interval t1t2, which is, of course, varied in such a way that the integral I(1) retains its value. The variation of the integral I(0) depends then only upon the variation of the sum of integrals

t1tF(0)(x,y,x,y)dt+t't2F(0)(x,y,x,y)dt

We cause a variation in the stretch t1t2 by writing

ξ¯=ϵξ+ϵ1ξ1η¯=ϵη+ϵ1η1

where we assume that

(A)

ξ,ξ1,η,η1 are all zero for t=t1 and t=t2
ξ,ξ1,η1 are zero for t=t1
η0 for t=t1

We may then always determine ϵ1 as a power-series in ϵ so that ΔI(1)=0.

If by ϕ we denote an expression of the form ϕ(0)λϕ(1), we have (Art. 79)

ΔI(0)=ϵt1tGwdt+epsilontt2Gwdt+ϵ[(ξλξ1)Fx+(ηλη1)Fy]t1t+ϵ[(ξλξ1)Fx+(ηλη1)Fy]tt2+ϵ(ϵ)

If the curve x=x(t),y=y(t) minimizes or maximizes the integral I(0), it is necessary that the coefficient of ϵ on the right-hand side of the above expression be zero. Since G=0 for unrestricted variation, it follows from the assumption (A) that

ηt[(Fy)t(Fy)t+]=0

If in the assumptions (A) we assume for t=t1 that η=0 and ξ0, we have an analogous equation for x.

It therefore follows (cf. Art. 97) that

[(F(0)λF(1))x]t=[(F(0)λF(1))x]t+
[(F(0)λF(1))y]t=[(F(0)λF(1))y]t+

We have then the theorem : Along those positions which are free to vary of the curve which satisfies the differential equation G=0, the quantities Fx and Fy vary everywhere in a continuous manner, even on such positions of the curve where a sudden change in its direction takes place.

Article 188.
It is obvious that these discontinuities may all be avoided, if we assume that ξ,η,ξ1,η1 vanish at such points. This we may suppose has been done. We may also impose many other restrictions upon the curve ; for example, that it is to go through certain fixed points, or that it is to contain certain given portions of curve, or that it is to pass through a certain limited region. In all these cases there are points on the curve which cannot vary in a free manner. But whatever condition may be imposed upon the curve, the following theorem is true.

All points which are free to vary and there always exist such points must satisfy the differential equation G(0)λG(1)=0, and for all such points the constant λ has the same value.

Article 189.
The second variation. We assume that the variations at the limits and at all points of the curve where there is a discontinuity in the direction, vanish. We also suppose that the variations ξ¯,η¯ have been so chosen that ΔI(1)=0.

We then have (cf. Art. 115):

ΔI(0)=ϵδI(0)+ϵ22t0t1[F1(0)(dwdt)2+F2(0)w2]dt+(ϵ)3
0=ϵδI(1)+ϵ22t0t1[F1(1)(dwdt)2+F2(1)w2]dt+(ϵ)3

and consequently

ΔI(0)=ϵ[δI(0)λδI(1)]+ϵ22t0t1[F1(dwdt)2+F2w2]dt+(ϵ)3

Since

δI(0)λδI(1)=t0t1(G(0)λG(1))wdt=0

it follows that

ΔI(0)=ϵ22t0t1[F1(dwdt)2+F2w2]dt+(ϵ)3

This last integral may be written at once (Art. 119) in the form

ΔI(0)=ϵ22t0t1F1(dwdtwududt)2dt

where u is determined from the differential equation (Art. 118)

J=F1d2udt2+dF1dtdudtF2u=0

It follows here as a necessary condition for the existence of a maximum or a minimum that F1 for all portions of the curve at which there is free variation, must in the first case be everywhere negative' and in the second case everywhere positive' and must also be different from 0 and . In order that this transformation of the integral be possible the equation J=0 must admit of being integrated in such a way that u is different from zero on all portions of curve, which vary freely (Art. 128).

We shall determine in Chapter XVII whether the three necessary conditions thus formulated are also sufficient for a maximum or a minimum value of the integral I(1). By means of the example in the next Chapter, we shall also show that if there exists a curve, for which the first integral has a maximum or a minimum value while the second integral retains a given value, then the curve is determined through the three conditions, which are the same here as those formulated in Art. 135. The behavior of the -function is then decisive regarding whether there in reality exists a maximum or a minimum.

Template:BookCat

  1. Cf. Lectures on the Theory of Maxima and Minima, etc., p. 20.
  2. Cf. Lectures on the Theory of Maxima and Minima of Functions of Several Variables, p. 21.