Calculus of Variations/CHAPTER XII

From testwiki
Revision as of 21:46, 26 April 2022 by imported>ShakespeareFan00
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

CHAPTER XII: A FOURTH AND FINAL CONDITION FOR THE EXISTENCE OF A MAXIMUM OR A MINIMUM, AND A PROOF THAT THE CONDITIONS WHICH HAVE BEEN GIVEN ARE SUFFICIENT.

  • 153 The notion of a field continued from the preceding Chapter.
  • 154 The function (x,y,p,q,p¯,q¯).
  • 155 The function must have the same sign for every point of the curve.
  • 156 The sufficiency of the above condition.
  • 157 Another form of the function .
  • 158 Still another form.
  • 159,160 The signs of the function and F1.
  • 161 Another proof of the sufficiency of the condition as given in Article 156.
  • 162 The function cannot be zero along an entire curve in the given field.
  • 163 The envelope of conjugate points.
  • 164 The curve may be composed of a finite number of regular traces.
  • 165 Cases where the traces are not regular.
  • 166 Generalizations in the Integral Calculus.
  • 167,168,169,170,171,172 Applications to the four problems already considered.
  • 173 When F(x,y,x,y) is a rational function of x and y, there can exist neither a maximum nor a minimum value of the integral.
  • 174 General summary.
  • 175 Extensions and generalizations: Instead of the determination of a structure of the first kind in the domain of two variables, it may be required to determine a structure of the first kind in the domain of n quantities.
  • 176 When equations of condition exist among the variables.
  • 177 When the second and higher derivatives appear.
  • 178 The Calculus of variations applied to the determination of structures of a higher kind. The minimal surfaces.

Article 153.
In the preceding Chapter we considered the family of curves that have the same initial point A and satisfy the differential equation G=0. These deviate very little from one another in their initial direction. We saw that the curves again intersect only in the neighborhood of points that are the conjugates of A, the conjugate point along any curve being the limiting position of the point of intersection of this curve and a neighboring curve when the angle between their initial directions becomes infinitesimally small. All points that lie on these curves before the points that are conjugate to A form a connected portion of surface ; that is, if P1 is a point belonging to this collectivity of points, a boundary may be described about P1 so that all points within this boundary also belong to the collectivity of points.

For, let

x=ϕ(t,α,β)y=ψ(t,α,β)

be the equations of a given curve which satisfies G=0, and let the coordinates of a point on this curve be

x1=ϕ(t1,α,β)y1=ψ(t1,α,β)

Further, let x1+ξ,y1+η be the coordinates of another point P2 that lies in the neighborhood of P1, so that ξ,η are quantities arbitrarily small.

We may then (Art. 151) draw a curve between A and P2 which satisfies the differential equation G=0, if we can determine four quantities τ,τ,α,β as power-series in ξ,η in such a way that the following equations are true:

0=ϕ(t0)τ+ϕ1(t0)α+ϕ2(t0)β+(τ,α,β)20=ψ(t0)τ+ψ1(t0)α+ψ2(t0)β+(τ,α,β)2ξ=ϕ(t1)τ+ϕ1(t1)α+ϕ2(t1)β+(τ,α,β)2η=ψ(t1)τ+ψ1(t1)α+ψ2(t1)β+(τ,α,β)2

Since the determinant of these equations (Art. 151) is Θ(t1,t0) and is different from zero, the point t1 not being conjugate to t0, it follows that the quantities τ,τ,α,β may be developed in powerseries in ξ,η which are convergent for small values of these quantities.

Consequently, a curve may be drawn through A and P2 which satisfies the differential equation G=0, and this curve will be neighboring the first curve and will deviate as little as we wish in direction from its initial direction, if ξ,η, and consequently also τ,τ,α,β are sufficiently small.

If we form the determinant for the curve AP2, which, when put equal to zero, is the equation for the determination of the point conjugate to A, it is seen that this determinant also may be developed as a power-series in ξ,η, which becomes Θ(t1,t0) when ξ=η=0. The function Θ(t1,t0) is different from zero when sufficiently small values are ascribed to ξ,η. Consequently, within the interval AP2, there is present no point which is conjugate to A.

We may therefore envelop the interval situated between two conjugate points of the original curve by a narrow surface area, which is of such a nattire that a curve, and only one, may be drawn from the point A to any point within it, which satisfies the differential equation G=0, is neighboring the first curve and deviates in its initial direction only a little from it.

Article 154.
Let a portion of curve P0P1. satisfying the differential equation G=0, be given, which is of such a nature that for no point on it F1=0 or , and suppose that the point conjugate to P0 does not lie before P1. Between P0 and P1 take an arbitrary point P2 and draw through P2 a regular curve.[1] On this curve we choose a point P3 so close to P2 that a curve may be drawn through P0 and P3 which satisfies the differential equation G=0, and which lies entirely within the strip of surface defined above. Let us consider the change in the integral when we take it over P0P3+P3P2 instead of over P0P2. We may denote an integral taken over a curve that satisfies the differential equation by I, and one over an arbitrary curve by I¯, and we may denote the direction of integration by added indices. We have therefore to compute the expression

ΔI=I03+I¯32I02

or

ΔI=(P0P3FdtP0P2Fdt)+P0P2Fdt

an expression which (Art. 79)

=ϵ(P0P2Gwds+[ξFx+ηFy]t2)+(ϵ2)+P0P2Fdt

where ξ,η are measured in the direction from P2 to P3.

At the point P2 and along the curve P3P2 in the direction P3P2 we have

ϵξ=x¯2dt¯+(dt¯)2ϵη=y¯2dt¯+(dt¯)2

dt¯ denoting that this differential is taken with respect to the curve P3P2.

If we consider the arguments in F expressed as functions of t¯ along the curve P3P2, it follows that

P0P2Fdt=F(x2,y2,x¯2,y¯2)dt¯+(dt¯)2

Hence at the point P2, which is an arbitrary point of the curve P0P1, we have

ΔI=(F(x2,y2,x¯2,y¯2)[x¯2x2F(x2,y2,x2,y2)+y¯2y2F(x2,y2,x2,y2)])dt¯+(dt¯)2(a)

The function F is homogeneous of the first order (Art. 68) with respect to its third and fourth arguments, so that (see Art. 72)

F(x2,y2,x¯2,y¯2)=x¯2F(1)(x2,y2,x¯2,y¯2)+y¯2F(2)(x2,y2,x¯2,y¯2)

We define by (x,y,x,y,x¯,y¯) the expression

1)(x,y,x,y,x¯,y¯)=x¯(F(1)(x2,y2,x¯2,y¯2)F(1)(x2,y2,x2,y2))+y¯(F(2)(x2,y2,x¯2,y¯2)F(2)(x2,y2,x2,y2))

Hence at the point P2 it follows that

ΔI=(x,y,x,y,x¯,y¯)dt¯+(dt¯)2

when in the function we have substituted for the arguments those values that belong to the point P2. The direction-cosines of the curve P0P2 at P are denoted by

p2=x2x2'2+x2'2andq2=y2x2'2+x2'2

and those of the curve P3P2 at P2 by p¯2 and q¯2. It is evident from a consideration of the right-hand side of the formula defining above (and cf. Art. 68) that

(x,y,x,y,x¯,y¯)x¯2'2+y¯2'2=(x,y,p,q,q¯,q¯)

Article 155.
If further we denote by σ the differential of arc P3P2, we have finally

2)ΔI=(x,y,p,q,p¯,q¯)σ+(σ)2

Accordingly, if we take σ sufficiently small; that is, if we choose the point P3 very close to P2, then we may always bring it about that the change in the integral has the same sign as that of the function .

The point P2 was an arbitrary point on the curve P0P1, and P2P3 also represented an arbitrary direction.

It follows that if for any point P2 and for any direction at P2 the function were negative, and for any other point and direction positive, then the given curve could vary in such a manner that the change in the integral is at one time positive and at another time negative. We have, therefore, the following theorem :

If the integral taken over the curve P0P1 which satisfies the differential equation G=0 is to be a maximum or a minimum, then the function must have the same sign for every point of the curve, and at every point of the curve for any direction, and this sign m.ust be negative for a maximum and positive for a minimum,.

Article 156.
That the above condition is sufficient to assure the existence of a maximum or a minimum may be shown as follows : Let P0(I)P1 be a curve which satisfies the four conditions already established (and recapitulated in Art. 174), and let P0(II)P1 be any arbitrary curve that lies in the field about the curve P0(I)P1. It is subject only to the condition that it must be a regular curve and lie wholly in the given field.

By varying the parameters α and β we can construct a system of curves as near as we like to one another, all satisfying the differential equation G=0. These curves cut the curve P0(II)P1 in two (or perhaps more) points. They do not cut the curve P0(I)P1 or intersect among themselves within the field in question. The function must have the same sign along each of these curves as it has along the curve P0(I)P1. For, take an arbitrary point P on any of these curves. Then on the curve P0(I)P1 there is a point for which the quantities x,y,p,q differ only a little from the quantities that belong to the point P, and consequently has the same sign for both points.

Consider now the variation in our integrals as we pass from P0(I)P1 to P0P2P3P1 and from P0P2P3P1 to P0P4P5P1, etc. As we saw in the preceding article, the variation caused by passing from P0(I)P1 to P0P2P3P1

=t0t2Fdt+ϵ[Fxξ+Fyη]t2t3+t3t1Fdt
=t0t2Fdt[p¯Fx+q¯Fy]t2σ+t3t1Fdt[p¯Fx+q¯Fy]t3σ

p¯,q¯ being the direction cosines of the tangent to the curve P0(II)P1 at the points P2 and P3, which, we notice, have opposite signs at these points.

If we denote the integration along the curves by the curves themselves, it is seen at once that the variation in these integrals may be expressed by

P0P2P31P1P0(I)P1=[]t0σ+[]t1σ+(σ)2

where the first is the length from P0 to P2 and the second from P3 to P1.

Similarly the differences in the integrals along

P0P4P51P1P0P2P3P1=[]t2σ+[]t3σ+(σ)2
P0P6P71P1P0P4P5P1=[]t4σ+[]t5σ+(σ)2
...................................
P0P2νP2ν+11P1P0P2ν2P2ν1P1=[]t2ν2σ+[]t2ν1σ+(σ)2
P0(II)P1P0P2νP2ν+1P1=[]t2νσ+[]t2ν+1σ+(σ)2

Adding these results together, we have the difference in the integrals along

P0(II)P1P0(II)P1=P0(II)P1σ+(σ)2

σ being a differential of arc along the curve P0(I)P1. This is a verification of the theorem stated at the end of the last Article.

We also see that, if we had not assured ourselves that none of the intermediary curves intersect, the signs of the σ's would not all have been alike, and consequently the sum total of all these σ's would not have constituted the curve P)(II)P1.

Article 157.
Another form of the function .

We have seen in the Integral Calculus that

f(p1,q1)f(p0,q0)=p0,q0p1,q1df(p,q)
=p0,q0p1,q1(f(p,q)pdp+f(p,q)qdq)
=k=0k=1(f(1)[p0+k(p1p0),q0+k(q1q0)](p1p0)+f(2)[p0+k(p1p0),q0+k(q1q0)](q1q0))dk

Hence, if we write

pk=p+k(p¯p)=(1k)p+kp¯
qk=q+k(q¯q)=(1k)q+kq¯

it is seen that

F(1)(x,y,p¯,q¯)F(1)(x,y,p,q)=k=0k=1(F(11)(x,y,pk,qk)(p¯p)+F(12)(x,y,pk,qk)(q¯q))dk
F(2)(x,y,p¯,q¯)F(2)(x,y,p,q)=k=0k=1(F(21)(x,y,pk,qk)(p¯p)+F(22)(x,y,pk,qk)(q¯q))dk

Note that (see Art. 73)

F(11)=qk2F1F(12)=pkqkF1F(22)=pk2F1

and further that F(12)=F(21).

By substituting these values in the above expressions, and in turn the resulting quantities in the expression for , we have

(x,y,p,q,p¯q¯)=p¯[F(1)(x,y,p¯,q¯)F(1)(x,y,p,q)]+q¯[F(2)(x,y,p¯,q¯)F(2)(x,y,p,q)]
=k=0k=1F1(x,y,pk,qk)([qk(p¯p)pk(q¯q)]qkp¯+[qk(p¯p)+pk(q¯q)]pkq¯)dk

The expression in the square brackets is

(qkp¯pkq¯)[qk(p¯p)pk(q¯q)]=(1k)(qp¯pq¯)2

and consequently

3)(x,y,p,q,p¯,q¯)=(qp¯pq¯)2k=0k=1F1(x,y,pk,qk)dk

This expression for in the form of a definite integral is defective, in that it has a meaning only when F1 remains finite for all values of pk and qk, as k varies between 0 and 1. For example, if k=1/2, then p1/2=p+(p¯p)/2=(p+p¯)/2, andi if p¯=p, then p1/2=0; in the same way for k=1/2 and q¯=q, then also q1/2=0. These two arguments being zero, F1 becomes infinite (cf. Art. 73). Further, if the two directions p,q and p¯,q¯ coincide, then becomes zero of the second order.

If OP and OP¯ are vectors of unit length with components p,q and p¯,q¯, then the components of OP, when P travels along the line PP¯, are pk,qk,k varying between 0 and 1.

Article 158.
Another form was given by Weierstrass to the expression , in which he avoided the defect mentioned above, by integrating along the arc of a circle instead of along the straight line PP¯. If we integrate along the arc of a circle of unit radius from the point P to the point P¯ we obtain an expression for which is universally true.

We have as before, if POX=τ, P¯OX=τ¯, ω=τ¯τ(mod 2π), and πω=π,

(x,y,p,q,p¯,q¯)=p¯[F(1)(x,y,p¯,q¯)F(1)(x,y,p,q)]+q¯[F(2)(x,y,p¯,q¯)F(2)(x,y,p,q)]
=cosτ¯[F(1)(x,y,cosτ¯,sinτ¯)F(1)(x,y,cosτ,sinτ)]+sinτ¯[F(2)(x,y,cosτ¯,sinτ¯)F(2)(x,y,cosτ,sinτ)]
=cosτ¯λ=0λ=ωdλF(1)[x,y,cos(τ+λ),sin(τ+λ)]+sinτ¯λ=0λ=ωdλF(2)[x,y,cos(τ+λ),sin(τ+λ)]

But, if F(1) denotes the derivative of F with respect to its third argument, etc.,

dλF(1)[x,y,cos(τ+λ),sin(τ+λ)]
=[Fcos,cos(11)sin(τ+λ)+Fcos,sin(12)cos(τ+λ)]dλ
=[sin3(τ+λ)sin(τ+λ)cos2(τ+λ)]F1dλ
=sin(τ+λ)F1[x,y,cos(τ+λ),sin(τ+λ)]dλ

similarly,

dλF(2)[x,y,cos(τ+λ),sin(τ+λ)]=cos(τ+λ)F1[x,y,cos(τ+λ),sin(τ+λ)]dλ

Hence, it follows that

(x,y,p,q,p¯,q¯)=λ=0λ=ω[cosτ¯sin(τ+λ)+sinτ¯cos(τ+λ)]F1dλ
=λ=0λ=ωsin(τ¯τλ)F1dλ=λ=0λ=ωsin(ωλ)F1[x,t,cos(τ+λ),sin(τ+λ)]dλ

If we write

ωλ=λ

the integral just written is

λ=0λ=ωsinλF1[x,y,cos(τ¯λ),sin(τ¯λ)]dλ=F1[x,y,cos(τ¯λ2),sin(τ¯λ2)]λ=0λ=ωdcosλ

where λ2 is intermediary between 0 and ω.

We therefore have finally

4)(x,y,p,q,p¯,q¯)=(1cosω)F1[x,y,cos(τ¯λ2),sin(τ¯λ2)]

If then F1[x,y,cos(τ¯λ),sin(τ¯λ')] has a constant sign between 0 and ω it follows also that (x,y,p,q,p¯,q¯) has this sign, since λ2 is one of the values of λ within this interval.

The above formula is true for all values of ω situated between π and +π, and since cos(τ¯λ2) and sin(τ¯λ2) cannot both be zero at the same time, it is seen that

F1[x,y,cos(τ¯λ2),sin(τ¯λ2)]

and consequently the expression 4 for has not the same defect as the one given in the preceding article.

Article 159.
For any displacement of the curve ω0, and consequently 1cosω is a positive quantity. Hence has the same sign as F1. If F1[x,y,cos(τ¯λ),sin(τ¯λ)] is found by examination to have always the same sign independently of cos(τ¯λ), sin(τ¯λ) for every point of the curve within the interval in question, then we may be convinced that there is a maxim.um, or a minimumof the integral without the derivation and examination of the function . By this process, however, we have shown without the second variation that the function F1(x,y,p,q) can change its sign for no point on the curve, and for no direction of the tangent to the curve at a point.

Article 160.
It is evident that if F1, considered as a function of its third and fourth arguments, has a definite sign, then has also the same sign; but if retains a definite sign, p and q being fixed while p¯ and q¯ are varied, it does not then follow that F1 always has a definite sign. This is illustrated in the following example, due to Schwarz :

Let

F(x,y,x,y)=αx'2+y'2+betaxy'2x'2+y'2=(α+βcosτsin2τ)x'2+y'2

It follows that

F1(x,y,x,y)=α(x'2+y'2)3/2+2βx(x'23y'2)(x'2+y'2)3=(α+2βcos3τ)1(x'2+y'2)3/2

and, since x'2+y'2=cos2λ+sin2λ=1,

(x,y,p,q,p¯,q¯)=λ=0λ=ωsin(ωλ)F1[x,y,cos(τ+λ),sin(τ+λ)]dλ
=λ=0λ=ωsin(ωλ)(α+2βcos3λ)dλ

where we have written τ+λ=λ or τ=0; i.e., we have taken the X-axis as the initial direction, from which ω is measured.

Noting that

sin(ω+2λ)+sin(ω4λ)=2sin(ωλ)cos3λ

it is seen that

(x,y,p,q,p¯,q¯)=(1cosω)[α+β(cosω+cos2ω)]

The greatest and least values that cosω+cos2ω can have are 2 and 1/4, the corresponding values of ω being 0 and 2π/3. Hence, if we we make α=l and /beta=1, the function is situated between the values

34(1cosω)and3(1cosω)

and can consequently vanish only for ω=0, and is never negativeOn the other hand, 1+2cos3τ changes sign repeatedly, for example, when τ=40.

Article 161.
The proof stated at the end of Art. 155 is of paramount importance in the determination whether there exists a true maximum or minimum. The proof of the sufficiency of this theorem, as illustrated in Art. 156, was given in a somewhat different form by Prof. Schwarz. Owing to its importance we add another proof, taken from the lectures of Weierstrass.

Let OO11 be the curve which satisfies the differential equation G=0, and let 0131 be the arbitrary curve in the field, as defined in Art. 156. Let 3 be any point on the arbitrary curve, whose coordinates we consider as functions of length of arc s (instead of t, as before). The point O1 is taken between 0 and 1 so that the curve 0131 may lie wholly within the field, since the field might terminate in a point at 0. From the point 0 we draw a curve to 3 which satisfies the differential equation G=0. We consider the sum of integrals I03+I¯31 as a function of s. This function we denote by (s). Further, take on the arbitrary curve a point 2 in the neighborhood of the point 3 and before it. Join the points 0 and 2 by a curve which satisfies the differential equation G=0. Then, if we denote the increment of s by σ, it is seen that

5)(sσ)(s)=I02+I¯21I03I¯31=I02I03+I¯23=(x3,y3,p3,q3,p¯3,q¯3)σ+(σ)2

In the same manner take a point 4 immediately after the point 3 on the arbitrary curve and join this point with the point 0 by a curve which satisfies the differential equation G=0. Then we have

6)(sσ)(s)=I04I03I¯34=(x3,y3,p3,q3,p¯3,q¯3)σ+(σ)2

It therefore follows that

7)limσ=0(sσ)(s)σ=limσ=0(s+σ)(s)σ=(x3,y3,p3,q3,p¯3,q¯3)

that is, the quantity (x3,y3,p3,q3,p¯3,q¯3) is the differential quotient of the function (s) at the point 3.

If, then, along the curve O131 the function is nowhere positive, the function (s) continuously diminishes when the point 3 slides from O1 toward the point 1.

Let the point </math>O_{1}</math>, which was taken very near the point 0, coincide with this point; then we can say :

If the function is nowhere positive and is not zero at every point of the arbitrary curve 031 the integral taken over the original curve is always greater than the integral extended over the curve 031 ; and if the function is not negative and not zero at evevy point of the curve 031 then the integral taken over the original curve 01 is continuously less than the integral extended over the arbitrary curve 031.

Article 162.
It remains yet to see if it is possible for the function to vanish along the whole curve 031. It appears from the formula 3) that this is possible only when along the whole curve we have

(pq¯qp¯)2=0 or pq¯qp¯=0

In this case every curve 03 which satisfies the differential equation G=0 has a common tangent at the point 3 with the curve 031.

We shall show that the curve MN which is formed of the points conjugate to the point 0 has this property, and that no curve having this property can be drawn from 0 within the region that is bounded by MN. In other words, is equal to zero along the curve MN, but is not equal to zero for all the points of any other curve that can be drawn within the region that is enveloped by MN.

All the curves that satisfy the differential equation G=0, which pass through one point, and whose initial directions differ from one another by very small quantities, may be represented (Art. 148) in the form

x=ϕ(t,k)y=ψ(t,k)

where the values of k are within certain limits.

To each curve corresponds a different value of k. If, therefore, we fix a value of k and take a second value k+k the curve which corresponds to this value may be expressed by the equations

x+ξ=ϕ(t+τ,k+k)y+η=ψ(t+τ,k+k)

where the same value of t corresponds to the initial directions of both curves.

If the latter curve is cut by the former we must have

0=ϕ(t)τ+ϕkk+(τ,k)20=ψ(t)τ+ψkk+(τ,k)2

The determinant of the linear terms of the equations just written gives, when put equal to zero, the equation for the determination of the point conjugate to the initial point, i. e.,

ϕ(t)ψkψ(t)ϕk=0 \qquad (A)

The smallest root of this equation, which is greater than the value t0 of t, gives the value of t, which belongs to the conjugate point. If this value is t1, then the coordinates of the point are

x¯=ϕ(t1,k)y¯=ψ(t1,k)

If we consider t1 as a function of k, defined through the equation (A), and if we give to k a series of values, the two equations just written represent the curve that is constituted of the points conjugate to 0.

The direction-cosines of the tangent to this curve are proportional to the quantities x¯k,y¯k. But we also have

x¯k=ϕ(t1,k)t1dt1dk+ϕ(t1,k)ky¯k=ψ(t1,k)t1dt1dk+ψ(t1,k)k

Multiply the first of these equations by ψ(t1,k)t1=ψ(t1), and subtract from it the second after it has been multiplied by ϕ(t1,k)t1=ϕ(t1). We have then, with the aid of (A),

ϕ(t1)dy¯dkψ(t1)dx¯dk=0

Since ϕ(t1),ψ(t1) are proportional to the direction-cosines of the tangent at a point t1 of the curve through t0 and t1, which satisfies the differential equation G=0, it follows from the above equation that the tangents to both curves at the point t1 coincide. Hence, the locus of the conjugate points to is the envelope of the curves through 0, which satisfy the differential equation G=0.

Article 163.
Let x¯=f(u) and y¯=g(u) be an arbitrary curve 031, which passes through the point 0, and is situated entirely within the region bounded by the envelope. Further, suppose that 031 does not coincide throughout its whole extent with any of the curves passing through 0, which satisfy the differential equation G=0. Suppose, however, that 031 is touched by the curves that pass through and satisfy the differential equation G=0. At the point of contact we must have

ϕ(t,k)=f(u)ψ(t,k)=g(u)

and

ϕtdgduψtdfdu=0 \qquad (B)

The values of t and u, which belong to the point of contact, are determined as functions of k through the first two equations.

These equations, being true for sufficiently small values of k, may be differentiated with respect to k, and we thus have:

ϕtdtdk+ϕk=dfdududkψtdtdk+ψk=dgdududk

If we multiply the first of these equations by dgdu and the second by dfdu and add we have with the aid of (B)

ϕkdgduψkdfdu=0

If between this equation and the equation (B) we eliminate the quantities dgdu and dfdu, we have

ϕtψkψtϕk=0

an equation, which served for the determination of the point conjugate to the initial point. Consequently the point of contact of the curve, that passes through 0 and satisfies the differential equation G=0, with the arbitrary curve must be the point conjugate to 0.

But this is possible only if the curve x¯=f(u),y¯=g(u) coincides with the envelope ; while according to our supposition the curve 031 is to lie entirely within the region that is bounded by the envelope. It follows that there can be within the region no curve 031 such that each of the curves which satisfies the differential equation G=0, and which joins the point 0 with a point of 031, touches 031 at the same time.

Hence, the quantity qp¯pq¯ can be everywhere zero only when the arbitrary curve between 0 and 1 coincides throughout its whole extent with one of the curves that passes through 0 and satisfies the differential equation G=0. But since, within the strip of surface inclosing the field as we have defined it, there can be only one curve drawn through 0 and 1 which satisfies the differential equation G=0, it follows that the arbitrary curve 031 can coincide only with the original curve 01, and then it is not a variation of that curve. It therefore follows that the function cannot vanish for all the points of the curve that has been subjected to variation.

Article 164.
It is not necessary that the curve 031 be a single trace of a regular curve in its whole extent. If we assume that 031 is composed of an arbitrary number of regular portions of curve, the integral may be regarded as the sum of the integrals over the single portions, and the conclusions made above are also applicable.

It may happen that one of the portions of curve coincides throughout its whole extent with a portion of one of the curves that goes through 0 and satisfies the differential equation G=O. If this is the case for 23, for example, so that is equal to zero along 23, then we may replace this portion of curve by an arbitrary portion of curve 23, which lies very near 23. Then the theorem proved above is true for the curve 0231, viz., that

I03<>I02+I¯23

according as the function is nowhere positive or nowhere negative along the curve 0231. Now, if we bring the curve 23 as near to the curve 23 as we wish, the absolute value of the difference I03I02I¯23 can be made smaller than any arbitrarily small quantity δ; and, in accordance with what was proved above, in the first case the difference I03I02I¯23 is certainly not negative, and in the second case it is not positive.

If we shove the point 3 further along the arbitrary curve toward 1, then, when 3 takes a position in the neighborhood of 4, it follows again that I04I03I¯34 is greater or less than zero, and, as above, we see that the integral I01, extended over the curve that satisfies the differential equation G=0, is greater or less than the integral taken over the arbitrary curve 0231, according as the function is nowhere negative or nowhere positive.

Article 165.
Further, it is not necessary that the single portions of the curve which has been subjected to variation be regular in order that our conclusions be correctly drawn, if only the coordinates can be expressed as functions of some quantity, and if these functions have derivatives. Finally, if we consider the variation made quite arbitrary, so that only the positions of the points are given, while it is not known whether their coordinates have derivatives, then indeed the integral taken over this curve has no longer any meaning. But the meaning of the integral may be extended so that it has a signification even in this case. For if at first we assume that the coordinates of the curve, which has been subjected to variation, are expressible through functions that have derivatives, then the integral taken over the curve is

t0t1F[f(t),g(t),f(t),g(t)]dt

This integral distributed into a sum of integrals (corresponding to the intervals t0τ1,τ1τ2,,τnt1 is equal to

t0τ1Fdt+τ1τ2Fdt++τnt1Fdt \qquad (C)

We assume that the points x0,y0;x1,y1;xn,yn;xn+1,yn+1 correspond to the values t0mτ1,τn,t1.

We then have:

x1x0=f(t0)(τ1t0)+(τ1t0)[τ1t0]
..........................................
xn+1xn=f(τn)(t1τn)+(t1τn)[t1τn]
y1y0=g(t0)(τ1t0)+(τ1t0)[τ1t0]
..........................................
yn+1yn=g(τn)(t1τn)+(t1τn)[t1τn]

where [τvτv1] denotes a quantity which becomes indefinitely small at the same time with τvτv1.

For the first of the integrals in the expression (C) we write:

x=x0+x0(tt0)+(tt0)[tt0]
y=y0+y0(tt0)+(tt0)[tt0]
x=x0+x0(tt0)+(tt0)[tt0]
y=y0+y0(tt0)+(tt0)[tt0]

for the second integral we write

x=x1+x1(tτ1)+(tτ1)[tτ1]
y=y1+y1(tτ1)+(tτ1)[tτ1]

and similarly for the other integrals.

These expressions we write in the sum of integrals (C), and, developing them in power-series, we have through integration

(τ1t0)F(x0,y0,x0,y0)+(τ2τ1)F(x1,y1,x1,y1)++(t1τn)F(xn,yn,xn,yn)

plus a similar number of terms, which become indefinitely small of the second order with respect to the quantities τvτv1.

We may therefore write the integral in the form

limn(v=1n+1F(xv1,yv1,xv1,yv1))

where we must understand by τ0 the value t0, and by tn+1 the value t1.

Since τvτv1 are positive quantities, and the functions F in regard to x1,y1 are homogeneous of the first degree, we may write the above limit in the form

limn(v=1n+1F(xv1,yv1,(τvτv1)xv1,(τvτv1)yv1))

or, since

xvxv1=(τvτv1)xv1+(τvτv1)[τvτv1]

the above expression is

limn(F(x0,y0,x1x0,y1y0)++F(xn,yn,xn+1xn,yn+1yn))

Article 166.
The integral in the above form has a more general meaning than the one hitherto employed, with which, however, it coincides in every particular where that one has a meaning. We may assume, with respect to any arbitrary variation, a series of points x0,y0;x1,y1;xn,yn;xn+1,yn+1 of such a nature that the distance between, say, two successive points does not exceed a certain quantity δ.

We then form the sum

F(x0,y0,x1x0,y1y0)++F(xn,yn,xn+1xn,yn+1yn)

If we make δ smaller and smaller by increasing the number of points, it may happen that this sum approaches a definite limit. We call this limit the value of the integral taken over the curve. It may also happen that the limit does not approach a definite value; for example, it may vacillate between two values. We then say the integral taken over this curve has no meaning.

If we think of the series of points that are taken upon the curve, joined together successively by a broken line, the integral taken over this broken line will approach the same limit as will the integral taken over the curve, if the integral has a meaning.

If, therefore, a curve 01 is given, which satisfies all the conditions that have hitherto been made for a maximum or a minimum, and if this curve varies in an arbitrary manner, then if the integral taken over the curve, which has been subjected to variation, has a meaning as defined above, we ma)' draw a broken line, the integral over which deviates as little as we wish from the integral taken over the curve that has been caused to vary and to which the theorem of Art. 161 is applicable. Consequently, we may say, in the case of a maximum, the integral taken over the curve subjected to variation cannot be greater than the integral taken over the original curve, and in the case of a m,inimum,, it cannot be less than the integral taken over the original curve.

Since we may make the region as narrow as we wish within which all the variations are to lie, we ma)' assume that upon the curve which has been varied a point 3 lies so near to 01 (but not upon it) that two curves 03, 31 can be drawn between the points and 3 and between 3 and 1, which also satisfy all the conditions of the problem.

For the sake of brevity, let us assume that we have to do with a maximum. Then, as we have just seen, the integrals over 03 and 31 cannot at all events be smaller than the integrals over the corresponding parts of the curve which has been varied ; but, after the preceding theorems, the integral taken over 01 is greater than the sum of the integrals taken over 03 and 31, and consequently also greater than the integral over the curve that has been varied. A maximum is therefore in reality present.

Article 167.
We may now investigate the behavior of the function in the case of the four problems which we last considered in Arts. 140–144.

The problem of the surface of rotation of minimum area.

We saw that the catenary between limits, within which were situated no pair of conjugate points, was the curve that described a surface of minimum area when rotated around the axis of the half-plane. From the point P0 we may draw in any direction a curve which satisfies the differential equation G=0 (a catenary); the function F1 is positive for each of these curves as soon as we limit ourselves to the half-plane in which y is positive. A true minimum will therefore in reality enter. For if p,q are the direction-cosines of the tangent to the catenary at any point, p¯,q¯ those of the tangent to any arbitrary curve through the same point, then, owing to the relations

F(1)(x,y,x,y)=yxx'2+y'2F(2)(x,y,x,y)=yyx'2+y'2

it follows that

F(1)(x,y,x,y)=ypF(2)(x,y,x,y)=yq

since

p2+q2=1

and consequently

(x,y,p,q,p¯,q¯)=y((p¯p)p¯+q¯q)q¯)=y(1(pp¯+qq¯))

The expression pp¯+qq¯ is the cosine of the angle between the two tangents. Hence we see that the function is negative for no point which comes under consideration, and for no two directions p,q and p¯,q¯.

If, therefore, y=0 for no point of the curve, our former conclusions are applicable, and a true minimum of the integral has, in reality, been found.

Article 168.
The Brachistochrone. We saw that this curve is the cycloid

x=g+r(1sint)y+a=r(1cost)

We assume that the point A, from which the moving point starts, having an initial velocity proportional to the quantity a, is the origin of coordinates, and that the Y-axis is the direction of gravity. We saw that the cycloid could then be generated by a point described by a circle which rolls upon the straight line y=a. If a is different from zero, an arc of a cycloid may be constructed through A in any direction. If the curve passes through a singular point it does not minimize the integral, as was shown in Art. 104. If A and B are not singular points, the function F1 has a positive value different from zero everywhere along this curve and in the neighborhood of it in every direction.

Between two arbitrary points (see Art. 105), when the quantity a is given, there can alwaj'-s be drawn one, and only one, arc of a cycloid which has no singular points between these two points. If, therefore, a is different from zero, and consequently A and B are not singular points, then (see Art. 159) it follows that the curve, in reality, causes the integral to have a minimum value. Suppose that A or B is a singular point; then at this point F1 becomes infinite, a case which we consider in the next Article.

Article 169.
Suppose A is a singular point and a=0. Draw an arbitrary curve between A and B. Take upon this curve in the neighborhood of A a point A1, and through A1 and B draw a cycloid which cuts the X-axis at A1. The material point under the action of gravity passes through A1 with the same velocity which it would have at an equal distance below the X-axis if it traversed the cycloid drawn through A and B.

The following notation may be introduced :

I01 to denote the time of falling between A1 and B upon the cycloid A1B,

I01 to denote the time of falling between A and A1 upon the arbitrary curve AB,

I to denote the time of falling between A1 and B upon the cycloid A1B,

I to denote the time of falling between A1 and B upon the arbitrary curve A1B.

We proved that

I>I

and therefore, if we write

I¯=I01+I

it follows that

I¯>I+I01

Now, let the point A1 approach nearer and nearer the point A, so that the integral I approaches the limit I01, while I01 becomes indefinitely small. We must then have

I¯I01

That I¯ is greater than I01 may be seen as follows: As soon as G0 along a portion of curve, we may always vary it in such a way that the increment in the corresponding integral may have any sign. If, then, G0 along the whole curve AA1B, we may substitute another curve, for which, if I is the value of the integral which belongs to it,

I<I¯

But since we also have

II01

it follows that

I¯>I01

If, on the other hand, G=0 along the whole curve AA1B, then this curve must consist of several cycloidal arcs ; since, if it were only one, the curves AA1B and AB would be identical. These arcs must have different tangents at the point where they come together ; for, since this point cannot lie on the X-axis, a consecutive point having the same direction must lie on the same cycloidal arc. If corners were present, however, they could be so rounded off that there would be a shorter path between the two points, and consequently, the velocity being the same, the time of falling would be shorter.

Hence the arc of a cycloid also minimizes the time of falling between A and B in the case where A is a singular point ; that is, when the material point starts from A with an initial velocity that is zero.

The conclusions just made are also applicable, if B is a singular point ; for it makes no difference whether the material point ascends from B to A or falls from A to B, if we allow the material point to go back with the same initial velocity with which it arrived at B. On the way back it will reach A with its original velocity. Its velocity will be the same in both cases at all points of the curve, but directed toward opposite directions. The integral taken over the curve has the same value in both cases ; and consequently the curve which caused the integral to have a minimum value will also, in the second case, minimize the integral.

Article 170.
The problem of the geodesic line on a sphere offers here nothing of special interest. It is found that the function retains a positive sign along the arc of a great circle situated between two poles.

Article 171.
Problem of the surface of revolution which offers the least resistance.

In this problem

F()x,y,x,y)=xx'3x'2+y'2

and since

p2+q2=1

it follows that

F(x,y,p,q)=xp3p2+q2=xp3
Fp=x(p4+3p2q2)Fq=2xp3q

Substituting these values in

(x,y,p,q,p¯,q¯)=F(x,y,p¯,q¯)p¯Fpq¯Fq

we have

(x,y,p,q,p¯,q¯)=x[p¯3p¯p22p¯p2q2+2q¯p3q]
=x[p¯(p¯2p2)+2p2q(pq¯p¯q)]
=x[p¯(p¯2(p2+q2)+2p2q(pq¯p¯q))+2p2q(pq¯p¯q)]
=x(p¯qpq¯)[p¯(p¯q+pq¯)2p2q]
=x(p¯qpq¯)[q¯(q¯qpq¯)(p2+q2)+2pp¯q¯(p2+q2)2p2q(p¯2+q¯2)]
=x(p¯qpq¯)[p¯(p¯qpq¯)(p2+q2)2p2p¯(p¯qpq¯)+2pqq¯(p¯qpq¯)]
=x(p¯qpq¯)[p¯(p2+q2)2p2p¯+2pqq¯]
=x(p¯qpq¯)[p¯(q2p2)+2pqq¯]

Writing

cosτ¯=p¯sinτ¯=q¯cosτ=psinτ=q

we have

(x,y,p,q,p¯,q¯)=xsin(τ¯τ)2cos(τ¯+2τ)

Therefore, the sign of is the same as that of cos(τ¯+2τ), and may be either positive or negative by properly choosing τ¯, an angle which depends upon p¯,q¯.

At every point of the curve for which x0 the function can have different signs, and consequently a maximum or a minimum value of the integral does not exist. We saw in Art. 109 that x must be different from zero for all points of the arc.

Article 172.
Legendre (Mimoire sur la manihre de distinguer les maxima des minima dans le Calcul des Variations) showed that by taking a zigzag line for the generating curve, the resistance could be made as small as we wish.

Suppose that the arc P1P2, had the desired property of generating a surface of least resistance, and suppose that the tangent to this curve is nowhere parallel to the X-axis. Writing p=dxdy, it follows that p0 along the arc P1P2.

We have then (Art. 108)

I1,2=t1t2xx'3x'2+y'2dt=x1x2p2x1+p2dx

Since p is finite and continuous along the arc in question, it follows that p21+p2 has the same properties along the arc, and therefore

I1,2=p02x1+p02x1x2xdx=p02x1+p02x22x122

where p0 is a mean value of p, lying between the points P1 and P2 of the curve.

Between the ordinates at P1 and P2 draw a line parallel to the Y-axis, and on this line take a point P3 whose ordinate is longer than those of the points P1 and P2. Draw the straight lines P1P3 and P2P3, and let p1 and p2 be the values of dxdy for these lines. The integral Fdt taken over the broken line P1P3P2 may be denoted by I13+I32, where

I13=x1x3p12x1+p12dx=p121+p12x32x122

and

I32=x3x2p22x1+p22dx=p221+p22x22x322

We have then

I132I12=I13+I32I12=p12(x32x12)2(1+p12)+p22(x22x32)2(1+p22)p02(x22x12)2(1+p02)

The first two terms of this expression may be made as small as we choose by sufficiently diminishing the quantities p1 and p2, which is done by removing indefinitely the point P3 along its ordinate. Hence, their sum is less than the third term, so that, consequently,

I132<I12

This result may also be derived as follows :

I13+I32I12=p12(1+p02)p02(1+p12)x32x12x22x12+p22(1+p02)p02(1+p22)x22x32x22x12<p12(1+p02)p02(1+p12)+p22(1+p02)p02(1+p22)

since x1<x3<x2.

Hence also for a greater reason

I13+I32I12=<(p12+p22)1+p02p02

From this it is seen that the ratio I13+I32I12 may be indefinitely dimmished by properly choosing p1 and p2. There is then no limit to the least possible resistance.

The method just given does not replace the -criterion which shows that no surface of minimal resistance exists. It shows simply that no rotational surface exists, which gives an absolute minimum of resistance—a resistance less than any other neighboring surface. The -criterion shows that no minimum exists in the sense of giving a resistance less than that given any neighboring curve within a limited neighborhood.

Article 173.
In the general case, when F(x,y,x,y) is a rational function of x and y, neither a maximum nor a minimum can exist. For in this case

=F(x,y,p¯,q¯)p¯Fpq¯Fq

is also a rational function of p¯ and q¯ and homogeneous in these quantities of the first degree. Consequently

(x,y,kp¯,kq¯)=k(x,y,x¯,y¯)

and therefore

(x,y,p¯,q¯)=(x,y,p¯,q¯)

It is thus seen that we have only to reverse the direction of the displacement to effect a change of sign in the function .

Article 174.
We have now completely solved the four problems that were proposed in Chapter I, and at the same time one of the principal parts of the Calculus of Variations has been finished. After stating succinctly the four criteria that have been established, we shall take up the second part, which has as its object the theoretical and practical solution of problems, a general type of which were the Problems V and VI of Chapter I.

These criteria may be summarized as follows (cf. Art. 125): There exists a minimum or a maximum value of the integral

I=t0t1F(x,y,x,y)dt

where F is a one-valued, regular function of its four arguments and homogeneous of the first degree in x and y, if

1) the differential equation G=0 is satisfied for every point of the curve;

2) F1 is positive or negative throughout the whole interval t0t1;

3) there are no conjugate points of the curve within the interval t0t1 (limits included);

4) the function is positive or negative throughout the whole interval t0t1

In this discussion we have excluded the cases where

1) the extremities of the curve are conjugate points;

2) F1=0 for some point of the curve;

3) F1=0 for some stretch of the curve;

4) =0 for some point or stretch of the curve.

A general treatment of the first three cases would require the extension of the theory to variations of a higher order. Otherwise particular devices must be employed in every example in which one of the above exceptional cases is found.

Article 175.
Before we begin the consideration of Relative Maxima and Minima, we may, at least, indicate the natural extensions and generalizations of the theory which has already been presented : Instead of the determination of a structure of the first kind[2] in the domain of two quantities, it may be required to determine a structure of the first hind in the domain of n quantities.

If a structure of the first kind is determined in the domain of the n quantities x1,x2,,x2, then n1 of these quantities may be expressed as functions of the remaining one, say, x1.

Writing

u=F(x1,x2,,xn,dx2dx1,dx3dx1,,dxndx1)dx1

it is seen that u is so connected with the n1 functions that

dudx1=F(x1,x2,,xn,dx2dx1,dx3dx1,,dxndx1)

The difference of the values of u at the initial-point and at the end-point of the structure is expressed by a definite integral.

This integral takes the form, when we consider the x's expressed as functions of t, say, x1=x1(t),x2=x2(t),,xn=xn(t),

I=t0t1F(x1,x2,,xn,x1,x2,,xn)dt

The function F must be a one-valued, regular function of its arguments in the whole or a limited portion of the fixed domain.

The value of the integral I is independent of the manner in which the variables x1,x2,,xn have been expressed as functions of t. It therefore follows after the analogon of Art. 68 that the function F is subjected to the further restriction :

kF(x1,x2,,xn,x1,x2,,xn)=F(x1,x2,,xn,kx1,kx2,,kxn)

where k is a positive constant.

The indicated generalization of the problem given in Art. 13 may accordingly be expressed as follows :

The n quantities x1,x2,,xn are to be determined as functions of a quantity t in such a manner that for the analytical structure that is defined through the equations

x1x1(t),x2=x2(t),,xn=xn(t)

the value of the integral

I=t0t1F(x1,x2,,xn,x1,x2,,xn)dt

is a maximum or a m.inim,um,; in other words, if one causes the above analytical structure to vary indefinitely little, the change in the integral thereby produced must in the case of a maximum be constantly negative, and in the case of a minimum it must be constantly positive. Further, the function F is to be considered a one-valued, regular function of its arguments, and indeed, with respect to x1,x2,,xn, a homogeneous function of the first degree.

Article 176.
The treatment of the above problem is found to be the complete analogon of the problem given in Art. 13. A greater complication arises when there are present equations of condition among the variables x1,x2,,xn. An example of this kind we had in Problem III of Chapter I.

This problem may be expressed thus : Among all the curves in space which belong to the surface

f(x,y,z)=0

determine that one for which the integral

t0t1x'2+y'2+z'2dt

is a minimum.

The general problem may be formulated as follows : Among the structures of the first kind in the domain of the quantities x1,x2,,xn, for which the m equations

fμ(x1,x2,,xn)=0μ=1,2,,m;m<n1

exist, that one is to be determined for which the integral

I=t0t1F(x1,x2,,xn,x1,x2,,xn)dt

is a maximum or a minimum.

This problem may be reduced to the one of the preceding Article in an analogous manner as is done in Art. 10 for the case of the shortest line upon a surface. The m equations of condition may be satisfied by introducing for the variables x1,x2,,xn functions of nm new variables after the method given in the Lectures on the Theory of Maxima and Minima, etc., Chapter I, Art. 15. The new variables are independent of one another, so that the above integral may be replaced by one in which the variables are free from extraneous conditions ; or we may proceed as was done in the Theory of Maxima and Minima where the variables are subject to subsidiary conditions (loc. cit., p. 54).

Article 177.
The more general problem of the Calculus of Variations, in so far as it has to do with the structures of the first kind, may be stated as follows:

Among the structures of the first kind in the domain of the n quantities x1,x2,,xn, for which definite equations of condition exist, not only among the n quantities themselves, but also among their first derivatives, that structure is to be determined for which the integral

I=t0t1F(x1,x2,,xn,x1,x2,,xn)dt

becomes a maximum or a minimum

It may be easily shown that the apparently more general case in which F is a function of x1,x2,,xn and of the first and higher derivatives of these quantities, is contained in the problem just stated. For the sake of simplicity, take the case where only two variables are involved and write

u=x1x2F(x,y,dydx,d2ydx2,)dx

If in this integral we express x and y as functions of t, we have

dydt=dydxxd2ydt2=d2ydx2x'2+dydxx

We may consequently change the integral u into

u=t0t1F(x,y,x,y,x,y,)dt

We further have

dxdtx=0dydty=0

We may therefore write

u=t0t1F(x,y,x1,y1,dx1dt,dy1dt)dt

with the equations of condition :

dxdt=x1dydt=y1

If, then, there appear in F only the first and second derivatives, it is seen that F depends upon the four functions x,y,x1,y1 which are to be determined, while at the same time the two equations of condition just written must be satisfied. One of the classes of problems belonging to the general problem just stated is the one which was formulated in Art. 17 and which is treated in the following Chapters.

Article 178.
It may be mentioned finally that the problem of the Calculus of Variations may be further generalized, if we require the determination of structures of a higher kind. For example, in the simplest case the three quantities x,y,z may be determined as functions of two independent variables u and v. We have then instead of the single integral the double integral

F(x,y,z,xu,yu,zu,xv,yv,zv)dudv

which must be a maximum or a minimum.

The treatment of this problem would give a theory of Minimal Surfaces.

Template:BookCat

References

Template:Reflist

  1. The shaded curves do not satisfy the differential equation G=0.
  2. See my Lectures on the Theory of Maxima and Minima of Functions of Several Variables, pp. IS and 86.