Calculus of Variations/CHAPTER V

From testwiki
Revision as of 02:00, 27 October 2017 by imported>JackBot (Formatting, {{BookCat}})
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

CHAPTER V: THE VARIATION OF CURVES EXPRESSED ANALYTICALLY. THE FIRST VARIATION.

  • 74 General forms of the variations hitherto employed.
  • 75 The functions ξ and η. Their continuity.
  • 76 Neighboring curves. The first variation.
  • 77 The functions G, G1 and G2.
  • 78 Proof of an important lemma.
  • 79 The vanishing of the first variation and the differential equation G=0.
  • 80 The curvature expressed in terms of F and F1.
  • 81 The components wN and wT in the directions of the normal and the tangent.
  • 82 Variations in the direction of the tangent and in the direction of the normal.
  • 83 Discontinuities in the path of integration. Irregular curves.
  • 84,85 Problem of Euler illustrating the preceding article.
  • 86 Summary.

Article 74.
In Chapter II we considered examples of special variations. The method followed provided for the displacement of a curve in one direction only, in the direction parallel to the X-axis, and is consequently applicable only to the comparison of integrals along curves obtained from one another by such a deformation.

We shall now give a more general form to the variations employed and shall seek strenuous methods for the solution of the general problem of variations proposed in Chapter I. After deriving the necessary conditions we shall then proceed to discuss the sufficient conditions. In order to develop the conditions for the appearance of a maximum or a minimum of the integral

1)I=t0t1F(x,y,x,y)dt,

it is necessary to study more closely the conception of the variation of a curve and fix this conception analytically.

By writing instead of each point x, y of a curve (presupposed regular) another point x+ξ, y+η, we transform the first curve into another regular curve. This second curve is neighboring the first curve if we make sufficiently small the quantities ξ and η which like x and y we consider as one-valued continuous functions of t.

Article 75.
The following is one of the methods of effecting this result. Let ξ and η be continuous functions of t and also of a quantity k. We further suppose that ξ and \eta vanish when k=0 for ever value of t, for example

ξ=ku(t);η=kv(t),

u and v being finite and continuous functions of t.

The functions u and v and consequently also ξ and η are subject to further conditions. It is in general required to construct a curve between two given points which first may be regarded as fixed. Later the condition of their variability may be introduced. Consequently we have to consider only such values of t that ξ, η and consequently u, v vanish on the limits.

If for x, y we write x+ξ, y+η, then for x, y we must write x+ξ, y+η. Further, the function F(x+ξ,y+η,x+ξ,y+η) must be developed in powers of ξ, η, ξ and η. For the convergence of this series, it is necessary that ξ, η, ξ and η have finite values.

Now, if we write

ξ=ksintkn;η=kcostkn,

then we have

dξdt=k1ncostkn;dηdt=k1nsintkn,

so that, whereas ξ and η have infinitely small values for infinitely small values of k, the quantities dξdt, dηdt vacillate for n1 between +1 and 1 and become infinite for n>1. We shall consequently consider only such special variations in which u and v are functions of t alone, and which with their derivatives are finite and continuous between the limits t0 and t1. We thus restrict, in a great measure, the arbitrariness of the indefinitely small variations of the curve, and thus exclude a great many neighboring curves from the discussion. However, there exist among all the possible neighboring curves also such which satisfy the above conditions, and with these we shall first establish the necessary conditions and later show that the necessary conditions thus established are also sufficient for the establishment of the existence of a maximum or a minimum value of the integral. (See Arts. 134 et seq.)

Article 76.
We have instead of one neighboring curve a whole bundle of such curves if we make the substitutions

xx+ϵξ, yy+ϵη, xx+ϵξ, yy+ϵη,

and let ϵ, a quantity independent of the variables in F(x,y,x,y), vary between +1 and 1.

The total variation that is thereby introduced in the integral of the preceding article is

I+ΔI=t0t1F(x+ϵξ,y+ϵη,x+ϵξ,y+ϵη)dt,

which developed by Maclaurin's Theorem is

=t0t1(F+ϵ[Fxξ+Fyη+Fxξ+Fyη]+ϵ2())dt.

Further (see Art. 25)

ΔI=ϵδI+ϵ22!δ2I+;

hence, equating coefficients of ϵ,

2)δI=t0t1[Fxξ+Fyη+Fxξ+Fyη]dt.

But

t0t1Fxdξdtdt=[Fxξ]t0t1t0t1ξddt(Fx)dt;

So that 2) becomes

2)δI=t0t1([Fxddt(Fx)]ξ+[Fyddt(Fy)]η)dt+[Fxξ+Fyη]t0t1;

or

3)δI=t0t1(G1ξ+G2η)dt+[Fxξ+Fyη]t0t1,

where

G1=Fxddt(Fx), G2=Fyddt(Fy).

Article 77.
Owing to the hypotheses that x, y, dξdt, dηdt vary in a continuous manner with t [that is, within the portion of curve considered no sudden change enters in the direction of the curve] , the first variation of the integral I may be transformed in a remarkable manner.

We had

>G1=Fxddt(Fx)

and also (Art. 72)

F(x,y,x,y)=xFx+yFy.

Therefore

Fx=x2Fxx+y2Fyx;

and differentiating Fx with respect to t, we have

ddt(Fx)=2Fxxdxdt2Fyxdydt2Fx'2dxdt2Fyxdydt.

Hence,

G1=y(2Fxy2Fyx)(2Fx'2dxdt+2Fyxdydt).

Writing 2Fx'2=y'2F1, (Art. 73 ), and 2Fyx=xyF1, and defining G by the equation

G=2Fxy2FyxF1(ydxdtxdydt),

it is seen that

G1=yG.

In a similar manner it may be shown that

G2=xG.

Article 7.
Lemma. If ϕ(t) and ψ(t) are two continuous functions of t between the limits t0 and t1 and if the integral

t0t1ϕ(t)ψ(t)dt

is always zero, in whatever manner ψ(t) is chosen, then necessarily ϕ(t) must vanish for every value of t between t0 and t1.

The following proof, due to Prof. Schwarz, is a geometrical interpretation of a method due to Heine.[1]

Suppose it possible that the function ϕ(t) has a finite value for a point t=t situated between t0 and t1. Then owing to the continuity of ϕ(t) we can find an interval tdt+d within which ϕ(t) also has a finite value.

We write the integral in the form

t0t1ϕ(t)ψ(t)dt=t0tdϕ(t)ψ(t)dt+tdt+dϕ(t)ψ(t)dt+t+dt1ϕ(t)ψ(t)dt.

The second integral on the right hand side may be written

M=tdt+dψ(t)dt,

where M is the mean value of ϕ(t) for a value of t within the interval tdt+d.

We shall show that it is possible to determine a function ψ(t) which will render this integral positive and greater than the sum of the first and third integrals in the above expression, while at the same time ψ(t0)=psi(t1)=0.

Let us form the equation

(t1+x2d2)y=0,

which represents the parabola y=1x2d2 and the X-axis.

Consider next the equation

(t1+x2d2)y=ϵ2,

where ϵ is a small quantity. By taking ϵ sufficiently small, this curve can be made to approach as near as we wish the parabola and the X-axis.

Solving the above equation for y, we have as the two roots (two branches)

y=12(1x2d2)±14(1x2d2)2+ϵ2.

The branch

y=12(1x2d2)+14(1x2d2)2+ϵ2

is symmetrical with respect to the Y-axis, and for values of x, such that dx+d, the ordinate of any point of the curve is greater than the corresponding ordinate of the parabola.

For the parabola y=1x2d2, the integral

d+dydx=43d

It follows then that for the curve we must have

d+d[frac12(1x2d2)+14(1x2d2)2+ϵ2]dx>43d;

for |x|=d, we have y=ϵ; and from the inequality

y=frac12(1x2d2)+14(1x2d2)2+ϵ2<frac12(1x2d2)+14(1x2d2)2+ϵ,

it follows for |x|<d, that y is positive and <ϵ. For the lower branch y is negative and the curve

y=frac12(1x2d2)14(1x2d2)2+ϵ2

follows the parabola to infinity as shown in the figure. It is however the upper branch which we use, since y is less than ϵ, as soon as x passes the value d on either side of the origin.

Instead of the integral last written, take the integral which has the same value

tdt+d(12(1(tt)2d2)+14(1(tt)2d2)2+ϵ2)dt>43d.

Writing

χ(t)=12(1(tt)2d2)+14(1(tt)2d2)2+ϵ2,

we have

t0t1ϕ(t)χ(t)dt=t0tdϕ(t)χ(t)dt+tdt+dϕ(t)χ(t)dt+t+dt1ϕ(t)χ(t)dt
=Mt0tdχ(t)dt+Mtdt+dχ(t)dt+Mt+dt1χ(t)dt
=M[<ϵ](tdt0)+M[>43d]+M[<ϵ](t1td),

where M, M and M are mean values of ϕ(t) in the respective intervals and where [<ϵ] denotes that the quantity that stands within the brackets is less than ϵ.

It is seen that by taking ϵ sufficiently small that the sign of the integral t0t1ϕ(t)χ(t)dt is determined by that of M43dand consequently this integral is different form zero.

Instead of the function χ(t), write

ϕ(t)=(tt0tt0)m(t1tt1t)nχ(t),

where m and n are positive integers.

We see that ϕ(t0)=0=ϕ(t1), and as above, it follows that

t0t1ϕ(t)ψ(t)dt0.

Hence, on the supposition that ϕ(t)0 for a point of the curve alonjj which we integrate, it follows that a function ψ(t) can be found which causes the above integral to be different from zero.

But as this integral was supposed to be zero for all functions ψ(t), it follows that we must have ϕ(t)=0 for all values of t between t0 and t1.

Article 79.
In the expression

ΔI=ϵδI+ϵ22!δ2I+,

unless <math\delta I</math> and ϵ always retain the same sign, it is necessary that δI be zero in order that ΔI be continuously negative or continuously positive; i.e., in order that the integral I be a maximum or a minimum (see Art. 26).

Substitute for G1 and G2 their values in terms of G from Art. 77, in the expression for δI of Art. 76, and we have

δI=t0t1G(tξxη)dt+[ξFx+ηFy]t0t1.

If we suppose that the points P0 and P1 are fixed so that ξ=0=η for them, then the boundary terms vanish. Further, since yξxη is an arbitrary and continuous function of t, it follows from the above lemma that, in order for δt to be zero, G=0 for every point of the curve within the interval t0t0. G can not have a finite value different from zero for isolated points on the curve, since this portion of curve must be continuous in order that the integral may have a meaning.

The differential equation G=0 of the second order is a necessary condition for a maximum or a minimum value of I, and will afford the required curve if such curve exists. We note that it is independent of the manner of variation, as the quantities ξ and η do not appear in it.

From the relation (see Art. 77)

δI=t0t1(G1ξ+G2η)dt=0,

it follows that

t0t1G1ξdt+t0t1G2ηdt=0.

Among all possible variations there are those for which η=0, and consequently

t0t1G1ξdt=0.

As above we have then

G1=0,

and similarly

G2=0.

Further, if we multiply yG=G1 and xG=G2 respectively by v and x, we have by addition

(y'2+x'2)G=0,

and as x and y cannot both vanish simultaneously, it follows that G=0. Hence the equations G1=0, G2=0 on the one hand, and G=0 on the other, are necessary consequences of one another.

The equations G1=0=G2 often more convenient than G=0; especially is this the case if the function F does not contain explicitly one of the two quantities x and y. If x, for instance, is wanting, then Fx=0, and from

G1=FxddtFx=0,

it follows that Fx= constant.

Article 80.
The curvature at any point of a curve is denoted by

κ=1ρ=xyyx[x'2y'2]3/2,

and owing to the equation

G=0=2Fyx2FxyF(xyxy),

we have

1ρ=2Fyx2Fxy[x'2+y'2]3/2F1,

an expression which depends upon x,y,x,y alone and not upon the higher derivatives.

It is thus seen that through the equation G=0, a definite relation is expressed between the curvature of a curve at a point, the coordinates and the direction of the tangent of the curve at this point.

Article 81.
Let a point P on the curve be transformed by a variation into the point P and let the displacement pp be denoted by v; further, let the components in the x and y directions be ξ and η, while wN and wT denote the components of this displacement in the direction of the normal and the tangent to the curve at the point P.

Let λ denote the angle between these directions and let the direction cosines of the normal be denoted by

dx(dxdt)2+(dydt)2 and dy(dxdt)2+(dydt)2; i.e., by xs and ys.

Then from analytical geometry,

wT=ξcos(λ)+ηsin(λ)=xξ+yηx'2+y'2,

and therefore

wN=ηcos(λ)ξsin(λ)=xηyξx'2+y'2;

and

wT2+wN2=ξ2+η2.

These expressions substituted in the formula for δI (Art. 79) give

δI=t0t1wNGds+[FwTx'2+y'2+xFyyFxx'2+y'2]t0t1.

From this it is seen that only the component of the variation which is in the direction of the normal enters under the integral sign.

Article 82.
By means of the formula below we will prove that the variation in the direction of the tangent brings forth only such terms for the first variation that are free from, the sign of integration.

As in Art. 76, write

δI=t0t1(Fxξ+Fxξ+Fyη+Fyη)dt,

and, substituting in this expression ξ=vdxds, η=vdyds, we have

δI=t0t1[v(Fxdxds+Fydyds)+Fxddt(vdxds)+Fyddt(vdyds)]dt.

Noting that

t0t1Fxddt(vdxds)dt=[Fxvdxds]t0t1t0t1vdxdsddt(Fx)dt,

it is seen that

δI=t0t1(vdxds[Fxddt(Fx)]+vdyds[Fyddt(Fy)])dt+[Fxvdxds+Fyvdyds]t0t1
=t0t1G(yξxη)dt+[Fxvdxds+Fyvdyds]t0t1.

But

tξxη=yvdxdsxvdyds=vds[ydxdtxdydt]dt0,

so that everything under the sign of integration drops out, leaving

δI=[Fxvdxds+Fyvdyds]t0t1.

Hence, if we make a sliding of the curve by the substitution

xx+ϵξyy+ϵη,

and resolve this sliding into two components, of which the one is parallel to the direction of the tangent and the other is parallel to the direction of the normal, then the result of the sliding in the direction of the tangent is seen only in the terms which have reference to the limits, and all these terms are exact differentials under the sign of integration, while the effect due to a sliding in the direction of the normal is shown in the formula of the preceding article.

Article 83.
The expression for the first variation has been obtained on the hypothesis that the elements of integration have for each point of the path of integration a one-valued meaning. In case the path involved, discontinuities, it could be resolved into a finite number of portions of regular curve, and along each portion δI would have a meaning similar to that of the preceding article. It is assumed, then, that the required curve, which is to furnish a maximum or a minimum value of the integral, is regular in its whole trace, or, at least, that it consists of regular portions of curve. In the latter case we shall at first limit ourselves to the consideration of one such portion. Within this portion of curve not only x,y but also x,y will be one-valued functions of t. This assumption is already implicitly contained in the assumption of the possibility of the development of ΔI by Taylor's Theorem; for otherwise the derivatives of F according to x and y for the curve which has to be developed could not be formed.

That these assumptions have been made is due to the fact that otherwise the curve could not be the object of a mathematical investigation, since there are no methods of representing irregular curves in their entire generality. If therefore one is contented with the rules of differentiation and integration, he must extend these considerations over only such functions to which the rules may be applied, that is, to the functions having the properties above. There are many problems in geometry and mechanics for which the above hypotheses cannot be made.

Article 84.
The following problem proposed by Euler illustrates what has just been said:

Required a curve connecting two fixed points such that the area between the curve, its evolute and the radii of curvature at its extremities may be a minimum.

The analytical solution of this problem is the arc of a cycloid, if indeed there exists a minimum. We shall now show that such is not the case. For join the two fixed points A and B by a straight line which divide into n equal parts, and draw alternately

above and below the line n semi-circles having the n parts of the line as diameters.

All the radii of curvature of each semi-circle, i.e., of each portion of curve which is to be a minimum, intersect on the line AB and it is evident that

n2π(AB2n)2=πAB28n

must be a minimum.

If we increase the number n sufficiently, we may make the above expression become arbitrarily small; and in the limit n= the curve will tend to become the straight line AB. From this it is evident that there is not present a minimum surface area.

Article 85.
The same result would have been obtained, if instead of the straight line AB we had taken the arc of a cycloid through these points, and had then drawn a system of small cycloids having their cusps along the large cycloid. (See Todhunter, Researches in the Calculus of Variations, p. 252.) The reason that a minimum is not given through the large cycloid is due to the fact that such a minimum is ofiEered by an irregular curve, and that this irregular curve is not included in our analytical research.[2] It follows that our assumption made regarding the regularity of the curve is out of place and leads to something untrue.

But in spite of the not improbable possibility that the curve which is to satisfy a given proposition is irregular, we must make the hypothesis that the curve is regular, since we come to analytical difEerential equations only by limiting our investigations to such regular curves, and the most general theory of functions teaches that in turn through these diflEerential equations are defined the analytical functions which in their whole extent have existing derivatives.

Article 86.
To avoid any misunderstanding, we repeat what we have already said in the previous Chapter: it is not asserted that there is anything in the nature of the problem whereby one may a priori conclude that the required curve must be regular. Having these hypotheses, we fix our ideas and draw deductions. After the solution of the problem has been effected, we have to make in addition a special proof that the derived curve has all the required properties, and that this curve is the only one which has them.

The chief difficulty in all such problems, as we have shown above in the special problem of approximation (or of the passing to a limit), consists in showing that the regular curve that has been found, found indeed from the necessary conditions, also at the same time satisfies the sufficient conditions, and therefore satisfies all the requirements of the problem.

Template:BookCat

  1. Heine, Crelle, bd. 54, p. 338.
  2. See also Moigno et Liadelof , Calcul de Variations, p. 252.