Calculus/Conic sections

From testwiki
Jump to navigation Jump to search

Template:Calculus/Top Nav

The three methods to intersect a cone with a plane. Note that the circle is a special form of an ellipse.

Conic sections are the intersections of a surface of a cone and a plane. There are three ways to intersect. The first method is to intersect the cone vertically, which the intersection will yield a hyperbola. The second method is to intersect the cone parallel to the outermost line of the cone, which will yield a parabola. The third method is to intersect the cone horizontally or slightly horizontally, which will yield an ellipse. For more information, see Conic section. If you have knowledge on this particular subject, you can help expand here.

In future chapters, you will encounter more about conic sections. As you progress into polar coordinates, parametric equations, and three-dimensional quadric surfaces, conic sections will again be a difficult subject to discuss. In this chapter, we will only talk about the basic Cartesian-coordinate conic sections.

Standard equations

Ellipse

Ellipses are shapes that have an interesting property. In order to find the standard equation for the ellipse, we must know what an ellipse is. Apart from the intersection of a inclined plane and the surface of a cone, there is another way to construct an ellipse. Template:Calculus/Def The image on the right is the graph of an ellipse. If there is a point P=(x,y), then PF1+PF2=C, where C is a constant.

Knowing the defining characteristic of an ellipse, we can start find the equation.

Ellipse: notations

Derivation

To make things simple, let's set the center of the ellipse on the origin and the points F1 and F2 on the x-axis. (see image on the right)

Since

PF1+PF2=C

and

F1=(c,0),F2=(c,0),P=(x,y)

, using the distance formula, we can get:

PF1=(xc)2+(y0)2=x22xc+c2+y2

PF2=(x+c)2+(y0)2=x2+2xc+c2+y2

PF1+PF2=x22xc+c2+y2+x2+2xc+c2+y2

Now for the constant C. Let the length of the semi-major axis be a and the length of the semi-minor axis be b. Imagine that point P is now at the vertex, so P0=(a,0). At this particular point,

P0F1=ac

P0F2=a+c

C=P0F1+P0F2=2a

Because the definition says that PF1+PF2=C for any P, we can safely say that C=2a. Thus, we can start solving the equation.

x22xc+c2+y2+x2+2xc+c2+y2=2a

x22xc+c2+y2=2ax2+2xc+c2+y2

x22xc+c2+y2=4a24ax2+2xc+c2+y2+x2+2xc+c2+y2 (Square both sides)

4ax2+2xc+c2+y2=4a2+4xc (Simplify it)

ax2+2xc+c2+y2=a2+xc

a2(x2+2xc+c2+y2)=a4+2a2xc+x2c2 (Square both sides)

a2x2+2a2xc+a2c2+a2y2=a4+2a2xc+x2c2

a2x2c2x2+a2y2=a4a2c2 (Simplify it)

x2(a2c2)+a2y2=a2(a2c2) (Factor it)

Finally, the equation is

x2a2+y2a2c2=1

This should be the equation. But a2c2 can be further simplified. To do so, imagine again that our point P is on the co-vertex, so P1=(0,b). Thus,

P1F1=c2+b2

P1F2=c2+b2

C=P1F1+P1F2=2c2+b2

Since we have already established that C=2a, we can write down an equation that links a,b,c together:

2c2+b2=2a

c2+b2=a2

a2c2=b2

Now we substitute a2c2 as b2, we get the final result:

x2a2+y2b2=1

This equation is the standard form of the ellipse. It is considered standard because all key points are on the axes.

Terminology and properties

We already derived the equation. So, the terminology and properties will be based on that.

x2a2+y2b2=1
  • Focus (plural: foci): the points F1,F2 which have coordinates (c,0),(c,0) respectively. The definition gives those points their function: PF1+PF2=2a.
  • Semi-major axis: the axis with length a.
  • Semi-minor axis: the axis with length b, b<a.
  • Vertex (plural: vertices): the endpoint of the semi-major axis. It has the coordinate (±a,0).
  • Co-vertex: the endpoint of the semi-minor axis. It has the coordinate (±b,0), b<a.
  • Center: the middle point between the two foci. It has the coordinate (0,0).

Note that any changes towards the equation will change the coordinates for the key values. The coordinates above are strictly based upon the standard equation of the ellipse.

In the derivation, we stumbled upon a property, which is the relationship between the constants a,b,c. The property is c2=a2b2. In the derivation of the equation of the hyperbola, we will encounter this property again. However, it will be slightly different because of the signs. In the ellipse, a>c, so the property c2=a2b2 ensures that a2>0,b2>0,c2>0. In the hyperbola, as we will see, c>a, which means the length of the foci to the center is larger then that of the vertices to the center. In order to ensure that a2>0,b2>0,c2>0 for convenience, the property will be slightly adjusted.

Transformations

If we want the ellipse to be more "vertical" instead of "horizontal", the equation of the ellipse needs to be changed. To be more "vertical", the foci of the ellipse should be on the

y

-axis, having coordinates

(0,±c)

. Using the same method for derivation, we get:

x2b2+y2a2=1

If we want the ellipse to translate (move without rotating) in the plane, using what we've learned in Chapter Template:Calculus/map page, we can modify the equation into:

(xm)2a2+(yn)2b2=1

, where

(m,n)

is the center of the ellipse.

Parabola

Parabolas can be interpreted as the more general form of the quadratic function. However, they are essentially different. While quadratic function is a function which describes a relationship between a variable and another, parabola is a curve in 2D space. Template:Calculus/Def To make things simple for derivation, we put the point F on the y-axis and the vertex of the parabola on the origin (see image on the right), so F=(0,p) Now, imagine that point P is on the vertex, so P0=(0,0). Because PF=Pl and PF=p, the equation for line l is l:y=p.

Now, we can start deriving the standard equation for the parabola.

Part of a parabola (blue), with various features (other colours). The complete parabola has no endpoints. In this orientation, it extends infinitely to the left, right, and upward.

Derivation

Since we know that

F=(0,p)

,

l:y=p

, and

P=(x,y)

, we can solve

PF=Pl
PF=(x0)2+(yp)2=x2+y22yp+p2 Pl=y+p

So the equation

PF=Pl

can be turned into

x2+y22yp+p2=y+p

x2+y22py+p2=y2+2py+p2 (square both sides)

x2=4py

The standard equation for a parabola is

x2=4py

In Chapter Template:Calculus/map page, we talked about the various forms of quadratic functions, which if viewed as a geometric curve, is a parabola. To make the standard equation more familiar, we can adjust it to

y=14px2

Terminology and properties

Similar to the ellipse, we will use the standard equation to demonstrate key terms in the parabola.

y=14px2
  • Focus: the point F with coordinates (0,p).
  • Directrix: the line l with the equation y=p.
  • Vertex, the lowest point on the parabola with coordinates (0,0).

Recall that the vertex form of the quadratic function is y=a(xh)2+k. We can find that 14p=a.

Transformations

If we want the curve to face horizontally (the axis of symmetry is the

x

-axis instead of the

y

-axis), we change the coordinates for

F

to

F=(p,0)

and the line to

l:x=p

. After some calculations, we get:

x=14py2

If we want to translate the parabola in the plane, using what we've learnt, we get:

yk=14p(xh)2

, where the vertex is

(h,k)

.

This resembles the vertex of the quadratic function

y=a(xh)2+k

. However, it is important to realize that parabolas and quadratic functions are fundamentally different. One is a geometric curve while the other is a transformation between two variables.

Hyperbola

The relationship between the hyperbola and the reciprocal function is similar to that between the parabola and the quadratic function. Template:Calculus/Def Again, to make things easier, we put the points F1,F2 on the x-axis and the center on the origin, so F1=(c,0),F2=(c,0). Also, the length between the vertex and the center is a. See the image on the right for more clarification.

The hyperbola with key values.

Derivation

Since we know that

F1=(c,0),F2=(c,0)

, we can solve for

PF1 and PF2

.

PF1=(xc)2+y2=x22xc+c2+y2

PF2=(x+c)2+y2=x2+2xc+c2+y2

|PF1PF2|=|x22xc+c2+y2x2+2xc+c2+y2|

Similar to the ellipse derivation case, we need to turn the constant

C

into something in terms of either

a

or

c

. In this case, imagine point

P

is on the vertex, so

P0=(a,0)

.

P0F1=ca

P0F2=a+c

|P0F1P0F2|=|(ca)(a+c)|=2a

So,

C=2a

. Now, we derive.

|x22xc+c2+y2x2+2xc+c2+y2|=2a

Let's assume that PF2>PF1, which means the curve is on the positive x-axis, and

x2+2xc+c2+y2x22xc+c2+y2=2a
x2+2xc+c2+y2=2a+x22xc+c2+y2

x2+2xc+c2+y2=4a2+4ax22xc+c2+y2+x22xc+c2+y2 (square both sides)

ax22xc+c2+y2=a2+xc

a2x22a2xc+a2c2+a2y2=a42a2xc+x2c2 (square both sides)

x2(a2c2)+a2y2=a2(a2c2)

x2a2+y2a2c2=1

For convenience, we substitute

c2a2

as

b2

because looking at the graph,

c>a

, and we want our constants to be positive. So

x2a2y2b2=1

What about when PF2<PF1? That means the curve is on the negative x-axis, and

x22xc+c2+y2x2+2xc+c2+y2=2a

Using the same method, we get:

x22xc+c2+y2=2a+x2+2xc+c2+y2

x22xc+c2+y2=4a2+4ax2+2xc+c2+y2+x2+2xc+c2+y2

ax2+2xc+c2+y2=a2+xc

a2x2+2a2xc+a2c2+a2y2=a4+2a2xc+x2c2

x2(a2c2)+a2y2=a2(a2c2)

x2a2+y2a2c2=1

Because we let

c2a2=b2

, so

x2a2y2b2=1

All possibilities have been discussed, and we can safely say that the equation of a hyperbola is

x2a2y2b2=1

Terminology and properties

x2a2y2b2=1

  • Focus (plural: foci): the points F1=(c,0),F2=(c,0).
  • Vertex (plural: vertices): the endpoint of the semi-major axis. It has coordinates (±a,0).
  • Semi-major axis: the axis that has a length of a.
  • Asymptote: the line where the hyperbola is approaching but never intersects.

We briefly discussed the property of the hyperbola when talking ellipses. In that discussion, we said that we have to change the property slightly to make sure that all constants are positive. In this case, since c>a, instead of the ellipse's c2=a2b2, we have c2=a2+b2.

The asymptote is a little bit difficult to calculate because we need limits to find it.

Since

x2a2y2b2=1

, the relationship between

y

and

x

is:

y=±b2a2(x2a2)=±ba(x2a2)

Thus, the slope of the line passing through a point on the hyperbola and the center is:

m=ΔyΔx=±bax2a2x

The asymptote is the line where the hyperbola is approaching but never intersects. So, we can imagine that

x

is infinitely large to a point that it intersects the asymptote (see Chapter Template:Calculus/map page and Template:Calculus/map page for more).

±bax2a2x

(where

x

is infinitely large)

=±ba

Formally, if we want to express "where

x

is infinitely large", we write it like this:

limx±bax2a2x=±ba

. We will discuss this expression in the next unit. The slope of the asymptote is calculated to be

±ba

.

As a result, the asymptote equation for the hyperbola is

y=±bax

Transformations

If we want the hyperbola to face north-south instead of east-west, changing the coordinates of key points and using the same method for derivation will yield:

y2a2x2b2=1

If we want to translate the hyperbola in the plane, the equation will become:

(xm)2a2(yn)2b2=1

, where the center is

(m,n)

and the asymptote is

yn=±ba(xm)

Axes rotation

The rotation of the x and y axes

When we start rotating curves like conic sections, it is extremely difficult to intuitively visualize the process of rotating curves: we are more used to translation than rotation. So, instead of rotating the curve back to something like (xm)2a2(yn)2b2=1, we rotate the axes to make the curve look like (xm)2a2(yn)2b2=1. Note that x,y are the rotated coordinates.

An xy-Cartesian coordinate system rotated through an angle θ to an x'y'-Cartesian coordinate system

In order to do so, we need to find out the relationship between the pre-rotation axes and the post-rotation axes. In other words, suppose there is a point with coordinates (x,y). After the rotation, the coordinates are (x,y). Express (x,y) in terms of x,y.

Now, we need to establish some coordinates and some of the properties.

  1. The coordinate of the point is (x,y) in a x0y plane
  2. The distance between the point and the origin is r
  3. The angle between the line segment connecting the point and the origin and the positive x -axis is α
  4. The axes rotated θ counterclockwise to establish a new plane: x0y
  5. Bullet points 3 and 4 can help us know that the angle between the line segment connecting the point and the origin and the positive x -axis is αθ

Now, we try to evaluate

(x,y)

According to trigonometry (see Chapter Template:Calculus/map page):

x=rcos(αθ)=rcosαcosθ+rsinαsinθ

y=rsin(αθ)=rsinαcosθrcosαsinθ

We can also evaluate

x,y

in terms of

r,α

:

x=rcosαy=rsinα

Then substitute into the first two equations, we get:

x=xcosθ+ysinθ

y=ycosθxsinθ

Finally, the coordinate for the point

(x,y)

after the rotation is

(xcosθ+ysinθ,ycosθxsinθ)

.

The inverse transformation is x=xcosθysinθ,y=xsinθ+ycosθ.

General Cartesian form

Template:Calculus/Def The best way to get an understanding of the general form is to look at the following example. It covers most of the content in this chapter and is relatively difficult.

Example: Find the foci for this conic section with the equation 7x22xy+7y2+62x+62y18=0

To find the foci, we need the standard form. However, this is the general form. To make things worse, this is a rotated curve, which makes it impossible for humans to factor the equation into the translated standard form.

We know that the curve is rotated because after factoring, there is a (ax+by)2 factor. And the translated standard form does not have that particular factor. So, we should start rotating the axes so that after the rotation, the Bxy part can be cancelled.

First, we should list down what we know.

A=7,B=2,C=7,D=62,E=62,F=18

Then, we start rotating the axes.

Assume that we've rotated the axes

x0y

into

x0y

by rotating the axes by

θ

counterclockwise. The new rotated equation should be

Ax'2+Bxy+Cy'2+Dx+Ey+F=0

Since after factoring the equation, we don't any (ax+by)2 factor, we need to make Bxy=0. To make it simpler, we just need to make B=0.

Then, we need to express

B

in terms of the constants

A,B,C,D,E,F

, and

θ

so that we can know how much we should rotate the axes.

x=xcosθysinθ,y=xsinθ+ycosθ Ax2+Bxy+Cy2+Dx+Ey+F=0

After substituting, we get:

A(xcosθysinθ)2+B(xcosθysinθ)(ycosθ+xsinθ)+C(ycosθ+xsinθ)2+D(xcosθysinθ)+E(ycosθ+xsinθ)+F=0

After a lot of calculations and algebraic manipulations, we get:

A=Acos2θ+Bcosθsinθ+Csin2θ

B=(A+C)2cosθsinθ+B(cos2θsin2θ)

C=Asin2θBcosθsinθ+Ccos2θ

D=Dcosθ+Esinθ

E=EcosθDsinθ

F=F

We want

B=0

.

B=(A+C)2cosθsinθ+B(cos2θsin2θ)=0

Recall the trigonometric identity: double-angle formulae that

2cosθsinθ=sin2θ,cos2θsin2θ=cos2θ

So, the equation can be turned into

cot2θ=A+CB

For convenience, we will just solve for the simplest value.

θ=π4

Knowing the rotation angle and the other constants, we can solve for the new constants.

A=7cos2π42cosπ4sinπ4+7sin2π4=6

B=(7+7)2cosπ4sinπ42(cos2π4sin2π4)=0

C=7sin2π4+2cosπ4sinπ4+7cos2π4=8

D=62cosπ4+62sinπ4=12

E=62cosπ462sinπ4=0

F=18

The new rotated equation for the curve is

6x'2+8y'2+12x18=0

We can now factor the equation into the translated standard form.

6x'2+8y'2+12x18=0

6x'2+12x+6+8y'2=24

6(x+1)2+8y'2=24

(x+1)24+y'23=1

This is the standard equation for the ellipse. Recall that in an ellipse,

c2=a2b2

. Finally, we can determine the rotated coordinates of the foci.

The rotated coordinate of the center is

(1,0)

Because c2=a2b2, c=1.

The rotated coordinates of the foci are

F1=(2,0),F2=(0,0)

Now, we rotate the axes back to the original state.

x=xcosθysinθ,y=xsinθ+ycosθ

F1=(2,2)

F2=(0,0)

Therefore, the foci are

F1=(2,2)

and

F2=(0,0)

.

Template:Calculus/Top Nav

Template:BookCat