Chapter 1 begins the study of ordinary differential equations (ODEs) by deriving them from physical or other problems (modeling), solving them by standard mathematical methods, and interpreting solutions and their graphs in terms of a given problem. The simplest ODEs to be discussed are ODEs of the first order because they involve only the first derivative of the unknown function and no higher derivatives. These unknown functions will usually be denoted by y(x) or y(t)when the independent variable denotes time t. The chapter ends with a study of the existence and uniqueness of solutions of ODEs in Sec. 1.7.
Understanding the basics of ODEs requires solving problems by hand (paper and pencil, or typing on your computer, but first without the aid of a CAS). In doing so, you will gain an important conceptual understanding and feel for the basic terms, such as ODEs, direction field, and initial value problem. If you wish, you can use your Computer Algebra System (CAS) for checking solutions.
COMMENT. Numerics for first-order ODEs can be studied immediately after this chapter. See Secs. 21.1–21.2, which are independent of other sections on numerics.
Prerequisite: Integral calculus.
Sections that may be omitted in a shorter course: 1.6, 1.7.
References and Answers to Problems: App. 1 Part A, and App. 2.
If we want to solve an engineering problem (usually of a physical nature), we first have to formulate the problem as a mathematical expression in terms of variables, functions, and equations. Such an expression is known as a mathematical model of the given problem. The process of setting up a model, solving it mathematically, and interpreting the result in physical or other terms is called mathematical modeling or, briefly, modeling.
Modeling needs experience, which we shall gain by discussing various examples and problems. (Your computer may often help you in solving but rarely in setting up models.)
Now many physical concepts, such as velocity and acceleration, are derivatives. Hence a model is very often an equation containing derivatives of an unknown function. Such a model is called a differential equation. Of course, we then want to find a solution (a function that satisfies the equation), explore its properties, graph it, find values of it, and interpret it in physical terms so that we can understand the behavior of the physical system in our given problem. However, before we can turn to methods of solution, we must first define some basic concepts needed throughout this chapter.
An ordinary differential equation (ODE) is an equation that contains one or several derivatives of an unknown function, which we usually call y(x) (or sometimes y(t) if the independent variable is time t). The equation may also contain y itself, known functions of x (or t), and constants. For example,
are ordinary differential equations (ODEs). Here, as in calculus, y′ denotes dy/dx, yn = d2y/dx2, etc. The term ordinary distinguishes them from partial differential equations (PDEs), which involve partial derivatives of an unknown function of two or more variables. For instance, a PDE with unknown function u of two variables x and y is
PDEs have important engineering applications, but they are more complicated than ODEs; they will be considered in Chap. 12.
An ODE is said to be of order n if the n th derivative of the unknown function y is the highest derivative of y in the equation. The concept of order gives a useful classification into ODEs of first order, second order, and so on. Thus, (1) is of first order, (2) of second order, and (3) of third order.
In this chapter we shall consider first-order ODEs. Such equations contain only the first derivative and may contain y and any given functions of x. Hence we can write them as
or often in the form
This is called the explicit form, in contrast to the implicit form (4). For instance, the implicit ODE x−3y′ − 4y2 = 0 (where x ≠ 0) can be written explicitly as y′ = 4x3y2.
A function
is called a solution of a given ODE (4) on some open interval a < x < b if h(x) is defined and differentiable throughout the interval and is such that the equation becomes an identity if y′ and are replaced with h and h′, respectively. The curve (the graph) of h is called a solution curve.
Here, open interval a < x < b means that the endpoints a and b are not regarded as points belonging to the interval. Also, a < x < b includes infinite intervals −∞ < x < b, a < x < ∞, −∞ < x < ∞ (the real line) as special cases.
EXAMPLE 1 Verification of Solution
Verify that y = c/x (c an arbitrary constant) is a solution of the ODE xy′ = −y for all x ≠ 0. Indeed, differentiate y = c/x to get y′ = −c/x2. Multiply this by x, obtaining xy′ = −c/x; thus, xy′ = −y, the given ODE.
EXAMPLE 2 Solution by Calculus. Solution Curves
The ODE y′ = dy/dx = cos x can be solved directly by integration on both sides. Indeed, using calculus, we obtain y = ∫ cos x dx = sin x + c, where c is an arbitrary constant. This is a family of solutions. Each value of c, for instance, 2.75 or 0 or −8, gives one of these curves. Figure 3 shows some of them, for c = −3, −2, −1, 0, 1, 2, 3, 4.
EXAMPLE 3 (A) Exponential Growth. (B) Exponential Decay
From calculus we know that y = ce0.2t has the derivative
Hence y is a solution of y′ = 0.2y (Fig. 4A). This ODE is of the form y′ = ky. With positive-constant k it can model exponential growth, for instance, of colonies of bacteria or populations of animals. It also applies to humans for small populations in a large country (e.g., the United States in early times) and is then known as Malthus's law.1 We shall say more about this topic in Sec. 1.5.
(B) Similarly, y′ = −0.2 (with a minus on the right) has the solution y = ce−0.2t, (Fig. 4B) modeling exponential decay, as, for instance, of a radioactive substance (see Example 5).
We see that each ODE in these examples has a solution that contains an arbitrary constant c. Such a solution containing an arbitrary constant c is called a general solution of the ODE.
(We shall see that c is sometimes not completely arbitrary but must be restricted to some interval to avoid complex expressions in the solution.)
We shall develop methods that will give general solutions uniquely (perhaps except for notation). Hence we shall say the general solution of a given ODE (instead of a general solution).
Geometrically, the general solution of an ODE is a family of infinitely many solution curves, one for each value of the constant c. If we choose a specific c (e.g., c = 6.45 or 0 or −2.01) we obtain what is called a particular solution of the ODE. A particular solution does not contain any arbitrary constants.
In most cases, general solutions exist, and every solution not containing an arbitrary constant is obtained as a particular solution by assigning a suitable value to c. Exceptions to these rules occur but are of minor interest in applications; see Prob. 16 in Problem Set 1.1.
In most cases the unique solution of a given problem, hence a particular solution, is obtained from a general solution by an initial condition y(x0) = y0, with given values x0 and y0, that is used to determine a value of the arbitrary constant c. Geometrically this condition means that the solution curve should pass through the point (x0, y0) in the xy-plane. An ODE, together with an initial condition, is called an initial value problem. Thus, if the ODE is explicit, y′ = f(x, y), the initial value problem is of the form
EXAMPLE 4 Initial Value Problem
Solve the initial value problem
Solution. The general solution is y(x) = ce3x; see Example 3. From this solution and the initial condition we obtain y(0) = ce0 = c = 5.7. Hence the initial value problem has the solution y(x) = 5.7e3x. This is a particular solution.
The general importance of modeling to the engineer and physicist was emphasized at the beginning of this section. We shall now consider a basic physical problem that will show the details of the typical steps of modeling. Step 1: the transition from the physical situation (the physical system) to its mathematical formulation (its mathematical model); Step 2: the solution by a mathematical method; and Step 3: the physical interpretation of the result. This may be the easiest way to obtain a first idea of the nature and purpose of differential equations and their applications. Realize at the outset that your computer (your CAS) may perhaps give you a hand in Step 2, but Steps 1 and 3 are basically your work. And Step 2 requires a solid knowledge and good understanding of solution methods available to you—you have to choose the method for your work by hand or by the computer. Keep this in mind, and always check computer results for errors (which may arise, for instance, from false inputs).
EXAMPLE 5 Radioactivity. Exponential Decay
Given an amount of a radioactive substance, say, 0.5 g (gram), find the amount present at any later time.
Physical Information. Experiments show that at each instant a radioactive substance decomposes—and is thus decaying in time—proportional to the amount of substance present.
Step 1. Setting up a mathematical model of the physical process. Denote by y(t) the amount of substance still present at any time t. By the physical law, the time rate of change y′(t) = dy/dt is proportional to y(t). This gives the first-order ODE
where the constant k is positive, so that, because of the minus, we do get decay (as in [B] of Example 3). The value of k is known from experiments for various radioactive substances (e.g., k = 1.4 · 10−11 sec−1, approximately, for radium ).
Now the given initial amount is 0.5 g, and we can call the corresponding instant t = 0. Then we have the initial condition y(0) = 0.5. This is the instant at which our observation of the process begins. It motivates the term initial condition (which, however, is also used when the independent variable is not time or when we choose a t other than t = 0). Hence the mathematical model of the physical process is the initial value problem
Step 2. Mathematical solution. As in (B) of Example 3 we conclude that the ODE (6) models exponential decay and has the general solution (with arbitrary constant c but definite given k)
We now determine c by using the initial condition. Since y(0) = c from (8), this gives y(0) = c = 0.5. Hence the particular solution governing our process is (cf. Fig. 5)
Always check your result—it may involve human or computer errors! Verify by differentiation (chain rule!) that your solution (9) satisfies (7) as well as y(0) = 0.5:
Step 3. Interpretation of result. Formula (9) gives the amount of radioactive substance at time t. It starts from the correct initial amount and decreases with time because k is positive. The limit of y as t → ∞ is zero.
1–8 CALCULUS
Solve the ODE by integration or by remembering a differentiation formula.
9–15 VERIFICATION. INITIAL VALUE PROBLEM (IVP)
(a) Verify that y is a solution of the ODE. (b) Determine from y the particular solution of the IVP. (c) Graph the solution of the IVP.
17–20 MODELING, APPLICATIONS
These problems will give you a first impression of modeling. Many more problems on modeling follow throughout this chapter.
(a) Given 1 gram, how much will still be present after 1 day?
(b) After 1 year?
A first-order ODE
has a simple geometric interpretation. From calculus you know that the derivative y′(x) of y(x is the slope of y(x). Hence a solution curve of (1) that passes through a point (x0, y0) must have, at that point, the slope y′(x0) equal to the value of f at that point; that is,
Using this fact, we can develop graphic or numeric methods for obtaining approximate solutions of ODEs (1). This will lead to a better conceptual understanding of an ODE (1). Moreover, such methods are of practical importance since many ODEs have complicated solution formulas or no solution formulas at all, whereby numeric methods are needed.
Graphic Method of Direction Fields. Practical Example Illustrated in Fig. 7. We can show directions of solution curves of a given ODE (1) by drawing short straight-line segments (lineal elements) in the xy-plane. This gives a direction field (or slope field) into which you can then fit (approximate) solution curves. This may reveal typical properties of the whole family of solutions.
Figure 7 shows a direction field for the ODE
obtained by a CAS (Computer Algebra System) and some approximate solution curves fitted in.
If you have no CAS, first draw a few level curves f(x, y) = const of f(x, y), then parallel lineal elements along each such curve (which is also called an isocline, meaning a curve of equal inclination), and finally draw approximation curves fit to the lineal elements.
We shall now illustrate how numeric methods work by applying the simplest numeric method, that is Euler's method, to an initial value problem involving ODE (2). First we give a brief description of Euler's method.
Given an ODE (1) and an initial value y(x0) = y0, Euler's method yields approximate solution values at equidistant x-values x0, x1 = x0 + h, x2 = x0 + 2h, …, namely,
In general,
where the step h equals, e.g., 0.1 or 0.2 (as in Table 1.1) or a smaller value for greater accuracy.
Table 1.1 shows the computation of n = 5 steps with step h = 0.2 for the ODE (2) and initial condition y(0) = 0, corresponding to the middle curve in the direction field. We shall solve the ODE exactly in Sec. 1.5. For the time being, verify that the initial value problem has the solution y = ex − x − 1. The solution curve and the values in Table 1.1 are shown in Fig. 9. These values are rather inaccurate. The errors y(xn) − yn are shown in Table 1.1 as well as in Fig. 9. Decreasing h would improve the values, but would soon require an impractical amount of computation. Much better methods of a similar nature will be discussed in Sec. 21.1.
1–8 DIRECTION FIELDS, SOLUTION CURVES
Graph a direction field (by a CAS or by hand). In the field graph several solution curves by hand, particularly those passing through the given points (x, y).
9–10 ACCURACY OF DIRECTION FIELDS
Direction fields are very useful because they can give you an impression of all solutions without solving the ODE, which may be difficult or even impossible. To get a feel for the accuracy of the method, graph a field, sketch solution curves in it, and compare them with the exact solutions.
12–15 MOTIONS
Model the motion of a body B on a straight line with velocity as given, y(t) being the distance of B from a point y = 0 at time t. Graph a direction field of the model (the ODE). In the field sketch the solution curve satisfying the given initial condition.
(a) Graph portions of the direction field of the ODE (2) (see Fig. 7), for instance, −5 x 2, −1 y 5. Explain what you have gained by this enlargement of the portion of the field.
(b) Using implicit differentiation, find an ODE with the general solution x2 + 9y2 = c(y > 0). Graph its direction field. Does the field give the impression that the solution curves may be semi-ellipses? Can you do similar work for circles? Hyperbolas? Parabolas? Other curves?
(c) Make a conjecture about the solutions of y′ = −x/y from the direction field.
(d) Graph the direction field of and some solutions of your choice. How do they behave? Why do they decrease for? y > 0?
17–20 EULER'S METHOD
This is the simplest method to explain numerically solving an ODE, more precisely, an initial value problem (IVP). (More accurate methods based on the same principle are explained in Sec. 21.1.) Using the method, to get a feel for numerics as well as for the nature of IVPs, solve the IVP numerically with a PC or a calculator, 10 steps. Graph the computed values and the solution curve on the same coordinate axes.
Many practically useful ODEs can be reduced to the form
by purely algebraic manipulations. Then we can integrate on both sides with respect to x, obtaining
On the left we can switch to y as the variable of integration. By calculus, y′ dx = dy, so that
If f and g are continuous functions, the integrals in (3) exist, and by evaluating them we obtain a general solution of (1). This method of solving ODEs is called the method of separating variables, and (1) is called a separable equation, because in (3) the variables are now separated: x appears only on the right and y only on the left.
The ODE y′ = 1 + y2 is separable because it can be written
It is very important to introduce the constant of integration immediately when the integration is performed. If we wrote arctan y = x, then y = tan x, and then introduced c, we would have obtained y = tan x + c, which is not a solution (when c ≠ 0). Verify this.
The ODE y′ = (x + 1)e−xy2 is separable; we obtain y−2 dy = (x + 1)e−x dx.
By integration, .
EXAMPLE 3 Initial Value Problem (IVP). Bell-Shaped Curve
Solve y′ = −2xy, y(0) = 1.8.
Solution. By separation and integration,
This is the general solution. From it and the initial condition, y(0) = ce0 = c = 1.8. Hence the IVP has the solution . This is a particular solution, representing a bell-shaped curve (Fig. 10).
The importance of modeling was emphasized in Sec. 1.1, and separable equations yield various useful models. Let us discuss this in terms of some typical examples.
EXAMPLE 4 Radiocarbon Dating2
In September 1991 the famous Iceman (Oetzi), a mummy from the Neolithic period of the Stone Age found in the ice of the Oetztal Alps (hence the name “Oetzi”) in Southern Tyrolia near the Austrian–Italian border, caused a scientific sensation. When did Oetzi approximately live and die if the ratio of carbon to carbon in this mummy is 52.5% of that of a living organism?
Physical Information. In the atmosphere and in living organisms, the ratio of radioactive carbon (made radioactive by cosmic rays) to ordinary carbon is constant. When an organism dies, its absorption of by breathing and eating terminates. Hence one can estimate the age of a fossil by comparing the radioactive carbon ratio in the fossil with that in the atmosphere. To do this, one needs to know the half-life of , which is 5715 years (CRC Handbook of Chemistry and Physics, 83rd ed., Boca Raton: CRC Press, 2002, page 11–52, line 9).
Solution. Modeling. Radioactive decay is governed by the ODE y′ = ky (see Sec. 1.1, Example 5). By separation and integration (where t is time and y0 is the initial ratio of to )
Next we use the half-life H = 5715 to determine k. When t = H, half of the original substance is still present. Thus,
Finally, we use the ratio 52.5% for determining the time t when Oetzi died (actually, was killed),
Other methods show that radiocarbon dating values are usually too small. According to recent research, this is due to a variation in that carbon ratio because of industrial pollution and other factors, such as nuclear testing.
Mixing problems occur quite frequently in chemical industry. We explain here how to solve the basic model involving a single tank. The tank in Fig. 11 contains 1000 gal of water in which initially 100 lb of salt is dissolved. Brine runs in at a rate of 10 gal min, and each gallon contains 5 lb of dissoved salt. The mixture in the tank is kept uniform by stirring. Brine runs out at 10 gal min. Find the amount of salt in the tank at any time t.
Solution. Step 1. Setting up a model. Let y(t) denote the amount of salt in the tank at time t. Its time rate of change is
5 lb times 10 gal gives an inflow of 50 lb of salt. Now, the outflow is 10 gal of brine. This is 10/1000 = 0.01 (= 1%) of the total brine content in the tank, hence 0.01 of the salt content y(t), that is, 0.01 y(t). Thus the model is the ODE
Step 2. Solution of the model. The ODE (4) is separable. Separation, integration, and taking exponents on both sides gives
Initially the tank contains 100 lb of salt. Hence y(0) = 100 is the initial condition that will give the unique solution. Substituting y = 100 and t = 0 in the last equation gives 100 − 5000 = ce0 = c. Hence c = −4900. Hence the amount of salt in the tank at time t is
This function shows an exponential approach to the limit 5000 lb; see Fig. 11. Can you explain physically that should increase with time? That its limit is 5000 lb? Can you see the limit directly from the ODE?
The model discussed becomes more realistic in problems on pollutants in lakes (see Problem Set 1.5, Prob. 35) or drugs in organs. These types of problems are more difficult because the mixing may be imperfect and the flow rates (in and out) may be different and known only very roughly.
EXAMPLE 6 Heating an Office Building (Newton's Law of Cooling3)
Suppose that in winter the daytime temperature in a certain office building is maintained at 70°F. The heating is shut off at 10 P.M. and turned on again at 6 A.M. On a certain day the temperature inside the building at 2 A.M. was found to be 65°F. The outside temperature was 50°F at 10 P.M. and had dropped to 40°F by 6 A.M. What was the temperature inside the building when the heat was turned on at 6 A.M.?
Physical information. Experiments show that the time rate of change of the temperature T of a body B (which conducts heat well, for example, as a copper ball does) is proportional to the difference between T and the temperature of the surrounding medium (Newton's law of cooling).
Solution. Step 1. Setting up a model. Let T(t) be the temperature inside the building and TA the outside temperature (assumed to be constant in Newton's law). Then by Newton's law,
Such experimental laws are derived under idealized assumptions that rarely hold exactly. However, even if a model seems to fit the reality only poorly (as in the present case), it may still give valuable qualitative information. To see how good a model is, the engineer will collect experimental data and compare them with calculations from the model.
Step 2. General solution. We cannot solve (6) because we do not know TA, just that it varied between 50°F and 40°F, so we follow the Golden Rule: If you cannot solve your problem, try to solve a simpler one. We solve (6) with the unknown function TA replaced with the average of the two known values, or 45°F. For physical reasons we may expect that this will give us a reasonable approximate value of T in the building at 6 A.M.
For constant TA = 45 (or any other constant value) the ODE (6) is separable. Separation, integration, and taking exponents gives the general solution
Step 3. Particular solution. We choose 10 P.M. to be t = 0. Then the given initial condition is T(0) = 70 and yields a particular solution, call it Tp. By substitution,
Step 4. Determination of k. We use T(4) = 65, where t = 4 is 2 A.M. Solving algebraically for k and inserting k into Tp(t) gives (Fig. 12)
Step 5. Answer and interpretation. 6 A.M. is t = 8 (namely, 8 hours after 10 P.M.), and
Hence the temperature in the building dropped 9°F, a result that looks reasonable.
EXAMPLE 7 Leaking Tank. Outflow of Water Through a Hole (Torricelli's Law)
This is another prototype engineering problem that leads to an ODE. It concerns the outflow of water from a cylindrical tank with a hole at the bottom (Fig. 13). You are asked to find the height of the water in the tank at any time if the tank has diameter 2 m, the hole has diameter 1 cm, and the initial height of the water when the hole is opened is 2.25 m. When will the tank be empty?
Physical information. Under the influence of gravity the outflowing water has velocity
where h(t) is the height of the water above the hole at time t, and g = 980 cm/sec2 = 32.17 ft/sec2 is the acceleration of gravity at the surface of the earth.
Solution. Step 1. Setting up the model. To get an equation, we relate the decrease in water level h(t) to the outflow. The volume ΔV of the outflow during a short time Δt is
ΔV must equal the change ΔV* of the volume of the water in the tank. Now
where Δh (> 0) is the decrease of the height h(t) of the water. The minus sign appears because the volume of the water in the tank decreases. Equating ΔV and ΔV* gives
We now express ν according to Torricelli's law and then let Δt (the length of the time interval considered) approach 0—this is a standard way of obtaining an ODE as a model. That is, we have
and by letting Δt → 0 we obtain the ODE
where . This is our model, a first-order ODE.
Step 2. General solution. Our ODE is separable. A/B is constant. Separation and integration gives
Dividing by 2 and squaring gives h = (c − 13.28At/B)2. Inserting 13.28A/B = 13.28 · 0.52π/1002π = 0.000332 yields the general solution
Step 3. Particular solution. The initial height (the initial condition) is h(0) = 225 cm. Substitution of t = 0 and h = 225 gives from the general solution c2 = 225, c = 15.00 and thus the particular solution (Fig. 13)
Step 4. Tank empty. hp(t) = 0 if t = 15.00/0.000332 = 45,181 [sec] = 12.6 [hours].
Here you see distinctly the importance of the choice of units—we have been working with the cgs system, in which time is measured in seconds! We used g = 980 cm/sec2.
Step 5. Checking. Check the result.
Certain nonseparable ODEs can be made separable by transformations that introduce for y a new unknown function. We discuss this technique for a class of ODEs of practical importance, namely, for equations
Here, f is any (differentiable) function of y/x, such as sin(y/x), (y/x)4, and so on. (Such an ODE is sometimes called a homogeneous ODE, a term we shall not use but reserve for a more important purpose in Sec. 1.5.)
The form of such an ODE suggests that we set y/x = u; thus,
Substitution into y′ = f(y/x) then gives u′x + u = f(u) or u′x = f(u) − u. We see that if f(u) − u ≠ 0, this can be separated:
EXAMPLE 8 Reduction to Separable Form
Solve
Solution. To get the usual explicit form, divide the given equation by 2xy,
Now substitute y and y′ from (9) and then simplify by subtracting u on both sides,
You see that in the last equation you can now separate the variables,
Take exponents on both sides to get 1 + u2 = c/x or 1 + (y/x)2 = c/x. Multiply the last equation by x2 to obtain (Fig. 14)
This general solution represents a family of circles passing through the origin with centers on the x-axis.
2–10 GENERAL SOLUTION
Find a general solution. Show the steps of derivation. Check your answer by substitution.
11–17 INITIAL VALUE PROBLEMS (IVPS)
Solve the IVP. Show the steps of derivation, beginning with the general solution.
(a) If the birth rate and death rate of the number of bacteria are proportional to the number of bacteria present, what is the population as a function of time.
(b) What is the limiting situation for increasing time? Interpret it.
(a) Show that for the circles with center at the origin we get y′ = −x/y.
(b) Graph some of the hyperbolas xy = c. Find an ODE for them.
(c) Find an ODE for the straight lines through the origin.
(d) You will see that the product of the right sides of the ODEs in (a) and (c) equals −1. Do you recognize this as the condition for the two families to be orthogonal (i.e., to intersect at right angles)? Do your graphs confirm this?
(e) Sketch families of curves of your own choice and find their ODEs. Can every family of curves be given by an ODE?
(a) Show this for the five initial value problems, , y(0) = 0, ±1, ±2, graphing all five curves on the same axes.
(b) Graph approximate solution curves, using the first few terms of the Maclaurin series (obtained by termwise integration of that of y′) and compare with the exact curves.
(c) Repeat the work in (a) for another ODE and initial conditions of your own choice, leading to an integral that cannot be evaluated as indicated.
We recall from calculus that if a function u(x, y) has continuous partial derivatives, its differential (also called its total differential) is
From this it follows that if u(x, y) = c = const, then du = 0.
For example, if u = x + x2y3 = c, then
or
an ODE that we can solve by going backward. This idea leads to a powerful solution method as follows.
A first-order ODE M(x, y) + N(x, y)y′ = 0, written as (use dy = y′dx as in Sec. 1.3)
is called an exact differential equation if the differential form M(x, y)dx + N(x, y)dy is exact, that is, this form is the differential
of some function u(x, y). Then (1) can be written
By integration we immediately obtain the general solution of (1) in the form
This is called an implicit solution, in contrast to a solution y = h(x) as defined in Sec. 1.1, which is also called an explicit solution, for distinction. Sometimes an implicit solution can be converted to explicit form. (Do this for x2 + y2 = 1.) If this is not possible, your CAS may graph a figure of the contour lines (3) of the function u(x, y) and help you in understanding the solution.
Comparing (1) and (2), we see that (1) is an exact differential equation if there is some function u(x, y) such that
From this we can derive a formula for checking whether (1) is exact or not, as follows.
Let M and N be continuous and have continuous first partial derivatives in a region in the xy-plane whose boundary is a closed curve without self-intersections. Then by partial differentiation of (4) (see App. 3.2 for notation),
By the assumption of continuity the two second partial derivaties are equal. Thus
This condition is not only necessary but also sufficient for (1) to be an exact differential equation. (We shall prove this in Sec. 10.2 in another context. Some calculus books, for instance, [GenRef 12], also contain a proof.)
If (1) is exact, the function u(x, y) can be found by inspection or in the following systematic way. From (4a) we have by integration with respect to x
in this integration, y is to be regarded as a constant, and k(y) plays the role of a “constant” of integration. To determine k(y), we derive ∂u/∂y from (6), use (4b) to get dk/dy, and integrate dk/dy to get k. (See Example 1, below.)
Formula (6) was obtained from (4a). Instead of (4a) we may equally well use (4b). Then, instead of (6), we first have by integration with respect to y
To determine l(x), we derive ∂u/∂x from (6*), use (4a) to get dl/dx, and integrate. We illustrate all this by the following typical examples.
Solve
Solution. Step 1. Test for exactness. Our equation is of the form (1) with
Thus
From this and (5) we see that (7) is exact.
Step 2. Implicit general solution. From (6) we obtain by integration
To find k(y), we differentiate this formula with respect to y and use formula (4b), obtaining
Hence dk/dy = 3y2 + 2y. By integration, k = y3 + y2 + c*. Inserting this result into (8) and observing (3), we obtain the answer
Step 3. Checking an implicit solution. We can check by differentiating the implicit solution u(x, y) = c implicitly and see whether this leads to the given ODE (7):
This completes the check.
EXAMPLE 2 An Initial Value Problem
Solve the initial value problem
Solution. You may verify that the given ODE is exact. We find u. For a change, let us use (6*),
From this, ∂u/∂x = cos y sinh x + dl/dx = M = cos y sinh x + 1. Hence dl/dx = 1. By integration, l(x) = x + c*. This gives the general solution u(x, y) = cos y cosh x + x = c. From the initial condition, cos 2 cosh 1 + 1 = 0.358 = c. Hence the answer is cos y cosh x + x = 0.358. Figure 17 shows the particular solutions for c = 0, 0.358 (thicker curve), 1, 2, 3. Check that the answer satisfies the ODE. (Proceed as in Example 1.) Also check that the initial condition is satisfied.
EXAMPLE 3 WARNING! Breakdown in the Case of Nonexactness
The equation −y dx + x dy = 0 is not exact because M = −y and N = x, so that in (5), but ∂M/∂y = −1 but ∂N/∂x = 1. Let us show that in such a case the present method does not work. From (6),
Now, ∂u/∂y should equal N = x, by (4b). However, this is impossible because k(y) can depend only on y. Try (6*); it will also fail. Solve the equation by another method that we have discussed.
The ODE in Example 3 is −y dx + x dy = 0. It is not exact. However, if we multiply it by 1/x2, we get an exact equation [check exactness by (5)!],
Integration of (11) then gives the general solution y/x = c = const.
This example gives the idea. All we did was to multiply a given nonexact equation, say,
by a function F that, in general, will be a function of both x and y. The result was an equation
that is exact, so we can solve it as just discussed. Such a function F(x, y) is then called an integrating factor of (12).
The integrating factor in (11) is F = 1/x2. Hence in this case the exact equation (13) is
These are straight lines y = cx through the origin. (Note that x = 0 is also a solution of −y dx + x dy = 0.)
It is remarkable that we can readily find other integrating factors for the equation −y dx + x dy = 0, namely, 1/y2, 1/(xy), and 1/(x2 + y2), because
In simpler cases we may find integrating factors by inspection or perhaps after some trials, keeping (14) in mind. In the general case, the idea is the following.
For M dx + N dy = 0 the exactness condition (5) is ∂M/∂y = ∂N/∂x. Hence for (13), FP dx + FQ dy = 0, the exactness condition is
By the product rule, with subscripts denoting partial derivatives, this gives
In the general case, this would be complicated and useless. So we follow the Golden Rule: If you cannot solve your problem, try to solve a simpler one—the result may be useful (and may also help you later on). Hence we look for an integrating factor depending only on one variable: fortunately, in many practical cases, there are such factors, as we shall see. Thus, let F = F(x). Then Fy = 0, and Fx = F′ = dF/dx, so that (15) becomes
Dividing by FQ and reshuffling terms, we have
This proves the following theorem.
THEOREM 1 Integrating Factor F(x)
If (12) is such that the right side R of (16) depends only on x, then (12) has an integrating factor F = F(x), which is obtained by integrating (16) and taking exponents on both sides.
Similarly, if F* = F*(y) then instead of (16) we get
and we have the companion
THEOREM 2 Integrating Factor F*(y)
If (12) is such that the right side R* of (18) depends only on y, then (12) has an integrating factor F* = F*(y), which is obtained from (18) in the form
EXAMPLE 5 Application of Theorems 1 and 2. Initial Value Problem
Using Theorem 1 or 2, find an integrating factor and solve the initial value problem
Solution. Step 1. Nonexactness. The exactness check fails:
Step 2. Integrating factor. General solution. Theorem 1 fails because R [the right side of (16)] depends on both x and y.
Try Theorem 2. The right side of (18) is
Hence (19) gives the integrating factor F*(y) = e−y. From this result and (20) you get the exact equation
Test for exactness; you will get 1 on both sides of the exactness condition. By integration, using (4a),
Differentiate this with respect to y and use (4b) to get
Hence the general solution is
Setp 3. Particular solution. The initial condition y(0) = −1 gives u(0, −1) = 1 + 0 + e = 3.72. Hence the answer is ex + xy + e−y = 1 + e = 3.72. Figure 18 shows several particular solutions obtained as level curves of u(x, y) = c, obtained by a CAS, a convenient way in cases in which it is impossible or difficult to cast a solution into explicit form. Note the curve that (nearly) satisfies the initial condition.
Step 4. Checking. Check by substitution that the answer satisfies the given equation as well as the initial condition.
1–14 ODEs. INTEGRATING FACTORS
Test for exactness. If exact, solve. If not, use an integrating factor as given or obtained by inspection or by the theorems in the text. Also, if an initial condition is given, find the corresponding particular solution.
(a) ey(sinh x dx + cosh x dy) = 0 as an exact ODE and by separation.
(b) (1 + 2x)cos y dx + dy/cos y = 0 by Theorem 2 and by separation.
(c) (x2 + y2)dx − 2xy dy = 0 by Theorem 1 or 2 and by separation with ν = y/x.
(d) 3x2y dx + 4x3 dy = 0 by Theorems 1 and 2 and by separation.
(e) Search the text and the problems for further ODEs that can be solved by more than one of the methods discussed so far. Make a list of these ODEs. Find further cases of your own.
(a) Show that (21) is not exact. Find an integrating factor using either Theorem 1 or 2. Solve (21).
(b) Solve (21) by separating variables. Is this simpler than (a)?
(c) Graph the seven particular solutions satisfying the following initial conditions y(0) = 1, , , ±1 (see figure below).
(d) Which solution of (21) do we not get in (a) or (b)?
Linear ODEs or ODEs that can be transformed to linear form are models of various phenomena, for instance, in physics, biology, population dynamics, and ecology, as we shall see. A first-order ODE is said to be linear if it can be brought into the form
by algebra, and nonlinear if it cannot be brought into this form.
The defining feature of the linear ODE (1) is that it is linear in both the unknown function y and its derivative y′ = dy/dx, whereas p and r may be any given functions of x. If in an application the independent variable is time, we write t instead of x.
If the first term is f(x)y′ (instead of y′), divide the equation by f(x) to get the standard form (1), with y′ as the first term, which is practical.
For instance, y′ cos x + y sin x = x is a linear ODE, and its standard form is y′ + y tan x = x sec x.
The function r(x) on the right may be a force, and the solution y(x) a displacement in a motion or an electrical current or some other physical quantity. In engineering, r(x) is frequently called the input, and y(x) is called the output or the response to the input (and, if given, to the initial condition).
Homogeneous Linear ODE. We want to solve (1) in some interval a < x < b, call it J, and we begin with the simpler special case that r(x) is zero for all x in J. (This is sometimes written r(x) ≡ 0.) Then the ODE (1) becomes
and is called homogeneous. By separating variables and integrating we then obtain
Taking exponents on both sides, we obtain the general solution of the homogeneous ODE (2),
here we may also choose c = 0 and obtain the trivial solution y(x) = 0 for all x in that interval.
Nonhomogeneous Linear ODE. We now solve (1) in the case that r(x) in (1) is not everywhere zero in the interval J considered. Then the ODE (1) is called nonhomogeneous. It turns out that in this case, (1) has a pleasant property; namely, it has an integrating factor depending only on x. We can find this factor F(x) by Theorem 1 in the previous section or we can proceed directly, as follows. We multiply (1) by F(x), obtaining
The left side is the derivative (Fy)′ = F′ y + Fy′ of the product Fy if
By separating variables, dF/F = p dx. By integration, writing h = ∫p dx,
With this F and h′ = p, Eq. (1*) becomes
By integration,
Dividing by eh, we obtain the desired solution formula
This reduces solving (1) to the generally simpler task of evaluating integrals. For ODEs for which this is still difficult, you may have to use a numeric method for integrals from Sec. 19.5 or for the ODE itself from Sec. 21.1. We mention that h has nothing to do with in Sec. 1.1 and that the constant of integration in h does not matter; see Prob. 2.
The structure of (4) is interesting. The only quantity depending on a given initial condition is c. Accordingly, writing (4) as a sum of two terms,
we see the following:
EXAMPLE 1 First-Order ODE, General Solution, Initial Value Problem
Solve the initial value problem
Solution. Here p = tan x, r = sin 2x = 2 sin x cos x, and
From this we see that in (4),
and the general solution of our equation is
From this and the initial condition, 1 = c · 1 − 2 · 12; thus c = 3 and the solution of our initial value problem is y = 3 cos x − 2 cos2x. Here 3 cos x is the response to the initial data, and −2 cos2 x is the response to the input sin 2x.
Model the RL-circuit in Fig. 19 and solve the resulting ODE for the current I(t) A (amperes), where t is time. Assume that the circuit contains as an EMF E(t) (electromotive force) a battery of E = 48 V (volts), which is constant, a resistor of R = 11 Ω (ohms), and an inductor of L = 0.1 H (henrys), and that the current is initially zero.
Physical Laws. A current I in the circuit causes a voltage drop RI across the resistor (Ohm's law) and a voltage drop LI′ = L dI/dt across the conductor, and the sum of these two voltage drops equals the EMF (Kirchhoff's Voltage Law, KVL).
Remark. In general, KVL states that “The voltage (the electromotive force EMF) impressed on a closed loop is equal to the sum of the voltage drops across all the other elements of the loop.” For Kirchoff's Current Law (KCL) and historical information, see footnote 7 in Sec. 2.9.
Solution. According to these laws the model of the RL-circuit is LI′ + RI = E(t), in standard form
We can solve this linear ODE by (4) with x = t, y = I, p = R/L, h = (R/L)t, obtaining the general solution
By integration,
In our case, R/L = 11/0.1 = 110 and E(t) = 48/0.1 = 480 = const; thus,
In modeling, one often gets better insight into the nature of a solution (and smaller roundoff errors) by inserting given numeric data only near the end. Here, the general solution (7) shows that the current approaches the limit E/R = 48/11 faster the larger R/L is, in our case, R/L = 11/0.1 = 110, and the approach is very fast, from below if I(0) < 48/11 or from above if I(0) > 48/11. If I(0) = 48/11, the solution is constant (48/11 A). See Fig. 19.
The initial value I(0) = 0 gives I(0) = E/R + c = 0, c = −E/R and the particular solution
Assume that the level of a certain hormone in the blood of a patient varies with time. Suppose that the time rate of change is the difference between a sinusoidal input of a 24-hour period from the thyroid gland and a continuous removal rate proportional to the level present. Set up a model for the hormone level in the blood and find its general solution. Find the particular solution satisfying a suitable initial condition.
Solution. Step 1. Setting up a model. Let y(t) be the hormone level at time t. Then the removal rate is Ky(t). The input rate is A + B cos ωt, where ω = 2π/24 = π/12 and A is the average input rate; here A B to make the input rate nonnegative. The constants A, B, K can be determined from measurements. Hence the model is the linear ODE
The initial condition for a particular solution ypart is ypart(0) with t = 0 suitably chosen, for example, 6:00 A.M.
Step 2. General solution. In (4) we have p = K = const, h = Kt, and r = A + B cos ωt. Hence (4) gives the general solution (evaluate cos ωt dt by integration by parts)
The last term decreases to 0 as t increases, practically after a short time and regardless of c (that is, of the initial condition). The other part of y(t) is called the steady-state solution because it consists of constant and periodic terms. The entire solution is called the transient-state solution because it models the transition from rest to the steady state. These terms are used quite generally for physical and other systems whose behavior depends on time.
Step 3. Particular solution. Setting t = 0 in y(t) and choosing y0 = 0, we have
Inserting this result into y(t), we obtain the particular solution
with the steady-state part as before. To plot ypart we must specify values for the constants, say, A = B = 1 and K = 0.05. Figure 20 shows this solution. Notice that the transition period is relatively short (although K is small), and the curve soon looks sinusoidal; this is the response to the input .
Numerous applications can be modeled by ODEs that are nonlinear but can be transformed to linear ODEs. One of the most useful ones of these is the Bernoulli equation7
If a = 0 or a = 1, Equation (9) is linear. Otherwise it is nonlinear. Then we set
We differentiate this and substitute y′ from (9), obtaining
Simplification gives
where y1 − a = u on the right, so that we get the linear ODE
For further ODEs reducible to linear form, see lnce's classic [A11] listed in App. 1. See also Team Project 30 in Problem Set 1.5.
Solve the following Bernoulli equation, known as the logistic equation (or Verhulst equation8):
Solution. Write (11) in the form (9), that is,
to see that a = 2, so that u = y1−a = y−1. Differentiate this u and substitute y′ from (11),
The last term is −Ay−1 = Hence we have obtained the linear ODE
The general solution is [by (4)]
Since u = 1/y, this gives the general solution of (11),
Directly from (11) we see that y = ≡ 0 (y(t) = 0 for all t) is also a solution.
The logistic equation (11) plays an important role in population dynamics, a field that models the evolution of populations of plants, animals, or humans over time t. If B = 0, then (11) is y′ = dy/dt = Ay. In this case its solution (12) is y = (1/c)eAt and gives exponential growth, as for a small population in a large country (the United States in early times!). This is called Malthus's law. (See also Example 3 in Sec. 1.1.)
The term −By2 in (11) is a “braking term” that prevents the population from growing without bound. Indeed, if we write y′ = Ay[1 − (B/A)y], we see that if y < A/B, then y′ > 0, so that an initially small population keeps growing as long as y < A/B. But if y > A/B, then y′ < 0 and the population is decreasing as long as y > A/B. The limit is the same in both cases, namely, A/B. See Fig. 21.
We see that in the logistic equation (11) the independent variable t does not occur explicitly. An ODE y′ = f(t, y) in which t does not occur explicitly is of the form
and is called an autonomous ODE. Thus the logistic equation (11) is autonomous.
Equation (13) has constant solutions, called equilibrium solutions or equilibrium points. These are determined by the zeros of f(y), because f(y) = 0 gives y′ = 0 by (13); hence y = const. These zeros are known as critical points of (13). An equilibrium solution is called stable if solutions close to it for some t remain close to it for all further t. It is called unstable if solutions initially close to it do not remain close to it as t increases. For instance, y = 0 in Fig. 21 is an unstable equilibrium solution, and y = 4 is a stable one. Note that (11) has the critical points y = 0 and y = A/B.
EXAMPLE 5 Stable and Unstable Equilibrium Solutions. “Phase Line Plot”
The ODE y′ = (y − 1)(y − 2) has the stable equilibrium solution y1 = 1 and the unstable y2 = 2, as the direction field in Fig. 22 suggests. The values y1 and y2 are the zeros of the parabola f(y) = (y − 1)(y − 2) in the figure. Now, since the ODE is autonomous, we can “condense” the direction field to a “phase line plot” giving y1 and y2, and the direction (upward or downward) of the arrows in the field, and thus giving information about the stability or instability of the equilibrium solutions.
A few further population models will be discussed in the problem set. For some more details of population dynamics, see C. W. Clark. Mathematical Bioeconomics: The Mathematics of Conservation 3rd ed. Hoboken, NJ, Wiley, 2010.
Further applications of linear ODEs follow in the next section.
3–13 GENERAL SOLUTION. INITIAL VALUE PROBLEMS
Find the general solution. If an initial condition is given, find also the corresponding particular solution and graph or sketch it. (Show the details of your work.)
(b) Generalizing (a) from n = 1 to arbitrary n, solve the ODE y′ − ny/x = −xn−2 cos (1/x). Find an initial condition as in (a) and experiment with the graph.
15–20 GENERAL PROPERTIES OF LINEAR ODEs
These properties are of practical and theoretical importance because they enable us to obtain new solutions from given ones. Thus in modeling, whenever possible, we prefer linear ODEs over nonlinear ones, which have no similar properties.
Show that nonhomogeneous linear ODEs (1) and homogeneous linear ODEs (2) have the following properties. Illustrate each property by a calculation for two or three equations of your choice. Give proofs.
22–28 NONLINEAR ODEs
Using a method of this section or separating variables, find the general solution. If an initial condition is given, find also the particular solution and sketch or graph it.
A Riccati equation is of the form
A Clairaut equation is of the form
(a) Apply the transformation y = Y + 1/u to the Riccati equation (14), where Y is a solution of (14), and obtain for u the linear ODE u′ + (2Yg − p)u = −g. Explain the effect of the transformation by writing it as y = Y + ν, ν = 1/ν.
(b) Show that y = Y = x is a solution of the ODE y′ − (2x3 + 1) y = −x2y2 − x4 − x + 1 and slove this Riccati equation, showing the details.
(c) Solve the Clairaut equation y′2 − xy′ + y = 0 follows. Differentiate it with respect to x, obtaining y″(2y′ − x) = 0. Then solve (A) y″ = 0 and (B) 2y′ − x = 0 separately and substitute the two solutions (a) and (b) of (A) and (B) into the given ODE. Thus obtain (a) a general solution (straight lines) and (b) a parabola for which those lines (a) are tangents (Fig. 6 in Prob. Set 1.1); so (b) is the envelope of (a). Such a solution (b) that cannot be obtained from a general solution is called a singular solution.
(d) Show that the Clairaut equation (15) has as solutions a family of straight lines y = cx + g(c) and a singular solution determined by g′ (s) = −x, where s = y′, that forms the envelope of that family.
31–40 MODELING. FURTHER APPLICATIONS
where T = T(t) is the temperature in the building at time t, Ta the outside temperature, Tw the temperature wanted in the building, and P the rate of increase of T due to machines and people in the building, and k1 and k2 are (negative) constants. Solve this ODE, assuming andP = const, Tw = const, and Ta varying sinusoidally over 24 hours, say, Ta = A − C cos (2π/24)t. Discuss the effect of each term of the equation on the solution.
An important type of problem in physics or geometry is to find a family of curves that intersects a given family of curves at right angles. The new curves are called orthogonal trajectories of the given curves (and conversely). Examples are curves of equal temperature (isotherms) and curves of heat flow, curves of equal altitude (contour lines) on a map and curves of steepest descent on that map, curves of equal potential (equipotential curves, curves of equal voltage—the ellipses in Fig. 24) and curves of electric force (the parabolas in Fig. 24).
Here the angle of intersection between two curves is defined to be the angle between the tangents of the curves at the intersection point. Orthogonal is another word for perpendicular.
In many cases orthogonal trajectories can be found using ODEs. In general, if we consider G(x, y, c) = 0 to be a given family of curves in the xy-plane, then each value of c gives a particular curve. Since c is one parameter, such a family is called a one-parameter family of curves.
In detail, let us explain this method by a family of ellipses
and illustrated in Fig. 24. We assume that this family of ellipses represents electric equipotential curves between the two black ellipses (equipotential surfaces between two elliptic cylinders in space, of which Fig. 24 shows a cross-section). We seek the orthogonal trajectories, the curves of electric force. Equation (1) is a one-parameter family with parameter c. Each value of c (> 0) corresponds to one of these ellipses.
Step 1. Find an ODE for which the given family is a general solution. Of course, this ODE must no longer contain the parameter c. Differentiating (1), we have x + 2yy′ = 0. Hence the ODE of the given curves is
Step 2. Find an ODE for the orthogonal trajectories . This ODE is
with the same f as in (2). Why? Well, a given curve passing through a point (x0, y0) has slope f(x0, y0) at that point, by (2). The trajectory through (x0, y0) has slope −1/f(x0, y0) by (3). The product of these slopes is −1, as we see. From calculus it is known that this is the condition for orthogonality (perpendicularity) of two straight lines (the tangents at x0, y0)), hence of the curve and its orthogonal trajectory at (x0, y0).
Step 3. Solve (3) by separating variables, integrating, and taking exponents:
This is the family of orthogonal trajectories, the quadratic parabolas along which electrons or other charged particles (of very small mass) would move in the electric field between the black ellipses (elliptic cylinders).
1–3 FAMILIES OF CURVES
Represent the given family of curves in the form G(x, y; c) = 0 and sketch some of the curves.
4–10 ORTHOGONAL TRAJECTORIES (OTs)
Sketch or graph some of the given curves. Guess what their OTs may look like. Find these OTs.
11–16 APPLICATIONS, EXTENSIONS
The initial value problem
has no solution because y = 0 (that is, y(x) = 0 for all x) is the only solution of the ODE. The initial value problem
has precisely one solution, namely, y = x2 + 1. The initial value problem
has infinitely many solutions, namely, y = 1 + cx, where c is an arbitrary constant because y(0) = 1 for all c.
From these examples we see that an initial value problem
may have no solution, precisely one solution, or more than one solution. This fact leads to the following two fundamental questions.
Problem of Existence
Under what conditions does an initial value problem of the form (1) have at least one solution (hence one or several solutions)?
Problem of Uniqueness
Under what conditions does that problem have at most one solution (hence excluding the case that is has more than one solution)?
Theorems that state such conditions are called existence theorems and uniqueness theorems, respectively.
Of course, for our simple examples, we need no theorems because we can solve these examples by inspection; however, for complicated ODEs such theorems may be of considerable practical importance. Even when you are sure that your physical or other system behaves uniquely, occasionally your model may be oversimplified and may not give a faithful picture of reality.
Let the right side f(x, y) of the ODE in the initial value problem
be continuous at all points (x, y) in some rectangle
and bounded in R; that is, there is a number K such that
Then the initial value problem (1) has at least one solution y(x). This solution exists at least for all x in the subinterval |x − x0| < α of the interval |x − x0| < a; here, α is the smaller of the two numbers a and b/K.
(Example of Boundedness. The function f(x, y) = x2 + y2 is bounded (with K = 2) in the square |x| < 1, |y| < 1. The function f(x, y) = tan (x + y) is not bounded for |x + y| < π/2. Explain!)
Let f and its partial derivative fy = ∂f/∂y be continuous for all (x, y) in the rectangle R (Fig. 26) and bounded, say,
Then the initial value problem (1) has at most one solution y(x). Thus, by Theorem 1, the problem has precisely one solution. This solution exists at least for all x in that subinterval |x − x0| < α.
These two theorems take care of almost all practical cases. Theorem 1 says that if f(x, y) is continuous in some region in the xy-plane containing the point (x0, y0), then the initial value problem (1) has at least one solution.
Theorem 2 says that if, moreover, the partial derivative ∂f/∂y of f with respect to y exists and is continuous in that region, then (1) can have at most one solution; hence, by Theorem 1, it has precisely one solution.
Read again what you have just read—these are entirely new ideas in our discussion.
Proofs of these theorems are beyond the level of this book (see Ref. [A11] in App. 1); however, the following remarks and examples may help you to a good understanding of the theorems.
Since y′ = f(x, y), the condition (2) implies that |y′| K; that is, the slope of any solution curve y(x) in R is at least −K and at most K. Hence a solution curve that passes through the point (x0, y0) must lie in the colored region in Fig. 27 bounded by the lines l1 and l2 whose slopes are −L and K, respectively. Depending on the form of R, two different cases may arise. In the first case, shown in Fig. 27a, we have b/K a and therefore α = a in the existence theorem, which then asserts that the solution exists for all x between x0 − a and x0 + a. In the second case, shown in Fig. 27b, we have b/K < a. Therefore, α = b/K < a, and all we can conclude from the theorems is that the solution exists for all x between x0 − b/K and x0 + b/K. For larger or smaller x's the solution curve may leave the rectangle R, and since nothing is assumed about f outside R, nothing can be concluded about the solution for those larger or amaller x's; that is, for such x's the solution may or may not exist—we don't know.
Let us illustrate our discussion with a simple example. We shall see that our choice of a rectangle R with a large base (a long x-interval) will lead to the case in Fig. 27b.
EXAMPLE 1 Choice of a Rectangle
Consider the initial value problem
and take the rectangle R; |x| < 5, |y| < 3. Then a = 5, b = 3, and
Indeed, the solution of the problem is y = tan x (see Sec. 1.3, Example 1). This solution is discontinuous at ±π/2, and there is no continuous solution valid in the entire interval |x| < 5 from which we started.
The conditions in the two theorems are sufficient conditions rather than necessary ones, and can be lessened. In particular, by the mean value theorem of differential calculus we have
where (x, y1) and (x, y2) are assumed to be in R, and is a suitable value between y1 and y2. From this and (3b) it follows that
It can be shown that (3b) may be replaced by the weaker condition (4), which is known as a Lipschitz condition.9 However, continuity of f(x, y) is not enough to guarantee the uniqueness of the solution. This may be illustrated by the following example.
The initial value problem
has the two solutions
although is continuous for all y. The Lipschitz condition (4) is violated in any region that includes the line y = 0, because for y1 = 0 and positive y2 we have
and this can be made as large as we please by choosing y2 sufficiently small, whereas (4) requires that the quotient on the left side of (5) should not exceed a fixed constant M.
This form (6) of (1) suggests Picard's Iteration Method10 which is defined by
It gives approximations y1, y2, y3, … of the unknown solution y of (1). Indeed, you obtain y1 by substituting y = y0 on the right and integrating—this is the first step—then y2 by substituting y = y1 on the right and integrating—this is the second step—and so on. Write a program of the iteration that gives a printout of the first approximations y0, y1, …, yN as well as their graphs on common axes. Try your program on two initial value problems of your own choice.
(b) Apply the iteration to y′ = x + y, y(0) = 0. Also solve the problem exactly.
(c) Apply the iteration to y′ = 2y2, y(0) = 1. Also solve the problem exactly.
(d) Find all solutions of , y(1) = 0. Which of them does Picard's iteration approximate?
(e) Experiment with the conjecture that Picard's iteration converges to the solution of the problem for any initial choice of y in the integrand in (7) (leaving y0 outside the integral as it is). Begin with a simple ODE and see what happens. When you are reasonably sure, take a slightly more complicated ODE and give it a try.
11–16 DIRECTION FIELD: NUMERIC SOLUTION
Graph a direction field (by a CAS or by hand) and sketch some solution curves. Solve the ODE exactly and compare. In Prob. 16 use Euler's method.
17–21 GENERAL SOLUTION
Find the general solution. Indicate which method in this chapter you are using. Show the details of your work.
22–26 INITIAL VALUE PROBLEM (IVP)
Solve the IVP. Indicate the method used. Show the details of your work.
27–30 MODELING, APPLICATIONS
This chapter concerns ordinary differential equations (ODEs) of first order and their applications. These are equations of the form
involving the derivative y′ = dy/dx of an unknown function y, given functions of x, and, perhaps, y itself. If the independent variable x is time, we denote it by t.
In Sec. 1.1 we explained the basic concepts and the process of modeling, that is, of expressing a physical or other problem in some mathematical form and solving it. Then we discussed the method of direction fields (Sec. 1.2), solution methods and models (Secs. 1.3–1.6), and, finally, ideas on existence and uniqueness of solutions (Sec. 1.7).
A first-order ODE usually has a general solution, that is, a solution involving an arbitrary constant, which we denote by c. In applications we usually have to find a unique solution by determining a value of c from an initial condition y(x0) = y0. Together with the ODE this is called an initial value problem
and its solution is a particular solution of the ODE. Geometrically, a general solution represents a family of curves, which can be graphed by using direction fields (Sec. 1.2). And each particular solution corresponds to one of these curves.
A separable ODE is one that we can put into the form
by algebraic manipulations (possibly combined with transformations, such as y/x = u) and solve by integrating on both sides.
where M dx + N dy is the differential
of a function u(x, y), so that from du = 0 we immediately get the implicit general solution u(x, y) = c. This method extends to nonexact ODEs that can be made exact by multiplying them by some function F(x, y,), called an integrating factor (Sec. 1.4).
Linear ODEs
are very important. Their solutions are given by the integral formula (4), Sec. 1.5. Certain nonlinear ODEs can be transformed to linear form in terms of new variables. This holds for the Bernoulli equation
Applications and modeling are discussed throughout the chapter, in particular in Secs. 1.1, 1.3, 1.5 (population dynamics, etc.), and 1.6 (trajectories).
Picard's existence and uniqueness theorems are explained in Sec. 1.7 (and Picard's iteration in Problem Set 1.7).
Numeric methods for first-order ODEs can be studied in Secs. 21.1 and 21.2 immediately after this chapter, as indicated in the chapter opening.
1Named after the English pioneer in classic economics, THOMAS ROBERT MALTHUS (1766–1834).
2Method by WILLARD FRANK LIBBY (1908–1980), American chemist, who was awarded for this work the 1960 Nobel Prize in chemistry.
3Sir ISAAC NEWTON (1642–1727), great English physicist and mathematician, became a professor at Cambridge in 1669 and Master of the Mint in 1699. He and the German mathematician and philosopher GOTTFRIED WILHELM LEIBNIZ (1646–1716) invented (independently) the differential and integral calculus. Newton discovered many basic physical laws and created the method of investigating physical problems by means of calculus. His Philosophiae naturalis principia mathematica (Mathematical Principles of Natural Philosophy, 1687) contains the development of classical mechanics. His work is of greatest importance to both mathematics and physics.
4EVANGELISTA TORRICELLI (1608–1647), Italian physicist, pupil and successor of GALILEO GALILEI (1564–1642) at Florence. The “contraction factor” 0.600 was introduced by J. C. BORDA in 1766 because the stream has a smaller cross section than the area of the hole.
5ROBERT BOYLE (1627–1691), English physicist and chemist, one of the founders of the Royal Society. EDME MARIOTTE (about 1620–1684), French physicist and prior of a monastry near Dijon. They found the law experimentally in 1662 and 1676, respectively.
6CHARLES AUGUSTIN DE COULOMB (1736–1806), French physicist and engineer.
7JAKOB BERNOULLI (1654–1705), Swiss mathematician, professor at Basel, also known for his contribution to elasticity theory and mathematical probability. The method for solving Bernoulli's equation was discovered by Leibniz in 1696. Jakob Bernoulli's students included his nephew NIKLAUS BERNOULLI (1687–1759), who contributed to probability theory and infinite series, and his youngest brother JOHANN BERNOULLI (1667–1748), who had profound influence on the development of calculus, became Jakob's successor at Basel, and had among his students GABRIEL CRAMER (see Sec. 7.7) and LEONHARD EULER (see Sec. 2.5). His son DANIEL BERNOULLI (1700–1782) is known for his basic work in fluid flow and the kinetic theory of gases.
8PIERRE-FRANÇOIS VERHULST, Belgian statistician, who introduced Eq. (8) as a model for human population growth in 1838.
9RUDOLF LIPSCHITZ (1832–1903), German mathematician. Lipschitz and similar conditions are important in modern theories, for instance, in partial differential equations.
10EMILE PICARD (1856–1941). French mathematician, also known for his important contributions to complex analysis (see Sec. 16.2 for his famous theorem). Picard used his method to prove Theorems 1 and 2 as well as the convergence of the sequence (7) to the solution of (1). In precomputer times, the iteration was of little practical value because of the integrations.