III.16 Differential Forms and Integration

 Terence Tao


It goes without saying that integration is one of the fundamental concepts of single-variable calculus. However, there are in fact three concepts of integration that appear in the subject: the indefinite integralf (also known as the antiderivctive), the unsigned definite integral ∫[ab] f (x) dx (which one would use to find the area under a curve, or the mass of a one-dimensional object of varying density), and the signed definite integral Image f(x) dx (which one would use, for instance, to compute the work required to move a particle from a to b). For simplicity we shall restrict our attention here to functions f : ImageImage that are continuous on the entire real line (and similarly, when we come to differential forms, we shall discuss only forms that are continuous on the entire domain). We shall also informally use terminology such as “infinitesimal” in order to avoid having to discuss the (routine) “epsilon-delta” analytical issues that one must resolve in order to make these integration concepts fully rigorous.

These three concepts of integration are of course closely related to each other in single-variable calculus; indeed, THE FUNDAMENTAL THEOREM OF CALCULUS [I.3 §5.5] relates the signed definite integral Image f (x) dx to any one of the indefinite integrals F = ∫ f by the formula

Image

while the signed and unsigned integrals are related by the simple identity

Image

which is valid whenever a ≤ b.

When one moves from single-variable calculus to several-variable calculus, though, these three concepts begin to diverge significantly from each other. The indefinite integral generalizes to the notion of a solution to a differential equation, or to an integral of a connection, VECTOR FIELD [IV.6 §5], or BUNDLE [IV.6 §5]. The unsigned definite integral generalizes to the LEBESGUE INTEGRAL [III.55], or more generally to integration on a measure space. Finally, the signed definite integral generalizes to the integration of forms, which will be our focus here. While these three concepts are still related to each other, they are not as interchangeable as they are in the single-variable setting. The integration-offorms concept is of fundamental importance in differential topology, geometry, and physics, and also yields one of the most important examples of COHOMOL0GY [IV.6 §4], namely de Rham cohomology, which (roughly speaking) measures the extent to which the fundamental theorem of calculus fails in higher dimensions and on general manifolds.

To provide some motivation for the concept, let us informally revisit one of the basic applications of the signed definite integral from physics, namely computing the amount of work required to move a one-dimensional particle from point a to point b in the presence of an external field. (For example, one might be moving a charged particle in an electric field.) At the infinitesimal level, the amount of work required to move a particle from a point xiImage to a nearby point xi+1Image is (up to a small error) proportional to the displacement Δxi = xi+1 - xi, with the constant of proportionality f (xi) depending on the initial location xi of the particle. Thus, the total work required for this is approximately f (xi)Δxi. Note that we do not require xi+1 to be to the right of xi, so the displacement Δxi (or the infinitesimal work f (xi)Δxi) may well be negative. To return to the noninfinitesimal problem of computing the work required to move from a to b, we arbitrarily select a discrete path xo = a, x1, x2, . . . , xn = b from a to b, and approximate the work as

Image

Again, we do not require xi+1 to be to the right of xi; it is quite possible for the path to “backtrack” repeatedly: for instance, one might have xi < xi+1 > xi+2 for some i. However, it turns out that the effect of such backtracking eventually cancels itself out; regardless of what path we choose, the expression (3) above converges as the maximum step size tends to zero, and the limit is the signed definite integral

Image

provided only that the total length Image |Δxi| of the path (which controls the amount of backtracking involved) stays bounded. In particular, in the case when a = b, so that all paths are closed (i.e., xo = xn), we see that the signed definite integral is zero:

Image

From this informal definition of the signed definite integral it is obvious that we have the concatenation formula

Image

regardless of the relative position of the real numbers a, b, and c. In particular (setting a = c and using (5)) we conclude that

Image

Thus, if we reverse a path from a to b to form a path from b to a, then the sign of the integral changes. This contrasts with the unsigned definite integral[a, b] f (x) dx, since the set [a, b] of numbers between a and b is exactly the same as the set of numbers between b and a. Thus we see that paths are not quite the same as sets: they carry an orientation which can be reversed, whereas sets do not.

Now let us move from one-dimensional integration to higher-dimensional integration: that is, from single-variable calculus to several-variable calculus. It turns out that there are two objects whose dimensions may increase: the “ambient space,”1 which will now be Imagen instead of Image, and the path, which will now become an oriented k-dimensional manifold S, over which the integration will take place. For example, if n = 3 and k = 2, then one is integrating over a surface that lives in Image3.

Let us begin with the case n ≥ 1 and k = 1. Here, we will be integrating over a continuously differentiable path (or oriented rectifiable curve) γ in Imagen starting and ending at points a and b, respectively. (These points may or may not be distinct, depending on whether the path is open or closed.) From a physical point of view, we are still computing the work required to move from a to b, but now we are moving in several dimensions instead of one. In the one-dimensional case, we did not need to specify exactly which path we used to get from a to b, because all backtracking canceled itself out. However, in higher dimensions, the exact choice of the path γ becomes important.

Formally, a path from a to b can be described (or parametrized) as a continuously differentiable function γ from the unit interval [0,1] to Imagen such that γ(0) = a and γ(1) = b. For instance, the line segment from a to b can be parametrized as γ(t) = (1 - t)a + tb. This segment also has many other parametrizations, such as Image(t) = (1 - t2)a + t2b; however, as in the one-dimensional case, the exact choice of parametrization does not ultimately influence the integral. On the other hand, the reverse line segment (-γ) (t) = ta + (1- t) b from b to a is a genuinely different path; the integral along -γ will turn out to be the negative of the integral along γ.

As in the one-dimensional case, we will need to approximate the continuous path γ by a discrete path x0 = γ(t0), x1 = γ(t2, . . . , xn = γ(tn), where γ(t0) = a and γ(tn) = b. Again, we allow some backtracking: ti+1 is not necessarily larger than ti. The displacement Δxi = xi+1 - xiImagen from xi to xi+1 is now a vector rather than a scalar. (Indeed, with an eye on the generalization to manifolds, one should think of Δxi as an infinitesimal tangent vector to the ambient space Imagen at the point xi.) In the one-dimensional case, we converted the scalar displacement Δxi into a new number f (xi)Δxi, which was linearly related to the original displacement by a proportionality constant f (xi) that depended on the position xi. In higher dimensions, we again have a linear dependence, but this time, since the displacement is a vector, we must replace the simple constant of proportionality by a linear transformation ωxi from Imagen to Image. Thus, ωxi (Δxi) represents the infinitesimal “work” required to move from xi to xi+1. In technical terms, ωxi is a linear functional on the space of tangent vectors at xi, and is thus a cotangent vector at xi. By analogy with (3), the net work γ ω required to move from a to b along the path γ is approximated by

Image

As in the one-dimensional case, one can show that the right-hand side of (7) converges if the maximum step size sup0<i<n-1xi| of the path converges to zero and the total length Image |Δxi| of the path stays bounded. The limit is written as ∫γ ω. (Recall that we are restricting our attention to continuous functions. The existence of this limit uses the continuity of ω.)

The object ω, which continuously assigns2 a cotangent vector to each point in Imagen, is called a 1-form, and (7) leads to a recipe for integrating any 1-form ω on a path γ. That is, to shift the emphasis slightly, it allows us to integrate the path γ “against” the 1-form ω. Indeed, it is useful to think of this integration as a binary operation (similar in some ways to the dot product) that takes the curve γ and the form ω as inputs, and returns a scalar γ ω as output. There is in fact a “duality” between curves and forms; compare, for instance, the identity

Image

which expresses (part of) the fundamental fact that integration of forms is a linear operation, with the identity

Image

which generalizes (6) whenever the initial point of γ2 is the final point of γ1, where γ1 + γ2 is the concatenation of γ1 and γ2.3

Recall that if f is a differentiable function from Imagen to Image, then its derivative at a point x is a linear map from Imagen to Image (see [I.3 §5.3]). If f is continuously differentiable, then this linear map depends continuously on x, and can therefore be thought of as a 1-form, which we denote by d f, writing dfx for the derivative at x. This 1-form can be characterized as the unique 1-form such that one has the approximation

Image

for all infinitesimal v. (More rigorously, the condition is that|f(x+v) - f(v) -dfx(v)|/|v| → 0 as v → 0.)

The fundamental theorem of calculus (1) now generalizes to

Image

whenever γ is any oriented curve from a point a to a point b. In particular, if γ is closed, then ∫γ df = 0. Note that in order to interpret the left-hand side of the above equation, we are regarding it as a particular example of an integral of the form ∫γ ω: in this case, ω happens to be the form df. Note also that, with this interpretation, df has an independent meaning (it is a 1-form) even if it does not appear under an integral sign.

A 1-form whose integral against every sufficiently small4 closed curve vanishes is called closed, while a 1-form that can be written as df for some continuously differentiable function is called exact. Thus, the fundamental theorem implies that every exact form is closed. This turns out to be a general fact, valid for all manifolds. Is the converse true: that is, is every closed form exact? If the domain is a Euclidean space, or indeed any other simply connected manifold, then the answer is yes (this is a special case of the poincaré lemma), but it is not true for general domains. In modern terminology, this demonstrates that the de Rham cohomology of such domains can be nontrivial.

As we have just seen, a 1-form can be thought of as an object ω that associates with each path γ a scalar, which we denote by ∫γ ω. Of course, ω is not just any old function from paths to scalars: it must satisfy the concatenation and reversing rules discussed earlier, and this, together with our continuity assumptions, more or less forces it to be associated with some kind of continuously varying linear function that can be used, in combination with γ, to define an integral. Now let us see if we can generalize this basic idea from paths to k-dimensional sets with k > 1. For simplicity we shall stick to the two-dimensional case, that is, to integration of forms on (oriented) surfaces in Imagen, since this already illustrates many features of the general case.

Physically, such integrals arise when one is computing a flux of some field (e.g., a magnetic field) across a surface. We parametrized one-dimensional oriented curves as continuously differentiable functions γ from the interval [0,1] to Imagen. It is thus natural to parametrize two-dimensional oriented surfaces as continuously differentiable functions φ defined on the unit square [0, 1]2. This does not in fact cover all possible surfaces one wishes to integrate over, but it turns out that one can cut up more general surfaces into pieces that can be parametrized using “nice” domains such as [0,1]2.

In the one-dimensional case, we cut up the oriented interval [0,1] into infinitesimal oriented intervals from ti to ti+1 = ti = + Δt, which led to infinitesimal curves from xi = γ(ti) to xi+1 =γ(ti+1) =xixi. Note that Δxi and Δt are related by the approximation Δxi ≈ γ (titi. In the two-dimensional case, we will cut up the unit square [0,112 into infinitesimal squares in an obvious way.5 A typical one of these will have corners of the form (t1, t2), (t1 + Δt, t2), (t1, t2 + Δt), (t1 + Δt, t2 + Δt). The surface described by φ can then be partitioned into regions, with corners φ(t,1, t2), φ(t1t, t2, Δ(t1,t2t), Δ(t1t, t2t), each of which carries an orientation. Since φ is differentiable, it is approximately linear at small distance scales, so this region is approximately an oriented parallelogram in Imagen with corners x, x + Δlx, x + Δ2x, x + Δlx +Δ2x, where x = φ(t1, t2) and Δ1x and Δ2x are the infinitesimal vectors

Image

Let us refer to this object as the infinitesimal parallelogram with dimensions Δlx ∧ Δ2x and base point x. For now, we will think of the symbol “∧” as a mere notational convenience and not try to interpret it. In order to integrate in a manner analogous with integration on curves, we now need some sort of functional ωx at this base point that depends continuously on x. This functional should take the above infinitesimal parallelogram and return an infinitesimal number ωx1x ∧ Δ2x), which one can think of as the amount of “flux” passing through this parallelogram.

As in the one-dimensional case, we expect ωx to have certain properties. For instance, if you double Δlx, you double one of the sides of the infinitesimal parallelogram, so (by the continuity of ω) the “flux” passing through the parallelogram should double. More generally, ω1 x ∧ Δ2x) should depend linearly on each of Δlx and Δ2x: in other words, it is bilinear. (This generalizes the linear dependence in the one-dimensional case.)

Another important property is that

Image

That is, the bilinear form ωx is antisymmetric. Again, this has an intuitive explanation: the parallelogram represented by Δ2x ∧ Δlx is the same as that represented by Δlx ∧ Δ2x except that it has had its orientation reversed, so the “flux” now counts negatively where it used to count positively, and vice versa. Another way of seeing this is to note that if Δlx = Δ2x, then the parallelogram is degenerate and there should be no flux.

Antisymmetry follows from this and the bilinearity. A 2-form ω is a continuous assignment of a functional ωx with these properties to each point x.

If ω is a 2-form and φ : [0, 112 →, Imagen is a continuously differentiable function, we can now define the integral ∫φ ω of ω “against” φ (or, more precisely, the integral against the image under φ of the oriented square [0,1]2) by the approximation

Image

where the image of φ is (approximately) partitioned into parallelograms of dimensions Δx1, i ∧ Δx2, i based at points xi. We do not need to decide what order these parallelograms should be arranged in, because addition is both commutative and associative. One can show that the right-hand side of (10) converges to a unique limit as one makes the partition of parallelograms “increasingly fine,” though we will not make this precise here.

We have thus shown how to integrate 2-forms against oriented two-dimensional surfaces. More generally, one can define the concept of a k-form on an n-dimensional manifold (such as Imagen) for any 0 ≤ k ≤ n and integrate this against an oriented k-dimensional surface in that manifold. For instance, a 0-form on a manifold X is the same thing as a scalar function f : X → Image, whose integral on a positively oriented point x (which is zero dimensional) is - f(x), and on a negatively oriented point x is - f(x). A k-form tells us how to assign a value to an infinitesimal k-dimensional parallelepiped with dimensions Δxl ∧. . .∧Δxk, and hence to a portion of k-dimensional “surface,” in much the same way as we have seen when k = 2. By convention, if k ≠ k’, the integral of a k-dimensional form on a k’-dimensional surface is understood to be zero. We refer to 0-forms, 1-forms, 2-forms, etc. (and formal sums and differences thereof), collectively as differential forms.

There are three fundamental operations that one can perform on scalar functions: addition (f, g) → f + g, pointwise product (f, g) → fg, and differentiation f →df, although the last of these is not especially useful unless f is continuously differentiable. These operations have various relationships with each other. For instance, the product is distributive over addition,

f(g + h) = fg + fh,

and differentiation is a derivation with respect to the product:

d(fg) = (df)g + f(dg).

It turns out that one can generalize all three of these operations to differential forms. Adding a pair of forms is easy: if ω and η are two k-forms and Image : [0, 1]kImagen is a continuously differentiable function, then ∫φ (ω + η) is defined to be ∫φ ω + ∫ φ η. One multiplies forms using the so-called wedge product. If ω is a k-form and η is an l-form, then ω ∧ η is a (k + l)-form. Roughly speaking, given a (k +1)-dimensional infinitesimal parallelepiped with base point x and dimensions Δx1 ∧ . . . ∧ Δxk+l, one evaluates ω and η at the parallelepipeds with base point x and dimensions Δx1 ∧ . . . ∧ Δxk+l, respectively, and multiplies the results together.

As for differentiation, if ω is a continuously differentiable k-form, then its derivative dω is a (k +1)-form that measures something like the “rate of change” of ω. To see what this might mean, and in particular to see why dω is a (k +1)-form, let us think how we might answer a question of the following kind. We are given a spherical surface in Image3 and a flow, and we would like to know the net flux out of the surface: that is, the difference between the amount of flux coming out and the amount going in. One way to do this would be to approximate the surface of the sphere by a union of tiny parallelograms, to measure the flux through each one, and to take the sum of all these fluxes. Another would be to approximate the solid sphere by a union of tiny parallelepipeds, to measure the net flux out of each of these, and to add up the results. If a parallelepiped is small enough, then we can closely approximate the net flux out of it by looking at the difference, for each pair of opposite faces, between the amount coming out of the parallelepiped through one and the amount going into it through the other, and this will depend on the rate of change of the 2-form.

The process of summing up the net fluxes out of the parallelepipeds is more rigorously described as integrating a 3-form over the solid sphere. In this way, one can see that it is natural to expect that information about how a 2-form varies should be encapsulated in a 3-form.

The exact construction of these operations requires a little bit of algebra and is omitted here. However, we remark that they obey similar laws to their scalar counterparts, except that there are some sign changes that are ultimately due to the antisymmetry (9). For instance, if ω is a k-form and η is an l-form, the commutative law for multiplication becomes

ωη = (-1)kl ηω,

basically because kl swaps are needed to interchange k dimensions with 1 dimensions; and the derivation rule for differentiation becomes

d(ω ∧ η) = (dω) ∧ η + (-1)kω ∧ (dη).

Another rule is that the differentiation operator d is nilpotent:

Image

This may seem rather unintuitive, but it is fundamentally important. To see why it might be expected, let us think about differentiating a 1-form twice. The original 1-form associates a scalar with each small line segment. Its derivative is a 2-form that associates a scalar with each small parallelogram. This scalar essentially measures the sum of the scalars given by the 1-form as you go around the four edges of the parallelogram, though to get a sensible answer when you pass to the limit you have to divide by the area of the parallelogram. If we now repeat the process, we are looking at a sum of the six scalars associated with the six faces of a parallelepiped. But each of these scalars in turn comes from a sum of the scalars associated with the four directed edges around the corresponding face, and each edge is therefore counted twice (as it belongs to two faces), once in each direction. Therefore, the contributions from each edge cancel and the sum of all contributions is zero.

The description given earlier of the relationship between integrating a 2-form over the surface of a sphere and integrating its derivative over the solid sphere can be thought of as a generalization of the fundamental theorem of calculus, and can itself be generalized considerably: Stokes’s theorem is the assertion that

Image

for any oriented manifold S and form ω, where ∂ S is the oriented boundary of S (which we will not define here). Indeed one can view this theorem as a definition of the derivative operation ω → dω; thus, differentiation is the adjoint of the boundary operation. (For instance, the identity (11) is dual to the geometric observation that the boundary ∂s of an oriented manifold itself has no boundary: ∂(∂S) = ∅.) As a particular case of Stokes’s theorem, we see that ∫s dω = 0 whenever S is a closed manifold, i.e., one with no boundary. This observation lets one extend the notions of closed and exact forms to general differential forms, which (together with (11)) allows one to fully set up de Rham cohomology.

We have already seen that 0-forms can be identified with scalar functions. Also, in Euclidean spaces one can use the inner product to identify linear functionals with vectors, and therefore 1-forms can be identified with vector fields. In the special (but very physical) case of three-dimensional Euclidean space Image3, 2-forms can also be identified with vector fields via the famous right-hand rule,6 and 3-forms can be identified with scalar functions by a variant of this rule. (This is an example of a concept known as Hodge duality.) In this case, the differentiation operation ω → dω can be identified with the gradient operation ∫ → ∇ ∫ when ω is a 0-form, with the curl operation x → ∇ × x when ω is a 1-form, and with the divergence operation x → ∇. x when ω is a 2-form. Thus, for instance, the rule (11) implies that ∇ × ∫ = 0 and ∇ (∇ × x) for any suitably smooth scalar function f and vector field x, while various cases of Stokes’s theorem (12), with this interpretation, become the various theorems about integrals of curves and surfaces in three dimensions that you may have seen referred to as “the divergence theorem,” “Green’s theorem,” and “Stokes’s theorem” in a course on several-variable calculus.

Just as the signed definite integral is connected to the unsigned definite integral in one dimension via (2), there is a connection between integration of differential forms and the Lebesgue (or Riemann) integral. On the Euclidean space Imagen one has the n standard coordinate functions x1, x2, . . . , xn : ImagenImage. Their derivatives dx1, . . . , dxn are then 1-forms on Imagen. Taking their wedge product, one obtains an n-form dx1 ∧ . . . ∧ dxn. We can multiply this by any (continuous) scalar function : ImagenImage to obtain another n- form f (x) dxl ∧ . . . ∧ dxn. If Ω is any open bounded domain in Imagen, we then have the identity

Ω f(x) dx1 ∧ . . . ∧ dxn = ∫ Ω f(x) dx,

where on the left-hand side we have an integral of a differential form (with Ω viewed as a positively oriented n-dimensional manifold) and on the right-hand side we have the Riemann or Lebesgue integral of f on Ω. If we give Ω the negative orientation, we have to reverse the sign of the left-hand side. This correspondence generalizes (2).

There is one last operation on forms that is worth pointing out. Suppose we have a continuously differentiable map Φ : x → y from one manifold to another (we allow x and y to have different dimensions). Then of course every point x in X pushes forward to a point Φ(x) in y. Similarly, if we let v ∈ TxX be an infinitesimal tangent vector to X based at x, then this tangent vector also pushes forward to a tangent vector * V ∈ TΦ(X) Y based at Φ(x); informally speaking,Φ* v can be defined by requiring the infinitesimal approximation Φ(x+ v) = Φ(x) + Φ* v. One can write Φ*v = DΦ(x)(v), where DΦ : TXX → TΦ(X) Y is the derivative of the several-variable map Φ at x. Finally, any k-dimensional oriented manifold S in X also pushes forward to a k-dimensional oriented manifold Φ(S) in X, although in some cases (e.g., if the image of Φ has dimension less than k) this pushed-forward manifold may be degenerate.

We have seen that integration is a duality pairing between manifolds and forms. Since manifolds push forward under Φ from X to Y, we expect forms to pull back from y to X. Indeed, given any k-form ω on Y, we can define the pullback Φ* ω as the unique k-form on X such that we have the change-of-variables formula

Φ(s) ω = ∫s Φ*(ω).

In the case of 0-forms (i.e., scalar functions), the pullback Φ* f : x → Image of a scalar function f: y → Image is given explicitly by Φ* f (x) = f (Φ(x)), while the pullback of a 1-form ω is given explicitly by the formula

* ω) x(v) = ωΦ(x)*v).

Similar definitions can be given for other differential forms. The pullback operation enjoys several nice properties: for instance, it respects the wedge product,

Φ*(ω ∧ η) = (Φ*ω) ∧ (Φ*η),

and the derivative,

d(Φ*ω) = Φ*(dω).

By using these properties, one can recover rather painlessly the change-of-variables formulas in several-variable calculus. Moreover, the whole theory carries over effortlessly from Euclidean spaces to other manifolds. It is because of this that the theory of differential forms and integration is an indispensable tool in the modern study of manifolds, and especially in DIFFERENTIAL TOPOLOGY [IV.7].

1. We will start with integration on Euclidean spaces Imagen for simplicity, although the true power of the integration-of-forms concept is much more apparent when we integrate on more general spaces, such as abstract n-dimensional manifolds.

2. More precisely, one can think of ω as a section of the cotangent bundle.

3. This duality is best understood using the abstract, and much more general, formalism of homology and cohomology. In particular, one can remove the requirement that γ2 begins where γ1 leaves off by generalizing the notion of an integral to cover not just integration on paths, but also integration on formal sums or differences of paths. This makes the duality between curves and forms more symmetric.

4. The precise condition needed is that the curve should be contractible, which means that it can be continuously shrunk down to a point.

5. One could also use infinitesimal oriented rectangles, parallelograms, triangles, etc.; this leads to an equivalent concept of the integral.

6. This is an entirely arbitrary convention; one could just as easily have used the left-hand rule to provide this identification, and apart from some harmless sign changes here and there, one gets essentially the same theory as a consequence.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset