In mathematics, the covariant derivative is a way of specifying a derivative along tangent vectors of a manifold. Alternatively, the covariant derivative is a way of introducing and working with a connection on a manifold by means of a differential
operator, to be contrasted with the approach given by a principal
connection on the frame bundle — see Affine connection.
This article presents a traditional introduction, using a coordinate system, to the
covariant derivative of a vector field with respect to a vector. The covariant derivative
of a tensor field is presented as an extension of the same concept. The covariant
derivative generalizes straightforwardly to a notion of differentiation associated to a connection on a vector bundle, also known as a Koszul connection.
Introduction and history
Historically, at the turn of the 20th century, the covariant derivative was introduced
by Gregorio Ricci-Curbastro and Tullio
Levi-Civita[1] in the theory of Riemannian and pseudo-Riemannian geometry. Ricci
and Levi-Civita (following ideas of Elwin Bruno Christoffel) observed that the
Christoffel symbols used to define the curvature[2][3], could also provide a notion of differentiation which generalized the classical directional
derivative of vector fields on a manifold. This new derivative -- the
Levi-Civita connection -- was covariant in the sense that it satisfied Riemann's requirement that objects
in geometry should be independent of their description in a particular coordinate system.
It was soon noted by other mathematicians, prominent among these being Hermann Weyl,
Jan Arnoldus Schouten, and Elie
Cartan[4], that a covariant derivative could be
defined abstractly without the presence of a metric. The crucial feature was not a particular dependence on the metric, but that
the Christoffel symbols satisfied a certain precise second order transformation law. This transformation law could serve as a
starting point for defining the derivative in a covariant manner. Thus the theory of covariant differentiation forked away from
the strictly Riemannian context to include a wider range of possible geometries.
In the 1940s, practitioners of differential geometry began introducing other notions of covariant differentiation in
general vector bundles which were, in contrast to the classical bundles of interest to
geometers, not part of the tensor analysis of the manifold. By and large, these generalized
covariant derivatives had to be specified ad hoc by some version of the connection concept. In 1950, Jean-Louis Koszul unified these new ideas of covariant differentiation in a vector bundle by means of
what is known today as a Koszul connection [5] or a connection on a vector bundle. Using ideas from Lie algebra cohomology, Koszul successfully converted many of the analytic features of covariant
differentiation into algebraic ones. In particular, Koszul connections eliminated the need for awkward manipulations of
Christoffel symbols (and other analogous non-tensorial) objects in differential geometry. Thus they quickly supplanted the classical notion of covariant
derivative in many post-1950 treatments of the subject.
Motivation
The covariant derivative is a generalization of the directional
derivative from vector calculus. As with the directional derivative, the
covariant derivative is a rule
which takes as its inputs: (1) a vector u defined at a point P, and (2) a
vector field v defined in a neighborhood of P[6]. The output is then a vector
, also at the point
P. The primary difference with the usual directional derivative is that
must, in a certain precise
sense, be independent of the manner in which it is expressed in a coordinate
system.
A vector may be described as a list of numbers in terms of a basis, but
as a geometrical object a vector retains its own identity regardless of how one chooses to describe it in a basis. This
persistence of identity is reflected in the fact that when a vector is written in one basis, and then the basis is changed, the
vector transforms according to a change of basis formula. Such a transformation law is
known as a covariant transformation. The covariant derivative is required to
transform, under a change in coordinates, in the same way as a vector does: the covariant derivative must change by a covariant
transformation (hence the name).
In the case of Euclidean space, one tends to define the derivative of a vector field
in terms of the difference between two vectors at two nearby points. In such a system one translates one of the vectors to the origin of the other, keeping it parallel. With a Cartesian
(fixed orthonormal) coordinate system we thus obtain the simplest example: covariant
derivative which is obtained by taking the derivative of the components.
In the general case, however, one must take into account the change of the coordinate system. For example, if the same
covariant derivative is written in polar coordinates in a two dimensional
Euclidean plane, then it contains extra terms that describe how the coordinate grid itself "rotates". In other cases the extra
terms describe how the coordinate grid expands, contracts, twists, interweaves, etc.
Consider the example of moving along a curve γ(t) in the Euclidean plane. In polar coordinates, γ may be written in
terms of its radial and angular coordinates by γ(t) = (r(t), θ(t)). A vector at a particular time
t[7] (for instance, the acceleration of the curve)
is expressed in terms of
, where
and
are unit tangent vectors for the
polar coordinates, serving as a basis to decompose a vector in terms of radial and tangential components. At a slightly later time, the new basis in polar coordinates
appears slightly rotated with respect to the first set. The covariant derivative of the basis vectors (the Christoffel symbols) serve to express this change.
In a curved space, such as the surface of the Earth (regarded as a sphere), the translation is not well defined and its analog, parallel
transport, depends on the path along which the vector is translated.
A vector e on a globe on the equator in Q is directed to the north. Suppose we parallel transport the vector first along the equator until P and then (keeping it parallel to
itself) drag it along a meridian to the pole N and (keeping the direction there) subsequently transport it along another meridian
back to Q. Then we notice that the parallel-transported vector along a closed circuit does not return as the same vector;
instead, it has another orientation. This would not happen in Euclidean space and is caused by the curvature of the
surface of the globe. The same effect can be noticed if we drag the vector along an infinitesimally small closed surface
subsequently along two directions and then back. The infinitesimal change of the vector is a measure of the curvature.
Remarks
- The definition of the covariant derivative does not use the metric in space. However, a given metric uniquely defines a
special covariant derivative called the Levi-Civita connection.
- The properties of a derivative imply that
depends on the
surrounding of point p in the same way as e.g. the derivative of a scalar function along a curve at a given point p
depends on the surroundings of p.
- The covariant derivative can be described by a "tensor" in a fixed coordinate chart, but it
is not a true tensor in the sense that it is not invariant under coordinate changes.
- The information on the surroundings of a point p in the covariant derivative can be used to define parallel transport of a vector. Also the curvature, torsion and geodesics may be defined only in terms of the covariant derivative or other related variation on the idea of a
linear connection.
Formal definition
A covariant derivative is a (Koszul) connection on the tangent bundle and other tensor bundles. Thus it has a certain
behavior on functions, on vector fields, on the duals of vector fields (i.e., covector
fields), and most generally of all, on arbitrary tensor fields.
Functions
Given a function f, the covariant derivative
coincides with the normal
differentiation of a real function in the direction of the vector v, usually denoted by
and by
.
Vector fields
A covariant derivative
of a vector field
in the direction of the vector
denoted
is defined by the
following properties for any vector v, vector fields u, w and scalar functions f and g:
is algebraically linear in
so 
is additive in
so

obeys the product rule, i.e.
where
is defined above.
Note that
at point p depends on the value of v at p and on values of u in a
neighbourhood of p because of the last property, the product rule.
Covector fields
Given a field of covectors (or one-form)
α, its covariant derivative
can be defined using the
following identity which is satisfied for all vector fields u

The covariant derivative of a covector field along a vector field v is again a covector field.
Tensor fields
Once the covariant derivative is defined for fields of vectors and covectors it can be defined for arbitrary tensor fields using the following identities where
and ψ are any two
tensors:

and if
and ψ are tensor fields of the same tensor bundle then

The covariant derivative of a tensor field along a vector field v is again a tensor field of the same type.
Coordinate description
Given coordinate functions
, any tangent vector can be described by its components in the basis
.
The covariant derivative of a basis vector along a basis vector is again a vector and so can be expressed as a linear combination
Γkek. To specify the covariant derivative it is enough to specify the covariant derivative of
each basis vector field ej along ei.

the coefficients Γki j are called Christoffel
symbols. Then using the rules in the definition, we find that for general vector fields
and
we get

the first term in this formula is responsible for "twisting" the coordinate system with respect to the covariant derivative
and the second for changes of components of the vector field u. In particular

In words: the covariant derivative is the normal derivative along the coordinates with correction terms which tell how the
coordinates change.
The covariant derivative of a type (r,s) tensor field along ec is given
by the expression:


Or, in words: take the partial derivative of the tensor and add: a
for every upper index ai, and a
for every lower index bi.
If instead of a tensor, one is trying to differentiate a tensor density (of
weight +1), then you also add a term

If it is a tensor density of weight W, then multiply that term by W. For example,
is a scalar density (of weight +1), so we
get:

where semicolon ";" indicates covariant differentiation and comma "," indicates partial differentiation. By the way, this
particular expression is equal to zero, because the covariant derivative of a function solely of the metric is always zero.
Notation
In textbooks on physics, the covariant derivative is sometimes simply stated in terms of its components in this equation.
Often a notation is used in which the covariant derivative is given with a semicolon, while
a normal partial derivative is indicated by a comma.
In this notation we write the same as:
- Failed to parse (unknown function\stackrel): \nabla_{e_j} {\mathbf v} \ \stackrel{\mathrm{def}}{=}\ v^s
{}_{;j}e_s \;\;\;\;\;\; v^i {}_{;j} = v^i {}_{,j} + v^k\Gamma^i {}_{k j}
Once again this shows that the covariant derivative of a vector field is not just simply obtained by differentiating to the
coordinates vi,j, but also depends on the vector v
itself through vkΓikj.
In some older texts (notably Adler, Bazin & Schiffer, Introduction to General Relativity), the covariant derivative
is denoted by a double pipe:
- Failed to parse (unknown function\stackrel): \nabla_j {\mathbf v} \ \stackrel{\mathrm{def}}{=}\ v^i
{}_{||j} \;\;\;\;\;\;
Derivative along curve
Since the covariant derivative
of a tensor field T at a point p depends
only on value of the vector field X at p one can define
the covariant derivative along a smooth curve γ(t) in a manifold:

Note that the tensor field T only needs to be defined on the curve γ(t) for this definition to make sense.
In particular,
is a vector field along the curve γ itself. If
vanishes then the
curve is called a geodesic of the covariant derivative. If the covariant derivative is the Levi-Civita connection of a certain metric then the geodesics for the connection are precisely
the geodesics of the metric that are parametrised by arc
length.
The derivative along a curve is also used to define the parallel transport along
the curve.
Sometimes the covariant derivative along a curve is called absolute or intrinsic derivative.
Relation to Lie derivative
A covariant derivative introduces an extra geometric structure on a manifold which allows vectors in neighboring tangent
spaces to be compared. This extra structure is necessary because there is no canonical way to compare vectors from different
vector spaces, as is necessary for this generalization of the directional
derivative. There is however another generalization of directional derivatives which is canonical: the
Lie derivative. The Lie derivative evaluates the change of one vector field along the
flow of another vector field. Thus, one must know both vector fields in an open neighborhood. The covariant derivative on the
other hand introduces its own change for vectors in a given direction, and it only depends on the vector direction at a single
point, rather than a vector field in an open neighborhood of a point. In other words, the covariant derivative is linear (over
C∞(M)) in the direction argument, while the Lie derivative is linear in neither argument.
Note that the antisymmetrized covariant derivative ∇uv - ∇vu, and the Lie derivative
Luv differ by the torsion of the connection, so that if a
connection is symmetric, then its antisymmetrization is the Lie derivative.
Notes
- ^ Levi-Civita, T. and Ricci, G. "Méthodes de calcul différential absolu et
leurs applications", Math. Ann. B, 54 (1900) 125-201.
- ^ Riemann, G.F.B., "Über die
Hypothesen, welche der Geomtrie zu Grunde liegen", Gesammelte Mathematische Werke (1866); reprint, ed. Weber, H.: Dover,
New York, 1953.
- ^ Christoffel, E.B., "Über die Transformation der homogenen
Differentialausdrücke zweiten Grades," J. für die Reine und Angew. Math. 70 (1869), 46-70.
- ^ cf. with Cartan, E. "Sur les variétés à connexion affine
et la theorie de la relativité généralisée", Annales, Ecole Normale 40 (1923), 325-412.
- ^ Koszul, J. L. "Homologie et cohomologie des algebres de Lie", Bulletin
de la Société Mathématique 78 (1950) 65-127.
- ^ The covariant derivative is also denoted variously by ∂vu, Dvu, or other notations.
- ^ In many applications, it may be better not to think of t as
corresponding to time, at least for applications in general relativity. It is simply
regarded as an abstract parameter varying smoothly and monotonically along the path.
See also
This entry is from Wikipedia, the leading user-contributed encyclopedia. It may not have been reviewed by professional editors (see full disclaimer)