Jump to content

Intermediate treatment of tensors

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by LokiClock (talk | contribs) at 21:53, 29 December 2009. The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

 Note
The following is an intermediate-level treatment of tensors. Read the article Tensor for a simple description of tensors, or see Tensor (intrinsic definition) for a more abstract treatment. For an even more traditional approach, see classical treatment of tensors.
Note that the word "tensor" is often used as a shorthand for "tensor field", a concept which defines a tensor value at every point in a manifold. To understand tensor fields, you need to first understand tensors.

In mathematics and physics, a tensor is an idealized geometric or physical quantity whose numerical description, relative to a particular frame of reference, consists of a multiple indexed array of numbers. A vector, for example, is a tensor with a single index; thus, tensors can be regarded as a multi-index generalization of the vector concept. Just as it is with vectors, a change of reference frame induces a transformation of the components.

This way of viewing tensors, called tensor analysis, was used by Einstein and is generally preferred by physicists[citation needed]. It is, very grossly, a generalization of the concept of vectors, matrices, and linear transformations, and allows the writing of equations independently of any given coordinate system.

Overview

Tensor quantities may be categorized by considering the number of indices inherent in their description. The scalar quantities are those that can be represented by a single number (indices are not needed) —speed, mass, temperature, for example. There are also vector-like quantities such as force that require a list of numbers for their description (one index is required , where i = x, y, or z, so that direction can be accounted for). Finally, quantities such as quadratic forms naturally require a multiply-indexed array for their representation. These latter quantities can only be conceived of as tensors. Some well known examples of tensors in geometry are quadratic forms, and the curvature tensor. Examples of physical tensors are the energy-momentum tensor and the polarization tensor.

Actually, the tensor notion is quite general and applies to all of the above examples; scalars and vectors are special kinds of tensors. The feature that distinguishes a scalar from a vector, and distinguishes both of those from a more general tensor quantity is the number of indices in the representing array. This number is called the rank (or the order) of a tensor. Thus, scalars are rank zero tensors (with no indices at all) and vectors are rank one tensors.

It should be noted that the array-of-numbers representation of a tensor is not the same thing as the tensor. A given numerical representation of a tensor only makes sense in a particular coordinate system, and the array of numbers are called the components of the tensor in a chosen basis.

It is also necessary to distinguish between two types of indices, depending on whether the corresponding numbers transform covariantly or contravariantly relative to a change in the frame of reference. Contravariant indices are written as superscripts, while the covariant indices are written as subscripts. The type (or valence) of a tensor is the pair , where is the number of contravariant and the number of covariant indices, respectively. Note that a tensor of type has a rank of p + q. It is customary to represent the actual tensor, as a standalone entity, by a bold-face symbol such as . The corresponding array of numbers for a type tensor is denoted by the symbol where the superscripts and subscripts are indices that vary from to . The number , the range of the indices, is called the dimension of the tensor; the total number of degrees of freedom required for the specification of a particular tensor is the dimension of the tensor raised to the power of the tensor's rank.

Again, it must be emphasized that the tensor and the representing array are not the same thing. The values of the representing array are given relative to some frame of reference, and undergo a linear transformation when the frame is changed.

Finally, it must be mentioned that most physical and geometric applications are concerned with tensor fields, that is to say tensor valued functions, rather than tensors themselves. Some care is required, because it is common to see a tensor field called simply a tensor. There is a difference, however; the entries of a tensor array are numbers, whereas the entries of a tensor field are functions. The present entry treats the purely algebraic aspect of tensors. Tensor field concepts, which typically involve derivatives of some kind, are discussed elsewhere.

Definition

The formal definition of a tensor quantity begins with a finite-dimensional vector space , which furnishes the uniform "building blocks" for tensors of all valences. In typical applications, is the tangent space at a point of a manifold; the elements of typically represent physical quantities such as velocities or forces. The space of -valent tensors, denoted here by is obtained by taking the tensor product of copies of and copies of the dual vector space . To wit,

In order to represent a tensor by a concrete array of numbers, we require a frame of reference, which is essentially a basis of , say Every vector in can be "measured" relative to this basis, meaning that for every there exist unique scalars , such that (note the use of the Einstein notation)

These scalars are called the components of relative to the frame in question.

Let be the corresponding dual basis, i.e.,

where the latter is the Kronecker delta array. For every covector there exists a unique array of components such that

More generally, every tensor has a unique representation in terms of components. That is to say, there exists a unique array of scalars such that

Transformation rules

Next, suppose that a change is made to a different frame of reference, say Any two frames are uniquely related by an invertible transition matrix , having the property that for all values of we have the frame transformation rule

Let be a vector, and let and denote the corresponding component arrays relative to the two frames. From

and from the frame transformation rule we infer the vector transformation rule

where is the matrix inverse of , i.e.,

Thus, the transformation rule for a vector's components is contravariant to the transformation rule for the frame of reference. It is for this reason that the superscript indices of a vector are called contravariant.

To establish the transformation rule for covectors, we note that the transformation rule for the dual basis takes the form

and that

while

The transformation rule for covector components is covariant. Let be a given covector, and let and be the corresponding component arrays. Then

The above relation is easily established. We need only remark that

and that

and then use the transformation rule for the frame of reference.

In light of the above discussion, we see that the transformation rule for a general type tensor takes the form

See also

Further reading

  • Bernard Schutz, Geometrical methods of mathematical physics, Cambridge University Press, 1980.
  • Schaum's Outline of Tensor Calculus
  • Synge and Schild, Tensor Calculus, Toronto Press: Toronto, 1949
  • James Munkres, Analysis on Manifolds, Westview Press, 1991. Chapter six gives a "from scratch" introduction to covariant tensors.

An earlier version of this article was adapted from the GFDL article on tensors at http://planetmath.org/encyclopedia/Tensor.html from PlanetMath, written by Robert Milson and others