Gauge covariant derivative

From HandWiki
Short description: Derivative used in gauge theories

In physics, the gauge covariant derivative is a means of expressing how fields vary from place to place, in a way that respects how the coordinate systems used to describe a physical phenomenon can themselves change from place to place. The gauge covariant derivative is used in many areas of physics, including quantum field theory and fluid dynamics and in a very special way general relativity.

If a physical theory is independent of the choice of local frames, the group of local frame changes, the gauge transformations, act on the fields in the theory while leaving unchanged the physical content of the theory. Ordinary differentiation of field components is not invariant under such gauge transformations, because they depend on the local frame. However, when gauge transformations act on fields and the gauge covariant derivative simultaneously, they preserve properties of theories that do not depend on frame choice and hence are valid descriptions of physics. Like the covariant derivative used in general relativity (which is special case), the gauge covariant derivative is an expression for a connection in local coordinates after choosing a frame for the fields involved, often in the form of index notation.

Overview

There are many ways to understand the gauge covariant derivative. The approach taken in this article is based on the historically traditional notation used in many physics textbooks.[1][2][3] Another approach is to understand the gauge covariant derivative as a kind of connection, and more specifically, an affine connection.[4][5][6] The affine connection is interesting because it does not require any concept of a metric tensor to be defined; the curvature of an affine connection can be understood as the field strength of the gauge potential. When a metric is available, then one can go in a different direction, and define a connection on a frame bundle. This path leads directly to general relativity; however, it requires a metric, which particle physics gauge theories do not have.

Rather than being generalizations of one-another, affine and metric geometry go off in different directions: the gauge group of (pseudo-)Riemannian geometry must be the indefinite orthogonal group O(s,r) in general, or the Lorentz group O(3,1) for space-time. This is because the fibers of the frame bundle must necessarily, by definition, connect the tangent and cotangent spaces of space-time.[7] In contrast, the gauge groups employed in particle physics could in principle be any Lie group at all, although in practice the Standard Model only uses U(1), SU(2) and SU(3). Note that Lie groups do not come equipped with a metric.

A yet more complicated, yet more accurate and geometrically enlightening, approach is to understand that the gauge covariant derivative is (exactly) the same thing as the exterior covariant derivative on a section of an associated bundle for the principal fiber bundle of the gauge theory;[8] and, for the case of spinors, the associated bundle would be a spin bundle of the spin structure.[9] Although conceptually the same, this approach uses a very different set of notation, and requires a far more advanced background in multiple areas of differential geometry.

The final step in the geometrization of gauge invariance is to recognize that, in quantum theory, one needs only to compare neighboring fibers of the principal fiber bundle, and that the fibers themselves provide a superfluous extra description. This leads to the idea of modding out the gauge group to obtain the gauge groupoid as the closest description of the gauge connection in quantum field theory.[6][10]

For ordinary Lie algebras, the gauge covariant derivative on the space symmetries (those of the pseudo-Riemannian manifold and general relativity) cannot be intertwined with the internal gauge symmetries; that is, metric geometry and affine geometry are necessarily distinct mathematical subjects: this is the content of the Coleman–Mandula theorem. However, a premise of this theorem is violated by the Lie superalgebras (which are not Lie algebras!) thus offering hope that a single unified symmetry can describe both spatial and internal symmetries: this is the foundation of supersymmetry.

The more mathematical approach uses an index-free notation, emphasizing the geometric and algebraic structure of the gauge theory and its relationship to Lie algebras and Riemannian manifolds; for example, treating gauge covariance as equivariance on fibers of a fiber bundle. The index notation used in physics makes it far more convenient for practical calculations, although it makes the overall geometric structure of the theory more opaque.[7] The physics approach also has a pedagogical advantage: the general structure of a gauge theory can be exposed after a minimal background in multivariate calculus, whereas the geometric approach requires a large investment of time in the general theory of differential geometry, Riemannian manifolds, Lie algebras, representations of Lie algebras and principle bundles before a general understanding can be developed. In more advanced discussions, both notations are commonly intermixed.

This article attempts to follow more closely to the notation and language commonly employed in physics curriculum, touching only briefly on the more abstract connections.

Motivation of the covariant derivative through gauge covariance requirement

Consider a generic (possibly non-Abelian) gauge transformation acting on a n component field ϕ=(ϕa)a=1..n. The main examples in field theory have a compact gauge group and we write the symmetry operator as U(x)=eiα(x) where α(x) is an element of the Lie algebra associated with the Lie group of symmetry transformations, and can be expressed in terms of the hermitian generators of the Lie algebra (i.e. up to a factor i, the infinitesimal generators of the gauge group), {tK}K𝒦, as α(x)=αK(x)tK.

It acts on the field ϕ(x) as

ϕ(x)ϕ(x)=U(x)ϕ(x)eiα(x)ϕ(x),
ϕ(x)ϕ'ϕ(x)U(x)=ϕ(x)eiα(x),U=U1.

Now the partial derivative μ transforms, accordingly, as

μϕ(x)μϕ(x)=U(x)μϕ(x)+(μU)ϕ(x)eiα(x)μϕ(x)+i(μα)eiα(x)ϕ(x).

Therefore, a kinetic term of the form ϕμϕ in a Lagrangian is not invariant under gauge transformations.

Definition of the gauge covariant derivative

The root cause of the non gauge invariance is that in writing the field ϕ=(ϕ1,ϕn) as a row vector or in index notation ϕa, we have implicitly made a choice of basis frame field i.e. a set of fields φ1(x),,φn(x) such that every field can be uniquely expressed as ϕ=ϕaφa for functions ϕa(x) (using Einstein summation), and assumed the frame fields φa are constant. Local (i.e. x dependent) gauge invariance can be considered as invariance under the choice of frame. However, if one basis frame is as good as any gauge equivalent other one, we can not assume a frame fields to be constant without breaking local gauge symmetry.

We can introduce the gauge covariant derivative Dμ as a generalisation of the partial derivative μ that acts directly on the field ϕ rather than its components ϕa with respect to a choice of frame. A gauge covariant derivative is defined as an operator satisfying a product rule

Dμ(fϕ)=(μf)ϕ+f(Dμϕ)

for every smooth function f (this is the defining property of a connection).

To go back to index notation we use the product rule

Dμϕ=Dμ(ϕaφa)=(μϕa)φa+ϕa(Dμφa)..

For a fixed a, Dμφa is a field, so can be expanded w.r.t. the frame field. Hence a gauge covariant derivative and frame field defines a (possibly non Abelian) gauge potential

Dμφa=igAμbaφb

(the factor ig is conventional for compact gauge groups and is interpreted as a coupling constant). Conversely given the frame φ1,φn and a gauge potential Aμba, this uniquely defines the gauge covariant derivative. We then get

Dμϕ=(Dμϕ)aφa=(μϕaigAμabϕb)φa.

and with suppressed frame fields this gives in index notation

(Dμϕ)a=μϕaigAμabϕb,

which by abuse of notation is often written as

Dμϕa=μϕaigAμabϕb.

This is the definition of the gauge covariant derivative as usually presented in physics.[11]

The gauge covariant derivative is often assumed to satisfy additional conditions making additional structure "constant" in the sense that the covariant derivative vanishes. For example, if we have a Hermitian product h on the fields (e.g. the Dirac conjugate inner product ϕ¯ψ for spinors) reducing the gauge group to a unitary group, we can impose the further condition

μh(ϕ,ψ)=h(Dμϕ,ψ)+h(ϕ,Dμψ)

making the Hermitian product "constant". Writing this out with respect to a local h-orthonormal frame field gives

μ(ϕa*ψa)=a(Dμϕ)a*ψa+ϕa*(Dμψ)a,

and using the above we see that Aμ must be Hermitian i.e. Aμab=Aμba* (motivating the extra factor i). The Hermitian matrices are (up to the factor i) the generators of the unitary group. More generally if the gauge covariant derivative preserves a gauge group G acting with representation ρ, the gauge covariant connection can be written as

(Dμϕ)a=μϕaigAμKρ(tK)abϕb

where ρ is representation of the Lie algebra associated to the group representation ρ (loc. cit.).

Note that including the gauge covariant derivative (or its gauge potential), as a physical field, "field with zero gauge covariant derivative along the tangent of a curve γ"

Dγ˙ϕ=(ddtγμ)Dμϕ=0

is a physically meaningful definition of a field ϕ constant along a (smooth) curve. Hence the gauge covariant derivative defines (and is defined by) parallel transport.

Gauge Field Strength

Unlike the partial derivatives, the gauge covariant derivatives do not commute. However they almost do in the sense that the commutator is not an operator of order 2 but of order 0, i.e. is linear over functions:

[Dμ,Dν](fϕ)=(μνf)ϕ+νfDμϕ+μfDνϕ+fDμDνϕ(μν)=f[Dμ,Dν]ϕ.

The linear map

Fμν=1/(ig)[Dμ,Dν]

is called the gauge field strength (loc. cit). In index notation, using the gauge potential

Fμνb a=μAνbaνAμbaig(AμcaAνbcAνcaAμbc).

If Dμ is a G covariant derivative, one can interpret the latter term as a commutator in the Lie algebra of G and Fμν as Lie algebra valued (loc. cit).

Invariance under gauge transformations

The gauge covariant derivative transforms covariantly under Gauge transformations, i.e. for all ϕ

Dμϕ(x)D'μϕ(x)=D'μU(x)ϕ(x)=U(x)Dμϕ(x),

which in operator form takes the form

D'μU(x)=U(x)Dμ

or

D'μ=U(x)DμU1(x).

In particular (suppressing dependence on x)

igF'μν=[D'μ,D'ν]=[UDμU1,UDνU1]=U[Dμ,Dν]U1=igUFμνU1.

Further, (suppressing indices and replacing them by matrix multiplication) if Dμ=μigAμ is of the form above, D'μ is of the form

D'μ=μ+(μU1)UigUAμU1

or using U(x)=eiα(x),

D'μ=μiμαigUAμU1

which is also of this form.

In the Hermitian case with a unitary gauge group U1=U and we have found a first order differential operator Dμ with μ as first order term such that

ϕDμϕϕ'D'μϕ=ϕDμϕ..

Gauge theory

In gauge theory, which studies a particular class of fields which are of importance in quantum field theory, different fields are used in Lagrangians that are invariant under local gauge transformations. Kinetic terms involve derivatives of the fields which by the above arguments need to involve gauge covariant derivatives.

Abelian Gauge Theory

the gauge covariant derivative Dμ on a complex scalar field ϕ=ϕ1φ1 (i.e. n=1) of charge q is a U(1) connection. The gauge potential Aμ is a (1 x 1) matrix, i.e. a scalar.

(Dμϕ)1=(μϕ1iqAμϕ1)

The gauge field strength is

Fμν=μAννAμ

The gauge potential can be interpreted as electromagnetic four-potential and the gauge field strength as the electromagnetic field tensor. Since this only involves the charge of the field and not higher multipoles like the magnetic moment (and in a loose and non unique way, because it replaces μ by Dμ [12]) this is called minimal coupling.

For a Dirac spinor field ψ of charge q the covariant derivative is also a U(1) connection (because it has to commute with the gamma matrices) and is defined as

(Dμψ)α:=(μiqAμ)ψα

where again Aμ is interpreted as the electromagnetic four-potential and Fμν as the electromagnetic field tensor. (The minus sign is a convention valid for a Minkowski metric signature (−, +, +, +), which is common in general relativity and used below. For the particle physics convention (+, −, −, −), it is Dμ:=μ+iqAμ. The electron's charge is defined negative as qe=|e|, while the Dirac field is defined to transform positively as ψ(x)eiqα(x)ψ(x).)

Quantum electrodynamics

If a gauge transformation is given by

ψeiΛψ

and for the gauge potential

AμAμ+1e(μΛ)

then Dμ transforms as

DμμieAμi(μΛ),

and Dμψ transforms as

DμψeiΛDμψ

and ψ¯:=ψγ0 transforms as

ψ¯ψ¯eiΛ

so that

ψ¯Dμψψ¯Dμψ

and ψ¯Dμψ in the QED Lagrangian is therefore gauge invariant, and the gauge covariant derivative is thus named aptly.[citation needed]

On the other hand, the non-covariant derivative μ would not preserve the Lagrangian's gauge symmetry, since

ψ¯μψψ¯μψ+iψ¯(μΛ)ψ.

Quantum chromodynamics

In quantum chromodynamics, the gauge covariant derivative is[13]

Dμ:=μigsGμαλα/2

where gs is the coupling constant of the strong interaction, G is the gluon gauge field, for eight different gluons α=18, and where λα is one of the eight Gell-Mann matrices. The Gell-Mann matrices give a representation of the color symmetry group SU(3). For quarks, the representation is the fundamental representation, for gluons, the representation is the adjoint representation.

Standard Model

The covariant derivative in the Standard Model combines the electromagnetic, the weak and the strong interactions. It can be expressed in the following form:[14]

Dμ:=μig2YBμig2σjWμjigs2λαGμα

The gauge fields here belong to the fundamental representations of the electroweak Lie group U(1)×SU(2) times the color symmetry Lie group SU(3). The coupling constant g provides the coupling of the hypercharge Y to the B boson and g the coupling via the three vector bosons Wj (j=1,2,3) to the weak isospin, whose components are written here as the Pauli matrices σj. Via the Higgs mechanism, these boson fields combine into the massless electromagnetic field Aμ and the fields for the three massive vector bosons W± and Z.

General relativity

The covariant derivative in general relativity is a special example of the gauge covariant derivative. It corresponds to the Levi Civita connection (a special Riemannian connection) on the tangent bundle (or the frame bundle) i.e. it acts on tangent vector fields or more generally, tensors. It is usually written as instead of D. In this special case, a choice of (local) coordinates x1,,xd not only gives partial derivatives μ, but they double as a frame of tangent vectors 1,d in which a vector field v can be uniquely expressed as v=vμμ (this uses the definition of a vector field as an operator on smooth functions that satisfies a product rule i.e. a derivation). Therefore, in this case "the internal indices are also space time indices". Up to slightly different normalisation (and notation) the gauge potential Aμνλ is the Christoffel symbol defined by

μν=Γμνλλ.

It gives the covariant derivative

(μv)ν=(μ(vλλ))ν=((μvλ)λ+vλ(μλ))ν=μvν+Γμλνvλ.

The formal similarity with the gauge covariant derivative is more clear when the choice of coordinates is decoupled from the choice of frame of vector fields e1=e1μμ,,ed=edμμ. Especially when the frame is orthonormal, such a frame is usually called a d-Bein. Then

(μv)n=(μ(ve))n=((μv)e+v(μe))n=μvn+Γμnv

where μem=Γμme. The direct analogue of the "gauge freedom" of the gauge covariant derivative is the arbitrariness of the choice of an orthonormal d-Bein at each point in space-time: local Lorentz invariance [citation needed]. However, in this case the more general independence of the choice of coordinates for the definition of the Levi Civita connection gives diffeomorphism or general coordinate invariance.

Fluid dynamics

In fluid dynamics, the gauge covariant derivative of a fluid may be defined as

t𝐯:=t𝐯+(𝐯)𝐯

where 𝐯 is a velocity vector field of a fluid.[citation needed]

See also

References

  1. L.D. Faddeev, A.A. Slavnov, Gauge Fields: Introduction to Gauge Theory, (1980) Benjamin Cummings, ISBN:0-8053-9016-2
  2. Claude Itzykson, Jean-Bernard Zuber, Quantum Field Theory (1980) McGraw-Hill ISBN:0-07-032071-3
  3. Warren Siegel, Fields (1999) ArXiv
  4. Richard S. Palais, The Geometrization of Physics (1981) Lecture Notes, Institute of Mathematics, National Tsing Hua University
  5. M. E. Mayer, "Review: David D. Bleecker, Gauge theory and variational principles", Bull. Amer. Math. Soc. (N.S.) 9 (1983), no. 1, 83--92
  6. 6.0 6.1 Alexandre Guay, Geometrical aspects of local gauge symmetry (2004)
  7. 7.0 7.1 Charles W. Misner, Kip S. Thorne, and John Archibald Wheeler, Gravitation, (1973) W. H. Freeman and Company
  8. David Bleecker, "Gauge Theory and Variational Principles" (1982) D. Reidel Publishing (See chapter 3)
  9. David Bleecker, op. cit. (See Chapter 6.)
  10. Meinhard E. Mayer, "Principal Bundles versus Lie Groupoids in Gauge Theory", (1990) in Differential Geometric Methods in Theoretical Physics, Volume 245 pp 793-802
  11. Peskin, Michael, E.; Schroeder, Daniel, V. (1995). An introduction to Quantum Field Theory. Addison Wesley. pp. 78, 490. 
  12. Jenkins, Elisabeth E.; Manohar, Aneesh V.; Trott, Michael (2013). "On Gauge Invariance and Minimal Coupling". Journal of High Energy Physics (Springer) 2013 (9). doi:10.1007/JHEP09(2013)063. https://link.springer.com/content/pdf/10.1007/JHEP09(2013)063.pdf. 
  13. "Quantum Chromodynamics (QCD)". http://www.fuw.edu.pl/~dobaczew/maub-42w/node9.html. 
  14. See e.g. eq. 3.116 in C. Tully, Elementary Particle Physics in a Nutshell, 2011, Princeton University Press.