Ward identities are one of the most fundamental tools for studying quantum field theory, and they're encountered in almost any quantum field theory course. You've almost certainly encountered them before, so why should I bother writing about them? Simply put: despite learning how to derive Ward identities for the first time more than 5 years ago (in my first quantum field theory class, as an undergraduate at UChicago), I didn't really understand why they were important until quite recently. This is a product of my own unique research path — I haven't ever done any research in pure QFT, working instead mostly in quantum information and classical geometry, which means I haven't ever had to really understand what's going on under the hood in field theory. I don't think this oversight is so uncommon, so I'm putting together some basic thoughts on Ward identities in this post.
So, what is a Ward identity? On its face, it's an equation that tells you how classical conservation laws break down under quantization. We have some classically conserved current, ∂μjμ=0, and the (non-anomalous) Ward identity associated with that current is
⟨∂μjμ(x)X1(x1)…Xn(xn)⟩=n∑k=1δ(x−xk)δk⟨X1(x1)…Xn(xn)⟩.
The symbol δk appearing on the right-hand side of this equality is supposed to mean "within the following correlation function, replace the operator Xk(xk) with its variation under the infinitesimal symmetry associated with the current jμ."
At a first glance, the only information in equation (1) seems to be a violation of current conservation by "contact terms." The delta functions on the right-hand side mean that as an operator, ∂μjμ(x) vanishes except in correlation functions that contain other operators at the point x. My first reaction to this was: so what? It seems intuitive that classically conserved currents should be quantumly conserved, and the fact that they fail to be conserved in correlators with coincident operators seems like a minor technical detail. But a Ward identity actually contains a great deal more information than that; in particular, the real power of a Ward identity comes out only once you take a volume integral of the point x over a region R. When you do this, you get the new identity
⟨∫∂Rjμ(x)X1(x1)…Xn(xn)⟩=∑xk∈R⟨X1(x1)…δXk(xk)…Xn(xn)⟩.
So what we really learn, from the ward identity, is a complete specification of the "surface operator" Q∂R=∫∂Rjμ(x), for any region R. Knowing the correlation functions of an operator Q∂R against an arbitrary set of operators X1(x1)…Xn(xn) is equivalent to knowing how the operator acts on any state in Hilbert space. This is the power of the Ward identity — it tells you anything you might like to know about a large class of operators Q∂R associated with the symmetry whose current is jμ.
The rest of this post will deal with these issues in more detail. In section 1, I will give a very general treatment of Noether's theorem in Lagrangian field theory using differential forms. (I will consider only tensor field theories without internal gauge groups; more on that later.) Section 1 is fairly mathy, because it is written in generality for field theories on arbitrary curved manifolds; you can skip it if you're happy with picking up a bit of notation for the presentation of Noether's theorem in the sequel. In section 2, I will give the standard derivation of Ward identities from the path integral for arbitrary quantum field theories; I will include potential contributions from anomalies, and comment on their importance. In section 3, I will explain how, in flat Minkowski or Euclidean space, non-anomalous Ward identities can be used to derive the commutators of associated charge operators with arbitrary fields.
For additional reading, I recommend section 4.2.2 of David Tong's notes on conformal field theory, and chapter 2.4 of the big yellow book on conformal field theory. I found this paper of Lee and Wald useful for thinking abstractly about the Lagrangian formulation of field theories. While preparing this post, I found Kartik Prabhu's paper on the first law of black hole mechanics for gauge fields very helpful for thinking about the proper mathematical formulation of gauge theory Lagrangians, but ultimately decided to remove all discussion of gauge theories from this post because it felt a bit too far afield of the main points. I also exchanged some emails with Kartik while writing this post that were helpful to my understanding of gauge fields and spinor fields.
Prerequisites: Path integrals in quantum field theory. Section 1 requires background in differential forms, and it would probably be helpful to know general relativity. Section 3 requires understanding what it means to prepare a state with the path integral in quantum field theory; I like Tom Hartman's notes for this.
Table of Contents
1. Noether's Theorem in Lagrangian Field Theory
OK; that's enough formalism. Let's get to Noether's theorem. For any point in M, and for any system of coordinates in a neighborhood of that point, there is a collection of local fields for which the Lagrangian takes the form
L(x)=L[ϕA(x),∂μ1ϕA(x),…,∂μ1…∂μkϕA(x)]ϵ(x),
where the volume form ϵ that appears in this expression is the flat volume form in our chosen coordinate system, and therefore has no dependence on the field configuration. We can always do this, once we've picked a particular coordinate system, by adding local fields that convert between covariant and coordinate derivatives as in the preceding paragraph. (One advantage of using coordinate derivatives is that they commute, which makes the following presentation more notationally simple.) We will also fix a local basis for the field configurations, which will allow us to replace the abstract index A with a concrete coordinate index α:
L(x)=L[ϕα(x),∂μ1ϕα(x),…,∂μ1…∂μkϕα(x)]ϵ(x).
Now, suppose we consider a smooth family of field configurations ϕα(x,λ), i.e., a smooth map from R into the space of field configurations. This is the natural setting for considering "infinitesimal perturbations" of field configurations, which are realized by the one-parameter family ϕα(x,λ)=ϕα(x)+λδϕα(x) in the limit λ→0. The Lagrangian then becomes a function of λ, and its derivative can be written using the chain rule as
∂∂λL(x,λ)=k∑j=0(∂L∂(∂μ1…∂μjϕα)(x,λ))(∂μ1…∂μj∂ϕα∂λ(x,λ))ϵ(x).
By successively applying the product rule to the derivatives acting on ∂ϕα/∂λ, we can rewrite this in terms of a term where no derivative acts on ∂ϕα/∂λ, and a term that is a total derivative; this is just like the usual story from classical mechanics, but generalized to arbitrary tensor field theories. We will label these pieces by Eα(x,λ) and ∂μθμ(x,λ); we then have
∂∂λL(x,λ)=[Eα(x,λ)∂ϕα∂λ(x,λ)+∂μθμ(x,λ)]ϵ(x).
It is a basic result in the theory of differential forms that (∂μθμ)ϵ, which is an n-form, is equivalent to fd(θ⋅ϵ), i.e., the exterior derivative of the (n-1)-form obtained by contracting θμ into the first index of ϵ. This follows by noting that d(θ⋅ϵ) and ϵ must be proportional, since they are both n-forms on an n-dimensional manifold, and directly computing the proportionality function as ∂μθμ.
Given these considerations, we will group Eα and ϵ as a single n-form Eα, and we will group ∂μθμ and ϵ as an exact n-form dθ. The resulting expression is
∂∂λL(x,λ)=Eα(x,λ)∂ϕα∂λ(x,λ)+dθ(x,λ).
We will now see that Eα and dθ are independent of all of the many coordinate choices we made in this section. First, if we integrate the above equation over M, then the θ-dependent term disappears assuming appropriate dropoff conditions on the field near any asymptotic boudnary of M. The left-hand side of the expression is independent of coordinate choices, and ∂ϕα/∂λ is arbitrary, so Eα must be independent of coordinate choices as well. Then, since dθ can be expressed as a difference of two coordinate-independent objects, it is also coordinate-independent. The differential form Eα is called the equation of motion of the theory. When it vanishes for a particular field configuration (i.e., at some fixed λ), the action is stationary — a complete integral of the Lagrangian over all of M is unchanged at leading order as λ is varied.
Now, the field θ is not independent of coordinate choice; only dθ is. So there is a whole equivalence class of allowable θ terms that differ from one another by closed forms, i.e., forms ω with dω=0. It was shown by Wald in this paper that this ambiguity can be restricted if one demands that θ be expressed locally in terms of the fields and their derivatives. The only closed forms that can be constructed from local fields are exact, i.e. they satisfy ω=dα for some α, and so the ambiguity in θ is only up to addition by exact forms.
Finally, we introduce the idea of an infinitesimal symmetry. An infinitesimal symmetry is a flow on the space of field configurations, i.e., a map from each ϕA(x) to a one-parameter family ϕA(x)+λδϕA(x), with the property that the Lagrangian changes at leading order in λ by an exact form that is a local function of the fields. That is, for this particular one-parameter family, we should have
∂L∂λ|λ=0=dW[ϕA,∂μ1ϕA,…].
Just like θ, the form W is ambiguous up to the addition of an exact form.
If we compare this expression for the symmetric variation of the Lagrangian to the general formula for an arbitrary variation, we obtain the identity
Eαδϕα+dθ=dW.
We stress that while θ is defined for arbitrary variations, we must evaluate it on the symmetric variation corresponding to W in order for this identity to hold. What we learn from this expression is that, for the particular symmetric variation δϕα, the form Eαδϕα must be an exact form d(W−θ). We denote this exact form by dJ, and call J the Noether current corresponding to the symmetry. It is defined on any field configuration. Because it is defined to satisfy the equality
dJ=Eαδϕα,
the form J is closed on any field configuration where the equations of motion are satisfied.
There is a trick that will be very useful in the following section, which is to use J to study how the action of a field configuration changes under a "smeared symmetry." To construct a smeared symmetry, we start with an infinitesimal symmetry ϕA(x,λ)=ϕA(x)+λδϕA(x), fix a smooth function ρ(x) on spacetime, and construct from these two pieces of data the one-parameter family
ϕA(x,λ)=ϕA(x)+λρ(x)δϕA(x).
Our general identity for Lagrangian variations tells us that the linear-in-λ variation in the Lagrangian within this family of field configurations is
δL=Eα(x)ρ(x)δϕα(x)+dθ.
But we already know that Eα(x)δϕα(x) is the differential of the Noether current, J(x). So if we integrate this expression to obtain the variation in the action, the dθ term drops out and we obtain
δS=∫Mρ(x)dJ.
As a final comment, let me put all of this back in the form you're probably familiar with from flat space QFT. If we fix a metric on our manifold, then we obtain a preferred volume form ϵ, which is an n-form. Since dJ is also an n-form, the two must be related by
dJ=Fϵ.
If we contract the volume form into both sides of the equation, and use the fact that ϵ⋅ϵ is normalized to be (−1)sn!, with s the number of minus signs in the metric signature, then we obtain
F=(−1)sn!ϵ⋅dJ.
If you actually write out the exterior derivative in terms of covariant derivatives, and do the index manipulations, you find:
ϵ⋅dJ=n∇a(ϵaa2…anJa2…an).
(If you don't know how to do these manipulations, I recommend Appendix B of Wald's textbook on general relativity.)
So if we define the vector ja by
ja=(−1)s(n−1)!ϵaa2…anJa2…an,
then we have
dJ=(∇aja)ϵ.
This is a completely equivalent description; we can talk about Noether currents as being vector fields instead of (n−1) forms, so long as we have a preferred volume form around to convert between the two. This is why in Minkowski spacetime, where the volume form is trivial in inertial coordinates, forms are never discussed at all.
2. Ward Identities
Now, suppose we want to quantize a field theory on a manifold M. We have a set of fields ϕ, which might include background fields like the metric, and a Lagrangian n-form that can be written as
L[ϕ(x),∇a1ϕ(x),…,∇ak…∇a1ϕ(x)].
(Since we've fixed a metric, I'm now writing everything in terms of the Levi-Civita derivative on tensor fields and using abstract indices; this changes notation from the previous section, where I considered completely general notions of covariant derivative, but only ever did calculations in local patches. I'm also removing all indices from fields for the sake of notational simplicity.)
We will now split our collection of fields into two sets, reserving the name ϕ for the fields we want to quantize, and γ for background fields like the metric. These background fields will not be included in the path integral measure. For notational convenience, I will also start writing the Lagrangian as L[ϕ,γ], with dependence on derivatives being implicit. According to the path integral formulation of quantum field theory, correlation functions of fields are defined by the formal expression
⟨ϕ(x1)…ϕ(xn)⟩=1Z∫Dϕϕ(x1)…ϕ(xn)eγ∫ML[ϕ,γ].
The coefficient γ appearing in the exponent is i for a Lorentzian field theory, and (−1) for a Euclidean field theory. Z is the standard normalization, corresponding to the path integral with no operator insertions.
We now consider a field transformed under a smeared symmetry ϕ∗(x)=ϕ(x)+λρ(x)δϕ(x). (Recall from the previous section that the term "smeared symmetry" indicates that if ρ(x) were constant, this would be an actual infinitesimal symmetry of the theory.) The path integral giving correlation functions of ϕ can be rewritten under the change of variables ϕ↦ϕ∗ as
⟨ϕ(x1)…ϕ(xn)⟩=1Z∫Dϕ∗ϕ∗(x1)…ϕ∗(xn)eγ∫ML[ϕ∗,γ].
As a purely formal expression, the path integral measure Dϕ∗ can be written to first order in λ as (1+λAρ[ϕ])Dϕ. The term Aρ[ϕ], which is a functional of the field and may depend on ρ(x), is an anomaly. A symmetry for which it vanishes, for all ρ(x), is called a non-anomalous symmetry. And we know, from the previous section, that there is a Noether current J associated to the symmetry for which the first-order-in-λ change in the action is ∫Mρ(x)dJ(x).
The correlation functions of ϕ are λ-independent, so the path integral over ϕ∗ must be λ-independent as well. If we collect all order-λ terms, and set them equal to zero, we obtain the expression
0=1Zn∑k=1∫Dϕϕ(x1)…ρ(xk)δϕ(xk)…ϕ(xn)eγ∫ML[ϕ,γ]+1Z∫Dϕϕ(x1)…ϕ(xn)eγ∫ML[ϕ,γ]∫Mγρ(x)dJ(x)+1Z∫DϕAρ[ϕ]ϕ(x1)…ϕ(xn)eγ∫ML[ϕ,γ].
We can rewrite the first term as
∫Mρ(x)n∑k=1δ(x−xk)⟨ϕ(x1)…δϕ(xk)…ϕ(xn)⟩,
the second term as
γ∫Mρ(x)⟨dJ(x)ϕ(x1)…ϕ(xn)⟩,
and the third term as
⟨Aρ[ϕ]ϕ(x1)…ϕ(xn)⟩.
If the symmetry is non-anomalous, meaning we set Aρ=0, then we obtain
∫Mρ(x)n∑k=1δ(x−xk)⟨ϕ(x1)…δϕ(xk)…ϕ(xn)⟩=−γ∫Mρ(x)⟨dJ(x)ϕ(x1)…ϕ(xn)⟩.
If this is to hold for arbitrary smearings ρ(x), then we must have the local identity
n∑k=1δ(x−xk)⟨ϕ(x1)…δϕ(xk)…ϕ(xn)⟩=−γ⟨dJ(x)ϕ(x1)…ϕ(xn)⟩.
This is the Ward identity. For γ=−1, which is Euclidean signature, it is identical to the Ward identity quoted in the introduction to this post.
3. Commutators of Charges
−γ⟨dJ(x)ϕ(x1)…ϕ(xn)⟩=n∑k=1δ(x−xk)δk⟨ϕ(x1)…ϕ(xn)⟩,
⟨0|Oj1(xj1)U(tj1,tj2)Oj2(xj2)…U(tjn−1,tjn)Ojn(xn)|0⟩,
⟨state prepared by t>0|Q(0)∏xj∈Σϕ(xk)|state prepared by t<0⟩,
−γ⟨t>0|[Q(0),∏xj∈Σϕ(xj)]|t<0⟩=∑xk∈Σδk⟨t>0|∏xj∈Σϕ(xj)|t<0⟩.
−iGϕ(x)=δϕ(x),
this matches my expression with γ=−1, which is the right convention for Euclidean signature.
Comments
Post a Comment