EXCLUSIVE QCD FACTORIZATION AND SINGLE TRANSVERSE
 POLARIZATION PHENOMENA AT HIGH-ENERGY COLLIDERS
                                By
                             Zhite Yu
                      A DISSERTATION
                          Submitted to
                  Michigan State University
           in partial fulfillment of the requirements
                        for the degree of
               Physics — Doctor of Philosophy
                              2023


                                       ABSTRACT
    This Ph.D. thesis is divided into two distinct parts. The first part focuses on hard
exclusive scattering processes in Quantum Chromodynamics (QCD) at high energies, while
the second part delves into spin phenomena at the Large Hadron Collider (LHC).
    Hard exclusive scattering processes play a crucial role in QCD at high energies, pro-
viding unique insights into the confined partonic dynamics within hadrons, complementing
inclusive processes. Studying these processes within the QCD factorization approach yields
the generalized parton distribution (GPD), a nonperturbative parton correlation function
that offers a three-dimensional tomographic parton image within a hadron. However, the
experimental measurement of these processes poses significant challenges. This thesis will
review the factorization formalism for related processes, examine the limitations of some
widely used processes, and introduce two novel processes that enhance the sensitivity to
GPD, particularly its dependence on the parton momentum fraction x.
    The second part of the thesis centers on spin phenomena, specifically single spin produc-
tion, at the LHC. Noting that a single transverse polarization can be generated even in an
unpolarized collision, this research proposes two new jet substructure observables: one for
boosted top quark jets and another for high-energy gluon jets. The observation of these phe-
nomena paves the way for innovative tools in LHC phenomenology, enabling both precision
measurements and the search for new physics.


Dedicated to Lijiang, C.-P., and Jianwei
                   iii


                                ACKNOWLEDGMENTS
Perhaps every Ph.D. student has a long and struggling story. It is hardly conceivable that I
could accomplish this thesis without the support from people around me.
    First and foremost, I owe my deepest gratitude to my Ph.D. advisor, Prof. C.-P. Yuan.
Throughout my graduate study, I navigated several research topics, and each time, C.-P. was
fully supportive and helped me in all possible ways he could. His guiding principle—that
I must teach him something before I could graduate—has shaped my approach and led to
immense personal growth. I am particularly appreciative of his willingness to make time for
me, despite his highly demanding schedule. Each interaction guided me in organizing my
thoughts, challenging my preconceived understanding, and probing deeper into my research
subjects. I owe him a profound debt of thanks for instructing me in the art of research,
for ceaselessly encouraging me to explore new horizons, and for his invaluable assistance in
expanding my network and connections. The warmth and hospitality extended by C.-P.’s
wife, Ping Ma, during the difficult times of Covid-19, deserve my heartfelt gratitude. Her
kindness, support, and the wonderful dinners she prepared provided a comforting sense of
home, even while I was far away from my family.
    I must also express my profound thanks to Prof. Jianwei Qiu, with whom I began collab-
orating during my visit to Jefferson Lab in 2020. This association has been nothing short
of enriching. Jianwei’s generous sharing of his knowledge, life philosophy, work attitudes,
research insights, and innovative ideas has had a transformative effect on my approach to my
studies. His guidance and encouragement, coupled with our deep and stimulating discussions,
enabled me to navigate the intricacies of QCD factorization with clarity and confidence. Ad-
ditionally, I owe him immense gratitude for building numerous academic connections on my
behalf and facilitating my attendance at several significant conferences. These opportunities
                                              iv


have not only expanded my horizons but have also considerably benefited my career. The
lessons I’ve learned from working closely with him continue to resonate with me, and I am
eager to carry these forward in my future endeavors.
    The professors in the MSU high-energy physics group have been instrumental in my
growth, assisting me in various aspects of my research and personal development. I extend
my heartfelt thanks to my Ph.D. committee members, Profs. Andreas von Manteuffel, Wade
Fisher, Andrea Shindler, Alexei Bazavov, and Dean Lee, for their continuous encourage-
ment, unwavering support, thoughtful suggestions, and genuine care for both my academic
progress and personal well-being. Andreas deserves my special thanks for the stimulating
discussions, considerate suggestions, and warm invitations to his home gatherings. I’m grate-
ful to Prof. Kirtimaan Mohan for enriching physics dialogues and his tireless assistance with
enhancing my programming skills. Prof. Carl Schmidt guided me through my first physics
project and opened up a world of fascinating physics topics that I had the privilege to explore.
I am deeply indebted to Prof. Wayne Repko, whose groundbreaking papers on polarization
phenomena inspired my works on the top quark, W , and gluon polarization. His encour-
agement and enlightening historical insights have been a constant source of motivation. My
thanks also go to Profs. Huey-Wen Lin, Joey Huston, and Reinhard Schwienhorst for their
multifaceted support, engaging discussions, and the sharing of crucial career insights. I must
also acknowledge Prof. Vladimir Zelevinsky for his excellent teaching in quantum mechanics
and the unique opportunity he provided me to teach a class under his guidance. Lastly,
I want to express my appreciation to Brenda Wenzlick and Kim Crosslan, who skillfully
handled numerous non-academic tasks on my behalf. Their efficiency and dedication saved
me much time, energy, and distraction, allowing me to focus on my academic pursuits.
    I would also like to take this opportunity to extend my sincere gratitude to several pro-
                                               v


fessors back in China who have been guiding me and watching over my academic progress
throughout my educational journey. My graduate study in the U.S. was made possible
because of the generous help from my undergraduate advisor, Prof. Qing-Hong Cao. His in-
valuable advice, constant encouragement, and unwavering support were instrumental during
the crucial decisions and preparation stages. I am also profoundly grateful to Profs. Jiang-
Hao Yu and Bin Yan. Their guidance, mentorship, and assistance, particularly in the early
stage of my Ph.D. study, not only helped me navigate complex challenges but also instilled
in me a strong foundation and passion for my research field.
    Despite the wealth of guidance and support I received, the pursuit of a Ph.D. was often
fraught with frustration and moments of despair. The challenges were made bearable, how-
ever, by the friendships I forged during my time in graduate school. I am profoundly grateful
for the companionship, encouragement, and joy brought into my life by the following friends:
Bakul Agarwal, Shohini Bhattacharya, Lisong Chen, Zhouyou Fan, Yao Fu, Syuhei Iguro,
Peter Kong, Lisa Kong, Zhen Li, Yang Ma, Matteo Marcoli, Xiaoyi Sun, Xudong Tian Tang,
Keping Xie, Shuyue Xue, Tongzhi Yang, Kang Yu, Rui Zhang, Fanyi Zhao, and Yiyu Zhou.
In particular, I must extend my most sincere thanks to my friend Boyao Zhu, who has been
accompanying and helping me on various important occasions.
    After all is said, words cannot fully express my gratitude to my family, especially my
parents and brother. They have been standing by me unconditionally, supporting me in
every way throughout my entire life. Their unwavering love has filled me with warmth and
strength, constantly reminding me that I am cherished. I cannot imagine myself reaching
this point and acquiring this Ph.D. degree without their understanding, companionship,
encouragement, and steadfast support. I thank them from the bottom of my heart for
backing me along this journey, sharing my burdens, and for always being my anchor and my
                                              vi


inspiration.
     Finally, and most specially, I want to express my heartfelt thanks to my wife, Lijiang Xu.
She came into my life at my most frustrated time, bringing colors and sunshine into my dark
sky. I thank her for always taking pride in me, supporting every decision of mine, and for
imbuing me with more confidence, courage, and determination. Her embrace has filled my
life with warmth, help, encouragement, and love. This thesis would not have been possible
without her, yet it still does not seem sufficient as a gift in return, to which, I would like to
devote the rest of my life.
     To all who have walked this journey with me, I extend my deepest thanks and dedicate
this work to you. Your faith, guidance, support, and love have shaped not just this thesis,
but the scholar and person I have become.
                                               vii


                               TABLE OF CONTENTS
Part I QCD Factorization for Exclusive Processes . . . . . . . . . . . . . . . . . . .       1
Chapter 1     Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .   2
Chapter 2     Review of QCD Factorization: General principles . . . . . . . . . . . .       16
Chapter 3     QCD Factorization of exclusive processes . . . . . . . . . . . . . . . . 125
Chapter 4     Generalized parton distributions . . . . . . . . . . . . . . . . . . . . . 219
Chapter 5     Summary and Outlook . . . . . . . . . . . . . . . . . . . . . . . . . . . 326
Part II Single Transverse Polarization Phenomena at High-Energy Colliders . . . . . 329
Chapter 6     Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 330
Chapter 7     Poincare group representation and little group transformation . . . . . 337
Chapter 8     Polarization of fermions at high-energy colliders . . . . . . . . . . . . . 355
Chapter 9     Linear polarization of vector bosons at high-energy colliders . . . . . . 374
Chapter 10    Summary and Outlook . . . . . . . . . . . . . . . . . . . . . . . . . . . 416
BIBLIOGRAPHY . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 418
                                            viii


           Part I
QCD Factorization for Exclusive
         Processes
              1


Chapter 1
Introduction
Within the Standard Model of particle physics, Quantum Chromodynamics (QCD), the
quantum theory describing the strong interaction, is the most special part. The Lagrangian
governing the dynamics of the theory has colored fields, quarks and gluons, as the funda-
mental degrees of freedom, while the physical spectrum consists of color-neutral particles,
hadrons, that are composite states of the former. The colored particles are never observed
in isolation, a property called color confinement, which is the defining feature of QCD but
which is still established as an experimental fact instead of having been derived from the first
principles of QCD. Although it is believed that QCD has all the components for confinement
to emerge, it has not been explicitly shown.
    As a result of confinement, the study of the strong interactions among quarks and gluons
is always directly involved with hadrons. The color interactions are so strong that the
quarks and gluons are strongly bound inside the hadrons. On the other hand, however, the
color interactions get weaker as the interacting scales become higher. This phenomenon is
called asymptotic freedom [Gross and Wilczek, 1973; Politzer, 1973], which was historically
sought first as a necessary condition for describing the strong interaction. In a hard collision
involving hadrons in the initial or final states, interaction happens at a short time and
distance scale so that the strength of the QCD interaction is very weak, and it is the quark
and gluon degrees of freedom that are directly involved. The latter now behave almost as
                                                2


free particles, so they are collectively called partons.1 As partons move away from the hard
interaction, the distance between them becomes larger and larger, and the color interaction
becomes stronger and stronger, which eventually turns the partons into hadrons, a process
called hadronization.
     Therefore, hard scattering processes of hadrons typically involve QCD in a full range of
scales, from the hard scale characterized by the hard probe, where the color interactions
are weak and perturbative, down to the low scale characterizing the hadronization, where
the color interactions become strong enough to confine the colored degrees of freedom into
hadrons. It is perhaps an ultimate goal to be able to describe such a full process at all scales
within QCD, especially the low scales dominated by the nonperturbative regime of the color
interaction. This task, however, is unprecedentedly difficult. Even a semi-complete solution
has not been achieved, yet. The only nonperturbative method so far is given by Lattice
QCD [Workman et al., 2022]; it, however, still suffers from limitations from computational
resources and timing and from the intrinsic Euclidean nature instead of the real Minkowski
nature. Furthermore, Lattice QCD is more of simulating the physical results with the QCD
Lagrangian rather than describing the physical mechanisms. A fully analytic solution is still
desirable. With that said, though, the results from Lattice QCD are still valuable inputs to
our endeavor to understand the nonperturbative QCD interaction.
     Another approach, given such a situation, to understanding how colors are confined in
hadrons is by probing the hadronic structures in a phenomenological way. This is done
   1
     In many contexts, the term “parton” loosely refers collectively to a quark or gluon, with no regard to
the interaction scale. Here we restrict its meaning to hard scattering regime because in the bound state of a
hadron, the interaction among the colored degrees of freedom is so strong that the latter do not have clear
particle properties. So when referring to them as partons, it is meant that we are working in the kinematic
regime where the hadron is probed at a short distance scale, so that the color interaction becomes weak
and the particle nature of quarks and gluons emerges. In this sense, the concept of “particles” is by itself a
concept for perturbative interactions, but not for a theory with strong interactions.
                                                       3


through hard scattering experimental processes in which the hadrons are hit by high-energy
beams of elementary or hadronic particles or large-momentum hadrons are produced in the
collision of elementary particles. First of all, such processes are probing the partonic degrees
of freedom, so are indeed sensitive to the color interactions. Second, the hard scale implies
that there is one stage in the process where the QCD interaction becomes weak and one can
utilize perturbative calculation method, by virtue of the asymptotic freedom.
    The QCD factorization theorem [Collins et al., 1989; Collins, 2013] has been developed to
make full use of the asymptotic freedom by factorizing the dynamics at the hard perturbative
momentum scale and that at the low nonperturbative scale. Effectively, it gets around the
nonperturbative region by identifying good cross sections (or good physical observables)
whose leading nonperturbative dynamics can be organized into some distribution functions
characterizing the full nonperturbative partonic dynamics within hadrons, while whose other
non-leading nonperturbative contributions are shown to be suppressed by inverse powers of
the large momentum scale of the collision. By neglecting the non-leading contributions,
the remaining part involves purely the hard momentum scales so as one can reliably use
perturbation theory on the weakly coupled on-shell partons. It comes as a result of the
factorization theorem that the nonperturbative dynamics associated with each explicitly
observed hadron is independent of each other and the resultant distribution function only
depends on each hadron itself, but not on the specific processes. Such property is termed
universality of the distribution functions, which, albeit not perturbatively calculable, can be
fitted from experimental data. In reality, these distribution functions can be represented as
correlation functions of quark or gluon fields between hadronic states. Such field-theoretic
definitions also allow them to be calculated using nonperturbative approaches like Lattice
QCD [Constantinou et al., 2021]. Once obtained, they can be used as inputs to make
                                                  4


predictions for different hard scattering processes at different energies. In this way, although
universality does not come as a prerequisite of establishing factorization in the first place, it
is the universality that equips factorization with a predictive power.
    On the other hand, the universality of the parton correlation functions, especially with
the field-theoretic operator definitions, enables them to be studied on their own, and thereby
uncover certain aspects of the confined partonic dynamics. This is how factorization serves as
a phenomenological way to probe the hadron structures. Since the colors are fully entangled
inside hadrons, the hadron structures in terms of partons are far more complicated than
the atomic structures in terms of electrons. The best one can do phenomenologically to
probe hadron structures is to study various parton correlation functions. Those correlation
functions are in turn embedded by factorization formalism into physical observables in hard
scattering processes. Different processes give probe to different correlation functions.
    The simplest process is the deeply inelastic scattering (DIS), in which a high-energy
electron beam is scattered off the hadronic target, be it a single hadron like a proton or
neutron or a nucleus, with a large momentum transfer q = l − l′ , where l and l′ are the
                                                                                        p
momenta of the electron before and after the scattering, respectively, and Q ≡ −q 2 is
much greater than the hadronic scale, ΛQCD ≃ 200 GeV. In the final state, only the scattered
electron is identified and measured, and everything else is inclusively summed over. At
leading power of ΛQCD /Q, the cross section is dominated by the scattering configuration
where the target enters the hard interaction via one single energetic parton (which can
also be accompanied by arbitrarily many gluons of scalar polarization in a gauge theory
with a covariant gauge). The inclusiveness of the hadronic final states causes the soft gluon
exchanges between the scattered parton and the beam remnants to be suppressed by 1/Q, and
thus makes the dynamics of the target evolution independent of the rest of the scattering. In
                                                5


this way, the DIS cross section is factorized into a set of parton distribution functions (PDFs)
fi/h (x), which, loosely speaking, count the parton number densities at a given longitudinal
momentum fraction x of a fast moving hadron h, for each parton flavor i being a quark q,
antiquark (q̄), or gluon (g).
    The factorization formula of the DIS cross section and the concept of PDF date back
to 1969 before the invent of QCD when Feynman first proposed the parton model [Feyn-
man, 1969]. Nevertheless, a carefully formulated factorization formalism based on the first
principles of QCD leads to many fruitful results. Below are a few relevant ones to this thesis.
  (1) It identifies the factorization formalism as being separating hard and low energy scales,
       and makes a consistent use of asymptotic freedom.
  (2) It provides a clear operator definition for the PDF, allowing it to be studied by itself
       within field theory, and an unambiguous procedure for perturbatively calculating the
       hard parton scattering cross sections σ̂(x), whose convolution with the PDFs gives the
       full cross section at leading power of 1/Q, allowing one to go to any perturbative orders
       in principle, which in turn allows the fitting of PDFs to data at any perturbative order.
  (3) By carefully separating hard and low scales with renormalization effects taken into
       account, the factorization formalism introduces a factorization scale µ to the PDF and
       hard scattering coefficients, so that they are now dependent on one more variable and
       shall be written as fi/h (x, µ) and σ̂(x, µ). Then the full hadronic cross section σ of the
       DIS is expressed as,
                                   X Z      1                                         
                                                                                  ΛQCD
                    σh (xB , Q) =             dx fi/h (x, µ) σ̂(xB /x, Q/µ) + O         ,    (1.1)
                                  i=q,q̄,g xB                                      Q
                                                    6


    where xB = Q2 /2P · q is the Bjorken variable with P being the hadron momentum.
(4) The physical requirement that the whole cross section σh not depend on µ leads to
    a set of evolution equations, called DGLAP equations [Dokshitzer, 1977; Gribov and
    Lipatov, 1972; Lipatov, 1974; Altarelli and Parisi, 1977], for the PDFs and hard co-
    efficients, controling their dependence on µ, whose solution gives an efficient way to
    resum all logarithms of Q/Λ, improving the precision of Eq. (1.1).
(5) The factorization procedure can be generalized to other processes besides DIS, includ-
    ing semi-inclusive DIS (SIDIS) in lepton-hadron collisions, and Drell-Yan processes in
    hadron-hadron collisions. With the clear operator definitions of the PDFs, one can
    show their exact universality, so as to maximize the predictive power of factorization.
(6) The full spin dependence of both the hadrons and partons can be consistently included,
    together with their evolution equations.
(7) The systematic formulation brings the power suppressed terms in Eq. (1.1) under con-
    trol, such that one can also include higher-power terms, with new parton distribution
    functions, if desired.
(8) It can be easily generalized (although with practical complications) to different kinds of
    inclusive processes involving more than one scales, especially when they are widely sep-
    arated. This leads (consistently) to, among others, a new kind of factorization, called
    transverse-momentum-dependent (TMD) factorization, giving rise to a new plethora
    of distribution functions for probing the hadron structure.
(9) The same factorization formalism can be applied to exclusive processes where one
    observes all final state particles. Such processes complement the inclusive ones by
                                             7


      probing hadron structures in further different aspects, which will form the focus of this
      thesis.
    QCD factorization formalism has been extremely successful in interpreting high energy
experimental data from all facilities around the world, covering many orders in kinematic
reach in both x and Q and as large as 15 orders of magnitude in difference in the size of
observed scattering cross sections, which is a great success story of QCD and the Standard
Model at high energy and has given us the confidence and the tools to discover the Higgs
particle in proton-proton collisions [Chatrchyan et al., 2012b; Aad et al., 2012], and to search
for new physics [Cid Vidal et al., 2019].
    However, the probe with a large momentum transfer Q is so localized in space, 1/Q ≪ R
with R ∼ 1/ΛQCD being the typical hadron size, that it is not very sensitive to the details of
confined three-dimensional (3D) structure of the probed hadron, in which a confined parton
should have a characteristic transverse momentum scale ⟨kT ⟩ ∼ 1/R ≪ Q and an uncertainty
in transverse position δbT ∼ R ≫ 1/Q. This calls for the need to go beyond the longitudinal
hadron structures described by PDFs, probed by the DIS. Recently, new and more precise
data are becoming available for the so-called two-scale observables, which have a hard scale
Q to localize the collision so as to probe the partonic nature of quarks and gluons, but at the
same time entail a controllable soft scale to give a handle for the dynamics taking place at
O(1/R). Such two-scale observables can be well described by generalizing the factorization
formalism for the fully inclusive DIS. Distinguished by their inclusive or exclusive nature, the
generalized factorization theorems enable quantitative matching between the measurements
of such two-scale observables and the 3D internal partonic structure of a colliding hadron.
    For inclusive two-scale observables, one well-studied example is the production of a mas-
sive boson that decays into a pair of measured leptons in hadron-hadron collisions, known as
                                                8


the Drell-Yan process, as a function of the pair’s invariant mass Q and transverse momentum
qT in the lab frame [Collins et al., 1985]. When Q ≫ 1/R, the production is dominated by
the annihilation of one active parton from one colliding hadron with another active parton
from the other colliding hadron, including quark-antiquark annihilation to a vector boson
(γ, W/Z) or gluon-gluon fusion to a Higgs particle. When Q ≫ qT ≳ 1/R, the measured
transverse momentum qT of the pair is sensitive to the transverse momenta of the two col-
liding partons before they annihilate into the massive boson, providing the opportunity to
extract the information on the active parton’s transverse motion inside the colliding hadron,
which is encoded in the TMD PDFs (or simply, TMDs), fi/h (x, kT , µ2 ) [Collins, 2013], whose
dependence on the factorization scale µ has been included. Like PDFs, TMDs are universal
distribution functions to find a parton i with a longitudinal momentum fraction x and trans-
verse momentum kT from a colliding hadron of momentum p with xp ∼ µ ∼ Q ≫ kT , and
describe the 3D motion of this active parton, its flavor dependence and its correlation with
the property of the colliding hadron, such as its spin [Bacchetta et al., 2007; Diehl, 2016].
For a spin-1/2 target, there are 8 different types of TMDs, for both quark and gluon partons,
categorized by the dependence on the parton and target spins, which greatly generalizes the
3 types of quark PDFs and 2 types of gluons PDFs. Although this poses new challenges for a
full extraction of TMDs from experimental data, it undoubtedly provides more opportunities
to probe multifaceted aspects of the hadronic structures.
    However, due to the inclusive nature, the probed transverse momentum kT of the active
parton in the hard collision is not the same as the intrinsic or confined transverse momen-
tum of the same parton inside a bound hadron [Qiu et al., 2020]. As the colliding hadron is
broken by the large momentum transfer Q, the fast-moving partons travel with bare colors,
whose strong interactions among each other trigger a complex series of parton emissions
                                                9


and evolutions. Such collision-induced partonic radiation, called parton shower, generates
an additional transverse momentum to the probed active parton, which is encoded in the
evolution equation of the TMDs and could be non-perturbative, depending on the hard scale
Q and the phase space available for the shower. With more data from current and future ex-
periments, including both lepton-hadron and hadron-hadron collisions, better understanding
of the scale dependence of TMDs could provide us with valuable information on the confined
motion of quarks and gluons inside a bound hadron [Accardi et al., 2016; Abdul Khalek
et al., 2021; Liu et al., 2021b,a].
    In contrast, in exclusive hadronic scattering processes, the colliding hadron(s) are not
broken, so the corresponding observables could be directly related to intrinsic properties of
hadrons, without being interfered by parton showers. In order to employ asymptotic free-
dom for perturbative calculation, it is necessary to have a hard scale Q ≫ 1/R for good
exclusive observables. Then, as will be discussed in detail in this thesis, the amplitudes
of such exclusive processes can also be factorized into nonperturbative parton correlation
functions, with coefficients capturing the hard scattering of the partons that can be calcu-
lated perturbatively. The resultant correlation functions also have field-theoretic definitions,
which can be studied on their own and encode information on the confined parton dynamics
complementary to inclusive processes.
    The simplest hard exclusive process is large-angle meson scattering [Lepage and Brodsky,
1980; Brodsky and Lepage, 1989], with the simplest example being the scattering of electron
e and neutral pion π 0 ,
                                       e + π 0 → e + γ,                                    (1.2)
which in the center-of-mass (c.m.) frame produces an electron and photon pair with a
                                               10


large scattering angle. At a high collision energy, this process contains a hard scale Q that is
characterized by the large transverse momentum of the final-state e or γ. Then at the leading
power of 1/Q, the annihilation of the π 0 happens through two collinear parton lines, which are
constrained to be a quark-antiquark pair by isospin symmetry. By slightly generalizing the
factorization of DIS, one can express the amplitude of Eq. (1.2) in terms of the convolution of
the distribution amplitude (DA), D(z), of the pion and a hard coefficient C(z). In contrast to
inclusive processes, whose factorization is at cross section level, the factorization of exclusive
processes works at amplitude level, and the resultant correlation functions correspond to
amplitudes. In this way, D(z) is the probability amplitude of turning the pion into a pair of
quark and antiquark, carrying longitudinal momentum fractions z and 1 − z, respectively, of
the pion, in a way analogous to a hadron wavefunction. The parton distribution function,
on the other hand, is analogous to square of the hadron wavefunction, (with certain degrees
of freedom traced over). The process in Eq. (1.2) can also be reversed, with the pion being
produced in the final state. The factorization equally applies and results in a DA D̄(z) that
gives the transition amplitude from a pair of collinear quark-antiquark pair into a pion. The
operator definitions of D(z) and D̄(z) simply related them by a complex conjugate.
    The single-meson process in Eq. (1.2) can be generalized to involve more mesons and also
to large-angle scattering involving baryons. The factorization formalism can be similarly
applied, with a DA associated with each hadron in the initial or final state. A detailed
knowledge of the z dependence of DAs entails how the hadron momentum is distributed
among the valence partons.
    Slightly more complicated exclusive processes involve diffracted hadrons. The simplest
example is given by replacing the pion in Eq. (1.2) by a h of any flavor and also adding
                                               11


another hadron h in the final state with a slightly diffracted momentum,
                               e(l) + h(p) → e(l′ ) + h(p′ ) + γ(k),                       (1.3)
where the same label p is used for both the protons and initial-state proton momentum.
In the c.m. frame of the scattering, we require the transverse momenta of the final-state
electron and photon to be much greater than that of the diffracted proton,
                                                           √
                                   lT′ ∼ kT ∼ qT ≫ p′T ∼     −t,                           (1.4)
with t = (p − p′ )2 . Consider the scattering channel when the photon γ(k) is not radiated
by the electron, the diffracted hadron is then connected by two collinear parton lines to
the hard scattering, which is characterized by a hard scale Q ∼ qT . By generalizing the
factorization for Eq. (1.2), we can factorize the amplitude of Eq. (1.3) into a new type of
parton correlation functions, generalized parton distributions (GPDs) Fhi (x, ξ, t), convoluted
with hard coefficients Ci (x, xi; Q) that can be perturbatively calculated. The GPD combines
the PDF and DA into a coherent picture, with x playing the role of the longitudinal parton
momentum fraction, which is like the x variable of the PDF, and ξ characterizing the longi-
tudinal momentum transfer from the hadron h to the hard interaction, which plays the role
of the pion momentum in the DA.
    More importantly, the GPD contains one more soft scale ∆T ≡ pT − p′T that is control-
lable, similar to the kT dependence in TMDs. Thus the diffractive hard exclusive processes
provide another type of good two-scale observables. Here, by Fourier transforming the GPD
with respect to ∆T to the position space bT in the forward limit (ξ → 0), the transformed
                                                12


GPD fi/h (x, bT ) as a function of bT provides a transverse spatial distribution of the parton
i inside the hadron h at a given value of the parton momentum fraction x [Burkardt, 2000,
2003]. That is, measuring GPDs could provide an opportunity to study QCD tomography to
obtain 3D parton images in the x and bT space, which complements the 3D images encoded
in TMDs in the x and kT space. The spatial bT dependence could allow us to define an
effective hadron radius in terms of its quark (or gluon) spatial distributions, rq (x) (or rg (x)),
as a function of x, in contrast to its electric charge radius [Hofstadter and McAllister, 1955;
Hofstadter, 1956; Simon et al., 1980; Bernauer et al., 2010, 2014; Zhan et al., 2011; Mi-
hovilovič et al., 2017, 2021; Xiong et al., 2019], allowing us to ask some interesting questions,
such as should rq (x) > rg (x) or vice versa, and could rg (x) saturate if x → 0, which could
reveal valuable information on how quarks and gluons are bounded inside a hadron. By
virtue of the exclusiveness, ∆T is measured in experiments and directly correspond to the
GPD variable, without contamination from any parton showers. The 3D pictures entailed
in GPDs can thus be unambiguously extracted.
    However, there are obstacles in the way to use exclusive processes to probe hadron struc-
tures. First, the exclusiveness dictates each hard scattered hadron or each diffracted hadron
to be connected to the hard scattering by two collinear parton lines. Compared to inclusive
processes, this causes a penalty of one power suppression of 1/Q. As a result, the cross sec-
tions become lower as one goes to higher energies. The accessible data are therefore limited
to a finite range of Q. Second, the intriguing parton pictures encoded in DAs and GPDs re-
quire a precision knowledge of them as functions of z or x. This is, however, hard to extract,
for two following reasons. (1) The exclusive factorization happens at the amplitude level,
and the convolution variable z or x is the parton loop momentum, flowing through the active
parton pair defining the DAs or GPDs, whose integration is always from 0 to 1 or −1 to 1,
                                                 13


and is never pinned down to a particular value. This is in sharp contrast to the factorization
of inclusive processes like the DIS, which happens at the cross section level. As shown in
Eq. (1.1), the probed x is constrained within the range [xB , 1]. At the leading perturbative
order, x is also equal to the xB , which is a direct experimental observable. (2) For most
of the known DA or GPD-related processes, the convolutions of the hard coefficients with
                                                                         R1
DA or GPDs only give “moment-type” information, like the integral 0 dz D(z)/z for the
        R1
DA or −1 dx F (x, ξ, t)/(x − ξ + iε) for the GPD. Extracting full details of DAs or GPDs
from such moments does not yield a unique solution. Third, for diffractive processes, there
is one extra channel where the diffracted hadron is connected to the hard scattering by one
virtual photon if its quantum state is allowed. As we will show, the single photon exchange
channel has one power enhancement compared to the GPD channel, so could dominate the
contribution to the total amplitude and also interfere with the GPD-sensitive channels. This
causes a large background for extracting the GPDs. It will be the main focus of this thesis
to try to improve the extraction of GPDs, especially their x dependence.
    The structure of this thesis is organized as the following. In Ch. 2, we will review the
factorization formalism, in the context of simple processes like the representative Sudakov
form factor and inclusive processes like the DIS. The key elements of factorization, including
the Libby-Sterman analysis, power counting, subtraction formalism, and Ward identity will
be explained in a fair amount of detail. The important use of unitarity in inclusive hadron
production and Drell-Yan processes will be left out for lack of direct relevance to the sub-
sequent content. In Ch. 3, we will apply the factorization formalism to exclusive processes.
First we will show the factorization for 2 → 2 large-angle meson scattering, starting with
the one-meson process in Eq. (1.2), gradually increasing the meson number, and ending at
the meson-meson scattering into two mesons. The extension to baryonic and 2 → n (n > 2)
                                              14


cases should be straightforward and will not be discussed. Then we will generalize the large-
angle factorization to the single-diffractive case. This will introduce a complication due to
a partial pinch in the Glauber region. Thanks to the single-diffractive constraint, this pinch
will be avoided by deforming contours of the soft gluon momenta. This will lead to a uni-
fied factorization for a general type of hard exclusive processes that only involves one single
diffraction. Our argument will also indicate that as one goes beyond single-diffractive cases,
the contour deformation will not be allowed to get away with the Glauber pinch, and thus
they will not be factorizable, to be discussed in Sec. 3.3.3. There is an intrinsic similarity be-
tween the exclusive process factorization at leading power and inclusive process factorization
at subleading powers, which will be briefly discussed in Sec. 3.3.4.
    After setting up the factorization formalism for the single-diffractive processes, Ch. 4 will
be devoted to the phenomenological study of GPDs, especially to their x dependence. we
will review the definitions of GPDs and the parton pictures encoded therein, and then briefly
discuss the most popular processes for probing GPDs, especially their drawbacks in the x
dependence. Then after a systematic discussion on the sensitivity to x dependence, we will
introduce two new processes that provide enhanced x-sensitivity. We will first give a detailed
description of their hard coefficient calculations, and then organize them into observables,
including both unpolarized differential cross sections and various polarization asymmetries.
Finally, we will demonstrate how enhanced x sensitivity can help us determine the GPDs.
    In Ch. 5, we conclude this thesis and present the outlook for the future.
                                               15


Chapter 2
Review of QCD Factorization:
General principles
QCD is a renormalizable non-Abelian SU(3) gauge theory describing the color interac-
tion among quarks and gluons [Workman et al., 2022]. The non-Abelian gauge interac-
tion structure leads to the following renormalization group equation for the strong coupling
αs = gs2 /(4π),
                                 dαs                                
                              µ2    2
                                      = β(αs ) = −b0 αs2 + O αs3 ,                     (2.1)
                                 dµ
where b0 = (11CA − 4nf TF )/(12π) > 0, with the Casimirs CA = 3, TF = 1/2 being the SU(3)
color factors, and nf the active quark flavor number at the scale µ. Truncating Eq. (2.1) at
leading order (LO) gives the simple solution,
                                                    1
                                   αs (µ) =                                           (2.2)
                                             b0 ln µ2 /Λ2QCD
with the Landau pole
                                                                 
                                                          1
                                 ΛQCD  = µ0 exp −                                      (2.3)
                                                     2b0 αs (µ0 )
being determined by an experimental measurement of αs (µ0 ) at a certain scale µ0 . Given
the input αs (mZ ) = 0.118 at the Z pole mZ = 91.18 GeV (where nf = 5), we get ΛQCD ≃
87.8 MeV. Evaluating Eq. (2.1) at a higher oder can modify this value. A more precise
                                                16


computation at O(αs5 ) gives ΛQCD ≃ 208 MeV [Chetyrkin et al., 2000].
    Eq. (2.1) establishes QCD with drastically distinct physics phenomena at the two opposite
ends of the energy spectrum [Gross and Wilczek, 1973; Politzer, 1973]. From Eq. (2.2) we can
infer that at high-energy (or short-distance) scale, the coupling αs decreases asymptotically
to zero such that quarks and gluons are asymptotically free particles, with their interactions
as perturbations on their free motions, whereas at the low-energy (or long-distance) side,
the coupling strength becomes increasingly larger and blows up at the Landau pole ΛQCD ,
and hence quarks and gluons strongly interact with each other and the perturbative picture
breaks down. Although this picture is obtained within perturbation theory, the RGE solution
has resummed a certain logarithmic order to all perturbative orders in αs and reflects some
aspects of the nonperturbative domain, which is also confirmed by the fact that ΛQCD is of
the same order of the hadron size R ∼ 1 fm ≃ 200 MeV. So while it is not likely to be
the case that the coupling αs indeed becomes infinite at ΛQCD , it certainly implies that the
perturbation description cannot be extrapolated beyond that scale. Some nonperturbative
mechanism must kick in at µ ≳ ΛQCD to resolve the perturbative singularity, and confine the
quarks and gluons within hadrons. In this way, elementary fields in the QCD Lagrangian
do not appear individually in nature, but it is their bound states, hadrons, that are directly
observed in reality.
    This is the basic intuitive picture of QCD. Its speciality in this aspect is that one cannot
separate the strong interaction from the particle constituents, like what one may consider
for the electroweak sector of the SM. The QCD by itself is a nonperturbatively strongly
interacting theory, as well as self-contained and self-explained. Therefore, any experiments
one may possibly conceive to probe the QCD dynamics are directly involved with hadrons,
and the associated nonperturbativity. QCD factorization is a method or formalism that
                                              17


applies to certain kinds of processes involving one (or more) hard scale Q by separating
the short-distance dynamics, which can be perturbatively treated with the quark and gluon
degrees of freedom, from the long-distance dynamics, which is proved to correspond to certain
nonperturbative universal process-independent parton correlation functions in the hadrons.
Those functions can be obtained by fitting various different processes to the experiments
and facilitate the predictive power of QCD. On the other hand, a precise knowledge of those
functions also uncovers valuable aspects of the hadron structures.
    In this section, I will review important concepts and technicalities of QCD factorization,
using the Sudakov form factor and some familiar inclusive processes as examples. This will
serve as a useful background and comparison for the exclusive processes to be treated in the
next section.
2.1       Factorization as a power expansion
2.1.1      Feynman’s intuition and parton model
Factorization is the rigorous mathematical formulation of the Feynman’s parton model [Feyn-
man, 1969] from the first principles of QCD. To motivate the starting point of factorization,
I briefly review the Feynman’s intuition leading to the parton model.
    In the deep inelastic scattering (DIS) of electron e and proton p,
                                    e(k) + p(P ) → e(k ′ ) + X,                          (2.4)
the electron exchanges a virtual photon γ ∗ of momentum q = k −k ′ that has a high virtuality
     p
Q = −q 2 to localize the interaction within the size δr ∼ 1/Q. When working in the center-
                                                18


of-mass (CM) frame, the fast-moving proton undergoes Lorentz contraction and dilation,
such that
  (1) from the perspective of the hard interaction, the proton appears as a flat plate of
       transverse width R ∼ 1/ΛQCD ,
  (2) the parton constituents, called partons by Feynman and later identified as quarks and
       gluons, are more or less evenly distributed in this “flat plate”, and
  (3) the interaction among the partons happens in a time scale τ ∼ 1/ΛQCD in the proton
       rest frame, and now becomes dilated to be τ ′ ∼ (Q/m)τ ∼ Q/Λ2QCD , where we used
       the fact that the proton mass m is of the same order as ΛQCD .
Now (2) implies that when the electron (or the virtual photon) hits one parton in the proton,
which happens in the distance scale δR ∼ 1/Q with a duration δt ∼ 1/Q, the probability
for a second parton to participate in the hard interaction is of the order δr/R ∼ 1/(QR) ∼
ΛQCD /Q, which is suppressed as Q ≫ ΛQCD . The partons confined in the hadrons are never
freely on shell, but engage in strong interactions with other partons and have virtualities of
order Λ2QCD . During the hard interaction, the interaction between partons is suppressed by
δt/τ ′ ∼ Λ2QCD /Q2 , by (3), so the role played by the parton virtualities in the hard interaction
is also suppressed.
    As a result, the hard interaction between the electron and proton is actually between the
electron and a single free on-shell parton. The whole DIS cross section can be approximated
by the product of the probability of finding one parton out of the proton and the cross section
of the electron scattering off a free parton. The factorization formula written down following
this intuition constitutes the Feynman’s parton model [Feynman, 1972]. The Feynman’s
parton model predates the establishment of QCD and will receive further corrections and
                                                 19


developments from QCD. But it makes clear the important principle of factorization that
we are approximating the full hadron cross section by expanding in terms of the power of
ΛQCD /Q. Factorization is valid up to a power correction.
2.1.2     Assumptions of factorization
In order to derive factorization from the first principles of QCD, we need to systematize the
power expansion of the full hadronic cross sections. Without being able to solve QCD non-
perturbatively, this cannot be constructed from zero, but has to rely on certain assumptions
on the nonperturbative nature of QCD. These assumptions need not be made very precise,
and should be moderate enough so as not to contradict our first intuitions and experiments.
For our purpose, we take the following assumptions:
  (1) A hadron entering the interaction is connected to a group of parton lines that make
      up a correct quantum number. The connection vertex can be thought of as some wave
      function which does not need to be made clear. All possible parton configurations
      should be included, with different probability amplitudes. This assumption makes
      concrete the discussion of the hadron scattering in terms of its parton constituents.
  (2) Inclusive processes involve a sum over final states, which in reality are all kinds
      of hadronic states. In perturbative picture, those hadronic states emerge from the
      hadronization of partonic states. We take the assumption that the sum over hadronic
      states is equivalent to the sum over partonic states,
                        X                            X
                            |Xh ; out⟩⟨Xh ; out| ⇐⇒      |Xq,g ; out⟩⟨Xq,g ; out|.      (2.5)
                         X                            X
                                                20


      Note that this is not an equal sign, because the partons can easily make up a color
      non-singlet state. Since initial states are usually color singlets, those non-singlet states
      necessarily give zero results. This assumption avoids dealing with the details of parton
      to hadron transitions in the final states, as the latter is not clearly understood.
  (3) The sum over all perturbative Feynman diagrams in terms of partons, whether it
      converges or not, represents the true nature. This assumption also underlies the per-
      turbation theory for the electroweak interaction, but has more significance for the
      nonperturbative theory, QCD. It is also the foundation of the previous two assump-
      tions.
  (4) The configuration in which all the partons connected to the hadrons are highly virtual
      is strongly suppressed. In the opposite limit, when the partons have low virtuality, we
      expect nonperturbative dynamics to kick in and slowly saturates the kinematic region
      with the parton virtuality k 2 ≲ Λ2QCD . This will be made more precise in Sec. 2.2.2 that
      if the low-virtuality region is not associated with a pinch singularity, one can deform
      the contour of the parton momentum to stay away from a low virtuality. It is only at
      pinched low-virtuality regions that genuine nonperturbative dynamics dominates.
2.2       Libby-Sterman analysis
2.2.1      Two examples of pinch singularity
To understand the significance and get some feelings of pinch singularity, let us first study
two simple toy examples.
                                                21


    The first example is given by the one-dimensional integral
                                             Z ∞
                                                  dx        1
                              I1 (m) = lim+                       ,                      (2.6)
                                       ϵ→0    −∞  2π x − m2 + iϵ
                                                       2
where the limit ϵ → 0+ is to remind that the same iϵ prescription as in Feynman integrals
applies here to shift the poles ±m on the integration contour to lower and upper half planes
respectively. We will suppress the limit ϵ → 0+ later if no confusion occurs. Because of the
iϵ prescription, the integration variable x should be considered as a complex variable whose
integration contour is on the whole real axis. By the normal trick of Wick rotation x → ix,
this integral can be easily evaluated,
                                                      i
                                         I1 (m) = −      .                               (2.7)
                                                     2m
                                     x                                            x
               −m + iε                                  −Q + iε −m + iε
                              m − iε                                       m − iε Q − iε
                       (a)                                             (b)
Fig. 2.1: Integration contours of x in (a) Eq. (2.6) and (b) Eq. (2.12) on the complex plane,
respectively. The poles ±m ∓ iϵ approach the origin as m = 0 and pinch the the contour in
both cases. When m ̸= 0, we can deform the contours up to m, as indicated by the dashed
blue lines.
    Why is the integral I1 (m) singular at m = 0? As m → 0, the two poles ±m ∓ iϵ approach
each other and coalesce at 0 as m = 0. These two coalescing poles lie on different sides
of the integration contour and pinch it at the origin such that no matter how we deform
the contour, it must pass through the poles as 0. Around the pole, we are dealing with
                                                22


            R
an integral   0
                 dx/x2 , which diverges in a power form, and hence I1 (m) ∝ 1/m has a power
singularity of m as m → 0. Such singularity due to a pair of coalescing poles pinching the
integration contour is called pinch singularity.
    When m ̸= 0, the pinch becomes inexact. And then we can deform the contour to stay
away from the two poles ±m ∓ iϵ, but only up to the extent of O(m), i.e., in the region near
the poles, the distance between the contour and the poles is at most m, |x − (±m)| ≲ m,
as shown in Fig. 2.1(a) by the dashed blue line. In this region, the integral has a power
counting
                                Z
                                          dx             1     1
                                                  ∼ m  ·    ∼    ,                      (2.8)
                                  ∼m x2 − m2 + iϵ        m2    m
where we count |dx| as m and |x2 − m2 | as m2 . This agrees with the exact solution in
Eq. (2.7). The region with |x| ≫ m does not suffer from any pinched poles, so we can
deform the contour to make x arbitrarily large, which gives infinitesimal contribution to the
integral,
                                Z                    Z
                                          dx              dx
                                                  ∼           ∼ 0.                      (2.9)
                                 ≫m  x − m2 + iϵ
                                      2
                                                      x≫1 x2
In this way, the main contribution to the integral I1 (m) comes from the region near the two
poles ±m that would become pinched as m → 0.
    We can understand this further by taking the two poles ±m to be on the same half plane,
which gives a modified integral
                         Z ∞                             Z  ∞
                                         dx                           dx
             I1′ (m) =                                 =                            .  (2.10)
                          −∞ (x − m + iϵ)(x + m + iϵ)      −∞ x2 − m2  + iϵ sgn [x]
Since both poles ±m−iϵ lie on the lower half plane, we can uniformly deform the integration
                                               23


contour to the upper half plane all the way to infinity, which kills the whole integral,
                                  Z ∞
                                                          dx
                 I1′ (m) = lim                                                = 0,       (2.11)
                           K→∞     −∞  (x + iK − m + iϵ)(x + iK + m + iϵ)
as can be easily verified from the direct evaluation of Eq. (2.10) by residue theorem.
    Therefore, the pinch singularity at m = 0 becomes the region that gives important
contribution to the integral when m ̸= 0.
    Now we consider the second toy example,
                                       Z  ∞
                                                           d2 x
                         I2 (m, Q) =                                      ,              (2.12)
                                        −∞   (x2 − m2 + iϵ)(x2 − Q2 + iϵ)
which is a two-dimensional integral with an extra hard scale Q ≫ m. A direct evaluation
gives
                                                                         2 
                                1           Q2         1       Q2            m
         I2 (m, Q) = −        2      2
                                         ln 2 ≃ −        2
                                                            ln 2 × 1 + O           ,     (2.13)
                         4π(Q − m ) m               4πQ        m             Q2
where in the second step we also gave the approximation to the leading power of m2 /Q2 .
    Similar to Eq. (2.7), I2 is also singular as m → 0, but logarithmically. This is also due
to the pinched poles ±m ∓ iϵ as m → 0. Around the poles, the integral now counts as
I2 ∼ d2 x/(x2 Q2 ) ∼ dx/(xQ2 ), which diverges logarithmically due to the two-dimensional
integration measure. When m ̸= 0, we can deform the contour to avoid the poles ±m but
only up to m, so that the contribution of the region near those poles is of the order
                       Z
                                           d2 x                   m2       1
                              2      2          2    2
                                                              ∼   2   2
                                                                        ∼ 2,             (2.14)
                        ∼m  (x − m + iϵ)(x − Q + iϵ)             m ·Q    Q
                                                  24


where we counted d2 x as m2 , and neglected m2 and x2 with respect to Q2 .
    Apart from the poles ±m, I2 has two additional poles ±Q ∓ iϵ. We take Q ≫ m to be
large, so these two poles do not give pinch singularity, but they still keep the contour from
being arbitrarily deformed. In the region with |x| ∼ Q, the distance between the contour
and the poles ±Q is at most of order Q, as shown in Fig. 2.1(b), so this region gives a power
counting
                     Z
                                      d2 x                   Q2        1
                           2     2         2      2
                                                        ∼   2    2
                                                                   ∼ 2,                  (2.15)
                      ∼Q (x − m + iϵ)(x − Q + iϵ)          Q ·Q       Q
where we have counted both d2 x and |x2 − Q2 | as Q2 .
    In the scenario with Q ≫ m, the region |x| ∼ m is usually called the soft region, and
|x| ∼ Q the hard region. This is an example of two regions, and we notice from Eqs. (2.14)
and (2.15) that both regions have the power counting 1/Q2 . The intermediate region should
also have the same power counting and give the result
                                      Z  ∼Q
                                   1         dx     1    Q
                                                 ∼ 2 ln .                                (2.16)
                                   Q2   ∼m    x     Q   m
The region with |x| ≫ Q is referred to as the UV region in a Feynman integral context. Here
because of the UV counting d2 x/x4 ∼ 1/x2 as |x| → ∞, it does not contribute to I2 .
    In this way, we can understand the result in Eq. (2.13). The integration of x runs over
the whole domain from −∞ to ∞, and the scales m and Q appear in the final integral
result through pinch singularities. Here we are using the term “pinch singularity” in a
more general sense, not necessarily related to a genuine singularity, but referring to the fact
that the contour is constrained by two poles on different sides so that it is forced to go
through the region set by those poles. The pinch singularity is a necessary condition for the
associated region to make an important contribution to the integral, true for both soft and
                                               25


hard regions. As we have seen from Eq. (2.16), if two regions have the same power counting,
their intermediate region leads to a logarithmic contribution interpolating the two regions.
2.2.2      Power expansion and pinch singularity
Physical amplitudes are represented by Feynman diagrams and are given by Feynman inte-
grals of the the parton loop momenta, which can be written as
                                        Z
                                                         X(k; p, m)
                             I(p, m) =      dd k Q N                            ,               (2.17)
                                                                             nj
                                                    j=1 (Dj (k; p, m) + iϵ)
where k, p, and m denotes the array of all the loop momenta, external momenta, and masses,
respectively, all being multidimensional. The denominator Dj (k; p, m) is at most a quadratic
form of its arguments, and the numerator X(k; p, m) is a polynomial. Typically, nj = 1 for
all j.
     The external momenta p = {p1 , p2 , · · · , pn } define n + Cn2 = n(n + 1)/2 scales, given by
their virtualities Q2i = p2i and scalar products Q2ij = pi · pj . We examine the simplest case
when there is a single hard scale Q, provided by one or more of the |Q2ij |’s, which is much
larger than all the other scales, i.e., mass, virtuality, and Q2ij scales1 , which are taken to be
of the same order, to be referred to as the soft scale and denoted as M . Such a two-scale
integral can be examined through the power expansion, which can be schematically written
as
                                             X∞         n "X   ∞            #
                                                      M                         Q
                      I(Q, M ) = Qdim I   ·                         In,i lni      ,             (2.18)
                                            n=−n
                                                      Q         i=0
                                                                                M
                                                  0
where n, n0 and i are integers, In,i are scaleless functions of M/Q, and the logarithmic
   1
     The case of a highly virtual photon as in DIS should be considered as being embedded into the full
diagram, with the electron and proton being the external particles.
                                                     26


dependences on Q/M have been explicitly separated out. In the kinematic regime Q ≫ M ,
it would be a good approximation to only keep the leading term (or first few terms) in the
power series [Eq. (2.18)]. The Feynman’s parton model described in Sec. 2.1 motivates the
conjecture that the leading-power approximation should give a factorization structure.
    How do we systematically obtain such a power expansion? First, we rescale all the
variables in Eq. (2.17) by Q,
                                  k → Q k̃,      p → Q p̃,    m → Q m̃.                       (2.19)
This separates a factor Qdim I and converts Eq. (2.17) into a scaleless integral,
                                                    Z
                 ˜ m) = I(p, m)/Q          dim I                   X(k̃; p̃, m̃)
                 I(p,                             =    dd k̃ QN                             . (2.20)
                                                                                         nj
                                                              j=1 (Dj (k̃; p̃, m̃) + iϵ)
The scenario Q ≫ M can be approached by taking the limit Q → ∞, under which all the
external particles become massless and on shell and all the mass scales vanish,
                              p̃2i = p2i /Q2 → 0,     m̃2i = m2i /Q2 → 0.                     (2.21)
The scalar products p̃i · p̃j becomes a scaleless constant of order 1 if i and j are separated
by a constant angle, or vanishing if they become collinear to each other. In this way, the the
high-Q limit is equivalent to taking all the mass scales to 0 and all the external particles to
be massless and on shell. This massless limit implies singularities for the terms in Eq. (2.18)
with n ≤ 0. More leading terms correspond to more severe mass divergences. Hence, the
problem of obtaining the power expansion in M/Q is converted to the problem of finding
mass divergences in the corresponding massless theory.
                                                     27


    Where does the mass divergence come from? The loop momentum k̃ in Eq. (2.20) is
integrated from −∞ and ∞, and the iϵ prescription implies that we should consider the
integral as a multidimensional integration of a complex variable k̃ on the real axis. There
are poles along the integration contour, given by the zeros of one or more Dj (k̃; p̃, m̃)’s. If
the contour is not trapped around the poles, we can deform the contour to stay away from
them such that on the deformed contour, |Dj | = O(1) instead of 0. This gives a contribution
of order 1 to the integral. However, if the contour deformation is forbidden by a pair of or
more pinched poles, the integration is forced to include the region where one or more Dj is
vanishing. Such regions give singular contribution to the integral, which may or may not be
remedied by the numerator X and integration measure.
    This is the case for the massless theory with m̃, p̃2 → 0. For finite m̃ and p̃2 , the pinch
is no longer exact, and we are allowed to deform the contour to stay away from the poles
that would become pinched as m̃, p̃2 → 0, but only up to the extent of order M , such that
in the region near those poles, the previously vanishing propagators Dj (k̃; p̃, m̃) are now of
order M 2 /Q2 or M/Q, depending on their dimensions. Without considering the numerator
and integration measure, this pinched region would give power enhanced contribution to the
integral with respect to the hard region where all propagators are of order 1. Therefore,
we conclude that the pinch singularities in the corresponding massless theory specify the
important integration regions in the massive theory.
    The above discussion only concerns the pure perturbative Feynman integrals and assumes
all particles are massive to regulate the mass divergences. In the real case of QCD, there is
indeed a massless gluon, which can cause exact pinch singularity to the Feynman integral.
However, around the pinched poles, there are some parton propagators with vanishing vir-
tualities. By the fourth assumption in Sec. 2.1.2, when partons have virtualities that are
                                              28


less than or of the same order of Λ2QCD , we should expect nonperturbative dynamics to come
in and rescue the perturbative singularities. In this way, the soft scale in the full theory is
not given by the masses of partons, but by the intrinsic QCD scale ΛQCD , or the hadron
mass scale. And we expect the nonperturbative effects not to change the power counting of
the perturbative theory, but only to smoothly regulate the singular behavior, playing a role
similar to a mass scale. Following this, the perturbative pinch singularities do not lead to
genuine divergences in the amplitudes or cross sections, but imply the regions in the parton
momentum integrations that are sensitive to nonperturbative QCD dynamics.
    This idea is an underlying (usually implicit) foundation of the applications of perturba-
tive QCD. Without the knowledge of the nonperturbative solution to QCD, it indicates two
approaches for predicting high-energy scattering experiments, (1) to separate the part of a
diagram containing the propagators that are pinched on shell from the propagators with high
virtualities (with or without contour deformation), and (2) to design suitable observables for
which the perturbative pinch singularities cancel, so that the infrared sensitivity is cancelled.
The first approach leads to factorization, in which the subdiagrams containing the pinched
propagators will be organized into universal parton correlation functions, and the subdia-
grams with highly virtual propagators have little sensitivity to the infrared QCD dynamics
and constitute a hard scattering coefficient. Following the second approach are defined
the so-called infrared-safe observables, which can be reliably calculated using perturbative
method. We will see that to obtain the factorization for most processes, both approaches
are needed; in particular, we need to show the cancellation of soft gluon connections.
                                               29


2.2.3      Landau criterion for pinch singularity
The first step to derive factorization is to identify the pinch singularities in the corresponding
massless theory, obtained by rescaling all the momentum and mass scales by the hard scale
Q, as in Eq. (2.19). This task is easy for simple low-dimensional integrals like Eqs. (2.6) and
(2.12), but not for multidimensional Feynman integrals; even the simple one-loop Sudakov
form factor [Fig. 2.2(a)] becomes not trivial. A systematic criterion for pinch singularity
is therefore needed, which is given by Landau equations [Landau, 1959; Collins, 2020]. For
the Feynman integral in Eq. (2.17) (or the corresponding massless one), a singular point kS
makes a subset of the denominators Dj ’s vanish. The sufficient and necessary condition for
this singularity to be pinched is that the first derivatives of those Dj ’s at kS are linearly
dependent, with non-negative coefficients, i.e.,
                Dj (kS ) = 0, for j ∈ SN ⊂ {1, 2, · · · , N },   and                        (2.22a)
        X      ∂Dj (kS )
            λj           = 0, with λj ≥ 0, and at least one λj is strictly positive.        (2.22b)
       j∈SN
                 ∂kSµ
Note that in the notation k, we have assembled all the loop momenta {k1 , k2 , · · · , kL } by a
direct sum, so the index µ in Eq. (2.22) actually runs over 4L components. Hence Eq. (2.22b)
is true for each loop momentum.
                                X            ∂Dj (kSl )
                                          λj      µl    = 0, l = 1, 2, · · · , L,            (2.23)
                                              ∂kSl
                         j∈SN , j∈ loop l
where kl is the loop momentum for the loop l, and L denotes the total loop number.
    Finding all solutions to the Landau equation is made simple by the Coleman-Norton
theorem [Coleman and Norton, 1965]. A single diagram may be associated with different
                                                     30


solutions in which different sets of propagators Dj go on shell. For those propagators that
are not on shell, we contract the corresponding lines in the Feynman diagram to points
and obtain a reduced diagram. In the reduced diagram, each loop gives an equation as
Eq. (2.23). Suppose a certain loop momentum k flows through n lines, we assign to each
vertex a spacetime coordinate, xµa , (a = 1, 2, · · · , n), and define the spacetime distance
between two adjacent vertices ja and jb that are connected by the line j,
                                                                    ∂Dj (k)
                                ∆xµja jb = ∆xµja − ∆xµjb = λj               .                (2.24)
                                                                        ∂kµ
Recall that we are dealing with massless theory, and the condition Dj (k) = 0 implies that
each propagating line is on shell. In most cases, Dj (k) takes a quadratic form like (k + p)2 ,
such that ∂Dj (k)/∂k µ gives the momentum of the internal line, whose direction is oriented
according to the direction of k. By interpreting λj as the ratio of the travel time ∆x0ja jb to
the propagating energy ∂Dj (k)/∂k0 ,
                                                       
                                                         ∂Dj (k)
                                         λj = ∆x0ja jb                ,                      (2.25)
                                                            ∂k0
we have
                                          ∆xµja jb = ∆x0ja jb · vjµ ,                        (2.26)
where vjµ = ∂ µ Dj (k)/∂ 0 Dj (k) = (1, pj /Ej ) = (1, vj ) is the four-velocity of the particle on
line j. In this way, ∆xµja jb becomes the spacetime elapse of a physically propagating on-shell
(massless) particle, whose velocity is set by its momentum ∂ µ Dj (k) ≡ ∂Dj (k)/∂kµ . And
                                                    31


then the condition in Eq. (2.23) is equivalent to
                                          X
                                                    ∆xµja jb = 0,                        (2.27)
                                   j∈SN , j∈ loop l
where the direction jb → ja is the same as the loop momentum k. Eq. (2.27) gives a
consistent condition for a physically realizable classical process of particle propagation: by
orienting all lines as going forward with positive energy, each line represents a on-shell
particle propagating with a certain velocity determined by its on-shell momentum, and they
can scatter, split, and merge at each vertex, subject to momentum conservation.
    The above conclusions apply to both massive and massless theories. But for the concern
of determining leading-power contributions, we are interested in the massless limit of the
massive theory. Then the task of determining the reduced diagrams is very easy. All the
internal lines in the reduced diagrams carry on-shell lightlike momenta and propagate in
certain directions at the speed of light, or they have zero momenta and can be attached
anywhere.
2.2.4     Example: Sudakov form factor
As a simple example, in Fig. 2.2(a) is shown the one-loop diagram for Sudakov form factor,
where a virtual photon with momentum q = (Q, 0, 0, 0) decays into a quark-antiquark (q q̄)
pair, which go to opposite directions along the z axis, with momenta
                                Q                        Q
                           p1 =    (1, 0, 0, 1),  p2 =       (1, 0, 0, −1).              (2.28)
                                 2                        2
                                                 32


To obtain the reduced diagram, we contract internal lines and identify the resultant dia-
gram as a physically realizable process. For the simplest example, we contract all the three
propagators into the reduced vertex, which is called the hard vertex, and obtain the reduced
diagram in Fig. 2.2(b). Since it reduces to a tree-level diagram, the meaning of physical
process is evident. Now since there is no internal line, this diagram does not contain a pinch.
Strictly speaking, this is not an example of pinch singularity, nor a solution to Landau equa-
tion because it requires all λj = 0 in Eq. (2.22b). But we will see that this diagram still
gives an important contribution to the integral.
                p1
             +k                                 +k                                p1
          p1                                 p1
     q                                                                 k
                k                                                                        k
                                                  −k
         p2                                                     p2                p2
            −k                                                     −
                                                                     k
                p2
            (a)             (b)               (c)                  (d)              (e)
Fig. 2.2: The one-loop diagram of Sudakov form factor in QCD (a) and its reduced diagrams
(b)-(e).
    For a less trivial example, we only contract the propagator p2 − k and obtain the reduced
diagram in Fig. 2.2(c). For it to correspond to a physical process, we need both p1 + k and
−k to be lightlike and propagating along the same direction,
                                     λq (p1 + k) = λg (−k),                              (2.29)
with λq,g > 0, which is just Eq. (2.22b) with λq̄ = 0 for the coefficient of the antiquark
propagator. In this case, the gluon is propagating in a direction collinear to the quark line.
Similarly, contracting the propagator p1 + k gives the reduced diagram in Fig. 2.2(d), in
which the gluon is propagating collinearly to the antiquark line.
                                                33


    If we do not contract any propagators, the diagram cannot correspond to a physical
process unless the gluon has a zero (or soft) momentum, k = 0. A zero-momentum particle
does not exist as a real particle, so has no meaning in the sense of “propagating with
a certain velocity”. To embed it into the picture depicted by Coleman and Norton, we
interpret the soft particle as having an infinite wavelength, which is not a local particle and
can instantaneously connect any two vertices in the reduced diagram. In terms of Landau
condition, a soft propagator gives ∂Ds (k)/∂k µ = 0 at the point k = 0, which automatically
satisfies Eq. (2.22b) if λs = 1 and all the other λj ’s are zero. This pinch singularity is given
by the reduced diagram in Fig. 2.2(e) in which the soft gluon is represented by the dashed
line. The soft pinch singularity is the endpoint of the collinear pinch singularity by taking
λq = 0, λg = 1 and k = 0 in Eq. (2.29).
    Such procedure can be easily generalized to an arbitrary diagram. In any reduced dia-
gram, the collinear lines coming out of the hard vertex move away from each other at the
speed of light. They can only split and combine in their moving directions, and lines of dif-
ferent collinear directions never meet again. Therefore, the collinear sectors are defined by
the external particles. Each external lightlike particle with a momentum of order Q defines
a collinear direction, along which there can be arbitrarily many collinear lines, as shown in
Fig. 2.3, where there are two collinear sectors Cq and Cq̄ , associated with the quark and
antiquark, respectively. The hard vertex H contains arbitrarily many propagators whose
virtualities are of order Q2 . On top of these, there can be arbitrarily many soft lines con-
necting onto Cq , Cq̄ , and H, as indicated by the blue dashed lines. They are collected by
the soft subdiagram S (which is not necessarily connected).
                                               34


                                      p1                                     p1
                             Cq                                          Cq
              q                                          q
                  H                   S                       H              S
                             Cq̄                                         Cq̄
                                      p2                                     p2
                            (a)                                        (b)
Fig. 2.3: (a) General reduced diagram for Sudakov form factor. H is the hard subgraph that
contains arbitrarily many propagators which are not pinched and whose virtualities are of
order Q2 after proper contour deformations. Cq and Cq̄ are collinear subgraphs, which are
connected to H by arbitrarily many collinear propagators. S is the soft subgraph, which
is connected to Cq , Cq̄ , and H by arbitrarily many soft propagators. S is not necessarily
connected. The dots refer to any arbitrary collinear or soft lines. (b) The leading region
for Sudakov form factor. The dots refer to any arbitrary collinear or soft longitudinally
polarized gluon lines that can be added.
2.3       Power counting of pinch singularity
2.3.1      Pinch surface: intrinsic and normal coordinates
For the example of one-loop Sudakov form factor discussed at the end of Sec. 2.2.3, the
soft pinch singularity is a point k µ = 0 in the 4-dimensional Minkowski space of the loop
momentum k, while the collinear pinch singularities are two straight lines,
                                              −
                                k = (α p+1 , 0 , 0T ) with α ∈ (0, 1)                (2.30)
for the quark-collinear region and
                                k = (0+ , β p−2 , 0T ) with β ∈ (0, 1)               (2.31)
                                                   35


for the antiquark-collinear region. In a general multi-loop diagram, the solutions to Landau
equation form a multidimensional pinch surface in the (real-valued) loop momentum space.
Illustrated in Fig. 2.3(a) is a generic pinch surface for the Sudakov form factor. This pinch
surface is characterized by a set of soft momenta, ksi = 0, a set of collinear momenta along the
                                                                               j            −
quark direction, kqi = (αi p+    −                                                   +
                            1 , 0 , 0T ), and along the antiquark direction, kq̄ = (0 , βj p2 , 0T ),
and a set of hard momenta in the hard subgraph H whose virtualities are of order Q2 .
     As discussed in Sec. 2.2, the Libby-Sterman analysis relates the power counting of a
diagram I(Q, M ) in the ratio M/Q of the hard scale to soft scale to the mass singularity of
the same diagram at the limit of M → 0. The mass singularity is in turn given by the pinch
singularity of the massless theory, as determined by the Landau criterion. In the case with
finite masses, the pinch singularity of the massless theory becomes inexact and regulated
by the masses, as visualized by the simple examples in Sec. 2.2.1. Even though the loop
momentum contour is no longer pinched by poles to give zero-virtuality lines, it is indeed
trapped by pairs of close poles separated by distances much smaller than Q, which forbids
it from being arbitrarily deformed. The maximum virtualities of the pinched propagators in
this region are still much smaller than Q2 , which lead to a large integrand. As a result, the
region around the pinch surface is likely to give an important contribution to the integral, in
the sense of having a leading power counting behavior in M/Q in Eq. (2.18). However, this is
not necessarily true, because the numerator and integration measure of the Feynman integral
may rescue the singular behavior and reduce the power counting. Therefore, locating the
pinch surfaces is only necessary but not sufficient to determine the mass singularities in the
massless integral, or the leading power counting contributions in the massive integral. We
need to further formulate a power counting rule for the divergence degree around the pinch
surface.
                                                 36


    Around the pinch surface, we define a set of intrinsic coordinates to describe the points
on that surface, and a set of normal coordinates to characterize the deviations from the
surface. As an example, for the quark-collinear pinch surface in Eq. (2.30), we can use α
as the intrinsic coordinate, and (k − , kT ) as the (three-dimensional) normal coordinates. In
contrast, for the soft pinch “surface” in Fig. 2.2(e), there is no intrinsic degree of freedom;
any nonzero component k µ is a normal coordinate. The integration of normal coordinates
leads to the singularity. Since it is the virtualities of pinched propagators that cause a large
integrand, we further redefine the normal coordinates as a radial coordinate λ and angular
coordinates, such that for a fixed λ, the virtualities stays constant.
    Around the pinch surface in Eq. (2.30), k − and kT are much smaller than k + , which is
of order Q, the pinched quark and gluon propagators have virtualities
                                                             −
                                (p1 + k)2 = 2(1 + α)p+             2
                                                          1 k − kT ,                        (2.32)
                                                       −
                                        k 2 = 2αp+           2
                                                    1 k − kT ,                              (2.33)
which are linear with k − but quadratic with kT . So we choose the radial coordinate λ such
that
                                  k − = λ2 k̄ − /p+
                                                  1,    kT = λk̄T ,                         (2.34)
where λ has the mass dimension, and k̄ − and k̄T are (two-dimensional) dimensionless angular
variables subject to the condition |k̄ − | + |k̄T |2 = 1 if we choose the radial coordinate λ as
                                            q
                                      λ = |p+        −         2
                                                 1 k | + |kT | .                            (2.35)
Note that Eqs. (2.35) and (2.34) are just a change of variables from a flat coordinate system
                                                  37


into an angular coordinate system. For a fixed nonzero α and λ, the integration of the
angular variables k̄ − and k̄T do not touch any singularities, so gives a regular result. The
two propagators in Eq. (2.32) together with the integration d4 k ∼ λ3 dλ (modulo the α and
angular integrations) gives an integral
                                      Z   ≲Q            Z   ≲Q
                                             λ3 dλ             dλ
                                                    =              .                          (2.36)
                                        0    (λ2 )2       0     λ
The collinear pinch means that the singularity at λ = 0 cannot be avoided. Eq. (2.36) then
has a divergence degree p = 0 for a logarithmic divergence.
    This analysis is for a massless theory, for the massive case with both the quark having
a mass m, the pinch becomes inexact, and we can avoid the pole by order of m, so that λ
never reaches 0. This smoothly cuts off the integral in Eq. (2.36) at λ → 0, and gives
                                       Z   ≲Q                 
                                              dλ             Q
                                                   ∼ ln          ,                            (2.37)
                                          ≳m   λ            m
which manifests the logarithmic divergence. This massive discussion can be used to analyze
the form factor in QED with a massive electron, but obviously not to the real QCD. In
QCD, partons never appear as external on-shell lines, which must be replaced by hadrons.
The quarks are indeed massive, but the light quark masses are much less than ΛQCD , so the
mass regulation to the pinch singularity does not come into play before the nonperturbative
dynamics kicks in. Hence we should regard the mass scale in Eq. (2.36) as of the same order
as ΛQCD , i.e., the scale λ ≲ ΛQCD should be controlled by nonperturbative QCD.
    Similarly, around the soft pinch in Fig. 2.2(e), we have |k µ | ≪ Q. The gluon propagator
k 2 = 2k + k − − kT2 is quadratic with both (k + , k − ) and kT . So we choose the radial coordinate
                                                   38


λS as
                                                                     X
                                     k µ = λS k̄ µ ,    with λS =         |k µ |,                 (2.38)
                                                                      µ
where the k̄ µ is a (three-dimensional) dimensionless angular coordinates subject to the con-
        P
straint    µ |k̄ µ | = 1. Then the three pinched propagators have the scaling
                                                                                          
             k 2 = λ2S (2k̄ + k̄ − − k̄T2 )                                       = O λ2S ,      (2.39a)
                           −                    + −            + −
     (p1 + k)2 = 2p+               2                      2
                        1 k + k = 2λS p1 k̄ + λS (2k̄ k̄ − k̄T )
                                                                        2
                                                                                  = O(λS Q),     (2.39b)
     (p2 − k)2 = −2p−        +       2               − +     2
                          2 k + k = −2λS p2 k̄ + λS (2k̄ k̄ − k̄T )
                                                                   + −        2
                                                                                  = O(λS Q).     (2.39c)
We note that now the collinear lines have higher virtualities than the soft line, being much
larger than λ2S but still much smaller than Q2 . This is because it is the large momentum
                                                      −
components of the collinear lines, p+         1 or p2 , that interact with the soft gluon. Together with
the integration measure d4 k ∼ λ3S dλS , Eq. (2.39) gives the scaling for the soft region,
                                         Z   ≲Q                 Z  ≲Q
                                                   λ3S dλS            dλS
                                                             ∝             ,                      (2.40)
                                           0    λ2S (λS Q)2      0     λS
which is logarithmically divergent, for a divergence degree p = 0, similarly to Eq. (2.36).
    Now in the massive theory, suppose both the quark and gluon carry masses, mq and mg ,
respectively. If the quark and antiquark are on shell, p21 = p22 = m2q , the same scalings in
Eqs. (2.39b)(2.39c) hold, but the gluon propagator becomes O(λ2S ) + m2g , which smoothly
cuts off the λS → 0 region and brings Eq. (2.40) to a form like Eq. (2.37) with m replaced
by mg .
    On the other hand, if the gluon is massless but the quark and antiquark are off shell
                                                                                         
by O Λ2QCD , their virtualities would become O(λ2S ) and O(λS Q) + O Λ2QCD for the gluon
                                                          39


and quark/antiquark, respectively. This situation resembles the real QCD more since the
partons are never on shell. But this brings a more intricate power counting. Compared to
the hard region in Fig. 2.2(b), which has the same power counting as the leading-order (LO)
diagram, the soft region has a power counting
                            Z  ≲Q                        Z  ≲Q
                          2              λ3S dλS                    λS dλS
                   IS = Q                              =                        .      (2.41)
                             0    λ2S (λS Q + Λ2QCD )2    0    (λS + Λ2QCD /Q)2
Now we examine three subregions in the soft region,
    • λS ≪ Λ2QCD /Q, where IS ≪ O(1) is power suppressed;
                         
    • λS ∼ O Λ2QCD /Q , which gives IS ∼ O(1);
                    
    • O Λ2QCD /Q ≪ λS ≲ O(ΛQCD ), which gives IS ∼ O(1),
where we stop at λS ≲ O(ΛQCD ) beyond which all the three propagators have virtualities
much greater than Λ2QCD , and start entering the hard region. We found that the whole
region Λ2QCD /Q ≲ λS ≲ ΛQCD gives a leading-power contribution. In the low end with
                     
λS ∼ O Λ2QCD /Q , the quark propagators have virtualities of order Λ2QCD , but the gluon
has Λ4QCD /Q2 ≪ Λ2QCD . In the high end with λS ∼ O(ΛQCD ), the gluon propagator has a
virtuality of order Λ2QCD , but the quarks have Q ΛQCD ≫ Λ2QCD . Given the fourth assumption
in Sec. 2.1.2, the whole soft region λS ≲ ΛQCD is in the nonperturbative regime. But for the
perturbative analysis, we usually make the second assumption in Sec. 2.1.2 to convert the
sum over final hadron states into a sum over parton states, so on-shell partons do appear in
the final states, for which the region λS ≪ Λ2QCD /Q also becomes important and contribute
to soft divergences. In such cases, we need to show that the whole soft region is cancelled.
Therefore, since factorization is rooted in a perturbative analysis, we need to consider the
                                                 40


whole soft region Λ2QCD /Q ≲ λS ≲ ΛQCD .
    Such complication arises because in the soft region, the collinear lines and soft lines have
different virtualities, which causes an extra scale Λ2QCD /Q. In contrast, the power counting
analysis of the collinear region is much simpler because all collinear lines have virtualities
                   
O(λ2 ) + O Λ2QCD , and hence only the region λ ∼ ΛQCD needs to be considered. In a more
complicated diagram, we can have (multiple) soft and collinear momenta at the same time.
Each collinear momentum scales as
                                                   λ2
                    kc = (kc+ , kc− , kcT ) ∼ (Q,     , λ),   with  λ ∼ O(ΛQCD ),         (2.42)
                                                   Q
and each soft momentum has the scaling
                                                                     λ2
                   ks = (ks+ , ks− , ksT ) ∼ (λS , λS , λS ),  with     ≲ λS ≲ λ.         (2.43)
                                                                      Q
Eqs. (2.42) and (2.43) constitute the canonical scaling for the pinched regions, with λ and
λS being the radial normal coordinates parametrizing the distance from the pinch surfaces.
2.3.2     Power counting
Now we derive the power counting around the pinch surface, based on the canonical scaling
in Eqs. (2.42) and (2.43), with λS = O(λ2 /Q). We will take a simpler approach than the
treatment in [Collins, 2013] by examining the power counting with respect to the leading-
order diagrams. The derivation is for a generic quantum field theory (QFT), and we work
in the Feynman gauge for a gauge theory involving a vector boson.
    Each pinched momentum belongs either to a collinear sector or the soft sector, so a
                                                      41


general pinch surface represented by a reduced diagram is decomposed into a hard subgraph
H, a set of collinear subgraphs Ci , and a soft subgraph S. Normally we work in the CM
frame of the hard subgraph H, a momentum kH in which has all its components of order
Q. Each collinear subgraph Ci is defined by one (or more collinear) external hard particle
pi and is connected to the hard subgraph H via a set of collinear lines {kiH }. We include
                                                     Q R
all the propagators of {kiH } and their integrations kiH d4 kiH in Ci . Within each Ci , the
collinear lines can interact with each other in all arbitrary ways under the constraints of
fixed {kiH } and pi . The soft subgraph S, which may contain one or more connected parts, is
connected to each collinear subgraph Ci and/or hard subgraph H by soft lines, {kiS } and/or
{kHS }, respectively. Similarly, all the propagators and integrations of {kiS } and/or {kHS }
are included in S. A concrete example is given by the Sudakov form factor in Fig. 2.3(a),
but the discussion in this section applies more generally.
2.3.2.1    Leading order and hard region
First, for a given process, the leading-order diagram can be easily worked out, whose power
counting in the scaling limit Q → ∞ is determined by its dimension. For example, the
Sudakov form factor Γµ has dimension one, so it simply scales as Q1 at leading order. The
purely hard region, as illustrated in Fig. 2.2(b) where all internal propagator denominators
are of order Q2 , has the same structure as the leading order and gives the same power
counting. This is the feature of a renormalizable quantum field theory, as is the case of
QCD, for which the coupling is dimensionless; otherwise, we would have a suppression from
a power of g/Qdim(g) .
                                               42


2.3.2.2    Collinear subgraph
For ease of notation, now we look at a particular collinear subgraph and denote it as C,
In the simplest case, C only comprises a single line of the external particle, which does not
cause additional power counting analysis beyond the previous discussion. This situation can
be trivially generalized to the case where C is connected to H by a single propagator but
with an arbitrary two-point function included in C. If an extra line of the field Φ(x) connects
C to H, their conlution will be modified to
              Z
                 d4 kCH
                     Φ
                        H α (kCH
                              Φ
                                 ) C α (kCH
                                         Φ
                                            )
                 (2π)4
                      Z 4 Φ                Z                                          
                        d kCH α Φ              4      Φ ·x
                                                   −ikCH                α
                   =            H (kCH )      d xe         ⟨C|T {· · · Φ (x) · · · }|0⟩ , (2.44)
                         (2π)4
                                                                                           Φ
where we have only explicitly indicated the extra dependence on the new particle Φ, kCH       is
its momentum, and α describes its spin quantum number. The extra dimensions of C and
H due to the appearance of Φ are
                          ∆C α = −4 + dim(Φ),       ∆H α = − dim(Φ),                      (2.45)
where dim(Φ) is the dimension of the field Φ(x), which is 1 for a scalar or vector field, and
3/2 for a fermion field. The dependence of C α on α can be easily worked out using a boost
analysis. If we choose the direction of C as the z axis, then each collinear momentum in C
scales as in Eq. (2.42). Now we consider boosting C back to its rest frame, which causes the
C-collinear momenta to scale as (λ, λ, λ). Hence each component of α should scale in the
                                                43


same way, and the power counting C α is simply given by its dimension,
                           C α ≃ λdim(C) for each α in C rest frame.                    (2.46)
Then we boost C α back to the lab frame, where it is highly boosted along the z direction.
This gives an enhancement (Q/λ)s to the largest component of C α , with s being the spin of
Φ. In contrast, each component of H α scales the same, being Qdim(H) . Including the power
counting d4 kCH
             Φ
                 ∼ λ4 , we then have the extra power counting due to Φ:
    • Φ = ϕ (scalar): dim(ϕ) = 1 and s = 0, leading to Q−1 · λ1 = λ/Q; (Note that this case
      also applies to ghost fields.)
                                                                               p
    • Φα = ψ α (fermion): dim(ψ) = 3/2 and s = 1/2, leading to Q−3/2 · λ3/2 ·    Q/λ = λ/Q;
    • Φα = Aα (vector boson): dim(A) = 1 and s = 1, leading to Q−1 · λ1 · (Q/λ) = 1. This
      only applies to the unphysical A+ component which is proportional to its momentum.
      The physical transverse polarization A⊥ receives no enhancement from the boost, so
      gives a power counting λ/Q, the same as the scalar case. The remaining component
      A− undergoes a suppression λ/Q by the boost, so gives the power counting (λ/Q)2 .
Therefore, attaching a collinear subgraph to the hard subgraph by a scalar, fermion, or trans-
versely polarized vector boson brings a power suppression by λ/Q, while by a longitudinally
polarized vector boson brings no power suppression.
2.3.2.3    Soft subgraph connection to a collinear subgraph
Now we consider the power counting due to soft lines. Adding an extra line of the field
Φα (x) between the soft subgraph S and some collinear subgraph C (taken to be along the z
                                               44


direction) changes their convolution to
   Z                                 Z                   Z                                            
      d4 kCS
           Φ
                                       d4 kCS
                                           Φ
                                                                     Φ ·x
                                                                  −ikCS
              C α (kCS
                    Φ
                       ) S α (kCS
                               Φ
                                  )=          C α (kCS
                                                    Φ
                                                       )      4
                                                            d xe                       α
                                                                          ⟨0|T {· · · Φ (x) · · · }|0⟩ ,
      (2π)4                            (2π)4
                                                                                                      (2.47)
         Φ
where kCS    is the soft momentum that scales as (λ2 /Q, λ2 /Q, λ2 /Q), and α is the spin index.
This new attachment changes the dimensions of S α and C α by
                               ∆S α = −4 + dim(Φ),      ∆C α = − dim(Φ).                              (2.48)
This is similar to Eq. (2.45) with H replaced by C and C by S, but now the collinear
subgraph C α has different scalings for different α components. So we count Eq. (2.47) as
          Z                              2 dim(Φ)
               d4 kCS
                   Φ
                                               λ                                                   
      ∆              4
                         α Φ       α Φ
                       C (kCS ) S (kCS ) ∼                 × λ− dim(Φ) · (spin enhancement)
               (2π)                            Q
                                              dim(Φ)
                                               λ
                                           ∼               × (spin enhancement),                      (2.49)
                                               Q
where in the first step, the first factor is from the soft subgraph and the integration measure,
and the second factor is from the collinear subgraph. By the same boost argument as in the
previous situation, C α receives a power enhancement for fermions and vector bosons.
    • Φ = ϕ (scalar) or A⊥ (transversely polarized vector boson), dim(Φ) = 1 without boost
      enhancement, leading to a λ/Q suppression; (Note that this case also applies to ghost
      fields.)
                                                                            p
    • Φ = ψ (fermion), dim(Φ) = 3/2 with a boost enhancement                  Q/λ, leading to a λ/Q
      suppression;
    • Φ = A+ (longitudinally polarized vector boson), dim(Φ) = 1 with a boost enhancement
                                                   45


      Q/λ, leading to a power counting of 1.
Therefore, attaching the soft subgraph to a collinear subgraph by a scalar, fermion, or trans-
versely polarized vector boson brings a power suppression by λ/Q, while by a longitudinally
polarized vector boson brings no power suppression.
2.3.2.4    Soft subgraph connection to the hard subgraph
Adding an extra soft line to the hard subgraph H works in a similar way and leads to the
power counting formula
                                       dim(Φ)               2 dim(Φ)
                                     λ2           − dim(Φ)    λ
                     ∆(S ⊗ H) ∼                 ·Q         =             .              (2.50)
                                     Q                        Q
Compared to Eq. (2.49), the power factor Q− dim(Φ) instead of λ− dim(Φ) generally suppresses
the soft connections to H, and there is power enhancement from the boost. Therefore,
attaching the soft subgraph to the hard subgraph by a scalar or vector boson brings a power
suppression by (λ/Q)2 , while by a fermion brings a power suppression (λ/Q)3 .
    To conclude, we list in Table 2.1 the power counting rules for adding an extra line in
a certain reduced diagram. The rules work in a fashion of construction so give the power
counting relative to a certain diagram, e.g., the leading-order diagram. As an example,
for the Sudakov form factor in Fig. 2.3(a), the external q q̄ lines dictate the two collinear
subgraphs to be connected to H by at least a fermion line separately. From our power
counting rules, having additional line connections generally brings power suppressions except
for longitudinally polarized gluons connecting Cq,q̄ to H or S. This leads to the reduced
diagram in Fig. 2.3(b) that has the same power counting as the leading-order diagram or
the pure hard region. Any other pinch surfaces give more suppressed power counting. So it
                                               46


Table 2.1: The counting of the (λ/Q) power associated with each extra line attachment
between a collinear subgraph C and the hard subgraph H, the soft subgraph S and C, or
S and H. In the second and third columns, we take S to refer to the soft region with
the momentum scaling (λ2 /Q, λ2 /Q, λ2 /Q), while in the last two columns, S ′ refers to the
soft region with the scaling (λ, λ, λ). For S ′ we denote ∆ncs = ncs − 1 as the number of
collinear propagators that the soft momentum flows through with respect to the minimal
configuration (ncs = 1).
                                C-H    S-C      S-H         S ′ -C      S ′ -H
                      ϕ, c, c̄    1       1       2       1 + ∆ncs         1
                         ψ        1       1       3      1/2 + ∆ncs      3/2
                        A+        0       0       2       0 + ∆ncs         1
                        A⊥        1       1       2       1 + ∆ncs         1
                        A−        2       2       2       1 + ∆ncs         1
is called the leading region.
2.3.2.5     Power counting for alternative soft scalings
The previous subsection assumes the soft gluon momenta scale by a uniform scaling λS ∼
λ2 /Q. Such soft momenta do not change the collinear propagator virtualities when flowing
through the latter. In general, we can have λS to vary between λ2 /Q and λ, for ΛQCD ≲
λ ≪ Q. This generic soft scaling does not affect the power counting for the collinear-to-hard
coupling, but alters that for the soft attachments to the collinear and hard subgraphs.
    When a soft momentum ks ∼ (λS , λS , λS ) flows along a collinear momentum kc ∼
(Q, λ2 /Q, λ), it changes the virtuality to
  (kc +ks )2 = kc2 +ks2 +2kc ·ks ∼ λ2 +λ2S +λS Q ∼ max(λ2 , λS Q) = λ2 ·max(1, λS Q/λ2 ). (2.51)
Thus if this soft momentum flows through ncs collinear propagators, the collinear subgraph
                                             ncs
gains an extra factor [1/ max(1, λS Q/λ2 )]       apart from the dimensional counting λ− dim(Φ)
times a boost enhancement factor in Eq. (2.49). Since we take λS ≳ λ2 /Q, the term λS Q/λ2
                                                 47


is at least of order 1, so we can simplify max(1, λS Q/λ2 ) to λS Q/λ2 . The power counting
of the soft subgraph should also be modified by λS . Therefore, an extra soft attachment
between S and a collinear subgraph leads to an additional power counting
                     dim(Φ)  s  2 ncs  dim(Φ)−s                       dim(Φ)−ncs
                   λS            Q          λ             λ                λS
   ∆(C ⊗ S) ∼                 ·       ·              =                                     (2.52)
                   λ             λ         λS Q           Q              λ2 /Q
where s is the spin of Φ. This introduces an extra factor (λS Q/λ2 )dim(Φ)−ncs with respect to
Eq. (2.49). For λS = λ2 /Q, it recovers the same counting. As λS increases, we gain a power
enhancement if dim(Φ) > ncs , which can only happen for fermion case with ncs = 1, but
otherwise a suppression. The minimal configuration ncs = 1 yields a power counting
                           dim(Φ)−s           dim(Φ)−1
                            λ                λS
                                                            ,   (ncs = 1)                  (2.53)
                            Q              λ2 /Q
which does not affect the power counting for scalar and vector bosons, but changes the
                  p
fermion case to λS /Q, enhancing the λ/Q counting in Table 2.1 if λS ∼ λ. For ncs ≥ 2,
a large λS ≫ λ2 /Q leads to a suppression for all cases. Hence, for the leading region, one
usually needs ncs = 1 for the large scaling.
    For the soft momentum ks flowing into H, we can still neglect it in H since λS ≪ Q.
Then Eq. (2.50) is modified to
                                                                  dim(Φ)
                                        dim(Φ)   − dim(Φ)       λS
                        ∆(S ⊗ H) ∼    λS       ·Q         =                ,               (2.54)
                                                                Q
which enhances the counting in Eq. (2.50) if λS ≫ λ2 /Q.
    To summarize, we include in the last two columns of Table 2.1 the power counting for
the high end of soft region S ′ with λS = O(λ), and with ncs = 1. Since the soft attachments
                                                48


still lead to power suppression except for longitudinally polarized gluons, the leading region
graph for Sudakov form factor takes the same form as Fig. 2.3, now for the whole soft region
with λ2 /Q ≲ λS ≲ λ.
2.4       Basic approximation for a single region
It is the regions around the pinch surfaces that give important power contributions to the
Feynman integrals. Having identified the pinch surfaces that are associated with leading
power contributions, we may then make certain approximations to extract those contribu-
tions based on the power counting of the related momenta in Eqs. (2.42) and (2.43).
     As shown in Table 2.1, the major complication from the gauge theory is that there is no
penalty from adding arbitrarily many vector boson lines attaching the collinear subgraphs
to the hard or soft subgraphs. But the polarizations must be proportional to the collinear
momenta. This is the key for the factorization of gauge theory as it will allow the use of
Ward identity.
     To present the approximators, it is helpful to confine ourselves to a particular process or
amplitude. So in the following discussions, we will mainly be using the Sudakov form factor
as an example, but other processes like DIS will also be referred to for completeness.
2.4.1      Approximation of collinear-to-hard connections
In the Sudakov form factor, each collinear subgraph Ci is connected to the hard subgraph
H by one quark line and a series of gluon lines. They are convoluted by the integrations of
                                               49


the loop momenta, which can be written as
                                Z                     " n            #" m                 #
                                      d4 kq d4 kq̄ Y d4 ki                Y d4 lj
              Cq ⊗ H ⊗ Cq̄ =
                                     (2π)4 (2π)4 i=1 (2π)4 j=1 (2π)4
                                       × Cq,α,µ1 ···µn (kq , k1 , · · · , kn ) · g µ1 ν1 · · · g µn νn
                                       × Hα,ν1 ···νn ;β,σ1 ···σm (kq , k1 , · · · , kn ; lq̄ , l1 , · · · , lm )
                                       × g σ1 ρ1 · · · g σm ρm · Cq̄,β,ρ1 ···ρm (lq̄ , l1 , · · · , lm )         (2.55)
for a certain region of some diagram that has n and m collinear gluons connecting Cq and
Cq̄ to H, respectively. In Eq. (2.55), α (kq ) and β (lq̄ ) are the spinor indices (loop momenta)
of the quark and antiquark respectively, and {µi } ({ki }) and {νj } ({kj }) are the Lorentz
indices (loop momenta) of the Cq - and Cq̄ -collinear gluons, respectively. All the collinear
(pinched) propagators have been included in Cq or Cq̄ , and we will eventually include the
loop integrations into them as well.
    We take the quark and antiquark to move along the ±z directions, respectively. Around
the pinch surface, the collinear momenta scale as
    ki ∼ (Q, λ2 /Q, λ), (i = q, 1, · · · , n),    and lj ∼ (λ2 /Q, Q, λ), (j = q̄, 1, · · · , m).                (2.56)
These momenta circulate between the hard subgraph H and collinear subgraphs. The prop-
agators inside H are not pinched, and proper deformations can be done to make them have
high virtualities of order Q2 . Then we may expand each of them with respect to the small
parameter λ without encountering any singularities. The leading-power contribution can be
                                                        50


simply obtained by neglecting λ in H, so we approximate H by
                                       H({ki }; {lj }) → H({k̂i }; {ˆlj }),                               (2.57)
with
                k̂iµ = (ki+ , 0− , 0T ) = (ki · n) n̄µ ,    ˆlµ = (0+ , l− , 0T ) = (lj · n̄) nµ ,        (2.58)
                                                              j           j
where i = q, 1, · · · , n, j = q̄, 1, · · · , m, and we have introduced two lightlike auxiliary vectors
                                              1                                        1
               nµ = (0+ , 1− , 0T ) = √ (1, −⃗z),            n̄µ = (1+ , 0− , 0T ) = √ (1, ⃗z).           (2.59)
                                               2                                         2
    For the spinor contraction between Cq and H, the boosted factor Cq has a large component
in the spinor space, which can be projected out by [Collins, 2013]
                                               "     P                   #                        
                                                 γ·n   s us (k̂)ūs (k̂)                   γ +γ −
    Cq,α (k) → Cq,δ (k) Pδα = Cq,δ (k)                                        = Cq,δ (k)                , (2.60)
                                                       2k̂ · n                                2      δα
                                                                           δα
where k stands for a generic collinear momentum along the z direction and us (k̂) is the
massless spinor with momentum k̂ = (k · n)n̄ and spin s. The projector (P)δα has the
property
                             P 2 = P, ūs (k̂)P = ūs (k̂), and P(γ · k̂) = 0,                            (2.61)
such that it projects Cq (k) onto a massless spinor along the z direction. Perturbatively, the
fermion propagator numerator k/ + m contracted with P keeps its large component k + γ −
intact, so P keeps the leading-power accuracy. P inserting between Cq and H contracts the
massless spinor ūs (k̂) with H to give the hard factor a physical interpretation of massless
quark interaction.
                                                       51


    Similarly, for the spinor contraction between H and Cq̄ , we insert a projector
                                      "P                          #                                    
                                                  ˆ       ˆ                                γ +γ −
                                            s us (l)ūs (l)γ · n̄
       Cq̄,β (l) → P̄βκ Cq̄,κ (l) =                                      Cq̄,κ (l) =                       Cq̄,κ (l),       (2.62)
                                                 2ˆl · n̄            βκ
                                                                                                2
which is the same as P. After these two approximations, Eq. (2.55) becomes
                                Z                  " n             #" m                 #
                                   d4 kq d4 kq̄ Y d4 ki                 Y d4 lj
         Cq ⊗ H ⊗ Cq̄ ≃
                                  (2π)4 (2π)4 i=1 (2π)4 j=1 (2π)4
                                    × Cq,α,µ1 ···µn (kq , k1 , · · · , kn ) · g µ1 ν1 · · · g µn νn
                                    × Pαδ Hδ,ν1 ···νn ;κ,σ1 ···σm (k̂q , k̂1 , · · · , k̂n ; ˆlq̄ , ˆl1 , · · · , ˆlm ) Pκβ
                                    × g σ1 ρ1 · · · g σm ρm · Cq̄,β,ρ1 ···ρm (lq̄ , l1 , · · · , lm ),                      (2.63)
where the hard factor H is surrounded by two P’s which amputate and put on-shell the two
quark lines connected to H. This fact will be important when applying Ward identity to the
collinear gluons.
    Based on our power counting rules, only the longitudinal polarizations of the collinear
gluons connecting Cq or Cq̄ to H are of leading power. This means that we shall have νi = +
and σj = − in Eq. (2.65), which extracts the g −+ components for all the metric tensors. So
we make the approximations
                                  nµi k̂iνi     nµi k̂iνi                    ˆlσi n̄ρj        ˆlσi n̄ρj
                       µ i νi                                    σj ρj          j               j
                     g        7 →            =             ,   g        7→               =                 .                (2.64)
                                   k̂i · n        ki · n                       ˆlj · n̄         lj · n̄
These are equivalent to g µν 7→ nµ n̄ν , but writing as Eq. (2.64) has the advantage that the
gluon connection to the hard factor H will be replaced by its momentum contracted with
                                                            52


H,
                         Z                      " n                #" m             #
                               d4 kq d4 kq̄ Y d4 ki                    Y d4 lj
     Cq ⊗ H ⊗ Cq̄ ≃
                              (2π)4 (2π)4 i=1 (2π)4 j=1 (2π)4
                                                         "                #
                                                           Y nµi
            × Cq,α,µ1 ···µn (kq , k1 , · · · , kn ) ·
                                                              i
                                                                   ki · n
                             !                                                                                                  !
                  Y            h                                                                                       i Y
            ×          k̂iνi     Pαδ Hδ,ν1 ···νn ;κ,σ1 ···σm (k̂q , k̂1 , · · · , k̂n ; ˆlq̄ , ˆl1 , · · · , ˆlm ) Pκβ     ˆlσj
                                                                                                                             j
                     i                                                                                                   j
                "              #
                  Y n̄ρj
            ×                    · Cq̄,β,ρ1 ···ρm (lq̄ , l1 , · · · , lm ),                                                     (2.65)
                   j
                       lj · n̄
which will in turn allow the use of Ward identity.
2.4.2      Approximation of soft-to-collinear connections
From the power counting rules in Table 2.1, soft connections are generally power suppressed
except for soft gluons that are attached to collinear subgraphs with polarizations proportional
to the collinear momenta. For the leading region graph in Fig. 2.3(b), a soft momentum ks
scales as (λS , λS , λS ) and can be taken to circulate from S to Cq , to H, to Cq̄ , and then back
to S. When ks flows through H, each of its component is much smaller than Q, so we can
neglect it in H to the leading-power accuracy. When it flows through Cq along a collinear
line with momentum ki , it modifies the momentum to ki + ks , which does not change the
leading component (ki + ks )+ = ki+ = O(Q) thus does not change the collinear propagator
numerator, but it modifies the propagator denominator by
                                (ki + ks )2 − m2 = (ki2 − m2 ) + 2ki · ks + ks2 .                                               (2.66)
                                                                  53


Since ks has a uniform scaling λS for all components, it is the term 2ki+ ks− = O(QλS ) that
is the most important among all ks -related terms. Therefore, we may only keep ks− when ks
flows through Cq , which gives the approximation,
                                      µ
                                    kqs   7→ k̂qs  µ
                                                       = (kqs · n̄)nµ ,                                         (2.67)
where kqs denotes the soft momentum flowing through Cq . This applies for the whole range
of λS ∈ (λ2 /Q, λ). Even though for λS ∼ λ, the whole quark propagator is dominated by
2ki+ ks− = O(Qλ), we do not modify the term (ki2 − m2 ) in order for a unified approximation.
Similarly, for a soft momentum kq̄s flowing in Cq̄ , we approximate it by
                                      µ            µ
                                    kq̄s   7→ k̂q̄s    = (kq̄s · n)n̄µ .                                        (2.68)
    Those soft momentum approximation decouples the soft momenta from the hard sub-
graph and simplifies the soft-collinear couplings to
                         Z "Y 4             #"                  #
                                 d kqs,i Y d4 kq̄s,j
          Cq ⊗ S ⊗ Cq̄ ≃                                           Cq;µ1 ···µn ({k̂qs,i })g µ1 ν1 · · · g µn νn
                              i
                                  (2π)4             j
                                                        (2π)4
                             × Sν1 ,··· ,νn ;σ1 ,··· ,σm ({kqs,i }, {kq̄s,j })
                             × g σ1 ρ1 · · · g σm ρm Cq̄;ρ1 ···ρm ({k̂q̄s,j }),                                 (2.69)
where we have suppressed the collinear momentum dependence, and k̂qs,i and k̂q̄s,j are de-
fined as Eqs. (2.67) and (2.68), respectively. The soft factor S includes all the soft gluon
propagators. Here we are separately examining the soft momenta attaching to Cq and Cq̄ ,
which are related by necessary delta functions included in S; eventually we will also include
                                                       54


the soft integrations into S. Similar to the collinear gluon coupling Ci to H, here only the
g −+ components of all the metric tensors give leading-power contributions. So we make the
approximations,
                                  µi                µi                                   ρ              ρ
                      µi νi
                                k̂qs,i n̄νi      k̂qs,i  n̄νi         σj ρj
                                                                                 nσj k̂q̄s,j
                                                                                           j
                                                                                                  nσj k̂q̄s,j
                                                                                                          j
                    g       7 →              =                ,     g        7→               =               .                (2.70)
                                k̂qs,i · n̄      kqs,i · n̄                       n · k̂q̄s,j     n · kq̄s,j
Similar to Eq. (2.64), this is equivalent to g µν 7→ nµ n̄ν . It simplifies Eq. (2.71) to
                              Z "Y 4              #"                     #
                                                                                                      Y µ
                                                                                                                   !
                                        d kqs,i Y d4 kq̄s,j
       Cq ⊗ S ⊗ Cq̄ ≃                          4                      4
                                                                            Cq;µ1 ···µn ({k̂qs,i })         k̂qs,i
                                                                                                                i
                                    i
                                         (2π)             j
                                                               (2π)                                     i
                                       "                   #                                            "                    #
                                         Y n̄νi                                                            Y nσj
                                   ×                          Sν1 ···νn ;σ1 ···σm ({kqs,i }, {kq̄s,j })
                                          i
                                              k qs,i  · n̄                                                   j
                                                                                                                  n · kq̄s,j
                                                      !
                                         Y ρ
                                                  j
                                   ×          k̂q̄s,j     Cq̄;ρ1 ···ρm ({k̂q̄s,j }).                                           (2.71)
                                           j
The soft gluon couplings to the collinear factors are replaced by their approximated momenta
in the collinear subgraphs, which will allow the use of Ward identity.
2.5       Glauber region and modified approximations
The previous discussion on the soft momenta all relies on the uniform scaling in Eq. (2.43),
assuming the integration of the angular variable k̄ µ in Eq. (2.38) has a uniform bound in its
whole range. This includes the power counting rules, determination of leading regions, and
the soft approximations. The missing regions surrounding the soft pinch surface concern two
types:
    • |ks+ ks− | ∼ ksT
                    2
                        ≪ Q2 but |ks+ | ≫ |ks− | or vice versa. An example is ks ∼ (λ, λ3 /Q2 , λ2 /Q).
      This is still a soft momentum but with a large rapidity y ∼ ln(Q/λ), so it is also
                                                                55


      collinear to the quark. We can call this scaling soft-collinear scaling. When it flows
      through Cq , it is no longer a good approximation to only keep ks− as in Eq. (2.67). As
      we will see in Secs. 2.6 and 2.8, a correct treatment needs to consider ks as a collinear
      momentum, and its overlap with the soft region will be taken care of by the subtraction
      formalism.
   • Q2 ≫ |ks+ ks− | ≫ ksT     2
                                 . This does not raise any new issue compared to the uniform
      scaling and soft-collinear scaling.
   • |ks+ ks− | ≪ ksT 2
                         ≪ Q2 . This transverse-component-dominated region is called Glauber
      region, to which we now turn our discussion.
2.5.1      Glauber region
The Glauber region |ks+ ks− | ≪ ksT    2
                                           ≪ Q2 is a subset of the soft region. A typical Glauber
momentum scaling is
                                       ksGlauber ∼ (λ2 /Q, λ2 /Q, λ),                          (2.72)
where the plus and minus components are taken of the same order. When it flows through
the collinear subgraphs Cq,q̄ , it does not change the virtualities of collinear lines,
                                                                                            
     (kq + ksGlauber )2 ≃ kq2 − (ksT        ) + 2kq+ (ksGlauber )− − kqT · ksT
                                    Glauber 2                                 Glauber
                                                                                      ≃ O λ2 ,
                                                                                            
     (kq̄ − ksGlauber )2 ≃ kq̄2 − (ksT      ) − 2kq̄− (ksGlauber )+ + kq̄T · ksT
                                    Glauber 2                                 Glauber
                                                                                      ≃ O λ2 , (2.73)
where we only retained the terms of the highest scaling. The soft propagator also has the
same scaling
                                                                         
                                   (ksGlauber )2 ∼ (ksT
                                                     Glauber 2
                                                            ) ∼ O λ2 .                         (2.74)
                                                     56


In this way, the Glauber region also makes a leading-power contribution. But it is clear from
Eq. (2.73) that the transverse component of the soft momentum becomes non-negligible in
the collinear subgraphs, so that the approximations in Eqs. (2.67) and (2.68) are no longer
valid. Even though we may still take the approximations in Eq. (2.70), the soft momentum
k̂s that couples to collinear subgraphs is not the same soft momentum that flows in them.
As a result, the Glauber region violates the soft approximations that allow the exact use of
Ward identities.
     As we will see, it is crucial for the use of Ward identities to factorize soft gluons from
collinear subgraphs in a gauge theory. So the presence of the Glauber region endangers
factorization, which we must deal with particularly.
2.5.2      Contour deformation
The Glauber region (or any other soft region) is in the neighborhood of the soft pinch surface,
but is not itself a pinch surface. When getting away from the pinch surface, the momentum
contour is no longer exactly pinched at a singular point to give zero virtualities. But if we
are close to the pinch surface, the contour deformation is normally still restricted to keep the
virtualities from getting too large. Therefore, we need to investigate whether the contour is
pinched in the Glauber region. If not, we may still deform the contour to avoid the Glauber
region.
     Since the characteristics of the Glauber region is that the longitudinal components ks± are
much smaller than the transverse component ksT , we would identify the poles of ks± around
0 given ksT . First, the denominators of Cq -collinear propagators
                                                      −
                                         +
                  (kqi + ks )2 + iϵ = 2(kqi + ks+ )(kqi + ks− ) − (kqi,T + ksT )2 + iϵ
                                                   57


                                          + −
                                    ≃ 2kqi  (kqi + ks− ) − (kqi,T + ksT )2 + iϵ               (2.75)
contribute to poles of ks− on the lower half plane, of the order λ2 /Q. Since all the Cq -collinear
lines propagate from H to the future with large positive plus momenta, and we can always
choose ks to flow along Cq -collinear lines in the same direction without detouring back and
forth inside Cq , all the Cq -collinear lines only contribute to small ks− on the lower half plane.
Similarly, all the denominators of Cq̄ -collinear propagators only contribute to ks+ poles on
the upper half plane,
                                                        −
                                           +
                 (kq̄j − ks )2 + iϵ = 2(kq̄j − ks+ )(kq̄j − ks− ) − (kq̄j,T + ksT )2 + iϵ
                                         −    +
                                    ≃ 2kq̄j (kq̄j − ks+ ) − (kq̄j,T + ksT )2 + iϵ,            (2.76)
of order λ2 /Q. Both these ks+ and ks− poles are in the Glauber region, but only on the same
half plane respectively, and not pinched. Around those poles, the gluon propagator also
contribute to poles for ks+ and ks− , but of order ksT    2
                                                              /ks± ∼ O(Q), which is far away.
    Therefore, we can deform the contour of ks± such that their magnitudes stay much greater
than λ2 /Q. Due to the ks− poles from Cq propagators, we deform the ks− contour to the upper
half plane,
                                         ks− 7→ ks− + i v(ks− ),                              (2.77)
where v(ks− ) > 0 kicks in when ks− ∼ O(λ) and keeps ks− on the deformed contour to be at
least of order λ. A simple choice can be, e.g.,
                                                           2 /2λ2
                                           v(k) = λ e−k           ,                           (2.78)
                                                    58


which only modifies the Glauber region. This deformation does not change the ks+ poles
from the Cq̄ propagators, but changes the ks+ pole from the gluon propagator to the order
  2
ksT /ks− ∼ O(λ). Hence, it is still compatible to deform the ks+ contour to the lower half
plane by O(λ),
                                           ks+ 7→ ks+ − i v(ks+ ),                               (2.79)
where we chose the same deformation function in Eq. (2.78), which is not necessary.
    Eqs. (2.77) and (2.79) deform the contours of ks+ and ks− by the same amount. This is
called symmetrical deformation. To avoid possible obstruction from the poles of the gluon
propagator, the maximum extent of the symmetrical deformation is O(λ). Symmetrical
deformation is not always necessary. We will also see that in certain cases, e.g., in Sec. 3.2.2.2,
symmetrical deformation is not allowed by a partial pinch in the Glauber region. There it
is sufficient to only deform ks+ or ks− .
    The deformations in Eqs. (2.77) and (2.79) are only applied to the Glauber region, but
not necessary to the other soft subregions. One may devise a uniform deformation formula
for the whole soft region, e.g.,
                                                                                   
                                 2ks+ ks−                                    2ks+ ks−
               ks− 7→ ks− + iρ      2
                                              v(ks− ),    ks+ 7→ ks+  − iρ      2
                                                                                        v(ks+ ), (2.80)
                                  ksT                                         ksT
where ρ(x) has the property that ρ(x) ≃ 1 as |x| ≪ 1 and ρ(x) → 0 as |x| ≳ 1. One simple
choice is
                                                        (λ/Q)2
                                          ρ(x) =                    .                            (2.81)
                                                    x2 + (λ/Q)2
After this deformation, the components (ks+ , ks− , ksT ) are of the same order, restoring the
uniform scaling [Eq. (2.43)]. Then the soft approximations in Eqs. (2.67) and (2.68) can be
                                                       59


applied.
2.5.3          Modified approximations
The soft approximations can be applied only after the contour deformation, so it is important
that they do not introduce any poles that obstruct the contour deformations in Eq. (2.80).
Therefore, for the approximations in Eq. (2.70), we need to carefully specify an iϵ prescription
for the soft poles introduced around 0. For soft gluon momenta k̂qs,i and k̂q̄s,j flowing into S
from Cq and Cq̄ , respectively, we modify Eq. (2.70) to
                      µi                µi                                  ρ                 ρ
       µi νi
                    k̂qs,i n̄νi       k̂qs,i n̄νi       σj ρj
                                                                   nσj k̂q̄s,jj
                                                                                      nσj k̂q̄s,j
                                                                                               j
     g       7 →                  =                 , g       7→                  =                 . (2.82)
                 k̂qs,i · n̄ + iϵ   kqs,i · n̄ + iϵ              n · k̂q̄s,j + iϵ   n · kq̄s,j + iϵ
Thus introduced poles for ks± are all on the same half plane as those from the Cq̄,q propagators.
    The collinear approximations in Eq. (2.64) also introduce poles of the gluon momenta at
0. Even though they are designed only for the collinear region, as we will see in Sec. 2.6,
after applying those approximations, we extend the loop momenta to all regions, including
the soft and hard regions as well. The overlap with the soft region will be subtracted to
avoid double counting. The subtraction term is obtained by first applying the soft approx-
imation in Eqs. (2.67)(2.68)(2.82), and then applying the collinear approximation. Since
the soft approximation is applied to the deformed contour, it is necessary that the collinear
approximation be compatible with such deformation when the same gluon momentum enters
the soft region. The same collinear gluon momentum ki entering Cq from H can also enter
the soft region, where it flows from Cq̄ into S and has soft poles on the lower half plane for
its plus momentum component. Similarly, for the collinear gluon momentum lj entering Cq̄
from H, we need to avoid the soft pole on the lower half plane for its minus momentum
                                                      60


component. Therefore, we need to modify Eq. (2.64) to
                          nµi k̂iνi     nµi k̂iνi                   ˆlσi n̄ρj      ˆlσi n̄ρj
            µ i νi                                     σj ρj          j              j
          g        7 →              =             ,  g       7 →               =              . (2.83)
                       k̂i · n + iϵ   ki · n + iϵ                ˆlj · n̄ + iϵ   lj · n̄ + iϵ
    In this way, the necessity of contour deformation to get out of the Glauber region dictates
the iϵ prescriptions in the soft and collinear approximations. The direction of the deformation
is determined only by the causal structure of the scattering process, so are the iϵ prescriptions,
which will give correct causal properties for each factor in the factorization result.
2.6       Subtraction formalism and factorization
The leading power contribution of a certain amplitude or cross section does not a priori
correspond to a factorized expression. The latter is motivated by the fact that the leading
regions have distinct momentum scales, classified as the hard, the collinear, and the soft
momenta, as given by the Libby-Sterman analysis. By choosing proper approximations as
in Sec. 2.4, the different momentum scales detach so as to imply a factorized expression.
    As a necessary condition for factorization, the simplest nontrivial diagram that only has
a single leading region with two distinct momentum scales should simply factorize as a result
of the approximations, e.g., the leading-order DIS diagram as in Fig. 2.4(a). If the whole
amplitude or cross section only had one single region, then factorization would come as a
direct result of proper approximations. However, as a renormalizable gauge theory, each
process in QCD has an infinite number of diagrams with arbitrarily complicated leading
regions. A proper treatment must take into account all possible regions of all diagrams, with
careful avoidance of double counting between neighboring regions. The factorization is then
                                                    61


a highly nontrivial result, being very intricate and also fragile.
    In this section, we briefly review the subtraction formalism used in the treatment of
multiple regions in the derivation of factorization. First, we will consider the two low-order
DIS diagrams in Fig. 2.4 and examine their factorization. Starting with the smallest region
and building up larger and larger regions while avoiding double counting of overlapping
regions naturally motivate the subtraction method. Then after a formal discussion of the
subtraction formalism, we demonstrate its application to the simplest case, in a real physical
situation, the DIS factorization in the light-cone gauge.
                                                          q
                         q
                                                            k1
                            k
                        p                                   k2
                                                         p
                              (a)                              (b)
Fig. 2.4: Examples of DIS diagrams for an elementary target. (a) is the LO diagram, with
one single leading region indicated by the red hooked line. (b) is an NLO diagram, with two
leading regions indicated by the red and blue hooked lines, respectively.
2.6.1     A particular DIS diagram as an example
Each leading region of a DIS diagram contains only two subgraphs, one hard subgraph H
connected to the virtual photon lines, and one collinear subgraph C attaching to the external
target. The two subgraphs are joined by a set of collinear lines whose propagators we include
in the collinear subgraph. Fig. 2.4 shows two diagrams for an elementary target, which we
take as an on-shell quark with a small mass m to cut off collinear divergences, with no
                                               62


concern for confinement issues. Taking the kinematics
                                                                       
                              +  m2                        +     Q2
                      p=     p , + , 0T    ,   q=     −xp ,          , 0T   ,                (2.84)
                                 2p                             2xp+
for the target and the photon, respectively. With Q ≫ m, the lines in the hard subgraph
have virtualities of order Q2 , and those in the collinear subgraph have virtualities of order
m2 . The leading regions are represented by the massless reduced diagrams, obtained by
taking m → 0, which yields p → p̂ = (p+ , 0− , 0T ).
    In this section, we work in the light-cone gauge, where the gluon propagator numerator
is
                                              nµ k ν + nν k µ
                                     −g µν +                  ,                              (2.85)
                                                    k·n
with n is defined in Eq. (2.59). This suppresses the longitudinal polarization so that the
leading regions only have two quark or transversely polarized gluon lines joining C to H.
The approximation can be easily devised, following the spirit in Sec. 2.4, as
    • for each collinear momentum ki flowing into the hard subgraph H, we approximate it
      by only keeping its plus component,
                                         ki 7→ k̂i = (ki · n)n̄;                             (2.86)
    • for each quark line entering (leaving) H, insert the spinor projector
                                   Pn = γ − γ + /2 (P̄n = γ + γ − /2);                       (2.87)
                                                                              µν
    • for each gluon line connecting C to H, insert the Lorentz tensor g⊥        to project out the
                                                 63


       transverse polarization.
Denoting the effect of such approximation by
                                 Z                     Z
                      H ⊗C ≡       [dk]H(k) C(k) 7→       [dk]H(k) T̂ C(k),                 (2.88)
where k collectively denotes all the collinear momenta, and T̂ acts on the integrand, inserting
certain projectors between H and C and neglecting certain momenta in H to its left.
    The LO diagram Γ0 in Fig. 2.4(a) has two leading regions, (1) one with k = α p̂ (0 <
α < 1) collinear to the target, so that the reduced diagram has one hard subgraph and one
collinear subgraph, separated by the red hooked line, which inserts necessary projectors and
approximates on shell the collinear momenta passing it; (2) and the other with k 2 ∼ O(Q2 ),
so that the whole diagram is the hard subgraph, to which the external photon and quark
lines attach. To simplify the discussion in this section, we assume the second region is power
suppressed by some nonperturbative effect, so that we only have the first leading region.
Factorization then follows trivially from the fact that only k + flows through H, and this
leads to
                   Z                            Z             Z                         
                        d4 k                         +           dk − d2 kT
  CR Γ0 = TR Γ0 =            H0 (k) T̂ C0 (k) =    dk H0 (k̂)               Pn C0 (k) P̄n , (2.89)
                       (2π)4                                      (2π)4
up to power suppressed contribution. Here we use the compound notation “CR Γ0 ” to refer
to the leading contribution of the diagram Γ0 from the region R. TR is the approximator
designed for this region; it acts on Γ0 to extract the leading contribution. Since Γ0 only
has one leading region R, CR Γ0 is just equal to TR Γ0 . We will deal with the spin and color
projectors in more details later in Sec. 2.7.1.
                                                 64


    The NLO diagram Γ1 in Fig. 2.4(b) is more complicated. We have two leading regions:
(1) region R1 : k1 is in H with a large virtuality, while k2 = β p̂ (0 < β < 1) is collinear, as
indicated by the blue hooked line; and (2) region R2 : both k1 and k2 are collinear to the
target, with
                              k1 = αk2 ,     k2 = β p̂,    0 < α, β < 1,                             (2.90)
which is indicated by the red hooked line in Fig. 2.4(b). The topology of the region R1 is
defined in the momentum space as k12 ̸= 0 and k2 = β p̂. Apparently, its closure contains R2
as a subset. This relation between the two leading regions is denoted as R1 > R2 . R1 is
greater than R2 in the sense that it has more lines with hard virtualities, while R2 has more
lines in the collinear subgraph.
    To move forward, we first define the diagram Fig. 2.4(b) as
                            Z
                               d4 k1 d4 k2
                      Γ1 =                  [H0 (q; k1 ) · K0 (k1 , k2 ) · C0 (p; k2 )] .            (2.91)
                              (2π)4 (2π)4
The factor H0 (q; k1 ) includes the quark line on the top and the two photon vertices, K0 (k1 , k2 )
includes the propagators of k1 , the gluon line of (k2 − k1 ), and their vertices, and C0 (p; k2 )
includes the rest of the diagram. These three factors are convoluted in momenta k1 and k2 ,
which has been explicitly written, and spinor and color indices, which are implicitly implied
by the dot notation.
    To extract the leading contribution from R2 , we can insert the approximator T̂ between
H0 and K0 , formally written as
                                  Z
                                      d4 k1 d4 k2 h                                              i
              CR2 Γ1 = TR2 Γ1 =                     H  0 (q; k1 )T̂ K 0 (k1 , k2 ) · C 0 (p; k2 )  , (2.92)
                                     (2π)4 (2π)4
                                                  65


which projects k1 in H0 by k̂1 according to Eq. (2.86) and inserts the spinor projectors Pn
and P̄n . This expression simply factorizes into
                    Z               Z                                                        
                        +              dk − d2 kT d4 k2                                  
       CR2 Γ 1 =      dk H0 (q; k̂)                          Pn K0 (k, k2 ) · C0 (p; k2 ) P̄n ,     (2.93)
                                          (2π)4 (2π)4
which adds to Eq. (2.89) with the same hard coefficient, but as an NLO correction to the
collinear factor.
    Now we consider the contribution from R1 . Naively, CR1 Γ1 is obtained by
                          Z
                               d4 k1 d4 k2 h                                          i
      CR1 Γ1 ∼ TR1 Γ1 =                     H0 (q; k1 ) · K0 (k1 , k2 ) T̂ C0 (p; k2 )
                              (2π)4 (2π)4
                          Z 4
                               d k1 d4 k2 h                                                    i
                        =                   H  0 (q; k 1 ) · K  (k ,
                                                               0 1 2 k̂ ) · P  C
                                                                              n 0 (p;  k2 )P̄n    , (2.94)
                              (2π)4 (2π)4
which seems to factorize into a hard factor H0 (q; k1 ) · K0 (k1 , k̂2 ) and a collinear factor
Pn C0 (p; k2 )P̄n . Now the hard factor includes the integration of k1 , which should be con-
strained to the hard region. However, this is technically hard to define, given also the need
to deform contours when unpinched propagators become close to the on-shell poles. It would
be ideal to have the k1 integration in the hard factor to extend to all regions. Then it can
unavoidably reach the collinear region where k1 = αk̂2 . This is still a leading region in the
hard subgraph. But such contribution has been included in the region R1 . This reflects a
general fact that a larger region R1 has overlap with smaller regions R2 < R1 , such that the
approximator TR1 alone is not sufficiently exclusive to extract only the contribution from R1
when acting on the graph Γ alone. Therefore, when applying TR1 , one should first subtract
                                                   66


the contribution from smaller regions, such that
  CR1 Γ1 = TR1 (Γ1 − CR2 Γ1 ) = TR1 (1 − TR2 )Γ1
              Z 4
                  d k1 d4 k2 h                                                       i
           =                      H0 (q; k1 ) (1 − T̂ ) K0 (k1 , k2 ) T̂ C0 (p; k2 )
                 (2π)4 (2π)4
              Z       Z 4                                            Z                                
                    +        d k1                                            dk − d2 kT
           = dk                    H0 (q; k1 ) (1 − T̂ ) K0 (k1 , k̂)                   Pn C0 (p; k)P̄n , (2.95)
                            (2π)4                                              (2π)4
where in the third line we gave the factorization expression, which adds onto Eq. (2.89) with
the same collinear factor, but as an NLO correction to the hard factor, defined as
  Z
       d4 k1
             H0 (q; k1 ) (1 − T̂ ) K0 (k1 , k̂)
      (2π)4
       Z 4                                      Z                    Z                                  
            d k1                                     +                    dk1− d2 k1T
    =            H0 (q; k1 ) · K0 (k1 , k̂) − dk1 H0 (q; k̂1 ) ·                       Pn K0 (k1 , k̂)P̄n . (2.96)
           (2π)4                                                             (2π)4
    Adding the leading contribution from R1 and R2 , we have the total leading-power con-
tribution of Γ1 ,
                       X
                            CR Γ1 = CR2 Γ1 + CR1 Γ1 = TR2 Γ1 + TR1 (1 − TR2 )Γ1
                         R
                                    = TR2 Γ1 + TR1 Γ1 − TR1 TR2 Γ1 ,                                        (2.97)
where in the second line, we have separated all terms of different approximator applications.
By construction, TR2 gives a good approximation to Γ1 when both k1 and k2 have low virtu-
alities, but gives a poor description when either of them is hard. (Recalling our assumption
that the region with both momenta having hard virtualities is suppressed, so TR2 is bad
when k1 is hard but k2 is collinear.) Also, TR1 gives a good approximation to Γ1 only when
k1 is hard and k2 is collinear. Now, in the region R2 , among the three terms in the second
                                                         67


line of Eq. (2.97), the first term TR2 Γ1 gives a good approximation of Γ1 . The other two
terms combine into TR1 (1 − TR2 )Γ1 , which is suppressed in this region due to the (1 − TR2 )
factor that suppresses the low virtuality region of k1 . Since TR1 keeps k1 unchanged, it does
not affect such suppression. In the region R1 , the second term gives a good approximation
of Γ1 . The other two terms combine into (1 − TR1 )TR2 Γ1 , which is suppressed due to the
factor (1 − TR1 ) that suppresses the hard virtuality region of k1 .
    Now we examine how well Eq. (2.97) can approximate the graph Γ1 by constructing the
error term,
                                     X
                         r1 ≡ Γ1 −      CR Γ1 = (1 − TR1 )(1 − TR2 )Γ1 .                (2.98)
                                      R
The factor (1 − TR2 ) accounts for the error introduced by neglecting k1T with respect to Q,
so gives
                                                           
                                                       k1T
                                   (1 − TR2 )Γ1 = O           Γ1 .                      (2.99)
                                                        Q
The other factor (1 − TR1 ) accounts for the error introduced by neglecting k2T with respect
to k1T and Q, so
                                                                     
                              k2T k2T        k1T              k2T k1T k2T
                    r1 = O       ,      O          Γ1 = O          ,       Γ1 .        (2.100)
                              k1T Q           Q                Q     Q2
Since we always have k2T ≃ m ≪ Q and k1T ≲ Q, the error is power suppressed,
                                                    
                                                   m
                                        r1 = O         Γ1 .                            (2.101)
                                                   Q
    In fact, the argument of the subtraction formalism [Eq. (2.97)] can start with the error
term Eq. (2.98). Successively applying (1 − TR ) on Γ1 must yield a power suppressed result,
                                                68


since it successively suppresses all the leading regions. Therefore, we have, right from the
beginning,
                                                               
                                                              m
                            r1 ≡ (1 − TR1 )(1 − TR2 )Γ1 = O       Γ1 .                 (2.102)
                                                              Q
This can be reorganized as
          r1 = (1 − TR2 )Γ1 − TR1 (1 − TR2 )Γ1 = Γ1 − TR2 Γ1 − TR1 (1 − TR2 )Γ1 ,      (2.103)
so immediately gives the subtraction formula,
                                                                  
                                                                 m
                            Γ1 = TR2 Γ1 + TR1 (1 − TR2 )Γ1 + O        ,                (2.104)
                                                                 Q
where the power suppressed term is from r1 . Such subtraction formalism systematically
extracts the leading-power contributions from all regions, with loop momenta extending to
all regions and without double counting. Our analysis of the DIS diagrams up to NLO should
motivate that summing over all regions and diagrams can lead to a factorization result, which
we will discuss in Sec. 2.6.3.
2.6.2      Subtraction formalism
Generally, a diagram Γ can contain multiple leading regions, Ri . In the Sudakov form factor
example Fig. 2.3(b), different regions of a particular diagram differ by having different lines
or different numbers of lines in the hard, A-collinear, B-collinear, or soft subgraphs. In the
DIS example, different regions of the diagram Γ1 in Fig. 2.4(b) differ by having different
numbers of ladders in the hard of collinear subgraphs. In each region Ri , we design suitable
approximator TRi that acts on the integrand and gives a proper approximation for the integral
                                                69


in that region. The leading power contribution of each region R is iteratively defined as
             
             
             
             TR Γ,                              if no region Ri of Γ is smaller than R;
     CR Γ =                                                                               (2.105)
             
                        X
             TR Γ −               CR ′ Γ ,      otherwise.
                             R′ <R
And then summing over all regions gives an approximation to the original diagram Γ up to
power suppressed corrections,
                                               X
                                          Γ=        CR Γ + p.s.                             (2.106)
                                                 R
This is the general subtraction formalism of extracting the leading-power contribution from
a diagram Γ.
     No attempt will be given here to prove Eq. (2.106).2 We simply motivate it by examining
a simple case where the leading regions are strictly nested, i.e., all the leading regions Ri of
any graph Γ have the strict ordering R1 > R2 > · · · > Rn . This is true for diagrams with only
two kinds of subgraphs, such as DIS diagrams which only have collinear and hard subgraphs
(but not for Sudakov form factor which has four subgraphs). Similar to Eq. (2.102), by
successively subtracting all the leading region contributions, the remainder
                               r = (1 − TR1 )(1 − TR2 ) · · · (1 − TRn )Γ                   (2.107)
is power suppressed. Then by reorganizing Eq. (2.107), we have
            Γ = Γ − r + p.s.
               = Γ − (1 − TR2 ) · · · (1 − TRn )Γ + TR1 (1 − TR2 ) · · · (1 − TRn )Γ + p.s.
   2
     A formal argument can be found in [Collins, 2013].
                                                    70


             = Γ − (1 − TR3 ) · · · (1 − TRn )Γ + TR2 (1 − TR3 ) · · · (1 − TRn )Γ
                  + TR1 (1 − TR2 ) · · · (1 − TRn )Γ + p.s.
             = TRn Γ + TRn−1 (1 − TRn )Γ + · · · + TR1 (1 − TR2 ) · · · (1 − TRn )Γ + p.s.
             = CRn Γ + CRn−1 Γ + · · · + CR1 Γ + p.s.,                                          (2.108)
where in the last step we defined
                             CRi = TRi (1 − TRi+1 ) · · · (1 − TRn )Γ,                          (2.109)
which agrees with the general definition in Eq. (2.105).
    In a general Feynman diagram, the relations among all the leading regions form an ordered
graph, starting from the largest region where all loop momenta have hard virtualities, and
ending at the smallest region where as many loop momenta as possible are in the soft (or
collinear, when there is no soft subgraph) region. Then the contribution CR Γ of any region
R has the formal structure,
                            X
             CR Γ = TR Γ −         TR CR′ Γ
                            R′ <R
                            X                     X
                   = TR Γ −        TR TR′ (Γ −          CR′′ Γ)
                            R′ <R               R′′ <R′
                                 X                      X              X
                   = TR Γ + TR         (−TR′ )Γ + TR          (−TR′ )         (−TR′′ )Γ + · · ·
                                R′ <R                   R′ <R         R′′ <R′
                                XY
                   = TR Γ + TR            (−TRi′ )Γ,                                            (2.110)
                                {Ri′ }  i
where in the last line the sum is over all possible nestings of regions smaller than R: R >
                                                   71


R1′ > R2′ > · · · > Rn′ , and
                              Y
                                (−TRi′ ) = (−TR1′ )(−TR2′ ) · · · (−TRn′ ),             (2.111)
                              i
with the approximators for larger regions on the left. Each nesting can terminate at any
region Rn′ < R, not necessarily the smallest region.
    Eq. (2.110) gives a general formula for the subtraction terms in CR Γ. They are given
by successively applying −TR′ for smaller regions R′ < R, and then applying TR . When
applying any TR′ , we treat the loop momenta as classified in the same way as the region R′ ,
and the approximator TR′ works by deforming certain momentum contours, neglecting certain
momentum components, and inserting certain factors. That is, any subtraction term in CR Γ
is treated to have the same subgraph decomposition as R, and it is to be approximated in
the same way as TR Γ. Since R′ < R has more lines in the collinear or soft subgraphs and/or
fewer in the hard subgraph, the approximator TR′ inserts certain approximation factors and
truncates some momenta in the hard and/or collinear subgraphs of R′ . When applying TR
on top of TR′ , some soft lines are upgraded to collinear lines and some collinear lines to
hard lines, so then the approximator factors of TR′ will reside within the new hard and
collinear subgraphs specified by R. Therefore, graphically, TR Γ and the subtraction terms
P
   R′ <R TR (−CR′ Γ) correspond to the same region graph, with the only difference being that
the subtraction terms already have some “sub-approximators” applied within.
    In order for the presence of subtraction not to affect the argument of the approximation
TR , especially the application of Ward identities that follows the soft and collinear approxi-
mators defined in Eqs. (2.82) and (2.83), the same simplifications following TR Γ should also
                                  P
apply to the subtraction terms      R′ <R TR (−CR′ Γ). So we require the approximator factors
                                               72


as in Eqs. (2.82) and (2.83) not to cause gauge-violating structures. Applying TR on Γ allows
to factorize soft or collinear gluons onto Wilson lines for any region R, so does the TR′ for
smaller regions R′ < R in the subtraction terms of CR Γ. Then further applying TR on the
Wilson lines resulted from TR′ should also lead to a Wilson line structure. To guarantee
this, it is sufficient to require the use of Ward identities on a Wilson line to give back the
same Wilson line. On the other hand, since TR′ is likely to involve contour deformations for
some soft momenta, it is necessary that the approximator TR does not introduce any factors
contradicting such deformations when those soft momenta are upgraded to collinear or hard
momenta. This explains the iϵ choices in Eq. (2.83).
                                                          q
                                                              H0
                           q
                                                             k1
                               H
                             k                                K0
                          P    C                             k2
                                                         P     C0
                                (a)                             (b)
Fig. 2.5: (a) A general leading region for a DIS cut diagram in light-cone gauge is divided
into a hard subgraph H and a collinear subgraph C, joined by two collinear quark or gluon
lines. (b) The ladder expansion for a certain DIS cut diagram is decomposed into a series of
2PI subdiagrams connected by two quark or gluon lines. The thick external lines represent
hadron targets.
2.6.3       Factorization of DIS in light-cone gauge
Now we extend to all orders the discussion in Sec. 2.6.1 of the DIS factorization in light-cone
gauge, as a simple application and illustration of the subtraction formalism in Sec. 2.6.2.
We will replace the elementary quark target by a physical on-shell hadron target, such as a
proton, of momentum P and mass M .
                                               73


    In the light-cone gauge, the general leading region for DIS contains a hard subgraph and
a collinear subgraph, which are joined by two collinear quark or gluon lines, as shown in
Fig. 2.5(a). For a given graph, a larger region has more lines in H and fewer in C. This
motivates the ladder expansion of a general DIS diagram, as shown in Fig. 2.5(b), where each
unit of H0 , K0 , and C0 is two-particle-irreducible (2PI), which means that they cannot be
divided into two parts by only cutting two propagators, such that they cannot have further
ladder expansion. We denote H0 , K0 , and C0 as sums of all possible 2PI subgraphs for given
external lines, each being a function of external momenta, spin indices, and color indices.
Then the sum of all DIS diagrams is given by
                   W = D0 + H0 · C0 + H0 · K0 · C0 + H0 · K0 · K0 · C0 + · · ·
                               X∞
                      = D0 +          H0 · K0n · C0 ,                                       (2.112)
                               n=0
where D0 is the minimal graph which is itself 2PI and has no ladder decomposition. We stress
that each factor represents an all-order sum of 2PI perturbative diagrams, by the assumption
(1) in Sec. 2.1.2, with the hadron-parton vertex described by some hadron wavefunction.
    By directly coupling the hadron to the virtual photon, D0 has all lines being highly
virtual, so it is power suppressed, by the same assumption (1) in Sec. 2.1.2.
    A graph Γn with n ladders (n is the number of K0 factor in Eq. (2.112)) has n + 1 leading
regions, R0 > R1 > · · · Rn , with Ri referring to the region with i lower K0 ladders belonging
to the collinear subgraph. Then following the discussion in Eqs. (2.107) - (2.109), the leading
contribution of each region Ri is
                                                         h            in−i
      CRi Γn = TRi (1 − TRi+1 ) · · · (1 − TRn )Γn = H0 · (1 − T̂ )K0      T̂ [K0 ]i · C0 , (2.113)
                                                     74


where T̂ is defined in the same way as Eq. (2.88). This factorizes into a hard factor H0 ·
[(1 − T̂ )K0 ]n−i and a collinear factor Pn [K0 ]i · C0 P̄n , similar to the low-order examples in
Eqs. (2.89)(2.93) and (2.95). Summing over i from 0 to n and then over n from 0 to ∞
amounts to summing over the ladders in the hard and collinear factors separately,
                               X ∞ X n
                         Ŵ =           CRi [H0 · K0n · C0 ]
                               n=0 i=0
                               X ∞ X n       h              in−i
                             =          H0 · (1 − T̂ )K0         T̂ [K0 ]i · C0
                               n=0 i=0
                               X ∞ X ∞       h              ii
                             =          H0 · (1 − T̂ )K0 T̂ [K0 ]j · C0
                                i=0 j=0
                                            1                 1
                             = H0 ·                    T̂          · C0 .                  (2.114)
                                     1 − (1 − T̂ )K0      1 − K0
This factorizes into a hard factor
                                                           1
                                  H(q; k̂) = H0 ·                    ,                     (2.115)
                                                   1 − (1 − T̂ )K0
and a collinear factor
                                                                
                                                      1
                                 C(P, k) = Pn               · C0 P̄n .                     (2.116)
                                                  1 − K0
They are convoluted in the momentum k and in color and spin indices. Eq. (2.114) approx-
imates W in Eq. (2.112) with the remainder term
                                           X∞        h              in
                    r = W − Ŵ = D0 +          H0 · (1 − T̂ )K0 · (1 − T̂ )C0 ,            (2.117)
                                           n=0
where both terms are power suppressed. Therefore, we get the factorized result of the DIS
                                                 75


cross section
                           Z              Z                          
                                 +            dk − d2 kT                M
                     W =      dk H(q; k̂)                C(P, k)  + O        .           (2.118)
                                                (2π)4                    Q
We will deal with the spin and color connections later in Sec. 2.7.1.
    Note that the sum over regions (i) and graphs (n) has been converted to two indepen-
dent sums over each subgraph, which leads to the factorized expression. However, it is the
subtraction of smaller regions from larger regions that separates different momentum scales.
In this way, the hard factor defined in Eq. (2.115) has removed all contribution from the
regions where any of the loop momenta become collinear. In terms of perturbative Feynman
amplitudes, it is free from collinear singularity, and the corresponding Feynman integrals are
only sensitive to the hard scale Q, so we are allowed to use perturbative descriptions due to
the asymptotic freedom, and it is safe to neglect the parton masses and virtualities therein,
as is encoded in the approximator T̂ . The collinear factor defined in Eq. (2.116) collects
all the pinch singularities in perturbative diagrams, so parton momenta in it are trapped in
the low-virtuality regions. It is thus not perturbatively tractable, but the all-order sum in
Eq. (2.116) can be formally defined nonperturbatively. Then the perturbative pinch singu-
larities are interpreted to be reflecting the sensitivity to nonperturbative dynamics. Even
though the result is obtained by perturbative diagram expansion, the overall sum, regardless
of its convergence issue, is still assumed to reflect the correct reality, by the assumption (3)
in Sec. 2.1.2. Its actual value can be obtained by fitting to experimental data, by virtue of
its universality.
    In this way, the subtraction formalism together with the sum over regions and graphs
factorizes the DIS cross section into hard and collinear factors, with the former only sensitive
to dynamics at a hard scale Q, and the latter only to the nonperturbative soft scale m. The
                                                76


separation of distinct scales is the essence of factorization.
2.7       Ward identity and Wilson lines: DIS factorization
          in Feynman gauge
In the light-cone gauge, the gluon propagators are constrained to be physically polarized, and
the leading region has a simple structure, as in Fig. 2.112 for the DIS. A simple application of
the subtraction formalism directly yields the factorization formalism. However, as indicated
by Eq. (2.85), the light-cone gauge introduces extra poles at k · n = k + = 0. The leading
regions of DIS have k + ≃ O(Q), without any pinch singularity at k + = 0, so such poles
do not pose severe trouble for the factorization argument. However, in more complicated
situations with also soft subdiagrams, this gives extra poles in the soft region. Moreover,
the 1/k · n should be interpreted in the principle-value sense as
                                                                
                                1     1        1           1
                                   =                +              ,                    (2.119)
                              k·n     2   k · n + iϵ k · n − iϵ
with both iϵ prescriptions being present. Such poles severely forbid the factorization ar-
guments in cases where a deformation of the k + contour is needed, such as the Sudakov
form factor and Drell-Yan process. Therefore, in most general cases, one prefers to use
Feynman gauge for the proof of factorization. This avoids issues of unphysical poles, but
introduces extra complications from unsuppressed longitudinally polarized gluons, as dis-
cussed in Sec. 2.3.2 and shown in Fig. 2.3(b) for the leading region of the Sudakov form
factor.
    While the unphysical longitudinal polarization arises due to gluons being massless vector
                                                77


bosons as a result of QCD being a gauge theory, they can be dealt with by gauge invariance
arguments. As we shall see in this section, such gauge redundancy will be decoupled by
Ward identities. However, contrary to what one might expect, such longitudinally polarized
gluons are intermediate lines so gauge invariance does not mean that they result in zero.
Instead, they have specific irreducibility properties and will be collected into Wilson line
structures.
    In the following, we will first discuss the basic use of Ward identities for a certain region R
of the DIS diagram Γ with subtraction, i.e., the term TR Γ. The simple Wilson line structure
shall make it evident that the use of Ward identities on a Wilson line gives back the same
Wilson line. This will be used to take subtractions into account in approximating the region
R, i.e., the term CR Γ. Finally, we will use the Ward identity argument on the Sudakov form
factor to factorize the latter into soft, collinear, and hard factors.
2.7.1      DIS factorization at the lowest order: PDF and quark po-
           larization
        q                       q                        q                      q
      ν             µ         ν                µ       ν        b    a   µ    ν     a     b    µ
           k               k + kg      kg    k           kq kg′   kg   k        kq kg′    kg k
      P      C                P     C                  P       C              P     C
              (a)                    (b)                        (c)                    (d)
Fig. 2.6: Leading-power leading-order sample DIS diagrams, initiated by a quark parton.
The quark parton can be accompanied by an arbitrary number of gluons with longitudinal
polarizations. We take the convention that all the gluon momenta flow into the collinear
subgraph.
    We first examine the LO DIS diagram Γ0 , given in Fig. 2.6(a). It has no difference from
the diagram in the light-cone gauge discussed in Sec. 2.6.1, with one single leading region
                                                 78


R, and the same approximation leads to a factorization formula similar to Eq. (2.89). As
a useful startup, in this section we continue the discussion in Eq. (2.89) to disentangle the
quark spinor indices.
    Note that in Fig. 2.6(a), the collinear subgraph C includes all possible diagrams, so can
be written as a Green function,
                                        Z
                          Cαβ (P, k) =     d4 y e−ik·y ⟨P |ψ̄β,j (y)ψα,i (0)|P ⟩,                       (2.120)
where α and β are spinor indices and i and j are color indices. The normal ordering of the
fermion fields, instead of time ordering, is fixed because we are dealing with a cut diagram.
With the integration of k − and kT in Eq. (2.89), the coordinate separations y + and yT are
set to zero, so the collinear factor becomes
                     Z                          Z
                       dk − d2 kT                   dy − −ik+ y−
       0
     Fαβ (P, k + ) =              Cαβ (P, k) =            e         ⟨P |ψ̄β,j (0+ , y − , 0T )ψα,i (0)|P ⟩.
                         (2π)4                       2π
                                                                                                        (2.121)
Throughout this section, we use the superscript to refer to the number of collinear gluons
attaching C to H. The spinor matrix in Eq. (2.121) can be expanded on the complete Dirac
basis,
                                                                          1
            0
          Fαβ  = S 1αβ + Se (γ5 )αβ + Vµ (γ µ )αβ + Aµ (γ5 γ µ )αβ + Tµν (σ µν )αβ .                    (2.122)
                                                                          2
Being sandwiched between Pn and P̄n , only three of the spinor structures are nonzero,
                                                                    X
                         Pn (Fαβ0
                                  )P̄n = V + γ − + A+ γ5 γ − −           T +i σ −i .                    (2.123)
                                                                   i=1,2
                                                  79


The coefficients can be projected from Eq. (2.122) by tracing with certain gamma matrices.
This defines the three parton distribution functions (PDFs),
                                        Z
                      γ+ 0                  dy − −ik+ y−
       0
      f (x) = Tr          F            =            e      ⟨P |ψ̄j (0+ , y − , 0T )γ + ψi (0)|P ⟩,            (2.124a)
                       2                     4π
                     +                  Z
                      γ γ5 0                dy − −ik+ y−
       0
   ∆f (x) = Tr              F          =            e      ⟨P |ψ̄j (0+ , y − , 0T )γ + γ5 ψi (0)|P ⟩,         (2.124b)
                        2                    4π
                     + i                Z
                      γ γ γ5 0              dy − −ik+ y−
     i 0
  δT f (x) = Tr                F       =            e      ⟨P |ψ̄j (0+ , y − , 0T )γ + γ i γ5 ψi (0)|P ⟩, (2.124c)
                          2                  4π
called unpolarized, polarized, and transversity PDFs, respectively, which are dimensionless
and boost invariant so only depends on x = k + /P + . The collinear factor in Eq. (2.89) then
has the form
                                      −                                               − i 
         0
                              0       γ              0      γ5 γ −            i 0          γ γ γ5
    Pn F (P, k)P̄n αβ = f (x)                 + ∆f (x)                    + δT f (x)                        . (2.125)
                                        2 αβ                   2 αβ                            2       αβ
Contracting with the hard part then converts Eq. (2.89) to the factorization formula in terms
of the PDFs,
                 Z                                                                                          
             0        dk + µν            0   k+γ −          0      γ5 γ − k +       i 0       k + γ − γ i γ5
       CR Γ =               H (k̂) f (x)              + ∆f (x)                 + δT f (x)
                       k + 0,βα                   2                    2                            2         αβ
                 Z                  "                                                #
                      dx 0                    k̂/                                  
               =          f (x) Tr H0µν (k̂) 1 − λ0 (x)γ5 − b0,i     T (x)γ5 γ
                                                                                 i
                                                                                       ,                       (2.126)
                       x                      2
                                                                         0
where k̂/ = k + γ − , and (λ0 (x), b0,i                 0     i 0
                                       T (x)) = ∆f (x), δT f (x) /f (x) quantify the quark polar-
ization state.
     The uncontracted hard part H0 (k̂) can be explicitly obtained from Fig. 2.6(a),
                                                                                 h                       i
    H0µν (k̂) = γ µ δ + (k̂ + q)2 (2π)γ · (k̂ + q)γ ν = (2π)δ(1 − x/xB ) γ µ γ · (k̂ + q)γ ν                   (2.127)
                                                        80


where q = (−xB P + , Q2 /2xB P + , 0T ) is the photon momentum. This is trivially color diago-
nal, so its contraction with Eq. (2.125) in color space, which has been implicitly left open and
entangled, can be reduced into a δij contracting with Eq. (2.125), giving an unweighted sum
of the color indices in Eq. (2.124). Even so, the mere color sum does not make Eq. (2.124)
gauge invariant because the fermion fields are at different positions. As we will see later, the
inclusion of longitudinally polarized gluons make render the PDFs gauge invariant.
    Tracing Eq. (2.127) with the given Dirac matrices in Eq. (2.126), we have
                "             #                        !               !
                           k̂
                           /                  k̂ · q            k̂ · q                     
       C µν = Tr H0µν (k̂)        = 4 k̂ µ − 2 q µ        k̂ ν − 2 q ν + q 2 g µν − q µ q ν , (2.128a)
                           2                    q                 q
                "                 #
                           γ5 k̂/
   ∆C µν    = Tr H0µν (k̂)           = −2i ϵµναβ k̂α qβ ,                                     (2.128b)
                             2
                "                    #
                                i
                           k̂
                           /γ     γ5
   δTi C µν = Tr H0µν (k̂)             = 0,                                                   (2.128c)
                              2
with an overall coefficient (2π)δ(1 − x/xB )δij omitted.
2.7.2       Low-order DIS factorization example in Feynman gauge:
            one extra collinear gluon
In Feynman gauge, even without loop effects, the LO diagram in Fig. 2.6(a) can receive
contribution from diagrams like Fig. 2.6(b)–(d), where the quark enters the hard scattering
together with an arbitrary number of gluons. To the leading power, only the plus components
of these gluons’ polarizations contribute, which we have termed longitudinal polarization.
Following Eq. (2.83), we include in the definition of the approximator T̂ that the contraction
of a longitudinally polarized gluon of momentum kg to the hard part to the left of the cut is
                                                       81


to be approximated as
                                                                         k̂gρ nσ
                              Hρ (kg ) g ρσ Cσ (kg ) → Hρ (k̂g )                      Cσ (kg ),                    (2.129)
                                                                      kg · n + iϵ
where we only left explicit the dependence on the gluon variables, kg is taken to flow into the
collinear subgraph C and k̂g is defined in Eq. (2.86). To the right of the cut, we take gluon
momenta to flow out of C and reverse the iϵ signs. The sign of the iϵ is in fact not crucial
in DIS since there is only one collinear subgraph. We fix it to be the same as the SIDIS.
    Apparently, after applying T̂ , the structure H(k̂g ) · k̂g allows to use Ward identity. In the
diagram Fig. 2.6(b), denoted as Γ1 , this gives
                                                                                               i
  H1µν (k̂g ) · k̂g = γ µ (2π)δ + (k̂ + q)2 γ · (k̂ + q) (−igta γ · k̂g )                                    γ ν , (2.130)
                                                                                    γ · (k̂ + k̂g + q) + iϵ
where ta = (taji ) is the SU(3) generator and carries the color dependence, with i and j
corresponding to the color indices in Eq. (2.124). Using the familiar identity,
                                                     1
                γ · (k̂ + q) (γ · k̂g )
                                        γ · (k̂ + k̂g + q) + iϵ
                                        h                                        i               1
                       = γ · (k̂ + q) γ · (k̂ + k̂g + q) − γ · (k̂ + q)
                                                                                     γ · (k̂ + k̂g + q) + iϵ
                                                                    1
                       = γ · (k̂ + q) − (k̂ + q)2                                  ,                               (2.131)
                                                        γ · (k̂ + k̂g + q) + iϵ
the second term vanishes due to the on-shell condition set by the cut line (or the δ-function
in Eq. (2.130)), Eq. (2.130) becomes
                                      H1µν (k̂g ) · k̂g = H0µν (k̂) × [i (−igta )] ,                               (2.132)
                                                              82


which differs from Eq. (2.127) only by an extra color factor. Combining with the remaining
factors in Eq. (2.129), the first i factor that comes from the numerator of the cancelled quark
propagator combines with 1/(kg · n + iϵ) to form a new eikonal propagator, and the color
factor (−igta ) detaches from the fermion line and forms a new eikonal vertex with nσ . So
we have
           Z                    Z                                                                     
                                     dk − d2 kT d4 kg             i
  CR Γ1 =      dk +
                    H0µν (k̂) ·                                                 σ a            a
                                                                         (−ign t ) Pn Cσ (P ; k, kg ) P̄n ,
                                       (2π)4 (2π)4          kg · n + iϵ
                                                                                                      (2.133)
where both spinor and color indices are traced over, and the a and σ refer to the color and
Lorentz indices of the gluon, respectively. Since the dependence of H on the gluon has been
detached by Ward identity, we include the integration of kg in the collinear part, together
with the eikonal factors.
    The collinear subgraph Cσ (P ; k, kg ) in Fig. 2.6(b) contains all possible disgrams so can
be converted to a Green function similar to Eq. (2.120),
                            Z
         Cσa (P ; k, kg ) =     d4 y d4 y1 e−ik·y−ikg ·y1 ⟨P |ψ̄β,j (y)T [Aaσ (y1 )ψα,i (0)] |P ⟩     (2.134)
where we suppressed the spinor indices of C, and the time ordering T is added because both
the quark and gluon are to the left of the cut in the amplitude. Now the color factor in
Eq. (2.133) combines with the gluon field, with the colors summed over. By converting the
eikonal propagator to
                                                       Z   ∞
                                            i
                                                  =          dλ eiλ(kg ·n+iϵ) ,                       (2.135)
                                      kg · n + iϵ        0
                                                        83


the kg integration can be directly carried out and sets y1 = λn,
  Z                                   
       d4 kg        i
                            (−ign t ) Cσa (P ; k, kg )
                                  σ a
      (2π)4 kg · n + iϵ
         Z ∞ Z                       Z 4
                      4   4    −ik·y     d kg −ikg ·(y1 −λn)
      =        dλ d y d y1 e                   4
                                                 e           ⟨P |ψ̄β (y)T [−ig n · Aa (y1 ) ta ψα (0)] |P ⟩
           0                            (2π)
         Z ∞ Z
      =        dλ d4 y e−ik·y ⟨P |ψ̄β (y) [−ig n · Aa (λn) ta ψα (0)] |P ⟩,                         (2.136)
           0
where the time ordering has been trivially removed since λ > 0. Further integrating out k −
and kT also sets y on the light cone, and gives the collinear factor,
                 Z                             Z    ∞                         
                   dy − −ik+ y−
           1
         Fαβ  =         e       ⟨P |ψ̄β (y − )                       a       a
                                                       dλ (−ig n · A (λn) t ) ψα (0)|P ⟩.           (2.137)
                    2π                             0
The spinor matrix reduction is to be done in the same way as Eq. (2.125) after projecting
with Pn and P̄n so will not be repeated. Since Eqs. (2.126) and (2.133) have the same hard
coefficient, we can combine them to define a whole collinear factor,
                     Z                                     Z ∞                    
      0      1          dy − −ik+ y−             −                         a     a
   (F + F )αβ =              e        ⟨P |ψ̄β (y ) 1 − ig         dλ n · A (λn) t ψα (0)|P ⟩.       (2.138)
                         2π                                   0
    Of course, at this order, one can also have the gluon to the right of the cut, which gives
an eikonal line associated with the field ψ̄ at y − . The result of the eikonal propagators and
vertices can be represented graphically by the double lines in Fig. 2.7. The arrows on the
double lines indicate the color flows in the fundamental representation, which is dictated by
the fields ψ and ψ̄ on the ends.
                                                       84


               q                                            q                               q
             ν                                            ν                   µ           ν                µ
                                 µ
                                                             k                                           k
          k + kg     kg        k       + c.c.     =                                +
                                                                   kg       k                k     kg
             P    C                                       P                               P
                                                               C                                    C
Fig. 2.7: Results of applying Ward identity on Fig. 2.6(b) and its complex conjugate. The
gluon momentum kg flows along the double line, which has the eikonal propagator the vertex
as given in Eq. (2.133).
2.7.3       Low-order DIS factorization example in Feynman gauge:
            two extra collinear gluons
Now we consider the two diagrams in Fig. 2.6(c) and (d), which have two extra longitudinally
polarized gluons attaching the collinear subgraph to the hard part. The same approximation
as Eq. (2.129) applies to both gluons so allows the use of Ward identity. The corresponding
hard part is
                                               
    2,ab ρ ′σ
  Hρσ   k̂g k̂g = γ µ (2π)δ + (k̂ + q)2 γ · (k̂ + q)
               "
                                            i                                          i
           × (−igta k̂/g )                               (−igtb k̂/′g )
                               γ · (k̂ + k̂g + q) + iϵ                  γ · (k̂ + k̂g + k̂g′ + q) + iϵ
                                                                                                         #
                                             i                                          i
               +(−igtb k̂/′g )                           (−igta k̂/g )                                     γ ν , (2.139)
                               γ · (k̂ +  k̂g′ + q) + iϵ                γ · (k̂ + k̂g +   k̂g′ + q) + iϵ
where the two terms are for Fig. 2.6(c) and (d), respectively, a and b are the color indices of
the gluons of momenta kg and kg′ , respectively, and both momenta are taken to flow into the
collinear subgraph C. We first perform Ward identity for the gluon kg and use the identities
                                 k̂/g = γ · (k̂ + k̂g + q) − γ · (k̂ + q),                                      (2.140a)
                                                           85


                                     = γ · (k̂ + k̂g + k̂g′ + q) − γ · (k̂ + k̂g′ + q),                     (2.140b)
for the two terms in Eq. (2.139), respectively. The cut line similarly sets the second term in
Eq. (2.140a) to zero, so we have
                                                                            i
                                    (−ig)2 (ta tb )i k̂/′g                                                   (2.141)
                                                             γ · (k̂ + k̂g + k̂g′ + q) + iϵ
for the first term, and
                                   "                                                                  #
                                                  i                                     i
            (−ig)2 (tb ta )i k̂/′g                                   −                                       (2.142)
                                     γ · (k̂ + k̂g′ + q) + iϵ          γ · (k̂ + k̂g + k̂g′ + q) + iϵ
for the second term. Now Eq. (2.141) combines with the second term in Eq. (2.142) into
                                                                            i
                                   (−ig)2 [ta , tb ]i k̂/′g                                  ,               (2.143)
                                                             γ · (k̂ + k̂g + k̂g′ + q) + iϵ
while the first term in Eq. (2.142) allows a further use of Ward identity by noticing k̂/′g =
γ · (k̂ + k̂g′ + q) − γ · (k̂ + q), and gives
                                                       (−ig)2 (tb ta )i2 .                                   (2.144)
Combining Eqs. (2.143) and (2.144) and inserting back to Eq. (2.139) gives
                                                         
           2,ab ρ ′σ
         Hρσ    k̂g k̂g = γ µ (2π)δ + (k̂ + q)2 γ · (k̂ + q)
                                            "                                                        #
                                                                              1
                           × (−ig) i   2 2      a
                                              [t , tb
                                                      ] k̂/′g                                    b a
                                                                                               +t t    γν .  (2.145)
                                                              γ · (k̂ + k̂g +   k̂g′ + q) + iϵ
                                                                 86


     In an Abelian gauge theory, the commutator [ta , tb ] vanishes, and Eq. (2.145) already
detaches the two gluons out of the hard part. In the non-Abelian gauge theory, [ta , tb ] =
if abc tc relates this term to tri-gluon coupling, which we will further explore in Sec. 2.7.4.
Effectively, the use of Ward identity for the gluon of kg detaches it from the hard part and
attaches it to the gluon of kg′ , but the Ward-identity vertex is still k̂/′g , which cannot relate
the two neighboring propagators of momenta (k̂ + q) and (k̂ + k̂g + k̂g′ + q) by any identity
similar to Eq. (2.140), so Ward identity cannot be simply applied. The way out is to notice
that k̂g′ has been projected on shell, so we rewrite it as
                                                                                                             
                                                    kg′+                            ′
                                                                                                kg′ · n
    k̂/′g =  γ − kg′+        −
                      = γ (kg +       kg′ )+                       = γ · (k̂g + k̂g )                           .   (2.146)
                                              (kg + kg′ )+ + iϵ                          n · (kg + kg′ ) + iϵ
The first factor allows the use of Ward identity, whereas the second factor modifies the
eikonal identity associated with kg′ . Then we can proceed with Eq. (2.145) as
                                                                                                     
                                                                              kg′ · n
               Hρσ2,ab
                       k̂gρ k̂g′σ =  H0µν (k̂) · (−ig) i2 2     a   b
                                                              [t , t ]                            b a
                                                                                              +t t ,                (2.147)
                                                                       n · (kg + kg′ ) + iϵ
which again factorizes the dependence on the collinear gluons out of H and gives the same
hard part factor as Eqs. (2.127) and (2.132). Combined with the remaining approximator
factors in Eq. (2.129) for both gluons, we have the overall eikonal factor as
                                                                                                               
            ρ          σ         a  b        i                  i                b a       i             i
  (−ign )(−ign ) [t , t ]                                                   +t t                                  . (2.148)
                                      n · kg + iϵ n · (kg + kg′ ) + iϵ                n · kg + iϵ n · kg′ + iϵ
This can be further simplified by writing [ta , tb ] = ta tb − tb ta , and using the simple eikonal
                                                                 87


identity,
       i             i                i              i                     i                i
                             =                                    +                                    , (2.149)
  n · kg + iϵ n ·  kg′  + iϵ   n · kg + iϵ n · (kg + kg ) + iϵ n · kg + iϵ n · (kg + kg′ ) + iϵ
                                                        ′                  ′
which converts Eq. (2.148) into a more symmetric form,
                                                                                                          
                                     i              i                          i                i
    (−ign )(−ign ) ta tb
             ρ         σ                                            b a
                                                                  +t t                                       .
                               n · kg + iϵ n · (kg + kg′ ) + iϵ          n · kg′ + iϵ n · (kg + kg′ ) + iϵ
                                                                                                         (2.150)
This is graphically represented by Fig. 2.8, where each double line has two propagators
corresponding to the two eikonal factors in each term in Eq. (2.150).
     q                            q                            q                             q
   ν        b    a              ν        a  b                ν                  µ          ν                 µ
                       µ                          µ
                                                                k                             k
     kq kg′   kg    k       +     kq    kg′ kg  k      =                             +
                                                                     kg′ kg   k                 kg′   kg   k
   P       C                    P        C                   P                             P
                                                                    C                             C
Fig. 2.8: Results of applying Ward identity on Fig. 2.6(c) and (d). Both momenta kg and kg′
flow into C and the momentum flows on the double line satisfy momentum conservation.
    Eq. (2.150) is to be multiplied on the collinear subgraph Cρσ (P ; k, kg , kg′ ) and defines the
collinear factor. But here for fixed momentum assignments of kg and kg′ , we do not include
diagrams in C that differ only by exchanging the two gluons. Otherwise, one would have
double counting in Fig. 2.6(c) and (d). After the use of Ward identity, the integrals of kg
and kg′ are completely within the collinear factor, so we can reverse the color and momentum
labels in the second term of Eq. (2.150) (or the second diagram on the right-hand side of
Fig. 2.8). This reduces Eq. (2.150) to only the first term, but now with the subgraph C to
                                                       88


include all possible diagrams. Then we can write it as a Green function,
                             Z
                                                                    ′                                            
    ab
  Cρσ  (P ; k, kg , kg′ ) =     d4 y d4 y1 d4 y2 e−ik·y−ikg ·y1 −ikg ·y2 ⟨P |ψ̄β,j (y)T Aaρ (y1 )Abσ (y2 )ψα,i (0) |P ⟩
                                                                                                                 (2.151)
    The first term of Eq. (2.150) can be converted to integrals by applying Eq. (2.135) to
both propagators,
                                                  Z   ∞      Z    ∞
                i                  i                                                                   ′
                                                =       dλ1          dλ2 θ(λ1 − λ2 ) eiλ1 n·kg eiλ2 n·kg .       (2.152)
          n · kg + iϵ n · (kg + kg′ ) + iϵ          0           0
Combining this with Eq. (2.151) and integrating over kg and kg′ set y1 = λ1 n and y2 = λ2 n.
Further integrating over k − and kT also sets y on the light cone. Finally, we have the form
of the collinear factor,
               Z
        2          dk − d2 kT d4 kg d4 kg′
     Fαβ    =                                   (−ignρ ta )(−ignσ tb )
                     (2π)4 (2π)4 (2π)4
                                                                    
                                    i                  i
                          ×                  ·            ′
                                                                       Cρσab
                                                                             (P ; k, kg , kg′ )
                              n · kg + iϵ n · (kg + kg ) + iϵ
               Z ∞         Z ∞                      Z
                                                       dy − −ik+ y−
            =       dλ1         dλ2 θ(λ1 − λ2 )              e
                 0          0                            2π
                                                                                               
                          × ⟨P |ψ̄β,j (y − ) [−ig n · Aa (λ1 n)ta ] −ig n · Ab (λ2 n)tb ) ψα,i (0)|P ⟩,          (2.153)
where we have removed the time ordering as in Eq. (2.136). Note that both gluon fields are
integrated along the light cone, and their order cannot be reversed due to the non-Abelian
nature. The θ(λ1 − λ2 ) function dictates the gluon field to the left to be also in front in the
light-cone path along n, so is equivalent to a path ordering, which is defined as
         Z   ∞      Z    ∞
     P         dλ1         dλ2 O1 (λ1 )O2 (λ2 )
           0           0
                                                            89


             Z   ∞      Z  ∞
          =        dλ1        dλ2 [θ(λ1 − λ2 )O1 (λ1 )O2 (λ2 ) + θ(λ2 − λ1 )O2 (λ2 )O1 (λ1 )] ,          (2.154)
               0         0
where P refers to path ordering operator. Therefore, we write the collinear factor as
             Z                                 (          Z ∞                    2 )
           1     dy − −ik+ y−
     2
   Fαβ =             e         ⟨P |ψ̄β,j (y − ) P −ig            dλ n · Aa (λn)ta         ψα,i (0)|P ⟩. (2.155)
           2      2π                                          0
                                                                                       ji
    Eq. (2.155) combines with Eq. (2.138) to give
                         Z                                X 2            Z ∞                       n
   0     1       2           dy − −ik+ y−              −        1                         a       a
(F + F + F )αβ =                  e          ⟨P |ψ̄β (y )          P −ig       dλ n · A (λn) t         ψα (0)|P ⟩.
                              2π                          n=0
                                                                n!          0
                                                                                                         (2.156)
The same analysis can be done for the diagrams with the two gluons to the right of the cut,
or with one gluon on both sides of the cut. The structure of the result in Eq. (2.156) easily
motivates one to conjecture it to all orders,
                   Z
                       dy − −ik+ y−
           Fαβ =            e        ⟨P |ψ̄β (y − )W † (∞, y − ; n) W (∞, 0; n)ψα (0)|P ⟩,               (2.157)
                        2π
with the colors summed over. Here W (∞, y; n) = (Wij )(∞, y; n) is the straight Wilson line
from y to ∞ along the light-cone direction n,
                                                         Z   ∞                        
                                                                        a            a
                        W (∞, y; n) = P exp −ig                 dλ n · A (y + λn) t      .               (2.158)
                                                            0
It points to the future because of the iϵ choice in the approximator Eq. (2.129), as easily
seen from Eq. (2.135). The Wilson lines associated with the fermion fields render the parton
distribution gauge invariant. Using the unitarity property of the Wilson line allows us to
                                                         90


join the two infinitely long Wilson lines to convert to a finite one,
                             Z
                                 dy − −ik+ y−
                      Fαβ =            e      ⟨P |ψ̄β (y − )W (y − , 0; n)ψα (0)|P ⟩,        (2.159)
                                  2π
with the gauge invariance being obvious. We will show the all-order derivation in the fol-
lowing.
2.7.4     Elements of Ward identity: perturbative line identities
                            a                        a                     a
                              k                 k                               k
                       j             i
                                          =                 i
                                                               +      j
                          p     p+k                 p+k                     p
Fig. 2.9: Ward identity element for the quark-gluon vertex. The dashed lines refer to ghosts,
but the vertex on the left-hand side is a regular quark-gluon vertex, with the arrow on the
end of the ghost line being the ghost momentum k contracted with the vertex.
    As we have seen from the low-order examples in Secs. 2.7.2 and 2.7.3, Ward identity is
mainly using the simple identity like Eqs. (2.131) and (2.140) and the successive cancellation.
For the quark process we have examined, the relevant line identity is shown in Fig. 2.9, which
reads
                                                                                    
                       i           a
                                         i          i          a
                                                                             a
                                                                                i
                             −igtij k/       =i            +igtij + −igtij             .     (2.160)
                    p/ + k/               p/      p/ + k/                         p/
Apart from the i factor, the identity contains two special vertices, denoted by the thick
diagonal lines in Fig. 2.9, with (+igtaij ) for the vertex corresponding to a field ψ̄, and (−igtaij )
for the one corresponding to ψ. The reason for denoting the arrowed gluon line in Fig. 2.9
as a ghost line is that the reduced vertices on the right-hand side are related to the BRST
                                                    91


variations of the quark fields.
        a                             a                      a                             a                      a
          k
 c, σ         b, ρ =    c, σ              b, ρ    +  c, σ              b, ρ    +  c, σ             b, ρ + c, σ         b, ρ
      p     p+k                    p+k                        p                         p      p+k              p    p+k
                    Fig. 2.10: Ward identity element for the tri-gluon vertex.
     Similarly, we can derive the Ward identity element for the tri-gluon vertex, as shown in
Fig. 2.10, reading
      −i           abc                     σ µρ                   µ ρσ                ρ µσ       −i
               (−gf     )k  µ  [(2k   + p)    g    − (k   + 2p)     g     +   (p −  k)   g   ]
  (p + k)2                                                                                        p2
          
                                   −i                           −i                                         i    −i
      = i −gf acb g ρσ ·                 2
                                            + −gf abc g ρσ · 2 + −gf bca (−p)ρ · (−p)σ · 2
                                (p + k)                              p                                       p (p + k)2
                                                                                  
                                                                −i        i
                   + −gf cba (p + k)σ · (p + k)ρ · 2                                ,                               (2.161)
                                                                  p (p + k)2
term by term. We have written each term to make the special vertices clear. For the last
two terms, a ghost line with an arrow on the end is multiplied by a factor of its momentum;
apart from this, it couples to other particles on this end in the same way as a gluon. In
this way, Ward identity relates gluon lines to ghost lines, as a special feature of non-Abelian
gauge theory. The resulted ghost lines with arrows allow iterative use of Ward identities
until we reach the external legs of the diagram.
                                 a                          a                             a
                                    k                           k                               k
                        c                 b
                                                =    c                 b
                                                                             +    c                   b
                             p        p+k                p        p+k                      p
                   Fig. 2.11: Ward identity element for the ghost-gluon vertex.
     A special case occurs for the gluon-ghost vertex. It is only proportional to the momentum
                                                             92


of the outgoing ghost, so does not have the form like a scalar QED vertex nor a regular vertex
identity like Figs. 2.9 and 2.10. A yet useful identity is shown in Fig. 2.11, which reads
       i         abc
                                   i        i           cba
                                                                           i                 i
          2
             (−gf     )k · (p + k)   2
                                       =         2
                                                     (−gf     )p · (p + k)     2
                                                                                 + (−gf abc ) 2 . (2.162)
  (p + k)                          p     (p + k)                             p               p
In a physical amplitude, ghosts only appear in closed loops (both clockwise and counter-
clockwise) with a minus sign, accompanied by the same gluon loops. When applying Ward
identities to a gluon loop, the successive use of the last two terms in Fig. 2.10 converts the
gluon loop to two ghost loops, each with the final end of the ghost line pointing back to
itself. Such ghost loops do not have a minus signs, and they cancel the regular ghost loops
by use of the first term in Fig. 2.11.
                                       +                  +                      =0
                                                  (a)
                              +                 +                     +                  =0
                                                  (b)
                         3                    3                       3
                   2        (−)          2       (−)              2      (+)
                                 n
                                      +                n
                                                            +                    n
                                                                                    =0
                                                                    1
                              1                    1
                                                  (c)
Fig. 2.12: Vertex identities for the use of Ward identity. In (c), the first two diagrams arise
from ghost loops so are multiplied by a (−1) while the third one arises from a gluon loop so
does not have this factor.
                                                  93


    In using Ward identities, we need to sum over all possible attachments of the same gluon
whose vertex is multiplied by its momentum. This results in a whole set of special vertices
in Figs. 2.9 and 2.10. The essence of Ward identity is the chain of cancellations among the
neighboring vertices. These are shown in Fig. 2.12(a) and (b). For diagrams involving gluon
loops, iterative use of Ward identities can convert the gluon loops to ghost loops, without the
minus signs. Then special vertices associated with ghost lines arise, like the third diagram
in Fig. 2.12(c). This is cancelled by Ward identity diagrams for accompanied ghost loops,
with one example given in Fig. 2.12(c). The identity in Fig. 2.12(a) is due to [ta , tb ] = if abc ,
which is what we encountered in Eq. (2.143) when moving one gluon across another one.
The remaining [ta , tb ] term there reflects the missing third diagram in Fig. 2.12(a). Both the
identities in Fig. 2.12(b) and (c) are results of the Jacobi identity.
    For a full Green function with only quark and gluon external lines, attaching a gluon line
in all possible ways with the vertex contracted with its momentum gives [Collins, 2013],
                                X                             X
                             =                            +                           ,
                                                                                          (2.163)
after successive use of the line identities. On the right-hand side of Eq. (2.163), we sum
over the special vertex for each external line in the first term, which are given in Fig. 2.9
for quarks and Fig. 2.10 for gluons (without the last two terms). When the gluon attaches
to gluon lines, iterative use of the last two terms in Fig. 2.10 replaces the gluon line by a
ghost line, which traverses the whole graph and join the external lines at special vertices,
as given by the second term on the right-hand side of Eq. (2.163). This line is anchored in
the internal graph, as indicated by the cross label. For an external gluon line, we note that
it can be completely converted to a ghost line with a vertex proportional to its momentum,
                                                 94


like the last two terms in Fig. 2.10. In this case, the special vertex on the right-hand side of
Eq. (2.163) is only one single ghost line with an arrow.
     When the external lines are amputated on shell by the LSZ reduction formula, all the
special vertices vanish because they do not give the needed mass poles, so Eq. (2.163) gives
zero. For the special gluon vertex that is purely a ghost line times its momentum, it becomes
zero when contracting with a physical polarization vector. Therefore, applying Ward iden-
tities on a physical amplitude, with all possible diagrams and attachments included, gives
zero.
2.7.5      All-order derivation of the Wilson line structure in DIS
Now we can come back to the low-order examples in Fig. 2.6 that we dealt with in Sec. 2.7.1–
2.7.3. Even though we have projected on shell the collinear quark and gluons lines as external
legs of the hard subgraph, the use of Ward identity for the longitudinally polarized gluons
does not give 0 because we are missing the diagrams with the gluons directly attaching
to the collinear quark or gluon lines. The hard subgraph is required to be one-particle-
irreducible (1PI) with respect to the collinear lines. Two merged collinear lines still have a
low virtuality so belongs to the collinear subgraph before entering the hard subgraph. Such
missing diagrams cause the remaining terms in Eqs. (2.132) and (2.145), which eventually
turn into the Wilson line structure.
     With the idea of identifying the missing diagrams, we can give an all-order derivation of
                                                               s,m
the Wilson line. We will consider a particular region Rℓ,r         of a certain DIS diagram Γ, shown
in Fig. 2.13(a), which is characterized by a hard subgraph H (s) and a collinear subgraph
C (m) that are of perturbative orders at, respectively, O((αs )s ) and O((αs )m ) in QCD.3 They
   3
     One shall not confuse the label “s” for the (integer) order of H (s) from the subscript in the strong
                                                    95


are joined by two collinear quark lines and (ℓ + r) longitudinally polarized gluons, with ℓ to
the left and r to the right of the cut. We will show that by summing all possible diagrams of
H (s) and all possible attachments of the collinear gluons, Ward identity factorizes the gluons
out of the hard part and recollects them onto two gauge links along the light-cone direction
n, as shown in Fig. 2.13(b).
                        q                                      q
                               (s)                                     P
                            H                                             H (s)
                                                                k    ℓ        r
                        kq
                       P    C (m)                            P          C (m)
                                (a)                                     (b)
Fig. 2.13: (a) A leading region of a DIS diagram. The two subgraphs H (s) and C (m) are
joined by two quark lines, ℓ gluon lines to the left of the cut, and r gluon lines to the right.
The hooks on the quark lines refer to the on-shell projections by Eqs. (2.86) and (2.87), and
the arrows on the gluon lines denote the approximators in Eq. (2.129). (b) The result of
Ward identity for the gluons and the Wilson structures. The upper blob contains all possible
diagrams of H (s) at the same order, and the Wilson line contains all permutations of the
gluons, while the collinear subdiagram C (m) is fixed.
                                                        s,m
    To start with, let us first consider the region R1,0     with (ℓ, r) = (1, 0), for which we show
the relevant subdiagram in Fig. 2.14(a), in the form of a scattering amplitude. Only physical
particles are included in the final state. They do not contribute to the Ward identity. The
quark parton line has been projected on shell when entering the hard part. Its full propagator
has been amputated in H, with an on-shell polarization vector effectively set by the inserted
spinor projector Pn ,
                                                                 "                #
                                γ γ− +
                                         k̂/q γ +   X              ūλ (k̂q ) γ +
                         Pn =          =          =    uλ (k̂q )                    ,         (2.164)
                                    2      2kq+      λ
                                                                       2kq+
coupling αs .
                                                   96


which vanishes when multiplied by k̂/q on the left. The gluon approximator in Eq. (2.129)
allows to use Ward identity. Summing over all possible diagrams in H and all possible attach-
ments of the gluon, including the one in Fig. 2.14(b), would render the physical amplitude
to zero, by the result following Eq. (2.163).
                   H                          H                           H
                       phys.                       phys.                      phys.
                   (a)                         (b)                        (c)
Fig. 2.14: Relevant diagrams for one collinear gluon attachment. Both the gluon and ghost
lines are amputated.
                                                             s,m
    However, Fig. 2.14(b) is not included in the region R1,0     . Because both the quark and
gluon are collinear, the merged quark line directly attaching to H is also collinear. All the
three lines are to be included in the collinear subgraph, so Fig. 2.14(b) belongs to the region
  s,m
R0,0  . This diagram represents the missing term in the use of Ward identity. It alone gives
Fig. 2.14(c) by the identity in Fig. 2.9, with the special vertex being −igtaij (accompanied
by an extra (−i) factor), where a is the color index for the gluon, i the color index of the
quark directly connecting to H, and j of the one to C. Together with the eikonal factor in
Eq. (2.129), we can convert the factorized gluon to a Wilson line structure like Eq. (2.136).
But the discussion here applies to all orders of both H and C, since we are using the general
theorem of Ward identity, not restricted to the special LO diagram as for Eq. (2.136).
    Note that we have been taking the convention of all gluon momenta flowing back to C
for simple eikonal propagators in Eq. (2.129), as explicitly done in Secs. 2.7.2 and 2.7.3. But
in the explicit derivation of the Ward identity in Sec. 2.7.4, the gluon momenta flow into
the graphs. Therefore, we have an extra minus sign, apart from the overall i factor for each
                                               97


gluon.
     In the literal consideration of Fig. 2.14(b), both the quark and gluon momenta are pro-
jected on shell, causing a formal singularity,
                                           γ · (k̂q + k̂g )
                                                            (γ · k̂g )Pn ,                             (2.165)
                                            (k̂q + k̂g )2
which is indefinite. For the use of Ward identity, we therefore first treat k̂g as off the light
cone, and then take the light-cone limit after cancelling the quark propagator, as
           γ · (k̂q + kg )                      γ · (k̂q + kg ) h                           i
    lim                    (γ · kg )Pn = lim                       γ · (k̂q + kg ) − γ · k̂q Pn = Pn . (2.166)
   kg →k̂g  (k̂q + kg )2                kg →k̂g  (k̂q + kg )2
     The same analysis can be performed for the region with any arbitrary ℓ, but the number
of missing terms increases very rapidly as ℓ grows, along with a more careful successive use of
Eq. (2.166). Before taking on this journey, we note that all the missing terms have some (or
all) of the gluon lines attach to the quark line or other gluon lines, so they have fewer gluon
lines attach to the hard subgraph H. While the gluons attaching directly to the quark line
can be made to the end of the quark line like Fig. 2.14(c), those attaching to H of a smaller
number can be related to regions with smaller ℓ by induction. In the end, all the gluons are
detached from H and attached to the end of the quark line, in a complicated tower. In this
process, the details of H are not so important. All it matters is that it is the external quark
line that attaches to H along with the gluons. So the result would be the same if we replace
H by a single quark line, exactly like the LO examples in Secs. 2.7.2 and 2.7.3.
     One can go further by noting that the quark line can even be replaced by a gauge link that
attaches to the incoming quark-photon vertex on one end and extends to ∞ on the other end.
                                                         98


Because the Ward identity element for a gluon attaching to the gauge link works in the same
way as it attaches to a quark line, with the ∞ end corresponding to the on-shell external leg
in the final state, summing all possible attachments of the longitudinally polarized gluons
to the on-shell quark and gauge link yields zero. Then, when requiring the gluons to only
attach to the gauge link, we can apply the same argument of missing terms, which will give
the same result as attaching the gluons to H. By choosing the gauge link along n, it has
propagators and vertices like i/(n · k) and −ignµ ta , respectively. So automatically, only the
plus momenta flow through it, and the approximator in Eq. (2.129) leaves it unchanged in a
trivial way,
                                                      k̂gµ nν
                                     (−ig nµ ta ) ·             = −ig nν ta .             (2.167)
                                                    kg · n + iϵ
That is, applying Ward identity for all the gluons attaching to the gauge link returns the
same gauge link.
    In conclusion, after summing over all possible gluon attachments to the hard subgraph
(which itself has included all possible diagrams for use of Ward identity), we can factorize
out the gluons and reorganize them in a gauge link structure,
                       X                  H                       X                H
                                                         =                              . (2.168)
                   H, {i1 ,··· ,il }                          H, {i1 ,··· ,il }
                                       i1      il                               i1   il
Since Ward identity receives no contribution from physical external legs, the sum over gluon
attachments in Fig. 2.13(a) does not cross the cut, and so we get the same gauge link
structure on both sides of the cut, as shown in Fig. 2.13(b), with ℓ gluons on the left and r
on the right. On either side, the gluons are summed over all possible permutations at give
momentum assignments for a particular diagram C (m) . Then as for Eq. (2.151), by including
                                                         99


the gluon momentum integrations into the collinear factor, we can relabel the gluon momenta
and colors to sum over the diagrams in C (m) , with a fixed gluon attachment configuration.
                                                                                        s,m
     To put it formally, we have examined a particular region Rl,r                           (Γ) of a particular diagram
Γ, which can be written as
                              Z             " ℓ                 r
                                                                           #
        (s)         (m)           d4 k Y d4 kiL Y d4 kjR                        (s)                         L  R 
    Hℓ,r [Γ]  ⊗  Cℓ,r [Γ]  =                                                  H ℓ,r [Γ]{µ   },{ν   } q; k,   ki , kj
                                 (2π)4 i=1 (2π)4 j=1 (2π)4                                i      j
                                  " ℓ              r
                                                               #
                                    Y            Y                  (m)                                        
                               ×         g µi ρi       g νj σj Cℓ,r [Γ]{ρi },{σj } P ; k, kiL , kjR , (2.169)
                                     i=1         j=1
where the spinor and color indices are suppressed, k is the total collinear momentum on
                                  ℓ                    ℓ
either side of the cut, kiL i=1 and kjR j=1 are the two sets of collinear gluon momenta to
the left and right of the cut, respectively. Summing over all possible gluon attachments for
all diagrams in the hard part factorizes the gluons out and recollects them onto two gauge
links,
                                            Z          "                     #Z                     Z Y
  X          n                      o                     X                          dk − d2 kT
                                                                                                        ℓ          r
                                                                                                            d4 kiL Y d4 kjR
                 (s)         (m)                     +             (s)
         TR    Hℓ,r [Γ] ⊗  Cℓ,r [Γ]     =       dk               H0,0 (q; k̂)
     s,m                                                    H
                                                                                      (2π)4           i=1
                                                                                                            (2π)4 j=1 (2π)4
 Γ/Rℓ,r
             Yℓ                                                       Y r                                                  
                           ρi ai                     i                                σj bj                   −i
           ×       (−ig n t )              L                 L
                                                                               (ig n t )
             i=1
                                  n   · (k 1 +   · · · +   k i ) +  iϵ  j=1
                                                                                                n · (k1R + · · · + kjR ) − iϵ
             "                  #{ai },{bj }
               X        (m)                                            
           ×        Pn Cℓ,r P̄n                P ; k, kiL , kjR ,                                                      (2.170)
                C                {ρi },{σj }
where the spinor and color indices are traced over, and in the second line, the product over
i puts smaller i’s to the left, while that over j puts smaller j’s to the right. On the left-hand
                                                                                                                           s,m
side of Eq. (2.170), we sum over graphs Γ with respect to the same region specification Rℓ,r                                   .
On the right-hand side, the hard subgraph is summed over with no extra gluon attachments,
(ℓ, r) = (0, 0), while the collinear subgraph is summed over at the given order m and with
                                                               100


the given numbers of gluons, ℓ and r.
    Then summing over m converts the collinear factor into a Green function, by use of
generalized Eqs. (2.135) and (2.152),
                 Z
         (ℓ,r)     dy − −ik+ y−
       Fαβ     =        e       ⟨P |ψ̄β (y − )W (r)† (∞, y − ; n) W (ℓ) (∞, 0; n)ψα (0)|P ⟩,  (2.171)
                    2π
where the gauge links exactly reproduce the ℓ- and r-th orders of the two Wilson lines in
Eq. (2.157), respectively,
                                                      Z ∞                      ℓ
                         (ℓ)               1                          a       a
                      W (∞, 0; n) = P −ig                   dλ n · A (λn) t ,                (2.172a)
                                          ℓ!            0
                                                 Z ∞                                r
                      (r)†     −           1                       a −             a
                   W (∞, y ; n) =             P̄ ig       dλ n · A (y + λn) t ,              (2.172b)
                                          r!          0
with P and P̄ standing for the path-ordering [Eq. (2.154)] and anti-path-ordering (obviously
modified from Eq. (2.154)) operations, respectively. Notably, the color flow of the partons is
completely detached from the hard subgraph onto the gauge links. Thus in Eq. (2.171), the
colors are traced over. More subtleties about the color indices arise for exclusive processes
and will be discussed in Ch. 3.
    Further summing over ℓ and r would convert Eq. (2.171) into the gauge-invariant form
in Eq. (2.157), with the full Wilson line structures. However, doing so requires a sum over
different regions and graphs. Without a careful treatment of the overlaps among regions,
such a sum gives unphysical results. So in the following, we include the subtraction of
overlapping regions, and then derive the full factorization result for the DIS.
                                                   101


2.7.6     All-order factorization of DIS in Feynman gauge: with sub-
          traction
We have seen in Sec. 2.7.5 how the sum over graphs with respect to a given region allows use
of Ward identity to factorize the hard and collinear subgraphs, as summarized in Eq. (2.170).
While this applies to any regions, one cannot simply proceed to sum over all regions and
obtain factorization. Instead, the contribution from each region R of a graph Γ must contain
subtractions of smaller regions.
    As shown in Eq. (2.110), each of the subtraction terms is obtained for a nesting of regions
R > R1′ > R2′ > · · · > Rn′ , given by first applying the approximators TR′ for smaller regions
R′ < R, and then applying the approximator TR , i.e.,
                                         X
         subtraction terms =                            TR (−TR1′ )(−TR2′ ) · · · (−TRn′ )Γ. (2.173)
                                 {R>R1′ >R2′ >···>Rn′ }
In any region R, we can classify each loop momentum ki as belonging to either the hard
region HR or the collinear region CR . Then a larger region R′ > R would upgrade some
momenta from CR into HR , and a smaller region R′′ < R would take some momenta from
HR into CR . That is, R′ > R if and only if HR′ ⊃ HR and CR′ ⊂ CR . Therefore, in the
successive applications of the region approximators in Eq. (2.173), an approximator that
applies later only assigns new collinear lines into the hard subgraph, but never takes lines
out from the hard subgraph.
    For the smallest region Rn′ in the nesting R > R1′ > R2′ > · · · > Rn′ , the graph Γ can
be decomposed into a hard subgraph HRn′ and a collinear subgraph CRn′ , joined by a set
of longitudinally polarized gluon and two quark lines. This HRn′ is necessarily a subset of
                                                     102


HR , so as we sum over the hard subgraph HR in TR Γ with respect to the specified region
decomposition of R, we automatically sum over HRn′ for each fixed region decomposition of
R′ . As argued in Sec. 2.7.5, the approximator TRn′ then allow to factorize these collinear
lines out of HRn′ and collect them onto two gauge links. The factorized hard part HRn′ only
depends on the approximated momenta of the quark lines. And in the factorized collinear
part CRn′ , all momenta are as if they are unapproximated.
    Now we consider all graphs with the same region decomposition as R and the same
                        (m)                                   (s)
collinear subgraph as Cl,r but different hard subgraphs Hl,r at the same order s. Among all
the region nestings of these graphs that are of the same length as R > R1′ > R2′ > · · · > Rn′ ,
collectively denoted as {R > R1i > R2i > · · · > Rni }, there are more than one of the smallest
regions Rni of those graphs with the same hard and collinear subgraph decompositions as
Rn′ and the same collinear subgraphs CRn′ , which has allowed us to sum over all graphs in
HRn′ and gluon attachments to factorize the latter. There are even more graphs that after
factorizing the smallest regions have the same factorized hard subgraph as HR0 n′ . Working
with this set of diagrams, their collinear factors CRni all have two gauge links, but which
may collect arbitrary numbers of gluons as allowed by the given perturbative order. When
                          i
going to next regions Rn−1   , some of the collinear lines become hard, and some collinear
lines get into or connected to the new hard subgraphs, but they must all stay within the
factorized collinear subgraphs CRni , not attached to the previously factorized hard part HR0 ni ;
otherwise, they would have stayed in or attached to HRni when considering the region Rni .
                             i
Therefore, in the regions Rn−1  , the hard subgraphs HRn−1  i    are connected to the two gauge
links by arbitrary numbers of gluons, and the collinear subgraphs CRn−1   i   are joined to them
by collinear gluon lines and two quark lines.
    We only consider those of the diagrams that have the same collinear subgraph as CRn−1     ′  ,
                                               103


with the same number of collinear gluons attaching to the hard subgraphs HRn−1                     i , which can
be arbitrary. Again, by summing these diagrams, allowing the collinear gluons to also attach
to the gauge links, we can factorize them out of the hard subgraphs onto two new gauge
links. The factorized hard factor HR0 ′           is summed over all possible graphs and only depends
                                            n−1
on two external amputated quark lines, convoluted through their plus momentum with the
collinear factor CRn−1
                     ′    , which has certain number of gluons attached to its gauge links.
    Following the same analysis, we now consider all of those diagrams which are factorized
into the same hard factors HR0 n′ and HR0 ′               (both being summed over all possible graphs
                                                     n−1
                                                                                         i
at particular orders). This can be similarly factorized for the region Rn−2                     . Continuing this
analysis iteratively until we reach the region R, which can be similarly factorized by summing
over the graphs. In this way, we showed that
                  X
                               TR TR1i TR2i · · · TRni Γ
            Γ/{R>R1i >···>Rni}
                             (m)
                                           X
                        = Cl,r ⊗                          HR0 0 ⊗ HR0 1 ⊗ HR0 2 · · · ⊗ HR0 n ,           (2.174)
                                   {HR 0 ,H 0 ,··· ,H 0 }
                                        0   R1        Rn
which is also true if multiplied by (−1)n as required by Eq. (2.173). On the left-hand side, we
sum over all the region nestings of length (n + 1) of the graphs Γ that are of the same order
                                                   (m)
and have the same collinear subgraph Cl,r for the region specified by R. On the right-hand
side, each hard factor HR0 i has only two gauge links and two external amputated quark lines.
They are convoluted only via the plus momenta, and color and spinor indices are summed
over within each factor. As a fixed overall perturbative order s, we sum over all possible
convolutions of the hard factors {HR0 1 , HR0 2 , · · · , HR0 n }.
    The main reason why Eq. (2.174) holds is that applying Ward identity on a gauge link
                                                         104


works in the same way as on a single quark line. So the Ward identity for TR applies equally
with or without subtractions, and yields the same collinear factor with gauge links. Then
                               s,m
we have, for the region Rl,r        , summing over graphs factorizes CR Γ into the same form as
                                                                (s)
Eq. (2.170), just with TR replaced by CR , and H0,0 replaced by the subtracted hard factor,
                           Xs                    X
            (s)      (s)
          Hsub  =  H0,0  +     (−1)n                           HR0 0 ⊗ HR0 1 ⊗ HR0 2 · · · ⊗ HR0 n ,        (2.175)
                           n=1          {HR 0 ,H 0 ,··· ,H 0 }
                                             0   R 1       Rn
with all possible subregion contribution removed at a fixed order s, for which one at most
has s nested regions so n ≤ s.
    Note that in the subtracted version of Eq. (2.170), we sum over all possible diagrams
                 (m)
in H (s) and Cℓ,r , with no contribution from smaller regions. The same thing can be done
for any other regions of any other graphs, and they yield the same factorization structure.
Then we sum over all the regions of all graphs, which can be in turn converted to a sum over
                             (m)
s, m, ℓ, r and H (s) and Cℓ,r ,
                       X              XXXX                      h               i
                                                                     (s)    (m)
                           CR Γ =                            CR Hℓ,r ⊗ Cℓ,r
                       R,Γ            ℓ,r s,m Cℓ,rm Hs
                                                         ℓ,r
                                      XXX                       h                             i
                                                          (s)                             (m)
                                 =                    Hsub ⊗ (gauge links)ℓ,r · Cℓ,r
                                      ℓ,r s,m Cℓ,rm
                                           "              #
                                      X X            (s)
                                 =                Hsub ⊗ F (ℓ,r)
                                      ℓ,r      s
                                 = Hsub ⊗ F,                                                                (2.176)
                                                                                           P     (s)
where in the last step we defined the all-order hard coefficient Hsub =                       s Hsub , and all-order
                             P
parton distribution F =         ℓ,r  F (ℓ,r) in Eq. (2.157), with F (ℓ,r) given in Eq. (2.171).
    In Eq. (2.176), the hard and collinear factors are only convoluted in the plus momentum.
                                                           105


The spinor indices are still entangled, and can be separated in the same way as Eq. (2.125).
In this way, we proved to all orders that the DIS cross section can be factorized into parton
distribution functions with hard coefficients. The result takes the same form as Eq. (2.126),
just with the “0” indices deleted. The all-order parton distribution functions take the same
forms as Eq. (2.124), but with the “0” indices deleted and the Wilson lines inserted, as in
Eq. (2.157).
    The parton distribution functions have collected all the pinched propagators, so they
have sensitive dependence on low-momentum-scale QCD dynamics and are nonperturbative.
The hard coefficients, on the other hand, have subtracted all the low-scale dependence and
are free from pinch singularities. The loop momenta therein always have (or can be freely
deformed to have) high virtualities. They are therefore only sensitive to the hard-scale
dynamics, which is purely perturbative in QCD. In this way, not only does factorization give
a factorized form for the DIS cross section, but also each factor has a difference momentum
scale dependence.
    Moreover, the systematic factorization formalism gives field-theoretic operator definitions
to the nonperturbative parton distribution functions in Eq. (2.157). Similar factorization
theorems can be derived for other processes, including SIDIS and Drell-Yan, and arrive
at the same set of parton distribution functions. The operator definition thus allows to
prove the universality of the nonperturbative set of functions. This important fact equips
factorization with predictive power, which follows by comparing those different processes
with the same set of PDFs but different perturbatively calculable hard coefficients. Also,
the operator definition implies that PDFs can be studied on their own, independent of the
physical processes from which they are derived. It is an ongoing and prospering effort in the
literature to use nonperturbative methods like Lattice QCD to compute the PDFs, which
                                              106


are soon to reach an era when they can be in comparison with globally fit PDF results.
See [Constantinou et al., 2021] for a review and references therein.
    Throughout our discussion, we only considered the single-flavor case with the active
parton being a quark. For the real case with several quark and antiquarks, each of our
derivation steps like Eqs. (2.170)(2.174) and (2.176) needs to contain a sum over quark
flavors, so the resultant factorization formula in Eq. (2.126) needs to be supplemented by a
sum over quark and antiquark flavors, with a corresponding hard coefficient for each channel.
The same discussion holds for the most general case that also involve gluons. For the gluon
case, one can arrive at the same factorization result by a similar but more complicated
derivation of the Wilson lines. A thorough discussion with full details to all orders is still
missing in the literature [Collins and Rogers, 2008]. We leave it to a future study and will
not delve into it in this thesis.
2.7.7      UV renormalization and factorization scale
Now in the whole discussion of region analysis, we have explicitly considered only two regions:
collinear (with kT ≪ Q) and hard (with kT ∼ O(Q)). The original DIS diagram corresponds
to a four-point (cut) Green function and has a superficial ultraviolet (UV) divergence degree
D = −1. This means that the region with kT ≫ O(Q) is power suppressed, so it is justified
to only consider those two regions. However, the factorization procedure that we laid out
in the previous sections has implicitly introduced some artificial UV divergences, as we are
going to treat now.
    The definition of the PDF in Eq. (2.157) (and similarly the low-order ones in Eqs. (2.121)
(2.137) and (2.155)) as a parton correlation function on the light cone results from including
the integrations of k − and kT within the collinear factor and extending them to infinity,
                                              107


which is beyond the collinear region that it is supposed to capture. By only having a
parton spinor vertex γ + (or γ + γ5 and γ + γ i γ5 ) in Fig. 2.13(b), the PDF is essentially a
three-point Green function, with a superficial divergence degree D = 0, Hence, the UV
region with kT ≫ O(Q) in the PDF is not power suppressed but gives logarithmically
divergent contribution. The formally defined collinear and hard factors in Eq. (2.176) are
thus both UV divergent. However, by the subtraction formalism, the same UV divergences
in the PDF are formally subtracted in the hard factor, so the overall convolution is free of
UV divergences. Even so, it is still physically significant to use UV finite definitions for
both factors. Therefore, we need to subtract the UV divergence in the PDF to define a
renormalized PDF.
                                                                  k               k
           k                 k        k                   k
                                                                    qi
             qi                         qi
                                                                         UV
                   UV                        UV                     ˆl
             l                          li
           P               P          P                 P         P             P
                   CO                        CO                          CO
                   (a)                       (b)                          (c)
Fig. 2.15: Separation of ultraviolet (UV) and collinear (CO) regions in a general diagram of
PDF. The thick blue lines have UV momenta flowing through, with the scaling in Eq. (2.177).
    The renormalization procedure is very similar to the factorization analysis, and works in
the following steps:
  (1) Treat the PDF as an independent amplitude, and separate its internal momenta as
      either UV or collinear. This yields the same region analysis and the same subgraph
      decomposition. But the hard subgraph is replaced by a UV subgraph, as shown in
      Fig. 2.15(a), and the hard scale Q is replaced by a UV scale ΛUV that is to be taken
                                             108


    to infinity. A momentum in the UV region has the scaling,
                            µ         +    −
                                                                          
                           kUV  = kUV   , kUV , kUV,T ∼ Q, Λ2UV /Q, ΛUV .                 (2.177)
(2) In Fig. 2.15(a), the collinear subgraph (CO) is connected to the UV subgraph (UV) by
    two quark lines and arbitrarily many gluon lines that are of longitudinal polarization.
    Any other configuration is power suppressed by 1/ΛUV that will eventually vanish.
(3) For each such region, the same approximations as Eqs. (2.86)(2.87) and (2.129) together
    with Ward identity factorize the collinear subgraph from the UV subgraph, as shown
    in Fig. 2.15(b)(c).
(4) Still, subtractions of smaller region are needed for each region to avoid double count-
    ing. (In the language of renormalization, smaller regions yield subdivergences.) After
    summing over all regions of all graphs, one would reach the same factorization struc-
    ture as Eq. (2.176), but now the hard factor is replaced by the UV factor and has a
    similar expression as Eq. (2.175). In dimensional regularization with d = 4 − 2ϵ, the
    UV factor is a function of 1/ϵ, depending on the subtraction schemes, and the power
    suppressed term 1/ΛUV is not present.
  Schematically, Fig. 2.15(c) together with subtractions leads to the factorization structure,
       Z
            d4 ℓ      
                 4
                   Tr UV(k + , ℓ) · CO(ℓ, p)
           (2π)
                 Z p+                         Z                   +          
                         +            +    γ−         dℓ− d2 ℓT      γ
             =         dℓ Tr UV(k , ℓ̂)          ·               Tr      CO(ℓ, p) + · · ·
                   k+                       2           (2π)4         2
                 Z 1                           Z                  +          
                      dz            +   ℓ+ γ −         dℓ− d2 ℓT     γ
             =           Tr UV(k , ℓ̂)            ·              Tr      CO(ℓ, p) + · · ·
                   x z                     2            (2π)4          2
                                                 109


                    Z  1                 Z                    +             
                         dz                    dℓ− d2 ℓT         γ
               =             UV(x/z) ·                     Tr       CO(ℓ, p) + · · · ,        (2.178)
                     x    z                       (2π)4           2
where · · · refers to other spinor structures and flavor channels, and Tr stands for the trace
of Dirac spinor indices. The square bracket in the last line is the renormalized PDF with
UV divergence removed.
    So, similar to how the region analysis of the cross section diagram in Fig. 2.13(a) leads
to the factorization result in Eq. (2.176), the region analysis on the “bare” PDF results in a
multiplicative renormalization, Using the unpolarized PDF as an example, we have
                                        XZ      1
                                                   dz        
                     fibare (x, 1/ϵ) =                  Z −1 ij (x/z, 1/ϵ; αs (µ)) fj (z, µ), (2.179)
                                        j     x    z
where (Z −1 )ij = UVij collects all the UV divergences and i and j stand for the parton
flavors. Here we have explicitly indicated the UV divergence in the bare PDF, which is a
polynomial of ϵ−1 in dimensional regularization. Inverting Eq. (2.179) gives the renormalized
PDF fi (x, µ),
                                     XZ     1
                                              dz
                        fi (x, µ) =                 Zij (x/z, 1/ϵ; αs (µ)) fjbare (z, 1/ϵ),   (2.180)
                                      j   x    z
where Zij (x/z, 1/ϵ; µ) is the renormalization coefficient, being the inverse of UV divergence,
          XZ      1
                    dy                                   
                         Zij (x/y, 1/ϵ; αs (µ)) Z −1 jk (y, 1/ϵ; αs (µ)) = δik δ(1 − x).      (2.181)
            j   x    y
    The renormalized PDF in Eq. (2.180) introduces a dependence on the untraviolet renor-
malization scale µ through αs (µ). This scale is different from the UV renormalization scale
in the QCD Lagrangian, but is more of an artifact of the factorization. It is called the
factorization scale. The dependence on the factorization scale µ leads to a multiplicative
                                                         110


evolution equation for the PDFs,
                                                  Z
                            d fi (x, µ) X             1
                                                         dz
                                          =                  Pij (x/z, αs (µ)) fj (z, µ),           (2.182)
                              d ln µ2          j    x     z
where the evolution kernel Pij (x/z) can be obtained by
                                           X     Z  1
            d                                          dy
                 Zik (z, 1/ϵ, α s (µ))   =                  Pij (z/y, αs (µ)) Zjk (y, 1/ϵ, αs (µ)), (2.183)
         d ln µ2                            j     z     y
which can be solved perturbatively order by order.
    Restoring the “bare” notation in Eq. (2.176) and substituting the renormalization ex-
pressions of PDFs like Eq. (2.179) for the bare ones, we get the same factorization formula
for the DIS cross section, but now in terms of the UV renormalized PDF and infrared (and
UV) finite hard coefficient,
                     XZ     1                                        
                               dx                   xB Q
   σDIS (xB , Q) =                 fi (x, µ) Ci          , ; αs (µ) + polarized + O(ΛQCD /Q),       (2.184)
                      i    xB   x                     x µ
which is similar to Eq. (2.176), but has an extra dependence on the factorization scale µ. In
Eq. (2.184) we only explicitly write the term for the unpolarized PDF, whose hard coefficient
has been relabeled as C; the polarized PDF terms can be easily obtained like in Eq. (2.126).
The renormalized hard coefficient Ci is related to the bare one in Eq. (2.176) through
                                    XZ       1
                                                dy −1
                      Ci (x, µ) =                   Z (x/y, 1/ϵ; αs (µ)) Cjbare (y, 1/ϵ).           (2.185)
                                       j   x     y ij
    This ends our discussion of the DIS factorization. The UV finite PDFs now can be studied
both theoretically and phenomenologically. Parametrized at a base scale µ0 , the PDFs can
                                                           111


be evolved to any arbitrary scale via Eq. (2.182), which allows them to be used in Eq. (2.184)
at the scale µ ∼ O(Q). By comparing the experimentally measured cross section with the
right-hand side in Eq. (2.184), one can obtain the PDFs that are best fit to experiments and
then employ them to give predictions for the same experiment at a different energy or for
other experiments.
2.8       Factorization of Sudakov form factor
Now we deal with the Sudakov form factor, whose leading region graph is shown in Fig. 2.3(b).
This is more complicated than the DIS factorization in Sec. 2.7 by having two collinear
subgraphs and one extra soft subgraph connected to them. We will show in this section that
by summing over all regions and graphs, the Sudakov form factor can be factorized into a
hard factor, two collinear factors, and a soft factor. As in Sec. 2.7, we will first see how
the region approximator TR alone can factorize each subdiagrams, and then examine how
subtractions of smaller regions will modify the factorization structure.
2.8.1      Factorization without subtraction
For any graph with the region decomposition in Fig. 2.3(b), we route each q (q̄) collinear
loop momentum to flow between the hard subgraph H and the collinear subgraph Cq (Cq̄ )
or completely within the latter, and each soft momentum to circulate between H, Cq , Cq̄ ,
and the soft subgraph S, or completely within S. After the approximation [Eq. (2.58)]
for collinear momenta flowing into H, and that [Eqs. (2.67) and (2.68)] for soft momenta
flowing into Cq or Cq̄ , the soft momenta are completely zero in H. So then the collinear
approximation in Eq. (2.65), with the necessary modification in Eq. (2.83), allows an exact
                                              112


use of Ward identity for collinear gluons attaching to H.
    This works in a way similar to the DIS factorization in Sec. 2.7, for both collinear sub-
graphs. For each collinear sector, the external lines of H connected to the other collinear
subgraph are projected on shell, so do not contribute to Ward identity. Then by the same
argument as Sec. 2.7.5, after summing over the diagrams in H and all possible collinear gluon
attachments with respect to the same region specification, the collinear gluons are factorized
onto gauge links, as shown in Fig. 2.16(a). The q-collinear gluons are collected by a gauge
link along the n direction, and the q̄-collinear ones by a gauge link along the n̄ direction.
By the iϵ prescriptions in Eq. (2.83) with the collinear momenta flowing into Cq or Cq̄ , both
gauge links point to the future.
                                   p1                                              p1
                           Cq                                             Cq
                                                                                    n̄
            p̂1                                            p̂1
    q               n                               q              n
         H                        S                      H                             S
                    n̄                                             n̄
            p̂2                                            p̂2
                                                                                    n
                           Cq̄                                            Cq̄
                                   p2                                              p2
                      (a)                                            (b)
Fig. 2.16: Factorization of the Sudakov form factor. (a) Factorization of collinear gluons out
of the hard subgraph. (b) Factorization of soft gluons out of the collinear subgraphs.
    The advantage of first factorizing collinear gluons is that they are organized onto Wil-
son line structures, which makes the further factorization of soft gluons much simpler than
the original graph in Fig. 2.3(b), where the colors are more entangled among all the sub-
graphs. The factorized collinear subgraphs Cq and Cq̄ are connected Green functions with
                                              113


one external amputated on-shell quark or antiquark line, one un-amputated fermion line
corresponding the field ψ̄ or ψ, and an arbitrary number of gluon lines. They are attached
by soft gluons in any arbitrary way. Those gluons are also required to be 1PI before entering
the collinear subgraphs.
    As given in Eq. (2.71), the region approximator is designed such that the soft gluon
momenta are light-like when flowing into the collinear subgraphs. So exact use of Ward
identity can be made by treating them as all on shell. Since the Wilson lines organize the
color flows such that all the off-shell collinear quark (antiquark) and gluon lines in Cq (Cq̄ )
together behave as a single quark (antiquark) field, applying Ward identity on the soft gluons
yields the same result as if they are only attached to the end of the quark (antiquark) line.
Then we get a similar result as Eq. (2.168). Since it is the field ψ̄ that corresponds to the Cq
subgraph, and ψ to Cq̄ , the resultant Wilson lines collecting the soft gluons are respectively,
                   W (s) (∞, 0; n̄) on Cq side;   W (s̄)† (∞, 0; n) on Cq̄ side,         (2.186)
whose explicit forms are the same as Eq. (2.172), at orders given by the number of the soft
gluons, s and s̄, connecting to Cq and Cq̄ , respectively. In doing so, we are working at a
particular order of H, Cq , Cq̄ , and S, and have summed over all possible graphs in the first
three subgraphs.
    In this way, for a certain leading region R, specified by the specific order of each sub-
graph and the numbers of collinear and soft gluons connecting these subgraphs, applying
the approximator TR and summing over the graphs factorizes these subgraphs, as shown in
                                                114


Fig. 2.16(b). The color flows in the following way:
       antiquark in Cq̄ → out from the collinear Wilson line along n̄ on the ∞ end
                         → into S from the ∞ end of the soft Wilson line along n
                         → out from S on the 0 end of the soft Wilson line along n
                         → into H at the antiquark leg → out from H at the quark leg
                         → into S from the 0 end of the soft Wilson line along n̄
                         → out from S on the ∞ end of the soft Wilson line along n̄
                         → into the collinear Wilson line in Cq along n on the ∞ end
                         → out of Cq at the quark line.
The hard subgraph only depends on the external two amputated on-shell quark legs, so is
automatically a color singlet. This helps contracts the two soft Wilson lines on their 0 ends.
By summing over the graphs in S, we can convert it into a Green function,
                                    1
                         S (s,s̄) =    tr ⟨0|W (s) (∞, 0; n̄) W (s̄)† (∞, 0; n)|0⟩.             (2.187)
                                    Nc
As for Eqs. (2.170) and (2.171), this entails a relabeling of the soft gluon colors and momenta,
avoiding a double counting of the soft gluon permutations. Apparently, the unitarity of
Wilson lines has helped us to identify S (s,s̄) also as a color singlet, so the color flows are
disentangled between the collinear and soft subgraphs. Similarly, by summing over the
graphs in Cq and Cq̄ , we can write down their Green function definition,
      Cqr = ⟨q(p1 )|ψ̄(0)W (r)† (∞, 0; n)|0⟩P,      Cq̄r̄ = P̄⟨q̄(p2 )|W (r̄) (∞, 0; n)ψ(0)|0⟩, (2.188)
                                                    115


whose color dependence is also trivial, and where r and r̄ are the numbers of collinear gluons
attaching to the Wilson lines. Here P = P̄ are the spinor projectors defined in Eqs. (2.60)
and (2.62).
    We note that the conversions into Green functions are possible in Eqs. (2.187) and (2.188)
because we have extended the collinear and the soft momenta into infinity after factorizing
them out. This, of course, goes out of the momentum regions where the approximator is
designed for. But as argued in Sec. 2.6, the imperfections of the approximated integrals at
larger regions are taken care of by the subtractions when we consider those larger regions.
Hence, to sum over all regions and graphs, we need to carefully subtract double-counting
contribution from smaller regions.
2.8.2     Rapidity divergence and modifications
There is, however, a flaw in the derived factorization result in Eqs. (2.187) and (2.188). As
an explicit low-order calculation can show, the are divergences associated with Wilson lines
along light-like directions. These are because a gluon attached to such a Wilson line has no
bound on its rapidity, whose integration thus extends to infinity and leads to divergence. So
they are called rapidity divergence. In the factorization argument, light-like Wilson lines arise
because we use the light-like vectors n and n̄ in the approximations Eqs. (2.82) and (2.83).
    To cure this issue, we modify the vectors n and n̄ off the light cone,
                           n → u = n − e2y n̄,   n̄ → ū = n̄ − e−2ȳ n,                 (2.189)
where ȳ and y have large values and are approximating the rapidities of n̄ and n, with ȳ > 0
and y < 0. The u and ū are to replace the n and n̄ in Eq. (2.82). The minus signs are in
                                               116


order not to introduce extra poles for the soft momenta in Eq. (2.82). This makes u and ū
space-like.
    To further require the soft approximator on a Wilson line to give back the same Wilson
line, we require
                                       k̂qs · ū = kqs · ū,     k̂q̄s · u = kq̄s · u,                        (2.190)
for the soft momenta kqs and kq̄s flowing into Cq and Cq̄ , respectively. This can be done by
choosing
              (kqs · ū) n           −                                 (kq̄s · u) n̄
       k̂qs =               = (kqs     − e−2ȳ kqs+
                                                    ) n,    k̂q̄s =                      +
                                                                                     = (kq̄s        −
                                                                                             − e2y kq̄s ) n̄, (2.191)
                 ū · n                                                   u · n̄
which are still along the lightcone directions n and n̄, and keep the important minus or plus
components intact. The modified approximations in Eqs. (2.189) and (2.191) differ from the
original ones only by power-suppressed effects due to the parameters y and ȳ. They still
keep the soft gluons lightlike and on-shell when flowing into the collinear subgraphs, and
vanishing in the hard subgraph. So Ward identity can work equally and exactly, leading to
modified soft factor in Eq. (2.192),
                                              1
                        S (s,s̄) (ȳ, y) =       tr ⟨0|W (s) (∞, 0; ū) W (s̄)† (∞, 0; u)|0⟩.                 (2.192)
                                             Nc
    Similar modification should also be done for the collinear factors in Eq. (2.188). But
as we shall see shortly, subtractions of soft subregions from the collinear region cancel the
rapidity divergences.
                                                           117


2.8.3       Factorization with subtraction
                                 r,r̄
We examine some region Rs,s̄          that has r (r̄) collinear gluons connecting H to Cq (Cq̄ ) and
s (s̄) soft gluons connecting S to Cq (Cq̄ ), at a particular order for all these subgraphs, and
                                                                  r,r̄
we include all such possible diagrams, denoted as Γ/Rs,s̄              .
    For a particular region R of a certain graph Γ, a smaller region R′ < R can have some
collinear lines of R in S, or some hard lines of R in the collinear subgraphs. Identifying the
subgraphs for each region, together with subtractions of smaller regions, each subgraph can
include the same subgraph but with sub-approximators applied within. After summing over
                        r,r̄
all the graphs in Γ/Rs,s̄    , each subgraph is summed over all possible diagrams of its order,
together with all possible (nested) subtractions. Since at each stage of the approximator,
Ward identity is allowed to factorize collinear gluons out of the hard subgraph, and soft gluons
out of collinear subgraphs, each of the hard and the collinear factors involves factors of gauge
links structures like Eqs. (2.192) and (2.188). Thus even for the graph in Fig. 2.3(b) with
subtractions in each subgraph, Ward identity can be applied equally to factorize different
subgraphs. The resultant hard, collinear, and soft factors are at the corresponding orders,
but now with subtractions for smaller regions.
                                                             r,r̄
    Since the soft subgraphs of any diagram in Γ/Rs,s̄            do not contain smaller regions, there
is no further subtractions, and Eq. (2.192) is the final soft factor.
    The collinear subgraphs, however, contain subtractions of smaller regions where some
of the lines become soft. For example, let us consider the factorized subgraph Cq without
subtraction. Extending its loop momenta to all regions has converted it to the Green function
expressions in Eq. (2.188). Now let us confine the loop momenta to the q-collinear region,
and only allow them to reach the soft subregions. Then this collinear subgraph Cq can be
                                                    118


further decomposed into two subgraphs, as shown in Fig. 2.17(a), one collinear subgraph CA
whose lines are collinear to q, and one soft subgraph SA collecting all the soft lines.
                                    p1                                p1
                           CA                                CA
                                  rSA                              ū     rSA
                       rA        SA                      rA              SA
                                                                    u     rS
                                   rS
                          n                                 n
                            (a)                               (b)
Fig. 2.17: Factorization of the collinear subgraph Cq . (a) is one of its leading regions that
has a collinear and a soft subgraphs within Cq . (b) is the result of factorizing the soft gluons
onto Wilson lines.
    Each particular graph in Cq can be decomposed as Eq. (2.17)(a) in many ways, defining
a whole set of regions. For each such region can be defined an approximator that inherits
from the approximator for the whole graph in Fig. 2.3(b). Since we only consider the two
momentum regions, this approximator is denoted in Fig. 2.17(a) by the thick green hooked
line. To its left are the collinear momenta flowing along the Wilson line along n with large
plus momentum components. The soft momenta are approximated as Eq. (2.191) when
flowing into CA or across the hooked line, beyond which they are exactly zero on the Wilson
line. Then, the same approximation allows us to factorize the soft gluons attached to CA to a
Wilson line along ū, while those attached to the Wilson line are directly separated from those
collinear lines onto a separate Wilson line whose rapidity can be modified to y without loss
of the leading-power accuracy. The result is shown in Fig. 2.17(b), for which we considered
the specific case when SA is connected to CA via rS A soft gluon lines and to the Wilson line
via rS ones.
    In a certain nesting of regions that constitutes one subtraction term in the collinear
                                              119


subgraph Cq , a larger region would take some of the soft lines, which have already been
factorized into the SA factor in Fig. 2.17(b), to collinear to CA . This further decomposes SA
into two subgraphs, CA′ and SA′ , as shown in Fig. 2.18(a). The region approximator works in
the same way, and we can factorize the soft gluons out of CA′ onto Wilson lines of the same
structure, one along u and the other along ū.
                               CA′                                           CA′
                     ū                  rSA′                    ū                   ū          ′
                                                                                                 rSA
                         rA′            SA′                            rA′                    SA′
                                                                                       u        rS′
                             u             rS′
                                                                            u
                               (a)                                             (b)
Fig. 2.18: Factorization of the soft subgraph SA within the collinear subgraph Cq . (a) is
one of its leading regions that has a collinear and a soft subgraphs within SA , and (b) is the
result of factorizing the soft gluons onto Wilson lines.
    This procedure can be carried on iteratively, with one minus sign for each iteration. For
the Cq at a certain order, summed over all graphs with the same number of gluons r, the
subtractions are on all possible such iterations, so we have
                        Xr     X               X
       Cqr,sub = Cqr +                                   (−1)n Cqr0 · S s1 ,r1 (ȳ, y) · · · S sn ,rn (ȳ, y), (2.193)
                        n=1 s1 ,··· ,sn r0 +r1 +···rn =r
where si , ri ≥ 1 for i ≥ 1, and the sum of perturbative orders of all factors on the right-hand
side is the same as Cqr . The Cqr is the unsubtracted collinear factor given in Eq. (2.188).
    The same subtraction procedure can be done for the other collinear factor Cq̄ , following
the same idea of “refactorization”, which leads a result similar to Eq. (2.193). Then for the
                                                           120


                                    r,r̄
sum over diagrams in Γ/Rs,s̄             , we achieve a factorization result,
                                X
                                      Γ/Rs,s̄ r,r̄
                                                   = H sub · Cqr,sub · Cq̄r̄,sub · S s,s̄ (ȳ, y),                            (2.194)
where the hard factor with subtractions of smaller regions can be simply obtained by match-
ing the left-hand side onto the right-hand side with the definitions of the last three factors.
    Clearly, with careful subtraction included in Eq. (2.194), we can sum over graphs to all
orders for each subgraph independently, and over r, r̄, s, and s̄. This is equivalent to a sum
over all graphs and all regions, so reproduces the complete leading-power contribution of the
Sudakov form factor. This converts the soft factor to a complete expression of Wilson lines,
                                  X∞
                                                             1
                 S(ȳ, y) =               S s,s̄ (ȳ, y) =      tr ⟨0|W (∞, 0; ū) W † (∞, 0; u)|0⟩,                          (2.195)
                                 s,s̄=0
                                                            Nc
where s and s̄ are either both 0 or both nonzero. The sum over collinear subgraphs Cq and
r in Eq. (2.193) can be converted to independent sums over all the indices, n, si , and ri ,
                 X                    X                X∞             X
         Cqsub =        Cqr,sub =            Cqr +         (−1)n ·             Cqr0 · S s1 ,r1 (ȳ, y) · · · S sn ,rn (ȳ, y)
                 Cq , r               Cq , r           n=1          si , ri ≥1
                               "          ∞
                                                                            #
                 X                       X                               n       Cqunsub
               =        Cqr · 1 +             (−1)n S(ȳ, y) − 1               =             ,                                (2.196)
                 Cq ,r                   n=1
                                                                                  S(ȳ, y)
                  P
where Cqunsub ≡         Cq , r Cqr sums over the graphs and r in the unsubtracted collinear factor
in Eq. (2.188). In the second-to-last step we have used S 0,0 (ȳ, y) = 1. To deal with ra-
pidity divergences, we should have taken n in Fig. 2.17(a) to be off light cone, so that the
collinear factor also has a rapidity cutoff y. Then we have the all-order collinear factors with
                                                               121


subtractions,
                                        Cqunsub (yq , y)                            Cq̄unsub (ȳ, yq̄ )
                   Cqsub (yq , y) =                      ,    Cq̄sub (ȳ, yq̄ ) =                       ,   (2.197)
                                           S(ȳ, y)                                     S(ȳ, y)
where yq > 0 and yq̄ < 0 are the rapidities of external on-shell quark and antiquark, taken
to be massive and physical. Notably, the rapidity divergences associated with y → −∞ or
ȳ → ∞ are cancelled in both factors with subtraction. Combined with the soft factor, we
then have the all-order factorization result for the Sudakov form factor,
                                                 Cqunsub (yq , y) · Cq̄unsub (ȳ, yq̄ )
                       ΓSudakov = H sub ·                                                 + p.s.            (2.198)
                                                               S(ȳ, y)
Evidently, in this combination, the dependence on the artificially introduced rapidity cutoffs
y and ȳ are also cancelled. Now we see how including subtraction converts the soft factor
into the denominator, contrary to what one might naively expected from the structure of
Fig. 2.16(b).
    To move forward, it helps to takes the limit y → −∞ and ȳ → ∞. But this makes
each factor in Eq. (2.198) ill-defined so forbids a study of each of them alone. Therefore,
as introduced in [Collins, 2013], one can define two reorganized collinear factors by noticing
the property S(ȳ, y) = S(ȳ − y) by Lorentz symmetry,
                                                                  s
                  eq = C eq (yq , yn ) = C unsub (yq , −∞)                      S(∞, yn )
                 C                            q                                                           ,
                                                                      S(∞, −∞) S(yn , −∞)
                                                               s
                  eq̄ = Ceq̄ (yn , yq̄ ) = C unsub (∞, yq̄ )               S(yn , −∞)
                 C                            q̄                                                   ,        (2.199)
                                                                   S(∞, −∞) S(∞, yn )
with a new rapidity separator parameter yn . Then Eq. (2.198) becomes
                           ΓSudakov = H sub · C      eq (yq , yn ) · Ceq̄ (yn , yq̄ ) + p.s.                (2.200)
                                                         122


Requiring invariance of Eq. (2.200) with respect to yn leads to an evolution equation that
allows to resum large logarithms. We will not cover this topic in this thesis.
    Contrary to DIS, the Sudakov form factor is only a perturbative amplitude and does
not directly correspond to a physical process. In QCD, quarks do not appear as observed
particles, and massless external states cause infrared divergences; in our analysis, we have
implicitly added nonzero masses to the quarks as infrared regulators. The infrared diver-
gences due to the massless gluon are also implicitly regulated by methods like dimensional
regularization which we have not specified carefully.
    Because of this, the collinear factors in Eq. (2.199), albeit as nonperturbative as the
PDFs, are not to be obtained from fitting to experimental data. The factorization analysis,
however, serves as a very useful prototype for discussing factorization in more complicated
and physical situations, including the single-hadron inclusive process in leptonic collisions,
the SIDIS, and the Drell-Yan processes. Another important ingredient needs to be developed
there. Because any observed particles in the final state induce pinch singularities, we may
only talk about inclusive observables, for which summing over all other unobserved particles
allows us to employ the unitarity of QCD to show that all those associated pinch singularities
are cancelled. This topic is not of direct relevance to the rest of this thesis, so will not be
further reviewed.
    The focus of the rest of this thesis is on exclusive processes, which are similar to the
exclusive Sudakov form factor amplitude but differ by concerning with only hadronic external
states. Similar decomposition of a region into hard, collinear, and soft subgraphs is to be
carried out, with slightly different structures for each factor. As we will show, while working
with exclusive processes with a single hard scattering H, each hadron is connected to H via
a collinear set of parton lines that are very close to each other so behave as color singlet as a
                                               123


whole, to the leading-power accuracy. Soft gluons attached to them are therefore cancelled.
This mechanism of cancellation differs from those inclusive processes that use unitarity, and
is a direct result of color confinement.
                                            124


Chapter 3
QCD Factorization of exclusive
processes
We have reviewed in Ch. 2 the main principles and methodology of QCD factorization, which
applies to hadronic scattering processes with one hard scale Q much greater than ΛQCD .
Normally, this would localize the interaction to become sensitive to the partonic degrees of
freedom in the hadron(s). To the leading power in ΛQCD /Q, only one parton enters1 the
hard interaction. This breaks the incoming hadron into colored objects, which exchange
soft gluons to neutralize the colors. The final state is then a series of hard jets surrounded
by soft hadrons. Any query about a specific soft hadron would touch the long-distance
nonperturbative QCD dynamics and go beyond the control of a perturbative method. Hence
in such situations it is more sensible to study inclusive observables, which “inclusively sums
over” (a fancier way of saying “neglecting”) anything else besides the directly observed hard
particles or quantities. Such processes are called inclusive processes, where the unitarity
sum cancels the soft gluon exchanges between different collinear sectors and establishes
independent parton density functions or fragmentation functions. The universality of those
nonperturbative functions gives QCD factorization predictive power, and allows them to be
measured to reveal certain aspects of the hadron structures.
   1
     A similar story holds for the inclusive hadron production process where one parton leaves the hard
interaction and initializes a jet of hadrons.
                                                   125


    It should be noted that inclusiveness is not the absolutely necessary condition for well-
defined observables in hadronic scattering, but more of a practically convenient choice against
our inability to deal with the nonperturbative soft regime. This is in contrast to the soft
divergences in QED: There the massless photons pose infrared divergences in both virtual
and real processes, and it is only the inclusive observables that are well defined, given that
arbitrarily soft photons can never be detected by an equipment of a finite size. In QCD,
however, due to the color confinement, all final-state particles can in principle be captured
by detectors — no soft gluons elude the observation. Hence, it is in principle sensible to talk
about the amplitude or cross section for producing a certain number of particles of given
types and momenta. Such processes are termed exclusive processes.
    In practice, including the context of this thesis, exclusive processes usually refer to a nar-
rower class of processes in which hadrons are unbroken, given that the multiple soft radiations
triggered by broken hadrons are easily intractable, both theoretically and experimentally. In
this sense, we divide the exclusive processes to be discussed in this thesis into three types:
    • large-angle scattering, referring generally to a hadronic 2 → n process in which the
       final-state hadronic particles are hard and well separated, and no hadrons are found in
       the direction(s) of the hadron beam(s),
    • single-diffractive scattering, which is similar to the previous case, but has one diffracted
       hadron in one of the hadron beam directions, and
    • double-diffractive scattering, which has one diffracted hadron in both hadron beam
       directions.
We only discuss at most 2 → 2 processes for the large-angle scattering, with n > 2 a triv-
ial generalization. The minimal configuration involves only one hadron, and the maximal
                                                126


one is a scattering of two hadrons into two other hadrons. At a high collision energy, a
large-angle scattering happens at such a local region, characterized by the inverse of the
transverse momentum scale of the final-state particles, that soft gluons communicating dif-
ferent collinear sectors at a long-distance scale are cancelled, leading the amplitudes to be
factorized into hadron distribution amplitudes (DAs). For the single-diffractive scattering,
we can similarly show the soft cancellation, and then the diffraction subprocess is factorized
into generalized parton distributions (GPDs). For the double-diffractive scattering and be-
yond, however, we will show that soft gluons can be pinched in the Glauber region, which
prohibits a factorization theorem from being derived.
    By the exclusive nature of such processes, each hadron is connected to the hard scattering
by at least two partons. This makes them more power suppressed than inclusive processes, by
the counting rules in Table 2.1. Hence, exclusive processes are more suitably studied at low
energy scattering, while as the colliding energy increases, hadrons are more likely to break,
leading to the inclusive regime. Nevertheless, the universal parton correlation functions,
the DAs and GPDs, obtained from the factorization of exclusive processes, provide valuable
information on the hadron structures complementary to the correlation functions obtained
from inclusive processes, as will be discussed in more details in Ch. 4.
3.1      Large-angle exclusive meson scattering
We confine our discussion within large-angle 2 → 2 exclusive meson scatterings,
                               A(p1 ) + B(p2 ) → C(q1 ) + D(q2 ),                        (3.1)
                                               127


in which we always take A as a meson. These processes can be categorized according to
the beam particle B, and we look at three types of processes: (1) electron-meson scattering,
with B = e− , (2) photon-meson scattering, with B = γ, and (3) meson-meson scattering,
with B = meson. The factorization discussion can be trivially adapted to other processes
with no mesons in the initial states or processes involving baryons.
3.1.1      Large-angle electron-meson scattering
To the leading order (LO) in QED, the beam particle B = e− is scattered into the final
state, so we take C = e− as well. The other particle D can be either a photon or a meson,
which we now discuss sequentially.
3.1.1.1    Single-meson process: D = γ
First, electric charge conservation constrains the meson A to be neutral, so for simplicity,
we take A = π 0 to be the charge-neutral pion. The scattering
                                π 0 (p1 ) + e(p2 ) → e(q1 ) + γ(q2 )                   (3.2)
thus gives the π-γ transition form factor [Lepage and Brodsky, 1980]. As usual, we define
the Mandelstam variables
                       s = (p1 + p2 )2 ,   t = (p1 − q1 )2 , u = (p1 − q2 )2 .         (3.3)
We work in the c.m. frame, where A always moves along the +ẑ direction, and e(q1 ) has a
transverse momentum ⃗qT . In the limit s → ∞ while t/s and u/s stay constant, i.e., s → ∞
                                                 128


         √
with qT / s constant, the pion is connected to the hard scattering via a set of collinear lines,
as shown by the reduced diagram in Fig. 3.1(a). Following the Landau criterion in Sec. 2.2.3,
it represents a general pinch surface in the parton loop momentum space that possibly gives
mass divergences. The most general pinch surface here can also have arbitrarily many soft
lines connecting A and H, but they are power suppressed by the general power counting rule
in Sec. 2.3, so are neglected.
    The power counting rules for the collinear lines, as discussed in Sec. 2.3, can be summa-
rized as (1) one collinear fermion line or transversely polarized gluon line is associated with a
                                      √
power λ/Q, where Q = O(qT ) = O( s) and λ = O(ΛQCD ) = O(mπ , fπ ), with mπ and fπ the
pion mass and decay constant, respectively, and (2) a longitudinally polarized gluon line is
associated with a power (λ/Q)0 . Hence, the leading region should have two collinear quark
or transversely polarized gluon lines connecting A to H, together with arbitrarily many lon-
gitudinally polarized gluons. The pure gluon channel violates isospin symmetry. So we only
have one type of leading regions, with one collinear subgraph and a hard subgraph, joined by
a pair of quark lines and arbitrarily many gluon lines of longitudinal polarization, similar to
the DIS shown in Fig. 2.13. One example of the LO diagrams is shown in Fig. 3.1(c), where
                                                               ∗
the scattered electron exchanges a highly-virtual photon γee     with the quark. The latter is
then excited to a high virtuality. After a short lifetime, it annihilates with the antiquark
to emit a real photon. Exchanging the roles of quark and antiquark gives the other LO
diagram.
    One can immediately notice the difference of exclusive processes from inclusive ones like
Figs. 2.13 and 2.14 that now a collinear subgraph in the amplitude is connected to the
hard subgraph by at least two parton lines. This is because the hadron participating in
the exclusive process must be a color singlet. To stay intact, they must only exchange a
                                               129


                                                                             p2               q1
                  p2                  q1        p2                q1
                                                                                    ∗     q
                                                                                   γee
                          H                              H                              qe
                 p1 A                 q2       p1 A               q2         p1 A             q2
                         (a)                            (b)                          (c)
Fig. 3.1: (a) Reduced diagram for a general pinch surface of the exclusive scattering process
π 0 (p1 ) + e(p2 ) → e(q1 ) + γ(q2 ). The dots represent an arbitrary number of collinear lines. (b)
is the leading region, where the dots alongside the gluon line represent an arbitrary number
of collinear longitudinally polarized gluons. (c) is one LO diagram. Reversing the fermion
arrow gives the other LO diagram.
color-singlet state with the hard interaction. As a result, the leading power for the exclusive
amplitude in Eq. (3.2) already counts as (λ/Q)1 .
     The factorization works in a way similar to DIS treated in Sec. 2.7. We include all the
collinear propagators in the collinear subgraph C. For each ki flowing in H, which scales as
                                              ki ∼ (Q, λ2 /Q, λ),                                (3.4)
we approximate it by only retaining the plus component,
                                             ki → k̂i = (ki · n)n̄,                              (3.5)
where we used the same light-like auxiliary vectors defined in Eq. (2.59). We project on shell
the quark and antiquark lines external to H by inserting the Dirac matrices, respectively,
                            γ · n̄γ · n    γ −γ +                    γ · nγ · n̄    γ +γ −
                     PA =                =           and     P̄A =               =          .    (3.6)
                                 2            2                           2            2
Each gluon has its polarization dominantly proportional to its momentum, so we approximate
                                                      130


its connection to H by
                                                                    k̂iµi nνi
                         Hµi (ki )g µi νi Cνi (ki ) 7→ Hµi (k̂i )               Cν (ki ),            (3.7)
                                                                  ki · n − iϵ i
for a particular gluon of momentum ki flowing into H. Since there is no soft region here,
the iϵ is not important; we added it only for convention.
          H                   H                        H                        H                  H
                                                                          k̂q           k̂q̄   k̂q    k̂q̄
   kq   k   kq̄ =−                kq̄     −    kq    k
                                                                 =                           +
                      kq    k                            kq̄         kq + k   k        kq̄     kq  k kq̄ − k
          C                   C                        C                        C                  C
Fig. 3.2: Graphic representation of the two steps to detach a longitudinally polarized collinear
gluon from the collinear subgraph C to the hard subgraph H, and reconnect it to correspond-
ing gauge links of the C. The red thin dashed lines represent the color flows.
      The approximator defined in Eqs. (3.5)–(3.7) can be collectively denoted as T̂ . It acts
on one leading region R, which has the decomposition into a hard subgraph Hn and and a
collinear subgraph Cn as in Fig. 3.1(b), of a certain diagram Γ, where n is the number of
gluons connecting Hn and Cn alongside the quark and antiquark lines. The leading-power
contribution from the region R can be obtained by applying T̂ , with the contribution from
smaller regions subtracted,
                                           CR Γ = Hnsub · T̂ · Cn .                                  (3.8)
Since a smaller region with respect to R necessarily has some lines in Hn belong to the
collinear region, the subtraction in CR Γ only affects Hn . In Eq. (3.8), Hnsub is the hard
subgraph with subtraction for smaller regions, and T̂ acts to the left on Hnsub by neglecting
small components of collinear momenta (as specified in Eq. (3.5)) and inserting spinor and
Lorentz projectors (as specified in Eqs. (3.6) and (3.7)). Hnsub can be further written in a
                                                       131


form like Eq. (2.175) after summing over graphs.
     By our assumptions (1) and (3) in Sec. 2.1.2, the amplitude of Eq. (3.2) is given by the
sum over all graphs. Each region R of a graph Γ is uniquely specified by the hard subgraph
H and collinear subgraph C. Varying H with a given C or vice versa corresponds to a
different graph. Summing over all regions of all diagrams is equivalent to summing over the
subgraphs H and C individually. And for each given subgraph H, the associated subtraction
for smaller regions is also uniquely determined. Now for a given Cn , we sum over all possible
diagrams in Hn at a given n and perturbative order and over all possible attachments of the
collinear gluons onto the hard subgraph. This, together with the H(k̂) · k̂ structure after the
approximation in Eq. (3.7), allows the use of Ward identity for the collinear gluons, which
also equally applies to each subtracted term in Hnsub (as illustrated in Sec. 2.7.6).
     Due to the presence of two quark lines, the Ward identity results in two gauge links that
collect all the collinear gluons, different from the Sudakov form factor in Sec. 2.8. This can
be easily demonstrated for n = 1, as shown in Fig. 3.2, by the same method of finding the
missing terms as in Fig. 2.14. Each quark is connected by a gauge link that goes along the
lightcone direction n to ∞ in the future, and the gluon can be connected to either one. The
same procedure can be applied inductively to an arbitrary n, and the result only depends
on the colors of the external quark legs. Therefore, as in Sec. 2.7.5, we would reach the
same Ward identity result by attaching all the gluons to two gauge links. The vertices and
propagators along the gauge links are the same as those that can be obtained from the
operator2
                       Z              n                                          on
                             4   ik·x           †
                                                          
                           d ye         ψ̄q (0)W (∞, 0; n) j [W (∞, y; n)ψq (y)]i ,                  (3.9)
   2
     Eq. (3.9) only specifies the momentum k going out from the quark-to-Wilson-line vertex. The momentum
from the antiquark-to-Wilson-line vertex is p1 − k, determined by momentum conservation.
                                                    132


which has been expanded to the n-th order in g, and where ψq is the quark field of flavor q,
W (∞, y; n) is the Wilson line from y to ∞ along n, as defined in Eq. (2.158), and i, j are
color indices in the fundamental representation. As Eq. (2.171), going from the diagrammatic
expressions to Eq. (3.9) entails a relabelling of gluon momenta and colors. Different from
Eq. (2.171) though, the expansion of Eq. (3.9) to the n-th order allows arbitrary numbers of
gluons attached to either gauge link, whereas Eq. (2.171) contains a cut line so the number
of gluons assigned to each side is fixed.
    The hard subgraph then only has two external quark lines, with momenta k and p1 − k,
respectively. So we can write Eq. (3.8) as
                                                      h          i
                               H sub (k̂, p̂1 − k̂) ⊗ T̂w · Cn (k; p1 ),                   (3.10)
where T̂w acting on Cn is not different from T̂ , but just refers to the fact that Ward identity
has been used to attach all collinear gluons onto gauge links that only belong to Cn . Then
summing over all possible diagrams for Hn and Cn gives a factorized result,
                        X           Z
                                           d4 k
                            CR Γ =              Hβα;ji (k̂, p̂1 − k̂) Cαβ;ij (k; p1 ),     (3.11)
                        R,Γ
                                         (2π)4
where we have left explicit the momentum convolution, and the dependence on colors and
                                         P
spinor indices. In Eq. (3.11), H =           H  H sub is the subtracted hard subgraph with gluons
                              P
factored out, and C = T̂w        C C is the collinear subgraph, both being summed over all
diagrams. Since there is no any subtraction involved in the collinear subgraph, we can write
                                                    133


it as a matrix element form, by extending Eq. (3.9) to all orders,
                                Z                  n                         
        Cαβ;ij (k; p1 ) =PA,αα′   d4 yeik·y ⟨0|T      ψ̄q,β ′ (0)W † (∞, 0; n) j
                                               × [W (∞, y; n)ψq,α′ (y)]i |π 0 (p1 )⟩P̄A,β ′ β , (3.12)
where we have left explicit the dependence on colors and spinor indices, and have included
the spinor projectors in Eq. (3.6). By construction, Eq. (3.11) approximates the amplitude
of Eq. (3.2) at the leading power.
    Now H is only convoluted with C by (1) color indices i and j, (2) spinor indices α and
β, and (3) the plus component of the quark (or antiquark) momentum k. Since the pion is
color neutral, only the color singlet component of Cαβ;ij is nonzero, so we can define a gauge
invariant factor Cαβ by summing over the color diagonal elements,
                                                        1
                                   Cαβ;ij (k; p1 ) =       δij Cαβ (k; p1 ).                    (3.13)
                                                      Nc
Similar to Sec. 2.7.1, by expanding Cαβ (k; p1 ) in terms of the 16 independent Dirac matrices,
we can see that only the γ − , γ5 γ − , and σ −⊥ components survive under the projection of PA
and P̄A . The pseudoscalar nature of π 0 further kills the γ − and σ −⊥ components, ending up
with only one spinor structure,
                                                    γ γ · n̄ 
                                                      5
                                  Cαβ (k; p1 ) =                    C(k; p1 )                   (3.14)
                                                         2       αβ
                                                     134


with the coefficient being a singlet in both color and spinor space,
               Z                 n                                                               o
                                                                γ · nγ5
   C(k; p1 ) =    d4 yeik·y ⟨0|T     ψ̄q,α (0)W † (∞, 0; n) i             [W (∞, y; n)ψq,α (y)]i |π 0 (p1 )⟩.
                                                                     2
                                                                                                           (3.15)
Finally, for the momentum convolution, since the hard part H only depends on k · n, we
insert into Eq. (3.11) the factor
     Z                                   Z                                         Z      Z
                    k·n                                                                        dλ iλ(x p1 ·n−k·n)
1=      dx δ x −              = (p1 · n) dx δ (x p1 · n − k · n) = (p1 · n) dx                    e               .
                    p1 · n                                                                     2π
                                                                                                           (3.16)
Together with the color and spinor factors in Eqs. (3.13) and (3.14), the convolution in
Eq. (3.11) becomes
                   Z                          γ γ · n̄                                
                                      1          5
                       dx (p1 · n) δij                        Hβα;ji (xp̂1 , (1 − x)p̂1 )
                                     Nc             2      αβ
                                               Z        Z                                  
                                                     dλ       d4 k iλ(x p1 ·n−k·n)
                                            ×                      e               C(k; p1 ) ,             (3.17)
                                                     2π      (2π)4
This completes the derivation of factorization for the amplitude,
                                  XZ
                   Mπ0 e→eγ =                dx Dq/π0 (x) Hq (x; ⃗qT , s) + O(ΛQCD /qT ),                  (3.18)
                                    q
where we have left explicit the sum over the quark flavor q, and changed the notation C to
define the DA for π 0 ,
                Z       Z
                   dλ        d4 k iλ(x p1 ·n−k·n)
   Dq/π0 (x) =                    e                 C(k; p1 )
                   2π       (2π)4
            Z  ∞                    n                                                            o
                 dλ iλx p1 ·n                      †
                                                                γ · nγ5
         =           e        ⟨0|T      ψ̄q (0)W (∞, 0; n)               [W (∞, λn; n)ψq (λn)] |π 0 (p1 )⟩
              −∞ 2π                                                 2
                                                       135


          Z  ∞                      n                                     o
                  dλ iλx p1 ·n              γ · nγ5
        =            e         ⟨0|T ψ̄q (0)         W (0, λn; n)ψq (λn) |π 0 (p1 )⟩.           (3.19)
            −∞    2π                            2
    The integration of k − and ⃗kT sets the operator on the light cone n, along which the
operators have canonical commutation relations. Then we can equivalently remove the time
ordering in Eq. (3.19). It can also be shown by the analyticity properties of C(k; p1 ) as
a scattering amplitude under the integration of k − [Diehl and Gousset, 1998], following
the assumption (3) in Sec. 2.1.2 and that the analyticity properties are the same as the
corresponding perturbative Feynman diagrams. This would allow the insertion of physical
states,
                       XZ        ∞
                                   dλ iλx p1 ·n                            γ · nγ5
         D q/π 0 (x) =                 e       ⟨0| ψ̄q (0)W † (∞, 0; n) |X⟩
                         X      −∞ 2π                                           2
                                   × ⟨X| [W (∞, λn; n)ψq (λn)] |π 0 (p1 )⟩
                       X                                                             γ · nγ5
                     =        δ (pX · n − (1 − x)p1 · n) ⟨0| ψ̄q (0)W † (∞, 0; n) |X⟩
                         X
                                                                                           2
                                   × ⟨X| [W (∞, 0; n)ψq (0)] |π 0 (p1 )⟩,                      (3.20)
where momentum conservation constrains the total plus momentum of the state X,
                                             p+              +
                                               X = (1 − x)p1 .                                 (3.21)
For X to be a physical state, we must require p+        X ≥ 0, so that x ≤ 1. On the other hand, by
using the canonical commutation relation for ψ and ψ̄, Eq. (3.20) can also be written as
                           XZ      ∞
                                      dλ iλx p1 ·n n γ · nγ5
         Dq/π0 (x) = −                   e        Tr            ⟨0| [W (∞, λn; n)ψq (λn)] |X⟩
                             X    −∞  2π                  2
                                                    136


                                  ×⟨X| ψ̄q (0)W † (∞, 0; n) |π 0 (p1 )⟩
                         X                           n γ · nγ
                                                             5
                    =−      δ (pX · n − x p1 · n) Tr           ⟨0| [W (∞, 0; n)ψq (0)] |X⟩
                          X
                                                          2
                                                             
                                  ×⟨X| ψ̄q (0)W † (∞, 0; n) |π 0 (p1 )⟩ ,                  (3.22)
where Tr takes the spinor and traces. Now we have
                                        p+         +
                                          X = x p1 ≥ 0,                                    (3.23)
which requires x ≥ 0. Together, we must have x ∈ [0, 1] for the DA to be nonzero. Therefore,
we should constrain the x integration in Eq. (3.18) to be from 0 to 1.
    Such constraint is not a mandatory condition inherent from factorization, but as a result
of having the operator on light cone in collinear factorization and setting x = k + /p+     1 on
the real axis. This then causes a problem of endpoint singularity. We note that the above
approximations T̂ defined in Eqs. (3.5)–(3.7) is true only for the scaling in Eq. (3.4), which
corresponds to the pinch surface whose surrounding region gives the leading-power contribu-
tion to the amplitude. In principle, one should keep the scaling ki+ ∼ O(Q) throughout the
factorization analysis. Nevertheless, in the result of factorization, Eq. (3.18), the variable
x is integrated from 0 to 1, so that we have to include the region where one of the active
partons has momentum ki+ ≪ Q. Perturbatively, this does not lead to a pinch, so we should
have deformed the contour of ki+ by O(Q) to make the associated propagator in the hard
subgraph to have high virtuality. For example, as shown later in Eq. (4.118), the LO hard
coefficient contains a term that is proportional to 1/(x − iϵ)Q2 which becomes soft as x → 0,
and we should deform the contour of x to the lower half complex plane to make Im x ∼ O(1).
                                                137


Similar issue arises as x → 1. However, since the DA only has support in x ∈ [0, 1], such
deformation is forbidden by the end points of the z integration. Therefore, the validity of
the DA factorization in Eq. (3.18) needs to be supplemented with an additional assumption
that the end point region should be strongly suppressed by the DA, which we refer to as
soft-end suppression. This situation could be improved by the Sudakov suppression factor
introduced in [Li and Sterman, 1992]. Further work is needed on this issue.
    So far, we have been working with the bare DA and hard coefficient, without caring
for the possible UV divergences introduced by the approximator T̂ . The original amplitude
Mπ0 e→eγ contains no UV divergence. But the approximator T̂ short-circuits the integration
of k − and ⃗kT into C in Eq. (3.17), and extends the integration to infinity. This introduces
an (artificial) UV divergence. Since the hard coefficient H is defined with subtraction of
smaller regions, which are themselves factorized in the same way the DA is factorized (see
Eq. (2.175)), whatever UV divergences introduced by T̂ in the DA have been compensated
by the subtraction in H. In this way, both the DA and H contain UV divergences, which
cancel each other and make up a finite convolution result in Eq. (3.18). Nevertheless, it
would be nice to define a renormalized DA by taking off the UV divergences therein.
    Similar to the PDF renormalization in Sec. 2.7.7, the DA is also renormalized multiplica-
tively,
                                      Z  1
                     Dq/π0 (x, µ) =        dz Z(x, z; αs (µ); ϵ−1 )Dq/πbare
                                                                          0 (z; ϵ
                                                                                 −1
                                                                                    ),  (3.24)
                                       0
with an invertible renormalization coefficient,
                                       Z  1
                      bare
                    Dq/π 0 (x; ϵ
                                −1
                                   ) =      dz Z −1 (x, z; αs (µ); ϵ−1 )Dq/π0 (z, µ).   (3.25)
                                        0
This introduce a factorization scale µ dependence in the renormalized DA, which will lead
                                                  138


to an evolution equation (not to be discussed in this thesis). We note that due to the lack
of gluon channel, the DA renormalization has no mixing between the quark and gluon or
between different quark flavors. Restoring the “bare” notation in Eq. (3.18) and substituting
Eq. (3.25) for the bare DA, we get the same factorization formula for the amplitude, but
now in terms of the UV renormalized DA and infrared (and UV) finite hard coefficient,
                            XZ     1                                  
                                                              ⃗qT qT
              Mπ0 e→eγ =             dx Dq/π0 (x, µ) Hq    x; √ ,        + O(ΛQCD /qT ),        (3.26)
                             q   0                               s µ
which looks like Eq. (3.18) but has an extra factorization scale dependence. The renormalized
hard coefficient is related to the bare one by
                               Z 1
                        ⃗qT qT
               Hq x; √ ,          =      dz Z −1 (x, z; αs (µ); ϵ−1 ) Hbare (z; ⃗qT , s; ϵ−1 ). (3.27)
                           s µ         0
    In this way, we finished proving the factorization for the amplitude of Eq. (3.2). We
have not only obtained the operator definition for the pion DA, given in Eqs. (3.19) and
renormalized in (3.24), but also provided a practical procedure for calculating the hard
coefficient to all perturbative orders. By projecting the pion state in Eq. (3.26) to an on-shell
parton-pair state, we can expand both sides order by order and obtain the hard coefficient
at each order by an iterative matching.
3.1.1.2    Double-meson process: D = meson
Now we discuss the electron induced meson production. Similarly, the electromagnetic cur-
rent does not change the flavor of the meson, so for concreteness we take A = D = π + , with
                                                  139


the scattering of other mesons generalized in a trivial way. The process
                                π + (p1 ) + e(p2 ) → e(q1 ) + π + (q2 )                (3.28)
probes the electromagnetic pion form factor. The kinematics are also defined as in Eq. (3.3).
We work in the c.m. frame with the π + (p1 ) along +ẑ direction, under the limit qT ≫ mπ
         √
and qT / s = O(1).
                           p2                q1       p2                  q1
                                    H                           H
                           p1 A              q2       p1                  q2
                                           D             A              D
                                    S                            S
                                   (a)                         (b)
          Fig. 3.3: Leading regions of the exclusive scattering process in Eq. (3.28).
                           p2                q1       p2                  q1
                                   H                             H
                           p1                q2       p1                  q2
                                    S                            S
                                   (a)                          (b)
                 Fig. 3.4: Two examples for the leading region (b) in Fig. 3.3.
    Following the same procedure, we can list the leading region diagrams for the meson
production amplitude, shown in Fig. 3.3. Immediately, one can notice the differences from
the real photon production process discussed above:
  (1) there are two collinear subgraphs now, which are connected by an extra soft subgraph,
      and
  (2) there are two kinds of leading regions, shown in Figs. 3.3(a) and 3.3(b), which we
                                                  140


      denote as region (a) and region (b). For region (b), only one active quark parton
      enters the hard interaction, and the other one is soft and only transmits the needed
      quantum number.
Region (b) raises some theoretical difficulty for factorization argument. However, we note
that such leading regions are obtained based on the soft scaling in Eq. (2.43) with λS =
O(λ2 /Q). In this region, a soft parton has virtuality of order λ4 /Q2 , well below the non-
perturbative threshold, so we consider such ultrasoft region to be cut off by nonperturbative
dynamics. As argued in Sec. 2.3.1, considering λS ≲ λ2 /Q is more important for inclusive
processes where we replace the sum over final-state hadrons by the sum over final-state
on-shell partons. Here we are dealing with exclusive processes, where all the partons are
directly connected to hadrons, so we confine our discussion within λS ≳ λ, for which the
power counting rules are given by the last two columns of Table 2.1. We note the suppression
from having soft momenta flowing through more than one collinear lines. This constrains
the diagrams for region (b) to be at very low order due to the continuity of fermion lines,
while for region (a), we must require the soft gluons to attach to the collinear lines right
before they enter the hard part.
    While it is likely not well defined, the lowest-order diagram for region (b) can be conceived
as in Fig. 3.4(a), where the two quark lines directly attach to the “pion wavefunction”. In
this case, the two collinear lines have virtualities λQ, while the soft subgraph has a power
counting λ3 , so it gives the power counting λ/Q in total, which is one power higher than the
counting (λ/Q)2 for region (a). However, this assumes the bare quark-pion coupling to scale
as 1. In the kinematical regime when the pion is highly boosted, it is hardly conceivable
that all the pion momentum is carried by one of the two valence partons. So we add
into the soft-end suppression assumption made for the eπ 0 → eγ process in Sec. 3.1.1.1 that
                                                141


diagrams like Fig. 3.3(b) receive a high enough suppression from the nonperturbative hadron
wavefunction such that they are power suppressed compared to the region in Fig. 3.3(a). This
assumption is supported by high-order QCD corrections. As shown in Fig. 3.4(b), when there
are gluon connections between the soft and collinear partons, the whole diagram becomes
power suppressed compared to Fig. 3.3(a), by the counting rule in Table 2.1. We leave a
detailed study to future work. For now, we simply note that the soft-end suppression brings
the leading regions down to the one in Fig. 3.3(a).
    To simplify the following discussion, we note that by virtue of the large qT , one can
always boost to the frame where A is moving along +z direction and D is moving along −z
direction, as was done in [Collins and Sterman, 1981; Nayak et al., 2005], which brings the
discussion similar to the Sudakov form factor in Sec. 2.8. This can be achieved in a covariant
way by defining two sets of light-cone vectors
      µ     1              µ      1                 µ     1               µ     1
    wA  = √ (1, ẑ) ,    w̄A = √ (1, −ẑ) ,       wD  = √ (1, ŵ) ,     w̄D = √ (1, −ŵ) ,   (3.29)
             2                     2                       2                     2
where ŵ = (sin θ cos ϕ, sin θ sin ϕ, cos θ) is the direction of the final-state meson D. Then any
momentum four-vector r can be expanded in the wA -wD frame as
                                                 µ
                                      r µ = r + wA + r− wD µ
                                                             + rTµ ,                         (3.30)
where r± = (r · wD,A )/(wA · wD ) are the longitudinal components, and wA · wD ∼ O(1) does
not affect the power counting. Under this notation, we have
                                      r2 = 2 r+ r− wA · wD − rT2 ,                           (3.31)
                                                   142


where rT2 = −gµν rTµ rTν . The A-collinear momentum kA and D-collinear momentum kD have
dominant components along wA and wD , respectively,
                                                     
                             kAµ = kA+ , kA− , kA,T    AD
                                                          ∼ (Q, λ2 /Q, λ),
                              µ       +    −
                                                     
                             kD  = kD   , kD , kD,T    AD
                                                          ∼ (λ2 /Q, Q, λ),              (3.32)
where the subscript “AD” refers to light-front coordinates in the wA -wD frame. A soft
momentum ks exchanged between the A- and D-collinear subgraphs is in the central rapidity
region with respect to the wA -wD frame, so we have
                                                     
                              ksµ = ks+ , ks− , ks,T   AD
                                                          ∼ (λS , λS , λS ),            (3.33)
with λS varying between λ2 /Q and λ. In the following discussion of this section, we will stay
in this frame and omit the subscripts “AD”.
    As noted in Sec. 2.5, however, the Glauber region of the soft gluons requires special care,
where the soft momentum ks has the scaling
                                   ksGlauber ∼ (λ2 /Q, λ2 /Q, λ).                       (3.34)
Similar to the Sudakov form factor case detailed in Sec. 2.5.2, there is no pinch that traps
the soft momentum in the Glauber region. So we can deform the contour to stay away from
the Glauber region. For a soft momentum ks flowing from A into S and then into D, its
minus component only receives poles from the A-collinear lines, which all lie on the upper
half plane, whereas its plus component only has poles from the D-collinear lines which also
                                                   143


lie on the upper half plane. Hence, in this region, we deform the contour as
                             ks+ 7→ ks+ − i v(ks+ ),    ks− 7→ ks− − i v(ks− ),                (3.35)
where v(ks± ) is a positive real function defined in Sec. 2.5.2. Such deformation deforms the
Glauber momenta back to the uniform soft scaling in Eq. (3.33). Then we can define the
approximator T̂ for a leading region R:
  (a) For a soft momentum kSA (kSD ) flowing in A (D), we approximate it by
                                          kSA · wA                            kSD · wD
                       kSA 7→ k̂SA =               wD ,    kSD 7→ k̂SD =                wA .   (3.36)
                                          wA · wD                             wA · wD
  (b) For a soft momentum kSA flowing from A into S, we include its propagator in S and
      approximate its coupling with A by
                                                                          µ   ν
                                                                       k̂SA wA
               CA,µ (kA ; kSA ) g µν Sν (kSA ) 7→ CA,µ (kA ; k̂SA )                 Sν (kSA ), (3.37)
                                                                    kSA · wA − iϵ
      where kA stands for some A-collinear momentum, and the iϵ prescription makes the
                                            −
      artificially introduced pole at kSA        = 0 on the upper half plane, compatible with the
      needed deformation in Eq. (3.35). In Eq. (3.37), we used CA and S to refer to the
      collinear subgraph A and the soft subgraph, respectively.
  (c) For a soft momentum kSD flowing from D into S, we include its propagator in S and
      approximate its coupling with D by
                                                                          µ    ν
                                                                        k̂SD wD
              CD,µ (kD ; kSD ) g µν Sν (kSD ) 7→ CD,µ (kD ; k̂SD )                  Sν (kSD ), (3.38)
                                                                    kSD · wD + iϵ
                                                    144


    Note that we flipped the soft momentum flow relative to that in Eq. (3.35). Similarly
    to Eq. (3.37), here CD refers to the collinear subgraph D.
(d) For an A (D) collinear momentum kAH (kDH ) flowing in H, we approximate it by
                  kAH 7→ k̂AH = (kAH · w̄A )wA ,       kDH 7→ k̂DH = (kDH · w̄D )wD .     (3.39)
    Here we project kAH (kDH ) against w̄A (w̄D ), instead of wD (wA ), such that after
    factoring the collinear subgraphs out of H, each collinear subgraph is independent of
    one another. Such replacement keeps the leading momentum components, so does not
    affect the leading-power accuracy.
(e) For a collinear gluon attaching A to H, its polarization is dominantly longitudinal. We
    include its propagator in CA and approximate its coupling with H by
                                                                  µ     ν
                                                                k̂AH  w̄A
           Hµ (kH ; kAH ) g µν CA,ν (kAH ) 7→ Hµ (kH ; k̂AH )                CA,ν (kAH ), (3.40)
                                                              kAH · w̄A + iϵ
    where kH is some hard momentum in H, and we take kAH to flow from H into A. This
    introduces a pole at kAH · w̄A = 0. The iϵ is introduced to make it compatible with
    the deformation in Eq. (3.35), as explained in Sec. 2.5.3. The same momentum kAH
    can reach the soft region, where it flows from S into A, through H, into B, and back
    to S. The deformation in Eq. (3.35) is then adapted to
                                         S
                                      ∆kAH   = +i O(λ)(wA + wD ),                         (3.41)
                                               145


      which deforms the denominator kAH · w̄A by
                                                S
                                            ∆kAH   · w̄A = +i O(λ),                          (3.42)
      into the upper half plane. So we need the +iϵ prescription in Eq. (3.40). This will lead
      to a future-pointing Wilson line along w̄A .
  (f) For a collinear gluon attaching D to H, we include its propagator in CD and approxi-
      mate its coupling with H by
                                                                    µ     ν
                                                                  k̂DH  w̄D
            Hµ (kH ; kDH ) g µν CD,ν (kDH ) 7→ Hµ (kH ; k̂DH )                  CD,ν (kDH ), (3.43)
                                                               kDH · w̄D − iϵ
      where we take kDH to flow from H into D. The iϵ is introduced in a similar way to
      Eq. (3.40). This will lead to a past-pointing Wilson line along w̄D .
 (g) For the quark and antiquark lines entering H from A, we insert the spinor projectors
                                       γ · wA γ · w̄A           γ · w̄A γ · wA
                                PA =                   ,  P̄A =                ,             (3.44)
                                              2                        2
      respectively. For the quark and antiquark lines leaving H to D, we insert the spinor
      projectors
                                       γ · w̄D γ · wD           γ · wD γ · w̄D
                               P̄D =                   ,  PD =                  ,            (3.45)
                                              2                        2
      respectively.
    A region R for a graph Γ is specified by the set of collinear and soft gluons (and the two
pairs of collinear quark lines by default); any other lines belong to the hard subgraph H.
                                                   146


We denote the graph contribution in such a region as
                             Hn1 ,n2 ⊗ CA,n1 ;m1 ⊗ CB,n2 ;m2 ⊗ Sm1 ,m2 ,                  (3.46)
where n1 and n2 are the number of collinear gluons connecting H to A and B, respectively,
and m1 and m2 are the number of soft gluons connecting S to A and B, respectively. The
symbol ⊗ refers collectively to the momentum convolutions and color and spinor contractions.
For the same graph Γ, there may be smaller regions R′ than R, which have fewer lines in H
and/or CA,D , and/or more lines in S. The contribution from the region R is then extracted
by applying T̂ after the subtraction of smaller region contributions,
                                                                !
                                                     X
                                   CR Γ = T̂   Γ−         CR ′ Γ ,                        (3.47)
                                                    R′ <R
where the contribution CR′ Γ is obtained by iterative use of Eq. (3.47). The subtraction terms
in Eq. (3.47) also have T̂ acted in front, just as in Eq. (2.110). They are obtained by treating
the lines in the same way as in R, with certain lines belonging to H, certain lines to A, etc.,
even though the approximators for R′ have been applied that treat those lines in some other
(smaller) regions. Therefore, the subtraction terms in Eq. (3.47) have the same structure as
Eq. (3.46), with different factors H, CA , and CD , but the same S. Those subtraction terms
can be uniquely determined once R is specified.
    The approximator T̂ modifies certain momenta and inserts some Lorentz and spinor
projectors in a way that the use of Ward identity for soft and collinear gluons is exact.
This applies to both T̂ Γ and the subtracted terms CR′ Γ (which is obtained after applying
their approximators) in Eq. (3.47). Acting T̂ on the latter further modifies the momenta
                                                 147


and introduces projectors that makes a further use of Ward identity exact. Therefore, after
applying T̂ in Eq. (3.47), we can use Ward identity for both Γ and the subtracted terms in
an exact way. Then we sum over all possible diagrams with the same region specification as
Eq. (3.46), with H having a fixed order Nh of αs . Among them, the sum of those with the
same subgraphs A, D and S but different H and collinear gluon attachments allows the use
of Ward identity for the A and D collinear gluons. This factorizes the collinear gluons out
of H, and simplifies Eq. (3.46) to
                          H (Nh ) ⊗ T̂w [CA,n1 ;m1 ⊗ Sm1 ,m2 ⊗ CD,n2 ;m2 ] ,           (3.48)
where T̂w is the same as T̂ but just refers to the fact that the collinear gluons are now
collected by two pairs gauge links. This is shown graphically in Fig. 3.5(a). Eq. (3.48)
applies to both terms in Eq. (3.47), so the factorized result also extends to the subtracted
factors,
                           (N )
                        Hsubh ⊗ T̂w [CA,n1 ;m1 ⊗ Sm1 ,m2 ⊗ CD,n2 ;m2 ]sub ,            (3.49)
Now H (Nh ) is only specified by two pairs of external amputated collinear quarks, at a given
order Nh of αs . The sum over H is then independent of the other factors, and each given
H determines uniquely the subtracted terms within. So the sum over H yields the partially
factorized result,
                         Hsub ⊗ T̂w [CA,n1 ;m1 ⊗ Sm1 ,m2 ⊗ CD,n2 ;m2 ]sub ,            (3.50)
which applies for any values of n1 and n2 . Here
                         XXh          (Nh )
                                                                               i
                 Hsub =            Hi       − (subtraction for smaller regions) ,      (3.51)
                          Nh   i
                                                 148


        (Nh )
with Hi       denoting the i-th graph at Nh -th order for the two pairs of external collinear
quarks. Since smaller regions in H refer to some of the lines becoming collinear or soft, their
contributions are obtained by iterative use of the approximator T̂w , which can be refactorized
as soft and collinear factors.
                  e            e                                       e                     e
                        H                                                 i′       H      n′
                                                                      i        j′    m′      n
                                                                        j                  m
                                                                            i′          n′
         A                             D                   A                   j ′   m′            D
     π                                       π         π                                               π
                                                               i                                 n
                         S
                                                                   j                           m
                                                                                   S
                        (a)                                                       (b)
Fig. 3.5: Factorization of collinear gluons out of the hard subgraph (a) and also soft gluons
out of the collinear subgraphs (b) for the process in Eq. (3.28). The red and blue thin dashed
lines refer to the color flows. The external thick quark lines of the hard subgraph H are
amputated. The numbers of gluons in all sectors can be arbitrary.
    Then we sum over all subdiagrams for A and D at a given order Na and Nd of αs ,
respectively, and for fixed n1 and n2 . This allows the use of Ward identities for the soft
gluons to factorize them out of the collinear subgraphs. Again, this applies to both A and
D themselves and the subtracted terms therein, so we have
                                   h                                  i
                                      (N )                  (N )
                            Hsub ⊗ CA,na1 ;sub ⊗ Sm1 ,m2 ⊗ CD,nd2 ;sub ,                            (3.52)
which is graphically shown in Fig. 3.5(b), where the soft gluons are collected by two pairs of
gauge links, one along wA and the other along wD . Then we can sum over n1 , n2 , Na , and
                                               149


Nd independently, which converts the two collinear factors into matrix element definitions,
                                Z
                                                              
          unsub
      CA,αβ,ij      (k)    =        d4 z eik·z PA,αα′ ⟨0|T      [W (∞, z; w̄A )ψ1,α′ (z)]i
                                                                                        o
                                                           × ψ̄2,β ′ (0)W † (∞, 0; w̄A ) j |π + ⟩P̄A,β ′ β , (3.53a)
                                Z
                                                                 
         unsub
      CD,γδ,mn       (l)   =        d4 z e−il·z PD,γγ ′ ⟨π + |T    [W (−∞, 0; w̄D )ψ2,γ ′ (0)]m
                                                                                          
                                                           × ψ̄1,δ′ (z)W † (−∞, z; w̄D ) n |0⟩P̄D,δ′ δ ,     (3.53b)
where the subscript “unsub” means that these factors have not included the subtraction for
smaller regions (where some of the lines go soft), (α, β, γ, δ) are spinor indices, (i, j, m, n)
are color indices in the fundamental representation, and we keep the general notations ψ1
and ψ2 , which are u and d quark fields for π + .
   The sum over all possible soft subdigrams and over m1 and m2 can be done independently
and converts it into a matrix element definition,
                                                    n
                   Si′ i,j ′ j;m′ m,n′ n = ⟨0|T Wi′ i (0, −∞; wA )Wjj† ′ (0, −∞; wA )
                                                                                                   o
                                                           ×Wm† ′ m (∞, 0; wD )Wnn′ (∞, 0; wD ) |0⟩,          (3.54)
where the color indices (i, j, m, n) match the ones for the collinear factors in Eq. (3.53),
and (i′ , j ′ , m′ , n′ ) are to contract with those of the hard factor Hi′ j ′ ,m′ n′ . The soft subgraph
contains no subtraction for smaller regions, so Eq. (3.54) is the final result.
   Now for the same reason as Eq. (3.13), the collinear factors in Eq. (3.53) are color singlets,
such that
                         unsub               1        unsub           unsub           1       unsub
                      CA,αβ,ij     (k) =        δij CA,αβ   (k),    CD,γδ,mn (l) =      δmn CD,γδ   (l),      (3.55)
                                           Nc                                       Nc
                                                                 150


and their contraction with the soft factor renders the latter into an identity matrix,
                                δij Si′ i,j ′ j;m′ m,n′ n δmn = δi′ j ′ δm′ n′ ,           (3.56)
by unitarity of the Wilson lines. Hence all exchanges of soft gluons are canceled. This also
applies to the subtraction terms in A and D subgraphs. Those smaller regions where some of
the gluon lines turn soft are canceled order by order after summing over graphs. Therefore,
the unsubtracted collinear factors in Eq. (3.53) are the same as the subtracted ones.
    We note that the choices of lightlike vectors for the soft approximations in Eqs. (3.37) and
(3.38) could have introduced rapidity divergences if the soft factor does not reduce to unity.
In that case, one needs to choose some non-lightlike vectors to use in the soft approximations,
as in Sec. 2.8.2, which does not affect the result that S = 1. The cancellation of soft gluons
for the exclusive processes is a direct result of the scattered particles being color singlets,
which itself is the consequence of color confinement. This is in contrast to inclusive processes,
where soft cancellation is a result of unitarity, due to the sum over final states. If we also
compare QCD to non-confined gauge theories like QED, the latter have bare charges that
can emit and absorb soft (and/or collinear) gauge bosons (photons for QED), which can
introduce corresponding divergences to the amplitudes. A finite cross section is achieved
only after a proper sum over the final (and/or initial) states, by virtue of unitarity. Hence,
exclusive processes are only well defined for a confined gauge theory like QCD, but not for
non-confined ones, where only inclusive processes are sensibly defined.
    After a similar spinor and momentum decomposition as in Eqs. (3.14) and (3.16), we get
                                                      151


the factorized expression for the amplitude of Eq. (3.28),
                                 Z  1                                                   
                                                                                  ⃗qT qT
                 Mπ+ e→eπ+ =          dx dy Du/π+ (x, µ)D̄u/π+ (y, µ)H x, y; √ ,            ,       (3.57)
                                  0                                                  s µ
where we have used the multiplicative renormalization to convert each factor into the renor-
malized one. Here the two bare DAs are defined as
                 Z
    bare            dλ iλxp1 ·w̄A       
  Du/π + (x)  =        e          ⟨0|T ψ̄2 (0) γ · w̄A γ5 W (0, λw̄A ; w̄A )ψ1 (λw̄A ) |π + (p1 )⟩,
                    4π
                 Z
    bare            dλ −iλyq2 ·w̄D +            
  D̄u/π + (y) =        e            ⟨π (q2 )|T ψ̄1 (λw̄D ) γ · w̄D γ5 W (λw̄D , 0; w̄D )ψ2 (0) |0⟩, (3.58)
                    4π
where the time ordering can be deleted, as in Eq. (3.19). It can be easily shown that the
DA value does not depend on the momentum direction of the hadron, and the DA for the
final-state pion differs from the initial-state one by a complex conjugate,
                                                                ∗
                                        D̄u/π+ (x) = Du/π+ (x) ,                                    (3.59)
which applies to both the bare and renormalized DAs. The hard coefficient is defined as the
scattering of two pairs of collinear [q1 q̄2 ] states that form color singlets,
                                                                                           
                ⃗qT qT         1  γ5 γ · wA                       ⃗qT qT  γ5 γ · wD 
     H x, y; √ ,         = 2                         Hβα,δγ x, y; √ ,                             , (3.60)
                   s µ       Nc           2       αβ                   s µ            2      γδ
with subtraction for smaller region contributions.
    Eq. (3.57) can be readily extended to other mesons and baryons, just with a proper
change of the DA and hard coefficients [Lepage and Brodsky, 1980].
                                                     152


3.1.1.3      Choice of Glauber deformation for the double-meson process
In the discussion of Sec. 3.1.1.2, the contour deformation to get the soft gluon momentum
ks out of the Glauber region is symmetric with ks+ and ks− , as was employed for the Sudakov
form factor in Sec. 2.5.2. This is, nevertheless, not the unique choice [Collins and Metz,
2004], as it is sufficient to get rid of the Glauber region as long as |ks+ ks− | ≳ |ksT
                                                                                      2
                                                                                         |. By
examining the contour of ks+ , we note that while all the ks+ poles from the D-collinear lines
are of O(λ2 /Q) and lie on the same half plane, the poles from the A-collinear lines and soft
lines are of order Q in the Glauber region. Hence one may choose to only deform the contour
of ks+ , but now by a magnitude of O(Q),
                                              ks+ 7→ ks+ + i O(Q),                       (3.61)
when ks flows from D into S. This deforms a Glauber gluon momentum into the A-collinear
region with the scaling (Q, λ2 /Q, λ), and then one can perform usual approximations and
apply Ward identities for the rest of the soft gluon momenta. In this way, although the
Glauber region will not be treated accurately by the soft approximation, it will be by the
collinear approximation.
      The soft gluons factorized from D are attached to two Wilson lines along wD , and the
A-collinear longitudinally polarized gluons are collected by two Wilson lines along w̄A ; both
of the two sets of Wilson lines point to the future. Since we do not deform the contour of
ks− , it does not matter what iϵ prescription we assign to the approximator 1/ks− ; the +iϵ
choice leads to same result3 as the symmetric deformation in Sec. 3.1.1.2, with soft Wilson
lines along wA and collinear Wilson lines along w̄D both pointing from/to the past, but the
    3
      Here ks is the same as in Eq. (3.61), flowing from D to S and then to A.
                                                       153


−iϵ choice would have both point to the future.
    Similarly, one may also choose to only deform ks− as ks− 7→ ks− − iO(Q) when it flows out
of A-collinear lines into S, and then the iϵ prescription for k + is not important as long as
every ks− is associated with the same prescription as in 1/(ks− + iϵ).
    This gives some freedom in choosing the suitable iϵ prescriptions to achieve universal def-
initions for the soft factor and collinear factors when compared with other processes [Collins
and Metz, 2004]. Within collinear factorization framework, the soft factor cancels no matter
what prescription is used, and the Wilson lines associated with the collinear factors also
become straight lines on the light cone due to unitarity of the Wilson lines, so that univer-
sality is a trivial property in the collinear factorization for exclusive processes. However,
such freedom as in Eq. (3.61) is necessary for the factorization of diffractive processes, as we
will discuss in Sec. 3.2.2.2.
3.1.2      Large-angle photon-meson scattering
For photon-meson scattering, the beam particle B stands for a photon. The final-state
particles CD can take (1) (CD) = (l+ l− ), (2) (CD) = (γγ), (3) (CD) = (γ, meson), or (4)
(CD) = (meson, meson). The first three cases do not raise new issues in factorization, which
we will briefly address. The last case, however, requires a generalization of our factorization
argument for the electron-meson scattering in Sec. 3.1.1.
3.1.2.1    Single-meson process: (CD) = (l+ l− ) or (γγ).
For the dilepton or diphoton production, the color structure does not differ from the pion-
photon transition in Sec. 3.1.1.1. The leading region in QCD thus takes the same form as
Fig. 3.1(b) with a mere change of external lines.
                                                154


                       p2                 q1             p2            q1
                                      q
                       p1 A               q2             p1 A          q2
                                 (a)                             (b)
       Fig. 3.6: LO diagrams for photoproduction of dileptons (a) and diphotons (b).
    At LO in QED, the dilepton production happens via the decay of a timelike virtual
photon of invariant mass Q = mll , as shown in Fig. 3.6(a), so this process is probing the
timelike meson-photon transition form factor. This can only happen for a charge-neutral
meson with an even charge-conjugation parity (C-even), such as π 0 . In this case, the hard
scale for factorization is provided by the high virtuality Q, which is ensured by the condition
of a large qT . However, the factorization holds as long as Q ≫ ΛQCD , even when qT is small.
Therefore, we have a factorization formula as Eq. (3.26), with the same DA definition and a
proper change of the hard coefficient. The power suppressed correction is now O(ΛQCD /mll ).
    In contrast, for the diphoton production, all the three photons directly couple to the
quark line, as shown by the LO diagram in Fig. 3.6(b). In this case, the hard scale is
necessarily provided by the large qT . The same factorization cannot extend to the forward
kinematic region. Now that the meson is coupled to three photons, such processes can only
happen to charge-neutral C-odd mesons, such as the ρ vector meson. Since we neglect the
quark masses in the hard part, the collinear q q̄ state from the meson must have zero helicity,
so the vector meson must be longitudinally polarized. The factorization formula therefore is
extended from Eq. (3.26) to
                            XZ     1                             
                                                           ⃗qT qT
              MρL γ→γγ =             dx Dq/ρL (x, µ) Hq x; √ ,       + O(ΛQCD /qT ).     (3.62)
                             q   0                            s µ
                                                  155


Here the bare DA is defined as
                      Z  ∞                    n                                  o
          bare              dλ iλx p1 ·n              γ·n
        Dq/ρ   (x) =            e        ⟨0|T ψ̄q (0)      W (0, λn; n)ψq (λn) |ρL (p1 )⟩,     (3.63)
             L
                        −∞  2π                         2
and the bare hard coefficient is the scattering [q q̄](p̂1 ) + γ(p2 ) → γ(q1 ) + γ(q2 ), defined as
                                                    γ · n̄ 
                             ⃗qT                1
                 Hqbare   x; √       = (p1 · n) δij             Hβα;ji (xp̂1 , (1 − x)p̂1 ),   (3.64)
                                s              Nc        2 αβ
up to subtractions of smaller regions.
3.1.2.2    Double-meson process: (CD) = (γ, meson).
The photon-meson pair production has the same color structure as the elastic electron-
meson scattering in Sec. 3.1.1.2, so the leading region differs from Fig. 3.3 only by changing
the external electron lines by photon lines. By the same argument (including the soft-end
suppression assumption), we can obtain a factorization formula like Eq. (3.57). However,
we need to note that now the LO diagrams have both external photons attach to the quark
lines, resulting in three propagators in the hard part, as shown in Fig. 3.7. The calculation
of the hard coefficient thus becomes more involved, but it also reveals a richer structure.
                       p2                      q1          p2                     q1
                       p1 A                  D q2         p1 A                  D q2
                                    (a)                              (b)
            Fig. 3.7: LO diagrams for the photoproduction of photon-meson pairs.
    While the two photon lines can attach to the same quark line as in Fig. 3.7(a), they may
                                                    156


also attach to different quark lines as in Fig. 3.7(b). In the first case, all the propagators in
the hard part are either connected to two external on-shell lines or amputated parton lines
on one of its ends, whereas in the second case the gluon propagator is not. As a result,
the hard coefficient has an (unpinched) pole in the middle of the (x, y) integration in the
DA convolution. This introduces an imaginary part to the amplitude even at LO. More
importantly, as will be elaborated in Ch. 4, the (x, y) and qT dependencies within the hard
coefficient cannot be separated. Their entanglement will lead to a nontrivial sensitivity to
the x dependence of the DA.
3.1.2.3    Triple-meson process: (CD) = (meson, meson). Symmetric deformation.
      p2                  q1         p2                   q1           p2                q1
              S                                                                         C
                        C                               C
                                                                                 H
                H                              H           S
    p1 A                                                             p1 A               D q2
                        D q2        p1 A               D q2
                                                                                 S
               (a)                             (b)                              (c)
Fig. 3.8: (a) Leading-region graph for the photoproduction of light meson pairs. There can
be any numbers of soft gluons connecting S to each collinear subgraph. The regions with
S connecting to one or more collinear subgraphs via quark lines or transversely polarized
gluon lines are omitted. Depending on the quantum numbers, the collinear quark lines may
be replaced by transversely polarized gluon lines. The dots represent arbitrary numbers of
longitudinally polarized collinear gluons. (b) The result of factorizing the collinear subgraph
A out of the hard subgraph H, with the soft gluons coupled to A canceled. (c) The result
of factorizing the collinear subgraph C out of the hard subgraph H, with the soft gluons
coupled to C canceled.
    The process
                             MA (p1 ) + γ(p2 ) → MC (q1 ) + MD (q2 )                       (3.65)
has three hadrons in three different directions, among which arbitrary soft gluon exchanges
                                               157


can happen. The leading region is shown in Fig. 3.8(a). The regions where the soft subgraph
is connected to any two collinear subgraphs via quark lines or transversely polarized gluon
lines are omitted, which are power suppressed by the soft-end suppression.
    In such a three-meson process, each of the final-state mesons exchanges soft gluons with
both initial-state and final-state mesons. This makes it difficult to uniformly deform the
contour to move the soft gluons away from the Glauber region. To put forward the for-
mal discussion, we work in the c.m. frame and define some auxiliary vectors by extending
Eq. (3.29),
            1           µ     1                   1                           1
   wAµ
       = √ (1, ẑ),   w̄A = √ (1, −ẑ),   wCµ = √ (1, n̂) = w̄Dµ
                                                                 ,   w̄Cµ = √ (1, −n̂) = wD   µ
                                                                                                ,
             2                 2                    2                           2
                                                                                            (3.66)
where ẑ and n̂ are normalized three-vectors along the directions of the initial-state meson MA
and final-state meson MC . Basically, wA,C,D are the light-cone vectors along the directions
of meson A, C and D, respectively, and the corresponding vectors with bars refer to the
conjugate light-cone vectors along the opposite directions.
    The essential point is that any soft gluon momentum ks can be routed to only flow
                                                                           (ij)
through two collinear subgraphs. For this, we introduce the notation ks         to be a soft gluon
momentum that flows from the collinear subgraph i into S, then proceeds into the collinear
subgraph j, passes through the hard subgraph H, and finally returns to i. Apparently, we
       (ij)      (ji)
have ks     = −ks , with i, j = A, C, D and i ̸= j.
                                                      (ij)
    When considering the soft gluon momentum ks , we expand it in the wi -wj frame as
                                              158


defined in Eq. (3.30),4
                                                     (ij)                    (ij)
                                                   ks · wj                ks · wi              (ij)
                                   ks(ij)  = wi                   + wj                    + ksT ,                        (3.67)
                                                    wi · wj                 wi · wj
where all the three terms on the right-hand side are of the same size, O(λS ). When it
                                                                                                                   (ij)
flows in the collinear subgraph i, whose momenta are dominantly along wi , the ks                                       can be
approximated by only retaining the wj component,
                                                                             (ij)
                                                                           ks · wi
                                              ks(ij)  ≃   k̂s(ij) = wj                    .                              (3.68)
                                                                            wi · wj
Moreover, the coupling of this soft gluon to the collinear subgraph J i can be approximated
as
                                                                                        (ij)µ
                                                                                      k̂s     wiν
                      Jµi (ki , ks(ij) ) g µν Sν (ks(ij) )    ≃   Jµi (ki , k̂s(ij) )   (ij)
                                                                                                    Sν (ks(ij) ) ,       (3.69)
                                                                                      ks     · wi
because it is the specific component of g µν given by wjµ wiν /wi · wj that provides the dominant
contribution. In Eq. (3.69), ki stands for some collinear momentum in the subgraph i. This
approximation will allow the use of Ward identity to factorize the soft gluons out of the
collinear subgraphs.5
     While this is a good approximation for the central soft region, it is not for the Glauber
region in which
                                                                                (ij)
                                     |ks(ij) · wi | |ks(ij) · wj | ≪ |ksT |2 wi · wj .                                   (3.70)
   4
     While we may define the plus and minus components in each wi -wj frame like Eqs. (3.30)–(3.32), having
multiple such frames makes the notation cumbersome, so we stick to the covariant notations.
   5
     We should note that the argument given here is equivalent to [Collins and Sterman, 1981; Nayak et al.,
2005] that boost into the rest frame of two collinear subgraphs. The underlying reason is that any two
distinct collinear subgraphs are well separated in rapidity; in the language here, it is wi · wj ≃ O(1).
                                                                  159


                                                                                                     (ij)
Now because all the collinear lines in the subgraph i or j only give poles for ks                         · wi or
 (ij)                                                                              (ij)
ks · wj on the same half complex plane, the integration contour of ks                   is not pinched in the
Glauber region, and a proper deformation can get it out of the Glauber region. However, if
we take the symmetric deformation as in Eq. (2.80), we need to deform in opposite ways the
                                                                                              (AC)
soft momenta coupling A to C and those coupling D to C. Specifically, for ks                       , it receives
                                                                    (AC)
poles on the upper half plane for both the component ks                    · wA from A-collinear lines, and
 (AC)
ks     · wC from C-collinear lines. So we need to deform its contour as
                               ks(AC) 7→ ks(AC) − iO(λ)wC − iO(λ)wA ,                                      (3.71)
                                                                                                        (DC)
following the expansion in Eq. (3.67). On the other hand, the soft momentum ks                                has
             (DC)
poles for ks      · wD on the lower half plane, so the deformation is
                              ks(DC) 7→ ks(DC) + iO(λ)wC − iO(λ)wD .                                       (3.72)
While such sign difference is for different soft momentum attachments so does not pose any
difficulty like a Glauber pinch, it does imply that a C-collinear longitudinally polarized gluon
kC has soft subtraction terms with different contour deformation.
    If we take kC to flow from H to C, then we approximate kC by
                                         kC → k̂C = (kC · w̄C )wC                                          (3.73)
in H, and its coupling to H by
                                      µν                              k̂Cµ w̄Cν C
                      Hµ (kH , kC ) g    JνC (kC ) ≃ Hµ (kH , k̂C )             J (kC ) .                  (3.74)
                                                                     kC · w̄C ν
                                                    160


This introduces a pole at kC · w̄C = 0, which can potentially obstruct the deformations in
Eqs. (3.71) and (3.72) in the soft subtraction terms, as explained in Sec. 2.5.3. Even though
we are approximating the collinear region, which does not suffer from the Glauber region
problem, Eq. (3.74) is applied to the whole diagram with deformed contours. Furthermore,
the same gluon kC considered in Eq. (3.74) can also go into the soft region, attaching to
                                                                             (A)     (D)
A- or D-collinear subgraph, for which we will change the notation kC to kC or kC ,whose
contribution has already been included in the soft approximations defined in Eq. (3.69). A
subtraction is needed from Eq. (3.74) to avoid such double counting, which is obtained by first
applying the soft approximation [Eq. (3.69)] and then applying the collinear approximation
[Eq. (3.74)]. Since the subtraction mixes the collinear and soft approximations for the same
gluons, and the latter require deformation of contours, we do need the iϵ prescription in
Eq. (3.74) not to obstruct the contour deformations in Eqs. (3.71) and (3.72). The latter
would need to deform the denominator in Eq. (3.74) by
                      (A)
                 ∆(kC · w̄C ) = −iO(λ)wC · w̄C − iO(λ)wA · w̄C = −iO(λ),                 (3.75)
when kC reaches the soft region and attaches to A, or by
                      (D)
                 ∆(kC · w̄C ) = +iO(λ)wC · w̄C − iO(λ)wD · w̄C = +iO(λ),                 (3.76)
when kC reaches the soft region and attaches to D. Therefore, there is not a uniform iϵ
choice for the collinear gluon approximation [Eq. (3.74)] to respect the soft deformations in
the corresponding subtraction terms.
   To avoid this difficulty, we note that the collinear subgraph A only couples to final-state
                                              161


mesons by the soft gluons. So we can first factorize A out of the the hard part. To do that,
we approximate all A-collinear momenta kA by
                                       kA → k̂A = (kA · w̄A )wA                         (3.77)
when they flow in H. We insert proper spinor or Lorentz projectors for the collinear quark or
transversely polarized gluon lines. The coupling of each A-collinear longitudinally polarized
gluon to H is approximated by
                                                                   k̂Aµ w̄A
                                                                          ν
                    Hµ (kH , kA ) g µν
                                       JνA (kA ) ≃ Hµ (kH , k̂A )           JνA (kA ) . (3.78)
                                                                  kA · w̄A
By taking kA to flow from H into A, the soft subtraction terms contain the regions kA =
 (C)     (CA)             (D)      (DA)
kA = ks       and kA = kA = ks          . These require the deformations
                     (C)                                (D)
                 ∆kA = +iO(λ)(wA + wC ),             ∆kA = +iO(λ)(wA + wD ),            (3.79)
which change the denominator kA · w̄A in the collinear approximation [Eq. (3.78)] by
                           (C)                           (D)
                      ∆(kA · w̄A ) = +iO(λ),         ∆(kA · w̄A ) = +iO(λ),             (3.80)
respectively. Therefore, it is possible to introduce a +iϵ prescription to Eq. (3.78) to be
compatible with such deformations. This leads to future-pointing Wilson lines along w̄A to
collect the A-collinear longitudinal gluons.
    In contrast, it is easy to choose the iϵ prescriptions for all soft gluons to make the
approximation in Eq. (3.69) compatible with the deformations. We choose −iϵ when i = A
                                                  162


and +iϵ when i = C or D. As a result, the soft gluons attached to A will be collected by a
pair of Wilson lines along wA that come from the past infinity, and those attached to C (D)
by a pair of Wilson lines along wC (wD ) that go to the future infinity.
    Then following the same procedure as Sec. 3.1.1.2, we can factorize the A subgraph out
of H, and soft gluons out of A. This Ward-identity argument applies equally to the approx-
imated region itself and to the subtracted smaller regions, to which the same approximator
applied. Then because the meson MA is a color singlet state, the same soft cancellation
happens as Eqs. (3.55) and (3.56). That is, the two infinitely long Wilson lines associated
with the A subgraph are joined to a connected one with a finite length, and the soft gluons
coupling to A are canceled. Although the argument in Eqs. (3.55) and (3.56) is for the whole
Wilson lines to all orders, it applies to each finite perturbative order as well.
    The result is shown in Fig. 3.8(b). The remaining gluons only couple to C and D, which
are both in the final state. Then the symmetric deformation to get the gluons out of Glauber
region works in the same way as the Sudakov form factor in Eq. (2.80), namely,
                               ks(CD) 7→ ks(CD) + iO(λ)(wD − wC ).                     (3.81)
The following factorizations of collinear subgraphs and soft gluons work a similar way to
the Sudakov form factor, so will not be repeated here. The resultant soft Wilson lines are
canceled in the same way as those coupling to A, as a result of MC and MD being color
neutral mesons.
    Therefore, we end up with the factorization result of the amplitude,
                               XZ      1                                            
                                                                              ⃗qT qT
            MMA γ→MC MD =                dx dy dz Di/A (x, µ) Hiγ→jk x, y, z; √ ,
                               i,j,k 0                                           s µ
                                                 163


                                               × D̄j/C (y, µ) D̄k/D (z, µ),               (3.82)
where the sum is over all possible parton flavors, and we have used the multiplicative renor-
malization of DAs to write each factor as the renormalized one. The hard coefficient H is
the scattering of the photon off a collinear pair of on-shell massless partons i, into two pairs
of partons j and k. Note that the soft cancellation applies also to the subtraction terms in
A, C, D, and H, so the unsubtracted DA factors are the same as the subtracted ones, and
the H only contains collinear subtractions.
3.1.2.4    Triple-meson process: (CD) = (meson, meson). Asymmetric deforma-
           tion.
The factorization procedure for the process in Eq. (3.65) is based on the symmetric defor-
mation. Its feasibility relies on there being only one collinear subgraph A in the initial state,
which only exchanges soft gluons with final-state particles. The strategy does not apply to
the process in Eq. (3.100) that involves two widely separated mesons in both initial and
final states. On the other hand, we cannot extend the proof to the single-diffractive case
where the meson MA is replaced by a diffracted hadron h, which enters the hard interac-
tion with γ, but also produces another hadron h′ in the nearly forward direction. As we
will see in Sec. 3.2.2.2, there exists a kinematic region where the momentum component
ks · wA is pinched in the Glauber region when the soft gluon ks is exchanged between the
diffracted hadron and final-state mesons. It then forbids the symmetric deformation such
as Eq. (3.71). Therefore, a different approach is needed for extending the proof. Now, we
explore the possibility of asymmetric deformation.
    Given the need to allow generalization of the factorization proof to the single-diffractive
                                               164


                                                                    (Aj)                                  (Aj)
process, we choose not to deform the contour of ks                         · wA when a soft momentum ks
flows in the A-collinear subgraph, and will instead try to factorize soft interactions from the
collinear subgraphs C and D.
    The needed deformations can be motivated by examining a single soft gluon exchange
between different collinear subgraphs. We first consider the collinear subgraph C that has
                      (CA)          (CD)
one soft gluon ks            and ks         exchange with the A-collinear subgraph and D-collinear
                                        (CA)
subgraph, respectively. Since ks              flows in C in the same direction as the C-collinear lines,
                (CA)                                                                                  (CA)
the poles of ks        · wC are all on the lower half plane, so we deform the contour of ks                by
                                         ks(CA) → ks(CA) + i wA O(Q) ,                                 (3.83)
when it is in the Glauber region, similar to Eq. (3.61). Similarly, we deform the contour of
  (CD)
ks     by
                                         ks(CD) → ks(CD) + i wD O(Q) .                                 (3.84)
Such deformations move the soft gluon momenta from the Glauber region all the way into
the A or D collinear region, which will be properly treated by collinear approximations.
    In order for the approximator in Eq. (3.69) not to obstruct such deformations, we modify
it to
                                                                        (CA)µ
                                                                      k̂s       wCν
        JµC (kC , ks(CA) ) g µν Sν (ks(CA) ) ≃ JµC (kC , k̂s(CA) )  (CA)
                                                                                      Sν (ks(CA) ) ,  (3.85a)
                                                                   ks       · wC + iϵ
                                                                         (CD)µ ν
                                                                       k̂s      wC
        JµC (kC , ks(CD) ) g µν Sν (ks(CD) ) ≃  JµC (kC , k̂s(CD) ) (CD)               Sν (ks(CD) ) , (3.85b)
                                                                   ks        · wC + iϵ
where only the relevant arguments are written explicitly. Both approximations in Eq. (3.85)
                                                         165


have the structure
                                                                     k̂sµ wCν
                   JµC (kC , ks ) g µν Sν (ks ) ≃ JµC (kC , k̂s )                Sν (ks ) ,   (3.86)
                                                                  ks · wC + iϵ
where the structure k̂sµ JµC (kC , k̂s ) allows the use of Ward identity in a uniform way, no matter
which other collinear subgraph ks flows through. The +iϵ choice will lead to future-pointing
soft Wilson lines.
    Now we consider the collinear longitudinally polarized gluons attaching C to H. Similarly,
the approximation can be obtained by examining a single gluon, whose momentum kC flows
from H into C and can be expanded in the wC -w̄C frame,
                            kC = wC (kC · w̄C ) + w̄C (kC · wC ) + kC,T ,                     (3.87)
where among the three terms on the right, the wC component dominates and scales as O(Q).
Then we approximate kC in H by
                                        kC → k̂C = wC (kC · w̄C ) ,                           (3.88)
and the coupling of the collinear gluon to H by
                                                                       k̂Cµ w̄Cν
                 Hµ (kH , kC ) g µν JνC (kC ) ≃ Hµ (kH , k̂C )                    J C (kC ) , (3.89)
                                                                  kC · w̄C − iϵ ν
where only the relevant argument dependence is written explicitly and kH stands for some
hard momentum in H.
    When applying Eq. (3.89) to the whole graph with deformed contours, the same gluon
                                                    166


                                                            (A)    (D)
kC can go into the soft region, which we notate as kC or kC             when it attaches to the A- or
D-collinear subgraph. Then the deformations in Eqs. (3.83) and (3.84) are adapted to6
                               (A)                        (D)
                           ∆kC = −i wA O(Q),          ∆kC = −i wD O(Q),                           (3.90)
implying that the denominator in Eq. (3.89) needs to be compatible with the deformations
                                (A)
                            ∆kC · w̄C = −i (wA · w̄C ) O(Q) = −i O(Q) ,
                                (D)
                            ∆kC · w̄C = −i (wD · w̄C ) O(Q) = 0 .                                 (3.91)
This explains the −iϵ choice in Eq. (3.89). After applying Ward identity, it leads to collinear
Wilson lines pointing to the past.
     Eqs. (3.85) and (3.89) constitute the needed approximations related to the collinear
subgraph C. Even though we only considered a single soft or collinear gluon connection,
they generalize to multiple gluon connections in an obvious way: one just applies Eq. (3.85)
to every soft gluon connecting C to A or D, and (3.89) to every collinear longitudinally
polarized gluon connecting H to C. Then by applying suitable on-shell projections to the
C-collinear quark lines or transversely polarized gluon lines, and summing over all possible
attachments of the collinear gluons, we can factorize the collinear longitudinally polarized
gluons out of the hard part H onto two Wilson lines along w̄C pointing to the past, and the
soft gluons out of C onto two Wilson lines along wC pointing to the future.
     Similar to the discussion in Sec. 3.1.1.2, although choosing the lightlike wC in the soft
approximation Eq. (3.85) causes rapidity divergences, it does not affect our conclusion that
   6                                                                                      (CA)      (CD)
     Note that now the soft momentum direction is reversed compared to the convention of ks    and ks    ,
which are used in Eqs. (3.83) and (3.84).
                                                   167


the soft gluons eventually cancel as a result of exclusiveness. Remedying this superficial flaw
with a non-lightlike vector nC is straightforward and shall lead to the same result.
    The subsequent argument follows the same line as Secs. 3.1.1.2 and 3.1.2.3. By the color
neutrality of MC , the soft gluons factorized out of C are canceled order by order, which
is proved by identifying the Wilson line structure that they form. This reduces the graph
in Fig. 3.8(a) to the partially factorized one in Fig. 3.8(c), in which only the two collinear
subgraphs A and D are coupled to the hard subgraph H, and the soft subgraph S is only
coupled to A and D subgraphs.
    With the C-collinear subgraph factorized out, the leading-region graph in Fig. 3.8(c)
is similar to that in Fig. 3.3(a), whose factorization is discussed in Sec. 3.1.1.2, with the
asymmetric deformation in Sec. 3.1.1.3. Again, in the treatment of the soft region, one only
                                               (DA)
needs to deform the contour of soft gluon ks        by
                                  ks(DA) → ks(DA) + i wA O(Q),                             (3.92)
                            (DA)
regardless of the poles of ks    · wA provided by the A-collinear propagators. By the same
argument as for the C subgraph, the soft gluons coupling to D are canceled, and the D
subgraph is factorized out of H into the DA for MD . Then the soft gluons are only coupled
to the A subgraph and no longer pinched. They can then be deformed into the A-collinear
region and grouped into a part of A-collinear subgraph, which can be further factorized from
H into the DA of MA .
    The soft cancellation applies equally to the subtracted terms of smaller regions, so this
procedure leads to the same factorization in Eq. (3.82). Even though the Wilson lines
associated with the collinear factors point to different directions from the ones in Sec. 3.1.2.3
                                                168


with symmetric deformations, due to the cancellation of soft gluons, the Wilson line pair for
each collinear factor join together into a finite-length Wilson line, with the segments pointing
to infinity canceled. The resultant DA definitions are therefore universal and do not depend
on the specific deformation ways. This is a property of collinear factorization.
3.1.3      Large-angle meson-meson scattering
For meson-meson scattering, the beam particle B is also a meson. The final-state particles
C and D can take (1) (CD) = (l+ l− ), (2) (CD) = (γγ), (3) (CD) = (γ, meson), or (4)
(CD) = (meson, meson). The first three cases do not raise new issues in factorization, which
we will briefly remark on, and the last case only requires a simple generalization of our
factorization argument for the photon-meson scattering in Sec. 3.1.2.4.
                    p2                    q1           p2                   q1
                       B                                  B
                                       C                                 C
                    S         H                        S           H
                                       D                                 D
                   p1 A                    q2         p1 A                   q2
                              (a)                                 (b)
Fig. 3.9: Leading regions of the processes in Eqs. (3.93) and (3.94), with the final-state lines
being electrons or photons. The quark lines can be replaced by transversely polarized gluons,
and the dots refer to arbitrary numbers of gluon lines with longitudinally polarized gluons.
3.1.3.1    Double-meson process: (CD) = (l+ l− ) or (γγ).
The processes
                              MA (p1 ) + MB (p2 ) → l+ (q1 ) + l− (q2 )                   (3.93)
                                                169


and
                               MA (p1 ) + MB (p2 ) → γ(q1 ) + γ(q2 )                    (3.94)
have similar color structures as the meson electroproduction (discussed in Sec. 3.1.1.2) and
photoproduction (discussed in Sec. 3.1.2.2), respectively, except only that both mesons are
now in the initial state. They thus have similar leading regions, as shown in Fig. 3.9. As
usual, the region (b) is assumed to be power suppressed by the soft-end suppression.
    At LO in QED, the dilepton production happens via the production and decay of a time-
like virtual photon of invariant mass Q = mll . An example is π + π − (→ γ ∗ ) → l+ l− . This
property means that it is the invariant mass Q that provides the hard scale for factorization,
regardless of the transverse momentum qT of the leptons, similar to the dilepton photopro-
duction in Sec. 3.1.2.1. On the other hand, in the diphoton production, both photons directly
attach to the quark parton lines, and the large qT is necessary for factorization. Examples
are π + π − → γγ or π 0 π 0 → γγ.
    Factorization for the region (a) works in a similar way to the meson electroproduction
and photoproduction discussed before. One can use either symmetric or asymmetric defor-
mation to avoid the Glauber region. They give different soft and collinear Wilson lines in
intermediate steps, but result in the same soft cancellation and the same collinear factor
definitions, as a property of collinear factorization. The asymmetric deformation is partic-
ularly important for later generalization to single-diffractive scattering. For a soft gluon
momentum ks flowing from B to A, we expand it as
                                   ks · wA         ks · wB
                              ks =         wB +            wA + ksT ,                   (3.95)
                                   wA · wB        wA · wB
                                               170


with wA defined as in Eq. (3.66) and wB = w̄A in the c.m. frame, and deform its contour by
                                       ks 7→ ks − iO(Q)wA .                                  (3.96)
This then determines all necessary iϵ prescriptions for the soft and collinear approximations.
   In the end, we get a factorization formula for the scattering amplitude,
                       XZ     1                                                         
                                                                                  ⃗qT qT
  MMA MB →l+ l− /γγ =           dx dy Di/A (x, µ) Dj/B (y, µ) Hij→l+ l− /γγ x, y; √ ,      , (3.97)
                        i,j 0                                                        s µ
where the DAs are for initial-state meson annihilations, and the hard coefficient H is for
the scattering of two pairs of collinear partons i and j into l+ l− or γγ. We have used their
multiplicative renormalization to convert each factor to renormalized ones, which introduces
the factorization scale µ.
                   p2                      q1          p2                      q1
                                S                                  S
                      B                 C                 B                  C
                                                                  H
                                H
                                                                   H
                  p1 A                  D q2          p1 A                   D q2
                               (a)                                (b)
Fig. 3.10: Leading regions of the process in Eq. (3.100). (a) has one single hard scattering,
and (b) has two hard scatterings.
3.1.3.2    Triple-meson process: (CD) = (γ, meson).
The triple-meson process
                              MA (p1 ) + MB (p2 ) → γ(q1 ) + MD (q2 )                        (3.98)
                                                171


exactly resembles the meson pair photoproduction in Eq. (3.65), treated in Secs. 3.1.2.3 and
3.1.2.4, just with the exchange of the photon and one meson. As there, both symmetric and
asymmetric deformations are applicable to deal with the Glauber region. The symmetric
one starts with the factorization of D-collinear subgraph, which reduces the leading region
to Fig. 3.9, discussed in Sec. 3.1.3.1. The asymmetric deformation keeps intact the soft
gluon momentum components flowing along the A-collinear subgraph, but deforms the other
components in B or D collinear subgraphs by an order of Q. The soft gluons cancel in both
cases as a result of the mesons being color neutral. Finally, the amplitude is factorized into
                          XZ       1
          MMA MB →γMD =              dx dy dz Di/A (x, µ)Dj/B (y, µ)
                           i,j,k 0
                                                                      
                                                                ⃗qT qT
                                            × Hij→γk x, y, z; √ ,        D̄k/D (z, µ).  (3.99)
                                                                   s µ
3.1.3.3    Quadruple-meson process: (CD) = (meson, meson).
The quadruple-meson process
                             MA (p1 ) + MB (p2 ) → MC (q1 ) + MD (q2 )                 (3.100)
has two collinear sectors in both initial and final states. The leading region is shown in
Fig. 3.10(a). The symmetric deformation out of Glauber region does not trivially apply,
as explained in Sec. 3.1.2.3. So we will simply employ the asymmetric deformation in
Sec. 3.1.2.4.
                                      (Cj)
    For a soft gluon momentum ks           flowing from the collinear subgraph C to some other
                                                 172


one j, we expand it as
                                            (Cj)                  (Cj)
                                          ks · wC               ks · wj
                                ks(Cj) =               wj +                  wC + ksT .              (3.101)
                                            wC · wj              wC · wj
When it flows through the C-collinear subgraph, we approximate it by
                                                                 (Cj)
                                                                ks · wC
                                        ks(Cj) 7→ k̂s(Cj) =                  wj .                    (3.102)
                                                                 wC · wj
This component receives poles from the C-collinear lines, which are all on the lower half
plane. So we choose to deform its contour by
                                           ks(Cj) 7→ ks(Cj) + iO(Q)wj .                              (3.103)
This determines the iϵ prescription in the approximation of its coupling to the C-collinear
subgraph,
                                                                           (Cj) µ
                                                                         k̂s      wCν
         JµC (kC , ks(Cj) ) g µν Sν (ks(Cj) ) 7→  JµC (kC , k̂s(Cj) )  (Cj)
                                                                                       Sν (ks(Cj) ). (3.104)
                                                                      ks     · wC + iϵ
This will allow the use of Ward identity to factorize soft gluons out of J C onto a pair of
Wilson lines along wC pointing into the future.
   For a C-collinear momentum kC , we expand it as
                                 kC = (kC · wC ) w̄C + (kC · w̄C ) wC + kC,T ,                       (3.105)
                                                         173


and only keep the large component kC · w̄C in the hard part H,
                                        kC 7→ k̂C = (kC · w̄C ) wC .                                 (3.106)
The coupling of a collinear longitudinally polarized gluon kC to H is approximated by
                                                                     k̂Cµ w̄Cν
                   Hµ (kH ; kC ) g µν JνC (kC ) 7→ Hµ (kH ; k̂C )                 J C (kC ),         (3.107)
                                                                  kC · w̄C − iϵ ν
for kC to flow from H into J C . The −iϵ is uniquely determined to be compatible with
the deformation in Eq. (3.103), given the need of soft subtraction. This will lead to a pair
of past-pointing Wilson lines along w̄C to collect all C-collinear gluons with longitudinal
polarization.
     After factorizing the C-collinear subgraph from H, and soft gluons out of C, one can
easily identify the soft Wilson lines as an identity by the color neutrality of the meson MC .
This soft cancellation applies to both the approximated region and the subtracted smaller
regions. Thus we have a factorized DA for MC , whose unsubtracted version is the same
as the soft-subtracted one, convoluted with the rest of the graph. It then has the same
color structure as the triple-meson process in Eq. (3.98) and factorizes in the same manner.
Eventually, we have the factorized expression for the amplitude,7
                        XZ       1
  MMA MB →MC MD =                  dx dy dz dw Di/A (x, µ)Dj/B (y, µ)
                       i,j,k,l 0
                                                                       
                                                               ⃗qT qT
                                      × Hij→kl x, y, z, w; √ ,             D̄k/C (z, µ)D̄l/D (w, µ), (3.108)
                                                                  s µ
   7
     Note the symbol D has been used to denote both the DA and the particle D in the 2 → 2 scattering,
which should not cause confusion.
                                                    174


where the sum is over all parton flavors and their spin structures, and the hard coefficient
H is the scattering of two pairs of collinear partons i and j into another two pairs k and
l. Again, the hard coefficient contains subtraction of collinear regions for each of the four
mesons, and we have used the multiplicative renormalization of DAs to convert all factors
into renormalized ones, which introduced the factorization scale µ.
    The leading regions that contain soft quark or physically polarized gluon lines to di-
rectly couple any of the collinear subgraphs to the soft subgraph are assumed to be power
suppressed, by the same soft-end suppression assumption that applies to all the processes
discussed before. However, for the quadruple-meson scattering process, there is one different
type of regions that count at a more leading power. This is given by the reduced diagram in
Fig. 3.10(b) that has two separated hard scattering subgraphs. Discussion of such multiple
hard scattering case is beyond the scope of this thesis, for which we refer to [Landshoff, 1974;
Botts and Sterman, 1989]. In this thesis, we assume that all processes are dominated by one
single hard scattering.
3.2      Single-diffractive hard exclusive processes
Now we generalize the 2 → 2 large-angle meson scattering processes by allowing one extra
hadron h′ in the final state along the direction of one of the initial-state hadrons h. The extra
hadron h′ is the diffraction of the initial-state hadron h. To allow perturbative QCD study,
we further require a hard scale in the scattering process, so we take the two particles C and
D in the final state to have hard transverse momenta with respect to the collision axis. Thus
the minimal configuration we study is a generic 2 → 3 process that we call single-diffractive
                                               175


hard exclusive process (SDHEP),
                             h(p) + B(p2 ) → h′ (p′ ) + C(q1 ) + D(q2 ),                     (3.109)
where h of momentum p is the hadron we would like to study, B of momentum p2 is a
colliding lepton, photon or meson, and C and D of momentum q1 and q2 , respectively, are
two final-state particles, which can be a lepton, photon or meson, with large transverse
momenta,
                                                       √
                                        q1T ∼ q2T ≫      −t ,                                (3.110)
with t ≡ (p − p′ )2 . In the lab frame with h along +ẑ and B along −ẑ, the scattering
configuration is illustrated in Fig. 3.11(a).
                       C(q1)                                          C(q1)
                                ~qT
                                                                               ′
         h(p)              θ      B(p2)                              = p  −p) H
                                                              A (p 1
                                                                ∗
                                                                                     B(p2)
                                  ′ ′                          F
                               h (p )
                                                      h(p)            h′ (p′ )      D(q2 )
                  D(q2 )
                      (a)                                              (b)
Fig. 3.11: (a) Illustration of the kinematic configuration of the SDHEP in the lab frame. (b)
The two-stage paradigm of the SDHEP.
                                                                                 √
    There are two distinct scales involved in the SDHEP, one soft scale            −t characterizing
the diffraction subprocess, and one hard scale Q = O(qT ) characterizing the production of
the particles C and D. Then the SDHEP can be pictured as a two-stage process, as shown
in Fig. 3.11(b), being a combination of a diffractive production of a single long-lived state
                                               176


                   C(q1)                                            C(q1)
                            ′
                       −p) H                                          ∗ (p 1)
                  = p                               =           A =
                                                                 ∗  γ           H                  +
           A (p 1
             ∗
                                           B(p2)                                         B(p2)
            F                                                 F
   h(p)            h′ (p′ )              D(q2 )        h(p)         h′ (p′ )            D(q2 )
                                 C(q1)                                         C(q1)
                                   ∗ (p 1)                                       ∗ p 1)
     +                       [q q̄ ]       H             +                   [g g] (      H           +···
                    A =                                              A =
                      ∗                                                ∗
                                                 B(p2)                                          B(p2)
                    F                                                F
          h(p)                  h′ (p′)         D(q2 )      h(p)                h′ (p′)        D(q2)
Fig. 3.12: The representation of the SDHEP in terms of all possible exchanged channels
of the virtual state A∗ (p1 ) between the single-diffractive h → h′ transition [Eq. (3.111)]
and the 2 → 2 hard exclusive process [Eq. (3.112)]. The two gluons in gg channel have
physical polarizations. The q q̄ and gg channels can be accompanied by arbitrary numbers of
collinear longitudinally polarized gluons. The “· · · ” refers to the channels with more than
two physically polarized partons, which are power suppressed compared to the two-parton
case.
                                                        177


A∗ (p1 ),
                             h(p) → A∗ (p1 ) + h′ (p′ ),   with p1 = p − p′ ,            (3.111)
and a hard exclusive 2 → 2 scattering between the two nearly head-on states A∗ (p1 ) and
B(p2 ),
                                  A∗ (p1 ) + B(p2 ) → C(q1 ) + D(q2 ).                   (3.112)
In the c.m. frame of A∗ and B, as a necessary condition for factorization, the transverse
momentum qT of C or D is required to be much greater than the invariant mass of A∗ or B.
    The 2 → 2 hard exclusive process H in Fig. 3.11(b) takes place at a short distance
1/Q ≪ 1/ΛQCD ∼ 1 ∼ fm and is sensitive to the partonic structure of the exchanged state
A∗ (p1 ). The scattering amplitude of the SDHEP should include a sum of all possible partonic
states, as illustrated in Fig. 3.12, which can be schematically described as
                                            X∞ X
                                                      fn        ′
                           MhB→h′ CD =              Fh→h ′ (p, p ) ⊗ Cfn B→CD ,          (3.113)
                                            n=1 fn
where n and fn represent the number and flavor content of the particles in the exchanged state
                     fn
A∗ , respectively, Fh→h         ′                                               ′
                         ′ (p, p ) is a “form factor” responsible for the h → h transition, and
Cfn B→CD denotes the scattering amplitude of the hard part H, along with the sum running
over all possible exchanged states characterized by n and fn . Throughout the discussion
in this thesis, we retain the scattering amplitude Cfn B→CD at the lowest order in the QED
coupling constant for given exchanged state fn and particle types of B, C, and D, while
investigating contributions from QCD to all orders in its coupling constant.
    For n = 1, the only possible case is a virtual photon exchange, i.e., f1 = γ ∗ , which
resembles the Bethe-Heitler process for the DVCS (see [Ji, 1997b] for example). Rather than
                                                   178


probing the partonic structure of h, this channel only gives an access to the electromagnetic
form factor of h evaluated at a relatively soft scale t. As discussed below, the γ ∗ -mediated
subprocess gives a “superleading-power” background for the n ≥ 2 channels, and should
not be disregarded even if suppressed by higher-order QED couplings, unless symmetry
considerations prohibit it. The scattering amplitude of the SDHEP should be expanded in
inverse powers of the hard scale, and then followed by a perturbative factorization for the
leading-power contribution (and subleading-power contribution if needed, see, e.g., [Kang
et al., 2014]). If the n = 1 subprocess is forbidden (as discussed below), then the scattering
amplitude of the SDHEP starts with n = 2 subprocesses.
    For n = 2, we can have QCD subprocesses with f2 = [q q̄ ′ ] or [gg]. This gives the
leading-power contribution that, as shown in the following subsections, can be factorized
into GPDs with corresponding hard coefficients. The channels with n ≥ 3 belong to high-
                                                         √
twist subprocesses that are suppressed by powers of        −t/Q and will be neglected in the
following analysis.
3.2.1      General discussion of the γ ∗ -mediated channel
Before providing the detailed arguments for QCD factorization of SDHEPs, initiated by a
lepton, photon or meson beam, respectively, in the next three subsections, we first give a
general discussion for the γ ∗ -mediated hard subprocesses, corresponding to the n = 1 channel
in Eq. (3.113), independent of the particle types of B, C and D. More detailed discussions
for specific processes will be given in later subsections.
    One difference between the n = 1 and n ≥ 2 subprocesses is that the virtual photon
momentum is fully determined by the diffraction of the hadron h. The amplitude of the
                                               179


γ ∗ -mediated subprocess can be trivially factorized into the electromagnetic form factor of h,
                                 e
                    M(1) = − ⟨h′ (p′ )|J µ (0)|h(p)⟩ · ⟨C(q1 )D(q2 )| (−ieJµ (0)) |B(p2 )⟩
                                 t
                                 e
                           ≡ − F µ (p, p′ ) Hµ (p1 , p2 , q1 , q2 ),                                        (3.114)
                                 t
where the superscript “(1)” refers to the contribution to the SDHEP amplitude from the
                                    P
n = 1 channel, and J µ =                i∈q ei ψ̄i γ µ ψi is the electromagnetic current of quarks, summing
over their flavors “i” and weighted by their fractional charges ei , normalized such that
eu = 2/3 and ed = −1/3. In the second step we defined the hard factor Hµ that includes
the QED coupling −ie, and the electromagnetic form factor,
                                                                                         iσ µν p1ν
      F µ (p, p′ ) = ⟨h′ (p′ )|J µ (0)|h(p)⟩ = F1h (t) ū(p′ )γ µ u(p) − F2h (t) ū(p′ )           u(p),    (3.115)
                                                                                           2mh
which has the leading component F + ∼ O(Q) as the h-h′ system is highly boosted along the
z direction. However, when this component is contracted with Hµ , which scales as O(Q0 )
for each component, we have
                            1 + + −                    1                                      
              F + H− =      + F        p1 H = + F + p1 · H + p1T · HT − p−              1H
                                                                                             +
                                                                                                 ,          (3.116)
                           p1                          p1
where in the bracket, the first term vanishes by the Ward identity of QED, and the other
                  √                                                                                     √
two scale as        −t and t/p+    1 , respectively. So the leading power of F · H scales as              −t and is
given by the transverse polarization of the virtual photon. Therefore, the power counting of
                              √                                                                               √
M(1) is of the order 1/ −t, which is higher than the n = 2 channel by one power of Q/ −t.
     One caution should be noted that it is not appropriate to only keep p+                    1 in the amplitude
                                                                                              √
Hµ (p1 , p2 , q1 , q2 ) because the approximation introduces an error of order                   −t/Q. While this
                                                              180


is power suppressed comparing to the leading contribution from the n = 1 channel, it could
scale at the same order as the contribution from the n = 2 channel since both of them have
the power counting 1/Q. By neglecting all the n ≥ 3 channels, our approximation to the full
                                                √       
SDHEP amplitude is up to the error at O −t/Q2 , so that the 1/Q part should be kept
as exact when evaluating the contribution from the n = 1 channel. This will be explicitly
demonstrated for the single-diffractive real photon electroproduction in Sec. 4.3.
    There is one further subtlety when the γ ∗ -mediated subprocess involves light mesons in
H. The conventional practice is to factorize it into meson distribution amplitudes (DAs).
                                                     √ 
While this is true to the leading power at O 1/ −t , it neglects the power correction of
                     √ 
O(ΛQCD /Q) · O 1/ −t = O(1/Q), which is of the same order as the n = 2 channels, i.e.,
the GPD channels. Keeping the exact 1/Q contribution thus requires the subleading-power
(or, twist-3) factorization for the γ ∗ -mediated subprocess that involves any mesons, which is
beyond the scope of this thesis.
    There are two cases in which the γ ∗ -channel is forbidden. The first is for a flavor-changing
channel with h ̸= h′ that cannot be achieved by the electromagnetic interaction, like the pion-
nucleon scattering processes in [Berger et al., 2001; Qiu and Yu, 2022] which can involve the
proton-neutron transition. The second case is for particular combinations of the particle
types of B, C and D that mandate Hµ (p1 , p2 , q1 , q2 ) = 0 by some symmetries. Apart from
these two cases, we should generally include the γ ∗ -mediated subprocess.
    For example, for the photoproduction of diphotons considered in [Pedrak et al., 2017], one
should include the γ ∗ channel that involves photon-photon scattering in Hµ . Even though
this is suppressed by αem compared to the GPD subprocess that corresponds to the n = 2
                                                                 √
channel, the γ ∗ channel at n = 1 is power enhanced by Q/ −t. In such cases, we need
to carefully compare the contributions from both channels, and to develop an experimental
                                                181


approach to remove the background due to the γ ∗ channel in order to extract GPDs from the
experimental data. One common approach by using azimuthal correlations will be discussed
in Secs. 4.2 and 4.3.
3.2.2        SDHEP with a lepton beam
For single-diffractive hard exclusive electroproduction processes, we have B = C = e. The
other particle D can be a photon γ or a light meson MD . Both of these two processes allow the
γ ∗ -initialized channel with n = 1. For the n = 2 channel, D = γ leads to the deeply virtual
Compton scattering (DVCS) [Ji, 1997b; Radyushkin, 1997], and D = meson corresponds
to the deeply virtual meson production (DVMP) [Brodsky et al., 1994; Frankfurt et al.,
1996]. Both processes have been proved to be factorized into GPDs [Collins and Freund,
1999; Collins et al., 1997]. Here, we will switch the theoretical perspective from [Collins
and Freund, 1999; Collins et al., 1997] by fitting them into the general SDHEP framework.
The proof follows the two-stage paradigm depicted in Eqs. (3.111)–(3.113). This approach
incorporates the γ ∗ -initialized n = 1 channel naturally, and for the n = 2 channel, it leads
to a direct analogy to the large-angle meson scattering processes in Eq. (3.112) by having
A∗ being some meson state carrying the quantum number of the [q q̄ ′ ] or [gg] state. Our
strategy for the proof follows a two-step process introduced in [Qiu and Yu, 2022, 2023a]:
(1) justify the factorization for a simpler 2 → 2 hard exclusive process in Eq. (3.112), which
has been done in Sec. 3.1, and (2) extend the factorization to the full SDHEP in Eq. (3.109)
by addressing extra complications, including especially the difficulty from Glauber gluons.
As expected, we will reproduce the proofs in [Collins and Freund, 1999; Collins et al., 1997].
                                              182


                                                                                          q1
                                                                     p2
                                                            q1                   H
           p2                 q1         p2                                               q2
                                                    H       q2          x+ξ           x−ξ
                     H
           p A                q2         p                  p′        p                   p′
                     p′                             A
                    (a)                            (b)                          (c)
Fig. 3.13: Leading-region graphs of the DVCS for the (a) ERBL region and (b) DGLAP
region of the GPD. (c) illustrates the result after factorizing the collinear subgraph out of
H into a GPD. The two quark lines can be replaced by two transverse gluon lines.
3.2.2.1    Real photon production: D = γ
For n = 1, this gives the Bethe-Heitler process, and the amplitude Hµ in Eq. (3.114) is the
scattering amplitude of γ ∗ (p1 ) + e(p2 ) → e(q1 ) + γ(q2 ) with q1T
                                                                   2
                                                                       ≫ |p21 | = |t|.
    For n = 2, the state A∗ can be either a collinear q q̄ or gg pair, which interacts with
                                                          ∗
the electron beam by exchanging a virtual photon γee         with momentum q = p2 − q1 , similar
to Fig. 3.1(c). This channel is therefore referred to as deeply virtual Compton scattering
(DVCS). The [q q̄] and [gg] state can be accompanied by an arbitrary number of longitudinally
polarized collinear gluons. The traditional treatments are all carried out in the Breit frame
                         ∗
of the virtual photon γee  and hadron beam h [Collins and Freund, 1999; Collins et al., 1997].
Here, we follow the kinematic setup of the SDHEP in Eq. (3.109) to work in the c.m. frame
of the initial-state hadron and electron with the hadron along the z axis. The requirement
                                            ∗
of a high virtuality Q2 = −q 2 for the γee    is equivalent to the requirement of hard transverse
momenta qT for the final-state electron and photon in this frame, since Q2 ∝ qT2 . Hence, the
                  ∗
virtual photon γee   has a short lifetime and belongs to the hard part, and therefore we have
the leading-region diagrams as in Fig. 3.13(a)(b).
    Due to the presence of the diffraction, now we have two types of leading regions. In
the first region [Fig. 3.13(a)], all the h-collinear parton lines go into the hard scattering H
                                                  183


with positive plus momenta. This region is greatly similar to the leading region of meson
scattering, e.g., Fig. 3.1(b). We call it ERBL region. In the other region [Fig. 3.13(b)],
however, we also have some of the h-collinear parton lines go out of H with positive plus
momenta and merge with the beam remnants to form the diffracted hadron h′ . This region
is called DGLAP region. It has no analogy in the large-angle meson scattering so represents
a new feature of the diffractive scattering. In the DGLAP region, not only do we have
long-lived propagating partons lines connecting the collinear subgraph to the hard subgraph
H, but also have long-lived remnant particles propagating along the collinear direction of
h as spectators of the hard interaction. Therefore, one has an opened color object lasting
for a long time throughout the whole scattering. Soft gluons can be exchanged between the
spectators and colored lines along other collinear directions. This will lead to the problem
of Glauber pinch that we will discuss in detail below.
    Luckily, the Glauber region does not cause any issue for the real photon electroproduction
process here, since there is only one collinear subgraph and no soft subgraph. Therefore,
the factorization proof can be directly built on that of the corresponding meson scattering
process treated in Sec. 3.1.1.1.
    For both the ERBL and DGLAP regions, the collinear momenta kiµ are pinched for their
                       √                                                        √
minus components if      −t ≪ p+ 1 ∼ qT . Introducing the scaling variable λ =    −t ≪ qT , the
collinear momentum scaling is the same as in Eq. (3.4). And then the same approximations as
in Eqs. (3.5)–(3.7) can be made to factorize the collinear subgraph from the hard subgraph for
quark-initiated processes. For the diffractive scattering, one no longer has isospin symmetry
to forbid the gluon-initiated channel, so the leading region in Fig. 3.13 contains an extra
                                               184


case when all collinear parton lines are gluons. Then we replace each gluon coupling by
           Hµ (ki ; kH ) g µν Cν (ki ) = Hµ (ki ; kH ) (K µν (ki , n) + Gµν (ki , n)) Cν (ki ),             (3.117)
with
                                          kiµ nν                                      kiµ nν
                      K µν (ki , n) =                ,   Gµν (ki , n) = g µν −                  .           (3.118)
                                       ki · n − iϵ                                 ki · n − iϵ
Note that no replacement of ki → k̂i as Eq. (3.5) has been made in H. A gluon with its
coupling replaced by the K (G) factor is called a K-gluon (G-gluon). When all or all but one
gluons are K-gluons, we get a super-leading power contribution. The region that has two
G-gluons with all the others being K-gluons corresponds to the leading power. When there
are three or more G-gluons, one receives a power suppression. As demonstrated in [Collins
and Rogers, 2008], the super-leading power contribution is cancelled, but those regions still
give nonzero contribution at leading power, which combines with the leading regions to give
the full leading-power contribution.
    After use of Ward identities and sum over regions and graphs, the collinear lines are
factorized out of the hard part, as in Eq. (3.11),
                     Z           "
         (2)               d4 k X q                            q
     Mhe→h′ eγ    =            4
                                        Hβα;ji (k̂, p̂1 − k̂) Cαβ;ij (k; p, p′ )
                         (2π)        q
                                                                                                        i
                                                               g                      g
                                                           +Hνµ;ba    (k̂, p̂1 − k̂) Cµν;ab (k; p, p′ )   , (3.119)
up to terms suppressed by powers of λ/qT , where the superscript “(2)” refers to the contribu-
tion to the SDHEP amplitude from the n = 2 channel. We have included the contributions
from both quark and gluon channels. The collinear factor C q for the quark parton differs
                                                         185


from Eq. (3.12) only in the external hadron states,
                                    Z                             n
       q
                                                                                                 
     Cαβ;ij   (k; p, p′ ) =PA,αα′       d4 y eik·y ⟨h′ (p′ )|T         ψ̄q,β ′ (0)W † (∞, 0; n) j
                                                             × [W (∞, y; n)ψq,α′ (y)]i |h(p)⟩ P̄A,β ′ β ,                (3.120)
but now we are allowed to have more spin structures,
                                 "                                                                                      #
                             δij                    γ −
                                                                                 γ5 γ −    X                       σ ℓ−
      q
    Cαβ;ij   (k; p, p′ ) =        C q,+ (k; p, p′ )      + Ceq,+ (k; p, p′ )            +       C⊥q,+ℓ (k; p, p′ )           ,
                            Nc                       2                             2      ℓ=1,2
                                                                                                                    2
                                                                                                                          αβ
                                                                                                                         (3.121)
with each factor defined as
                                         Z
                                                                            
     C q,+    eq,+
           , C , C⊥   q,+ℓ
                             (k; p, p ) = d4 y eik·y ⟨h′ (p′ )|T ψ̄q (0)W † (∞, 0; n)
                                     ′
                                                        + +                                            
                                                          γ γ γ5 σ +ℓ
                                                    ×         ,           ,          W (∞, y; n)ψq (y) |h(p)⟩. (3.122)
                                                           2        2         2
The collinear factor for the gluon parton is
                                            Z                                nh                          i
        g,µν                       1                            ′    ′                      †
     Cab     (k; p, p′ )  = +                      4
                                                d ye    ik·y
                                                             ⟨h (p )|T              +ν
                                                                                 G (0)WA (∞, 0; n)
                            k (k − p1 )+                                                                   b
                                                                                                   
                                                                    × WA (∞, y; n)G+µ (y) a |h(p)⟩,                      (3.123)
                                              abc µ ν
where Gµν             µ ν
            a = ∂ Aa − ∂ Aa − gf
                                 ν µ
                                                  Ab Ac is the gluon field strength tensor, and WA is the
Wilson line in the adjoint representation,
                                                         Z         ∞                                   
                        WA,ab (∞, y; n) = P exp −g                      dλ nµ Aµc (y    + λn) (f   cab
                                                                                                       ) ,               (3.124)
                                                                  0
                                                              186


obtained by replacing ta in Eq. (2.158) by TAa = −i(f abc ). Due to the antisymmetry of Gµν                  a ,
                              g,µν                                               g,µν
only the components of Cab          with µ, ν = 1, 2 are nonzero, so Cab              also has four independent
Lorentz structures, similar to Eq. (3.121),
          g,ij      δab       h
                                  g,īī ij        g,ij   g,ji
                                                                 g,ij         g,ji     g,īī ij
                                                                                                   i
        Cab    =                C δ + C −C                       + C +C −C δ                          ,  (3.125)
                 2(Nc2 − 1)
where the repeated index ī is summed over, and the quantities C without color subscripts
(a, b) have already included a trace over them. We may rewrite Eq. (3.125) by abusing the
notations of Pauli matrices σk = (σkij ) (k = 1, 2, 3),
                                                        "             3
                                                                                   #
                                 g,ij           δab                 X
                               Cab      =        2
                                                         C0 δ ij +        Ck σkij , ,                    (3.126)
                                            2(Nc − 1)                k=1
with the coefficients determined as
           C0 = tr (C g ) = C g,11 + C g,22 ,                  C2 = tr (C g σ2 ) = −i(C g,21 − C g,12 ),
           C1 = tr (C g σ1 ) = C g,12 + C g,21 ,               C3 = tr (C g σ3 ) = C g,11 − C g,22 ,     (3.127)
where the “tr” is over the Lorentz indices (i, j).
    Define the kinematics associated with the collinear factors,
                                                                (p − p′ )+
        P = (p + p′ )/2,      ∆ = p1 = p − p′ ,         ξ=                 ,   k + = (x + ξ)P + ,        (3.128)
                                                                (p + p′ )+
and so
                 p+ = (1 + ξ)P + ,           p′+ = (1 − ξ)P + ,      (k − p1 )+ = (x − ξ)P + .           (3.129)
                                                        187


Then because only the plus parton momentum flows in H, the momentum integration in
Eq. (3.119) can be reduced to a mere convolution in k + , or in x. This then factorizes the
whole amplitude into GPDs that capture the infrared sensitivity,
                             XZ      1     h
                                       dx F f (x, ξ, t)H f (x, ξ) + Fef (x, ξ, t)H            e f (x, ξ)
             (2)
         Mhe→h′ eγ      =
                               f   −1
                                                                 X                                             i
                                                               +            FTf,i (x, ξ, t)HTf,i (x, ξ)          ,            (3.130)
                                                                      i=1,2
which sums over the parton flavors f = q, g, as illustrated in Fig. 3.13(c). We have defined
the quark and gluon GPDs, obtained by integrating Eqs. (3.122) and (3.127) over kT and
k−,
                                           Z
       q  e q     q,1     q,2                      dy − i(x+ξ)P + y−
      F , F , FT , FT (x, ξ, t) =                         e                                                                   (3.131)
                                                    4π
                                                                                                           
                               × ⟨h′ (p′ )|T ψ̄q (0)W (0, y − ; n) γ + , γ + γ5 , σ +1 , σ +2 ψq (y − ) |h(p)⟩,
                                            Z
       g eg       g,1     g,2                        dy − i(x+ξ)P + y−
      F , F , FT , FT (x, ξ, t) =                            e                                                                (3.132)
                                                   2πP +
                                                                         
                               × δ ij , −σ2ij , σ3ij , σ1ij ⟨h′ (p′ )|T G+j (0)WA (0, y − ; n)G+i (y − ) |h(p)⟩
for the unpolarized, (longitudinally) polarized, and transversity ones. The corresponding
hard coefficients are,
                                                                                 
       Hq, He q , H q,1 , H q,2 (x, ξ) = 1 γ − , γ5 γ − , σ 1− , σ 2−                      q
                                                                                        Hβα;         (k̂, p̂1 − k̂),          (3.133)
                     T       T                                                       αβ         īī
                                               2Nc
                                                     1           1                                      g
       Hg, He g , H g,1 , H g,2 (x, ξ) =                                    δ ij , −σ2ij , σ3ij , σ1ij Hji;āā    (k̂, p̂1 − k̂),
                     T        T                       2          2      2
                                               2(Nc − 1) x − ξ
where the spinor indices (α, β) and transverse Lorentz indices (i, j) are summed over. Note
                                                               188


that the factor 1/(x2 − ξ 2 ) in the gluon hard coefficient does not raise problems for the x
integration at x = ±ξ because such poles are introduced by the artificial use of the field
strength tensor in the gluon GPD definition. The latter contains zeros at x = ±ξ, which
cancel the poles at the hard coefficients.
    For the DVCS, transversity GPDs do not contribute because the massless parton ap-
proximation renders the corresponding hard coefficients to vanish. Then Eq. (3.130) only
has the first line. By a similar argument as Eqs. (3.20)–(3.23), the time ordering can be
dropped [Diehl and Gousset, 1998] in Eq. (3.131), allowing for insertion of physical states.
Then we have the support conditions for the GPDs,
                p+ − k + = (1 − x)P + ≥ 0,    p+ − (p1 − k)+ = (1 + x)P + ≥ 0.        (3.134)
Thus GPDs are only nonzero when x ∈ [−1, 1], which explains the integration range in
Eq. (3.130). Due to the amplitude nature, GPDs are (non-local) matrix elements between
two pure hadron state. There is no way for hadron spin averages to come in before we square
the amplitude. All possible GPDs should be kept unless forbidden by symmetries. This is
different from the collinear factorization of inclusive processes such as DIS, for which the
polarization state of partons is dependent on the target spins state, and polarized PDFs
are nonzero only when the targets are polarized. Moreover, we note that the flavor sum in
Eq. (3.130) is only over all possible quark flavors and gluon, not antiquarks. Because the x
is integrated from −1 to 1, there is no need to introduce antiquark GPDs separately.
    Due to the absence of a soft subgraph, the collinear factors in Eq. (3.131) do not need
further subtraction. The factorization result in Eq. (3.130) is obtained mainly by use of
Ward identities, which applies equally to the leading region R under approximation and to
                                              189


the subtracted smaller regions R′ < R, effectively contained in H. The hard coefficients in
Eq. (3.130) contain subtractions of smaller regions when some lines become collinear, which
can be dealt with recursively using the same factorization procedure.
    However, the GPDs defined in Eq. (3.131), as well as the corresponding hard coefficients
due to collinear subtractions, contain artificial UV divergences, as a result of short circuiting
the kT integration in the collinear factors. They hence need additional renormalization.
Similar to PDFs in Sec. 2.7.7, GPDs can also be multiplicatively renormalized. This can
be used to convert each factor in Eq. (3.130) to a renormalized version, with the same
factorization structure. All the renormalized factors depend additionally on the factorization
scale µ,
                  XZ     1   h                                                     i
        (2)
     Mhe→h′ eγ  =               f             f           e f           e f
                           dx F (x, ξ, t; µ)H (x, ξ; µ) + F (x, ξ, t; µ)H (x, ξ; µ) ,     (3.135)
                   f   −1
which implies a set of evolution equations that can be used to improve the factorization
predictivity.
    Compared to the corresponding DA factorization in Sec. 3.1.1.1, the soft parton issue
can also arise here, i.e., some of the parton momenta may have ki+ ≪ Q, which violates
the scaling in Eq. (3.4), and thus the corresponding approximations. This is termed the
“breakpoint” issue in [Collins et al., 1997]. However, since the region ki+ ∼ 0 ≪ Q is not
pinched, we can deform the contour of k + integration by k + 7→ k + ± iO(Q) [Collins et al.,
1997]. This deformation is allowed because the breakpoint only lies on the boundary between
the ERBL and DGLAP regions, but not at the GPD end points.
    Perturbatively, the soft parton singularity appears in Eq. (3.135) at x = ±ξ. As an
example, the LO DVCS hard coefficient, to be calculated in Eq. (4.119), contains terms that
                                               190


are proportional to 1/(x ± ξ ∓ iε). We can deform the x contour to avoid the poles at ∓ξ.
This is achieved in practical calculations by
                                         1             1
                                                 =P        ± iπ δ(x ± ξ),                (3.136)
                                  x ± ξ ∓ iε          x±ξ
where P denotes principal-value integration.
3.2.2.2     Light meson production: D = meson
Similarly, the single-diffractive hard electroproduction of a light meson MD can be built on
the large-angle meson electron scattering process in Sec. 3.1.1.2. We keep the same definitions
in Eqs. (3.29)–(3.34), and use the same approximations in Eqs. (3.36)–(3.45) except the
Eqs. (3.37) and (3.40), which will be explained below. We will rely on the asymmetric
deformation introduced in Sec. 3.1.1.3 which has been extensively used in Secs. 3.1.2.4 and
3.1.3.3.
                                   q1                             q1
                          p2                               p2              q2
                                     q                             q
                                ∗             q2
                               γee                             ∗
                                                              γee
                                 γ∗     p1
                           p                p′             p                  p′
                                    (a)                              (b)
Fig. 3.14: Examples of LO diagrams for the light meson production in the SDHEP with an
electron beam, for (a) the n = 1 channel and (b) the n = 2 channel for [q q̄ ′ ] case, where the
red thick lines indicate those with a hard qT flow and high virtualities.
    First, the n = 1 γ ∗ -initialized channel exists for a neutral meson production, which gives
the subprocess
                                   γ ∗ (p1 ) + e(p2 ) → e(q1 ) + MD (q2 ).               (3.137)
                                                      191


One LO diagram is shown in Fig. 3.14(a). The slightly off-shell photon γ ∗ (p1 ) scatters with
                              ∗
the highly virtual photon γee   (q = p2 − q1 ) to produce the meson MD . Eq. (3.137) is just
the reversed process of the large-angle real photon production in electron-meson scattering,
discussed in Sec. 3.1.1.1, although now the photon γ ∗ is virtual. As in Sec. 3.1.1.1, we can
also factorize the amplitude of the process [Eq. (3.137)] into the DA of MD to the leading
power of mD /qT , similar to Eq. (3.26). As noted in Sec. 3.2.1, however, this approximation
is only valid at leading power of the process in Eq. (3.137), which is of one power higher
(super-leading) than the n = 2 GPD channel of our main interest. A more consistent
treatment needs to factorize the process in Eq. (3.137) to the subleading power, which is
beyond the scope of this thesis. Alternatively, one may choose to parametrize the amplitude
by the γ ∗ γee
            ∗
               → MD form factor, without use of factorization. The n = 1 channel would be
forbidden for the production of a charged meson like π ± , or of a neutral meson with odd C
parity, such as ρ and J/ψ.
                                  q1                                  q1
                   p2                    q2            p2                     q2
                            H        D                           H        D
                                        S                                    S
                   p                    p′             p                      p′
                             A                                   A
                             (a)                                  (b)
Fig. 3.15: Leading-region graphs (a)(b) for producing a light meson from the SDHEP with
a lepton beam. Depending on the quantum numbers, the quark lines may be replaced by
transversely polarized gluon lines. (c) is the result after factorizing it into the DA and GPD.
    For the n = 2 channel, the diffracted hadron h can exchange a collinear [q q̄ ′ ] or [gg] state
with the hard scattering. The latter only holds when MD is charge neutral. One leading-order
diagram for the quark channel is shown in Fig. 3.14(b). The hard electron scattering still
                                               192


                                                     ∗
happens by exchanging a highly virtual photon γee      , and so this (sub)process is referred to as
deeply virtual meson production (DVMP). The leading regions are shown in Fig. 3.15(a) and
Fig. 3.15(b). In region (b) physically polarized quarks or gluons are attaching the collinear
subgraphs to the soft subgraph; it is power suppressed by the soft-end suppression with
respect to the meson wavefunction, as explained in Sec. 3.1.1.2.
     As for the DVCS in Sec. 3.2.2.1, the diffractive kinematics introduces the extra DGLAP
region, compared to the meson scattering case in Sec. 3.1.1.2. While this does not cause
problems for the DVCS, it does lead to obstacles in factorizing soft gluons out of the A-
collinear subgraph. This is illustrated in a simple model theory in Fig. 3.16, where we have
indicated the chosen soft momentum flows by the thin curved arrowed lines. We make the
following observations:
  (1) DGLAP region has active collinear parton lines both before and after the hard inter-
       actions, and the soft gluons can attach to both, as shown in Figs. 3.16(a) and 3.16(b).
       With the soft momentum flows as indicated, attaching to the initial-state collinear
       parton gives a pole of ks− at O(λ2 )/Q − iϵ, while the final-state one gives a pole of ks−
       at O(λ2 )/Q + iϵ;
  (2) DGLAP region also has some spectator partons going in the forward direction. When
       the soft gluon attaches to the spectator lines, as shown in Fig. 3.16(c), it flows both
       in the same and opposite directions as the target-collinear lines, so that one single
       diagram gives both O(λ2 )/Q ± iϵ poles for ks− contour.8
Diagrams like Fig. 3.16(c) pinch the ks− contour at small values, such that for a Glauber
gluon with the momentum scaling as in Eq. (3.34), one cannot deform the ks− contour to get
   8
     Rerouting the soft momentum flow can change the situation (1) such that it also flows through the
spectators and leads to both kinds of poles.
                                               193


out of the Glauber region, as was allowed by the corresponding 2 → 2 meson scattering in
Eq. (3.35). While the diagrams in Figs. 3.16(a) and 3.16(b) do not directly cause pinch in
the Glauber region, they cannot be trivially dealt with, either. Note that factorizing the soft
gluons from the A-collinear lines requires to first deform soft gluons out of the Glauber region
and then apply Ward identities. Even though we can deform the ks− contour to get out the
Glauber region for both diagrams, the deformation directions are opposite. For Fig. 3.16(a),
we need to replace the gluon coupling by
                                                                                       ν
                                                                                k̂sµ wA
                            JµA (ks , kA ) g µν Sν (ks ) 7→ JµA (k̂s , kA )                   ,          (3.138)
                                                                            ks · wA + iϵ
whereas for Fig. 3.16(b), we need to flip the iϵ sign. This would forbid use of Ward identity
for the soft gluons, since different terms do not combine and cancel. This feature is closely
related to the existence of Glauber pinch for ks− in Fig. 3.16(c).
                      q1                               q1                                 q1
               p2                  q2           p2                   q2            p2                q2
                       q                                q                                  q
                   ∗                                ∗                                  ∗
                  γee                              γee                                γee
                         ks       ks                            ks    ks                          ks
                                                                                             ks
               p                     p′          p                    p′           p                  p′
                         (a)                              (b)                                 (c)
Fig. 3.16: Three example diagrams illustrating the soft gluon exchange between the collinear
subgraphs along the diffractive hadron and the final-state meson, for the DGLAP region of
the GPD in a simple model theory. The green thin curved lines indicate the soft momentum
flows.
     The way out is to note that all the soft ks+ poles come from the D-collinear lines, and
lie on the lower half plane when ks flows from D into S. One may thus deform ks+ as
ks+ 7→ ks+ + iO(Q) while keeping ks− contour unchanged, as was done in Eq. (3.61). While
it is a free choice for the 2 → 2 hard exclusive scattering, this deformation is necessary here
                                                          194


due to the pinch in the DGLAP region of the diffractive process, and it moves all Glauber
gluon momenta to the A-collinear region. For the same reason as discussed in Sec. 3.1.1.3,
the iϵ prescription for k − does not matter so it can be chosen in an arbitrary but consistent
way.
    Then by a similar discussion to Sec. 3.1.2.4, we can first factorize the D-collinear subgraph
out of H, and soft gluons out of D. The same line of arguments applies here for the neutrality
of meson D, the soft cancellation, and that the pair of collinear Wilson lines associated with
D is joined into a finite-length Wilson line along w̄D . This applies to both the approximated
region in Fig. 3.15(a) and smaller regions for subtraction, and reduces the leading region
to Fig. 3.17(a). Then by only attaching to the collinear subgraph A, soft gluons are no
longer pinched. Because all the ks+ poles are of order Q, one may deform the ks+ contour by
order Q to make it a A-collinear momentum. We can thus group the soft subgraph into the
A-collinear subgraph. Then Fig. 3.17(a) is exactly similar to Fig. 3.13(a)(b) for the single-
diffractive real photon electroprodcution, and we can follow the same procedure to factorize
the collinear subgraph associated with the diffracted hadron out of H into the GPD.
    Finally, we achieve the factorization of the amplitude,
        (2)
                      XZ    1     Z  1
                                           i
     Mhe→h′ eMD    =          dx       dz Fhh′ (x, ξ, t; µ)Cie→ej (x, ξ; z; qT , µ) D̄j/D (z, µ), (3.139)
                       i,j −1      0
up to 1/qT power suppressed terms, as diagrammatically shown in Fig. 3.17(b). The hard
coefficient is a scattering of a collinear and on-shell parton pair i along wA off the electron into
another collinear and on-shell parton pair j along wD . It contains collinear subtractions from
                   i
both the GPD Fhh     ′ and DA ϕj/D , but the latter two do not contain further soft subtractions,
as a feature of collinear factorization. We have used the multiplicative renormalizations of
                                                    195


GPD and DA to convert each factor to a renormalized one, which introduces a factorization
scale µ and the associated evolution equations. The sum over i and j runs over all possible
flavors and spin structures.
                                   q1                                                    q1
                  p2                              q2                    p2                    z         q2
                             H                 D                                    H                D
                                                                                            1−z
                                         S                                x+ξ               x−ξ
                  p                         p′                         p                        p′
                             A                                                      A
                             (a)                                                   (b)
Fig. 3.17: (a) Factorization of soft subgraph from the collinear subgraph of the final-state
meson. (b) Factorization of the A-collinear subgraph out of the hard subgraph into GPD.
    For example, the charged pion π + production pe → neπ + only supports the channel
            ¯ which gives the factorization formula,
i = j = [ud],
                       Z 1    Z   1
                                    dz Fepn
        (2)                               u
     Mpe→neπ+      =       dx               (x, ξ, t; µ) C[ud]e→e[u
                                                             ¯      ¯ (x, ξ; z; qT , µ) D̄u/π + (z, µ).
                                                                   d]                                      (3.140)
                        −1      0
To the leading order of QED, the hard coefficient is only nonzero for the polarized GPD Fepn                    u
due to the QED Ward identity,
                                                                 −igµν
                              C[ud]e→e[u
                                    ¯       ¯ ∝ ū(q1 )γµ u(p2 )
                                           d]                            (p̂1 + q̂2 )ν ,                   (3.141)
                                                                   q2
which requires a γ from the GPD. The bare flavor-changing GPD Fepn                        u
                                                                                             is defined as
                           Z
                              dy − i(x+ξ)P + y−
     Fepn
       u,bare
              (x, ξ, t) =           e             ⟨n(p′ )|ψ̄d (0)W (0, y − ; n)γ + γ5 ψu (y − )|p(p)⟩.     (3.142)
                               4π
The neutral pion π 0 production pe → peπ 0 , on the other hand, supports both quark and
                                                         196


gluon channels,
                      X XZ            1    Z   1
                                                 dz Fepi (x, ξ, t; µ) C[iī]e→e[j j̄] (x, ξ; z; qT , µ) D̄j/π0 (z, µ).
        (2)
    Mpe→peπ0       =                    dx
                     i=u,d,g j=u,d   −1      0
                                                                                                                   (3.143)
The flavor-changing GPDs can be related to the flavor-conserving ones by isospin symme-
try [Mankiewicz et al., 1999],
     Fepn
       u
          (x, ξ, t) = Fenp
                        d
                           (x, ξ, t) = Fepu (x, ξ, t) − Fepd (x, ξ, t) = Fend (x, ξ, t) − Fenu (x, ξ, t),          (3.144)
which also applies to F .
3.2.2.3     Extending to virtual photon or heavy quarkonium production
The DVCS and DVMP differ in how the observed particle couples to the hard interaction:
the photon of the DVCS couples directly to the hard collision while the light meson of DVMP
couples to the hard collision via two collinear partons. The factorization proof for the DVCS
should apply equally to the case of producing a virtual photon γf∗ with high qT and low
virtuality Q′2 that decays into a pair of charged leptons. Even if qT ≫ Q′ , there is no large
logarithm of qT /Q′ that spoils perturbation theory, contrary to the inclusive process [Berger
et al., 2002b], because such logarithms are associated with diagrams’ collinear sensitivity,
which require two collinear parton lines to connect the low mass virtual photon to the hard
part, which is suppressed by one power of Q′ /qT compared to the direct photon attachment.
In contrast, the DVMP amplitude has large logarithms of qT /mD , due to the long-distance
evolution of the collinear parton lines. Such logarithms are incorporated by the evolution
equation associated with the factorization formula in Eqs. (3.57) and (3.139).
                                                          197


    For a virtual photon γf∗ with its virtuality Q′ of the same order as qT (but, sufficiently
away from masses of heavy quarkonia), it should belong to the short-distance hard part, and
the whole process becomes e− + h → h′ + 2e− + e+ . This is no longer a 2 → 3 SDHEP-type
process, but we can still relate it to the SDHEP type by considering the kinematic regime
where one of the final-state electrons has a high transverse momentum qT , balanced by the
other e+ e− pair, which also has a large invariant mass Q′ ∼ qT .
    First of all, the γ ∗ -mediated channel at n = 1 is allowed, with the hard scattering
e− + γ ∗ → 2e− + e+ . Second, the n = 2 channel does not unambiguously lead to the double
DVCS (DDVCS) process [Guidal and Vanderhaeghen, 2003] because it is not possible to
distinguish which of the final-state electrons comes from the scattering of the initial-state
                                                                                      
electron. By labeling the final-state electrons and positron as e−             − +
                                                                          1 , e2 , e    , we find that a
                                         
single configuration of e−         − +
                              1 , e2 , e   could correspond to both high-Q′ and low-Q′ processes.
Specifically, let us consider the following three kinematic cases:
                                                                                     √
  (1) All the (e−      − +
                  1 , e2 , e ) have high transverse momenta, of order qT ≫             −t, and the two
      invariant masses (me−1 e+ , me−2 e+ ) are large, of the same order of qT . This case leads
      unambiguously to DDVCS, and the factorization of DVCS can be trivially generalized
      here. But one needs to consider both diagrams with either e−                  −
                                                                           1 or e2 coming from the
      decay of the virtual photon γf∗ .
                                                                                  √
  (2) All the (e−      − +
                 1 , e2 , e ) have high transverse momenta, of order qT ≫            −t, but one of the
      invariant lepton-pair masses, say me−1 e+ , is much less than qT , and the other pair has a
      large invariant mass, i.e., qT ∼ me−2 e+ ≫ me−1 e+ . In this case, one can have (a) (e−          +
                                                                                                   1 ,e )
      comes from the decay of a low-virtuality γf∗ , and (b) (e−        +
                                                                   2 , e ) comes from the decay of
      a high-virtuality γf∗ . While both correspond to the DDVCS processes, it is the case (a)
                                                   198


       with a low-mass electron pair that contributes at a leading power.
                                                               √
  (3) (e−     +
         2 , e ) have high transverse momenta, of order qT ≫     −t, and e−
                                                                          1 has a low transverse
       momentum, much less than qT . Automatically, we have both (me−1 e+ , me−2 e+ ) to be
       large. This gives two different cases: (a) e− 1 comes from the diffraction of the initial-
                                                             ∗
       state electron, which gives out a quasireal photon γee  that scatters with the diffractive
       hadron h and produces a highly virtual photon γf∗ that decays into the (e−     +
                                                                                 2 , e ) pair; (b)
       e−
        2 comes from the hard scattering of the initial-state electron, whose interaction with
       the diffractive hadron h produces a highly virtual photon γf∗ with a high transverse
       momentum, which decays into the (e−        +
                                             1 , e ) pair. Now only the case (b) corresponds to
       the DDVCS process, and case (a) gives a subprocess of (quasi)real photon scattering
       with the hadron, whose factorization will be proved later in Sec. 3.2.3.1. While both
       subprocesses are factorizable, it is the subprocess (a) that gives the leading power
       contribution.
Of course, if the virtual photon γf∗ decays into a lepton pair of other flavors, like a µ+ µ−
pair, then it unambiguously leads to the DDVCS process and can be factorized in the same
way as the DVCS.
     When the γf∗ virtuality Q′ becomes much greater than qT , one starts entering the two-scale
regime. Whether there will be large logarithms of Q′ /qT that requires a new factorization
theorem to be developed is not a trivial problem based on our analysis so far. We leave that
discussion to the future.
     For a heavy quarkonium production, unfortunately, it is not obvious that the factorization
in Sec. 3.2.2.2 can be easily generalized. The key points to the factorization are
   (i) there is a pinch singularity that forces a collinear momentum to have the scaling in
                                               199


       Eq. (3.4), with a leading component and two smaller components;
  (ii) soft gluons can be factorized from the collinear lines.
The exclusive production of a heavy quarkonium naturally has the most contribution from
producing a heavy quark pair with an invariant mass MH ∼ 2mQ , where mQ ≫ ΛQCD
is the heavy quark mass. Since the corresponding heavy quark GPD in h-h′ transition is
suppressed, we do not suffer from the extra region like Fig. 3.15(b). When the transverse
momentum qT of the heavy quarkonium is much greater than mQ , the heavy quark can be
thought of as the active parton line associated with the observed particle D in Fig. 3.15(a),
and the heavy quarkonium is attached to the hard part by a pair of nearly collinear heavy
quark lines, whose momenta scale as
                                            
                            kQ ∼ λ2Q , 1, λQ qT ,   with λQ = mQ /qT ,                   (3.145)
when the heavy quarkonium moves along the minus direction. This pinches the plus mo-
mentum components to be small, and for a soft gluon ks attached to such heavy quark lines,
one may keep only the ks+ component, which allows us to factorize the soft gluon out of the
collinear lines. Hence, for qT ≫ mQ ≫ ΛQCD , one can still factorize the heavy quarkonium
production amplitude into the heavy quarkonium DA, up to the error of O(mQ /qT ). See
[Kang et al., 2014] for a similar discussion of the inclusive production of a heavy quarkonium.
     When mQ ∼ qT ≫ ΛQCD , the error estimated above becomes O(1), which invalidates the
factorization into heavy quarkonium DA. However, if MH /2−mQ ≪ mQ ∼ qT , the formation
of the heavy quarkonium from the produced heavy quark pair might be treated in terms of
the color singlet model [Einhorn and Ellis, 1975; Chang, 1980; Berger and Jones, 1981]
or the velocity expansion of nonrelativistic QCD with color singlet long-distance matrix
                                                200


elements [Bodwin et al., 1995]. For this exclusive production, the soft gluon interaction
from the diffractive hadron with the heavy quark pair at qT ∼ mQ ≫ ΛQCD is expected
to be suppressed by powers of mQ v/qT ∼ v with v being the heavy quark velocity in the
quarkonium’s rest frame. More detailed study for the heavy quarkonium production when
qT ≲ mQ will be presented in a future publication.
3.2.3      SDHEP with a photon beam
For single-diffractive hard exclusive photoproduction processes, we have B = γ. The other
particles C and D can be two elementary particles, one elementary particle and one light
meson, or two light mesons. So we consider the three cases, (1) massive dilepton (CD) =
(l+ l− ) [Berger et al., 2002a; Chatagnon et al., 2021] or diphoton (γγ) production [Pedrak
et al., 2017; Grocholski et al., 2021, 2022], (2) real photon and light meson pair (CD) =
(γMD ) production [Boussarie et al., 2017; Duplančić et al., 2018, 2023a,b; Qiu and Yu,
2023b], and (3) light meson pair (CD) = (MC MD ) production [El Beiyad et al., 2010]. These
are similar to the large-angle photon-meson scattering treated in Sec. 3.1.2. In this section, we
generalize the factorization arguments there to the corresponding single-diffractive processes,
following the same two-stage paradigm as the single-diffractive lepton-hadron scattering in
Sec. 3.2.2.
3.2.3.1     Dilepton or diphoton production: (CD) = (l+ l− ) or (γγ)
Both production processes allow the γ ∗ -mediated n = 1 subprocesses. For the dilepton
production, we have the partonic process γγ ∗ → l+ l− , starting at O(e2 ) in terms of the QED
coupling e, while we have γγ ∗ → γγ for the diphoton production, starting at O(e4 ). Since
                                                                        √ 
this γ ∗ -mediated n = 1 channel has a power enhancement of O qT / −t compared to the
                                              201


n = 2 channel, it cannot be simply neglected even though its scattering amplitude might
require a higher power in QED coupling. A careful quantitive comparison in size between
γ ∗ -mediated n = 1 and GPD-sensitive n = 2 subprocesses is needed in practical evaluation.
                                     q1
                        p2                                p2    q1     q2
                                       q2
                      p                 p′              p                 p′
                              (a)                               (b)
Fig. 3.18: Examples of leading-order diagrams in the n = 2 (GPD) channel for the single-
diffractive hard exclusive photoproduction of massive (a) dilepton and (b) diphoton pro-
cesses.
     For n = 2 channel, these two processes share the same color structure as the DVCS, and
thus the same leading-region graphs in Fig. 3.13 with a proper change of the external lines,
because B, C, and D are all elementary colorless particles. The argument for factorization
into GPDs works in the same way as for the DVCS in Sec. 3.2.2.1 and will not be repeated
here. The process with (CD) = (l+ l− ) happens by producing a timelike photon γ ′∗ in the
exclusive γh → γ ′∗ h′ process followed by the decay γ ′∗ → l+ l− , which is thus called timelike
Compton scattering (TCS), as shown in Fig. 3.18(a). For the process with (CD) = (γγ), all
the three photons couple to the quark lines, as illustrated in Fig. 3.18(b). In both processes,
it is the high qT that provides the hard scale for factorizability, by creating high virtualities
through the invariant mass of the virtual photon in the dilepton case or having the qT flow
through the quark lines in the diphoton case.
     It is important to note that in general, the requirement of a high invariant mass for
the pair of particles (CD) is not the same as requiring a hard qT . This is similar to the
large-angle photon-meson scattering in Sec. 3.1.2.1. For the TCS, it is the invariant mass of
the lepton pair mll that provides the hard scale for the partonic collision, and hence keeping
                                              202


                                       q1                   q1
                                                   p2
                             p2
                                        q2
                                                                 q2
                                 ∗                    γ        γ
                               γ      ′
                            p       p              p              p′
                                 (a)                       (b)
Fig. 3.19: (a) The sample diagram for the γ ∗ -mediated channel of the photoproduction of
a massive lepton pair, where the internal lepton propagator (in red) has a hard virtuality
only when qT is large. (b) At large mll but small qT , the forward scattering diagrams with
two photon exchanges between the diffractive hadron and the quasireal lepton can become
important and compete with the TCS mechanism in Fig. 3.18(a).
mll large is sufficient for TCS to be factorized into GPD, independent of the magnitude
of qT of the observed lepton. However, a hard qT is needed to guarantee the γ ∗ -mediated
n = 1 subprocess γγ ∗ → l+ l− to be a hard scattering process, as illustrated in Fig. 3.19(a).
If qT is too low, then this amplitude introduces another enhancement factor of O(mll /qT ),
                         √
in addition to the mll / −t enhancement of the n = 1 channel, as correctly pointed out
in [Berger et al., 2002a]. Then, this could allow other subprocesses to happen that may
compete with the TCS subprocess in magnitude. For example, one may have an n = 2
channel mediated by f2 = [γγ], as shown in Fig. 3.19(b), which is suppressed by e2 and one
          √
power of    −t/mll compared to the n = 1 channel, but is still one power O(mll /qT ) higher
than the TCS channel. The relative order comparison is then too complicated to be obvious,
and the extraction of GPDs from the TCS amplitude becomes even harder.
                                                  q1
                                           p1
                                                        p2
                                              q2
Fig. 3.20: A sample diagram for the photoproduction of diphoton process at low qT , where
the photon q1 is radiated collinearly by the incoming quark.
    On the other hand, if qT is too low in the diphoton production process, some quark lines
                                              203


could have low virtualities of order qT , as the photons could be radiated from the quark
lines (see Fig. 3.20) almost collinearly, introducing the long-distance physics into the “hard
probe”, which invalidates our factorization arguments.
3.2.3.2    Real photon and light meson pair production: (CD) = (γMD )
For (CD) = (γMD ) with MD being a light meson, the n = 1 channel corresponds to the
subprocess γ ∗ γ → γMD . This is forbidden for a charged meson like π ± , as considered in
[Duplančić et al., 2018], or for a neutral meson with even C-parity, like π 0 , η, etc. In the
high-qT scattering, the n = 1 amplitude can be factorized into the DA of MD .
    The n = 2 channel has the same color structure as the DVMP process in Sec. 3.2.2.2,
and the leading region is also as in Fig. 3.15 just with the proper change of the external
electron lines by photon lines. The argument for factorization then works in the same way,
and is not to be repeated here. For the same reason as the diphoton production process in
the previous subsection, we emphasize the necessity of the hard transverse momentum qT ,
which is not equivalent to requiring a large invariant mass of the γMD pair.
3.2.3.3    Light meson pair production: (CD) = (MC MD )
The single-diffractive photoproduction with (CD) = (MC MD ) differs from the electropro-
duction of a light meson in Sec. 3.2.2.2 by having one more hadron in the final state. This
leads to one more collinear subgraph in another direction but does not make the factorization
proof very different. As for the DVMP, generalizing the proof of the corresponding meson
scattering in Sec. 3.1.2.3 to the diffractive case encounters the trouble of Glauber pinch for
gluons attaching to the diffracted hadron. As a result, we will need to use the asymmetric
contour deformation in Sec. 3.1.2.4.
                                               204


    First, the n = 1 channel is given by the subprocess γ ∗ γ → MC MD , which may or may
not happen depending on the quantum numbers of MC and MD . This was considered first
in [Brodsky and Lepage, 1981]. The t-channel crossing process MA γ → γMD is briefly
discussed in Sec. 3.1.2.2. The time-reversal process MA + MB → γγ was also studied in [Qiu
and Yu, 2022]. Its amplitude can be factorized into the DAs of MC and MD , as a simple
generalization of the factorization proof for the process in Sec. 3.1.1.2.
                                       q1                                 q1
                          S                                             C
                                     C
                     p2                               p2
                              H                                 H         q2
                                          q2                           D
                                     D
                                                                        S
                     p                 p′             p                   p′
                              A                                 A
                              (a)                              (b)
Fig. 3.21: (a) Leading-region graphs for the single-diffractive hard photoproduction of a light
meson pair. (b) is obtained as an intermediate step after factorizing the C-collinear subgraph
out of the hard subgraph H and soft subgraph S.
    For the n = 2 channel, the leading regions is shown in Fig. 3.21(a), where all lines
in the hard part “H” are off shell by order of the hard scale Q ∼ qT , which effectively
makes the contribution from attaching soft gluons to H power suppressed. There could be
additional leading regions in which one or more of the collinear subgraph is connected to
the soft subgraph by one quark or transversely polarized gluon line, while connecting to the
hard subgraph by the other quark or transversely polarized gluon line. Following the same
assumption that such soft end point region is strongly suppressed by the nonperturbative
QCD dynamics from the meson distribution amplitude, we neglect them and consider only
the leading regions in Fig. 3.21(a).
    Extending the factorization of the meson scattering process to the corresponding single-
                                              205


diffractive process is trivial. The only complication arises from the extra DGLAP region
in the single-diffractive channel of the hadron h → h′ , which causes the momentum ks of
the soft gluon coupling to the A-collinear subgraph to be pinched in the Glauber region
for its component ks · wA , as explained in Sec. 3.2.2.2. This makes the use of symmetric
deformation as in Sec. 3.1.2.3 not possible. But the asymmetric deformation strategy in
Sec. 3.1.2.4 applies here with no change, because we never deformed the contour of ks · wA
when ks flows through the A-collinear subgraph. The important step of factorizing the C-
collinear subgraph is shown in Fig. 3.21(b). In the end, the diffractive amplitude is factorized
into the hadron GPD and meson DAs,
                        XZ     1    Z  1
         (2)                                        i
      Mhγ→h′ MC MD   =           dx      dzC dzD Fhh  ′ (x, ξ, t; µ)
                        i,j,k −1     0
                                × Ciγ→jk (x, ξ; zC , zD ; qT , µ) ϕj/C (zC , µ) ϕk/D (zD , µ), (3.146)
up to 1/qT power suppressed terms, where the sum over i, j, and k runs over all possible
flavors and spin structures.
3.2.4      SDHEP with a meson beam
For the SDHEP with a meson beam, we have B being some meson MB , which is usually
a pion or kaon. Similar to the case with a photon beam, we consider three cases for the
particles C and D: (1) massive dilepton (CD) = (l+ l− ) or diphoton (γγ) production; (2)
real photon and light meson pair (CD) = (γMD ) production; and (3) light meson pair
(CD) = (MC MD ) production. The dilepton and diphoton production processes have been
studied in [Berger et al., 2001; Qiu and Yu, 2022], respectively, and their factorizations are
                                                 206


similar to the DVMP process. The processes (2) and (3) have not been considered in the
literature. In this section, we address the factorization of these processes in the framework
of the SDHEP within the two-stage paradigm.
3.2.4.1    Massive dilepton or diphoton production: (CD) = (l+ l− ) or (γγ)
The SDHEPs of massive dilepton and diphoton productions are
                           h(p) + MB (p2 ) → h′ (p′ ) + l− (q1 ) + l+ (q2 ),           (3.147)
and
                            h(p) + MB (p2 ) → h′ (p′ ) + γ(q1 ) + γ(q2 ),              (3.148)
respectively. Both processes have C and D being colorless elementary particles, and they
are similar to the meson production in the SDHEP with a lepton beam in Sec. 3.2.2.2
and the meson-photon pair production in the SDHEP with a photon beam in Sec. 3.2.3.2,
respectively. The difference comes from switching the final-state meson with the initial-
state lepton or photon. The argument for the factorization works in essentially the same
way, with only a slight change due to the meson being in the initial state instead of final
state. In reality, only charged light meson beams such as π ± or K ± are readily accessible
in experiments, so we will consider only those beams. Then charge conservation implies a
flavor change of the diffractive hadron, i.e., h′ ̸= h, which forbids the γ ∗ -mediated n = 1
channel. Therefore, the leading-power contributions to the amplitudes in Eqs. (3.147) and
(3.148) start with the n = 2 channels, which are factorized into the GPDs associated with
the hadron transition h → h′ , as in [Berger et al., 2001; Qiu and Yu, 2022].
    For the process in Eq. (3.147), at the lowest order in QED, the high-qT lepton pair is
                                              207


produced via a timelike photon γll∗ with a high virtuality Q ∼ O(qT ), when it is sufficiently
away from the resonance region of a heavy quarkonium. This process can hence be referred
to as exclusive Drell-Yan process [Berger et al., 2001]. It is this highly virtual photon that
couples directly to the parton lines from the h-MB interaction, whose virtuality Q provides
the hard scale that localizes the parton interactions. This is sufficient for the factorization
argument. Furthermore, due to the lack of γ ∗ -mediated n = 1 subprocess, the requirement of
the high invariant mass for the lepton-pair is a sufficient condition for factorization, allowing
us to release the high qT requirement, which is contrary to the requirement for the lepton-pair
production in the SDHEP with a photon beam, as discussed in Sec. 3.2.3.1.
     In contrast, the process in Eq. (3.148) has the two final-state photons directly couple to
the parton lines, and the hard scale is solely provided by their high transverse momentum
qT , which is both the sufficient and necessary condition for collinear factorization. In the
low-qT regime, one starts to have two widely separated scales in the same process, qT2 ≪ ŝ =
(p−p′ +p2 )2 , just as the photoproduction of diphoton process in Sec. 3.2.3.1, the factorization
for which needs further study.
3.2.4.2     Real photon and light meson pair production: (CD) = (γMD )
Now we consider the process
                            h(p) + MB (p2 ) → h′ (p′ ) + γ(q1 ) + MD (q2 ),               (3.149)
which differs from the photoproduction of a meson pair process in Sec. 3.2.3.3 by switching
the initial-state photon with one of the final-state mesons. The n = 1 channel corresponds to
the subprocess γ ∗ (p1 ) + MB (p2 ) → γ(q1 ) + MD (q2 ), which has been discussed in Sec. 3.1.2.2.
                                                208


Depending on the quantum numbers of MB and MD , this channel may or may not be present.
The amplitude can be factorized into the DAs of MB and MD .
    The amplitude of n = 2 channel can be factorized into a GPD and two DAs, whose proof
can be adapted from Sec. 3.1.2.4 with straightforward modifications: one can first factorize
the D-collinear subgraph and the soft gluons attached to it, and then do the same thing for
B, which is sufficient to complete the proof.
                                                           q1
                                               S
                                                         C
                                      p2
                                          B       H
                                                            q2
                                                         D
                                        p                  p′
                                                 A
Fig. 3.22: Leading-region graphs for the single-diffractive hard mesoproduction of two
mesons. There can be any numbers of soft gluons connecting S to each collinear subgraph.
Depending on the quantum numbers, the quark lines may be replaced by transversely polar-
ized gluon lines. The dots represent arbitrary numbers of longitudinally polarized collinear
gluons.
3.2.4.3    Light meson pair production: (CD) = (MC MD )
Now we consider the process
                          h(p) + MB (p2 ) → h′ (p′ ) + MC (q1 ) + MD (q2 ),             (3.150)
whose corresponding 2 → 2 hard meson scattering is discussed in Sec. 3.1.3.3. The n = 1
channel, γ ∗ (p1 ) + MB (p2 ) → MC (q1 ) + MD (q2 ), which may or may not contribute depending
on the quantum numbers, can be analyzed in the same way as the photon-meson scattering
in Secs. 3.1.2.3 and 3.1.2.4. The n = 2 channel has leading regions shown in Fig. 3.22, under
                                                 209


the assumptions of strong soft-end suppression and a single hard scattering in which all the
parton lines are off shell by the hard scale. Compared to the meson pair photoproduction
process in Sec. 3.2.3.3, there is one more collinear subgraph in the initial state, and factor-
ization works with a simple generalization. In Fig. 3.22 one does not deform the contours
of soft gluon momenta ks for their components ks · wA when they flow in the A-collinear
subgraph. We first factorize C, D, and B from H sequentially, together with the soft gluons
attached to them, and then group the soft gluons into the A-collinear subgraph to complete
the proof in a way similar to Sec. 3.2.3.3. Consequently, the amplitude of the diffractive
process in Eq. (3.150) can be factorized into the GPD and DAs,
                              XZ      1    Z  1
            (2)                                                  i
         MhMB →h′ MC MD   =             dx      dzB dzC dzD Fhh    ′ (x, ξ, t; µ) ϕj/B (zB , µ)
                             i,j,k,l −1     0
                        × Cij→kl (x, ξ; zB , zC , zD ; qT , µ) ϕk/C (zC ; µ) ϕl/D (zD ; µ),     (3.151)
up to 1/qT power suppressed terms, where where the sum over i, j, and k runs over all
possible flavors and spin structures, and the hard coefficient Cij→kl (x, ξ; zB , zC , zD ; qT , µ)
can be calculated as the scattering of two collinear parton pairs i and j into another two
pairs k and l.
3.3      Further discussion on single diffractive processes
In this section, we give a few general remarks on the properties of SDHEPs, and their
factorizability and sensitivities for extracting GPDs.
                                                   210


3.3.1     Two-stage paradigm and factorization
We have presented the arguments to prove the factorization of SDHEPs with different collid-
ing beams and different types of final-state particles. Our proofs follow a unified two-stage
approach by taking advantage of the unique feature of SDHEPs, which can be effectively
                                                                                          √
separated into two stages, as specified in Eqs. (3.111) and (3.112). By requiring qT ≫      −t,
we effectively force the exchanged state A∗ between the single diffractive transition of h → h′
and the hard exclusive 2 → 2 scattering to be a low-mass and long-lived state in comparison
to the timescale ∼ O(1/qT ) of the hard exclusive process, and effectively reduce the SDHEP
into two stages: single diffractive (SD) + hard exclusive (HE) with the quantum interference
                                                            √
between these two subprocesses suppressed by powers of         −t/qT . As emphasized earlier,
requiring large transverse momenta for the final-state particles C and D is not equivalent
                                                         √
to requiring a large invariant mass of them, mCD ≫         −t; the latter does not necessarily
guarantee a hard collision.
    This two-stage paradigm gives a unified picture for the microscopic mechanism of the
SDHEPs, described in Eq. (3.113) and Fig. 3.12. It accounts for the γ ∗ -mediated n = 1
channel in a coherent framework, which is usually regarded as a “byproduct” of the GPD
channel in the literature and can be easily forgotten but which is in fact one power higher
than the GPD channel and should be incorporated unless it is forbidden by some quantum
number conservation.
    Furthermore, this two-stage paradigm leads to a simple methodology for proving factor-
ization of the SDHEPs in Eq. (3.109), in particular, for the n = 2 channel. By treating
the long-lived exchanged state A∗ as a “meson” capturing the quantum number of h → h′
transition, we make the corresponding scattering A∗ + B → C + D effectively a 2 → 2
                                              211


exclusive process with a single hard scale, whose factorization is relatively easier to prove. In
this way, the factorization proof of the SDHEP can focus on its differences from the 2 → 2
hard exclusive process.
    The only difference between the factorization of the 2 → 2 hard exclusive process and
the full SDHEP is that the GPD channel supports both ERBL and DGLAP regions, and a
Glauber pinch can exist for the DGLAP region. However, since we only have one diffractive
hadron, only one component ks ·wA of the soft gluon momentum ks is pinched in the Glauber
region. The factorizability of the corresponding 2 → 2 exclusive process implies that soft
gluons coupling to B, C, and/or D are canceled, which applies equally to the situation
of SDHEPs. The rest of the soft gluons only couple to the diffracted hadron and can be
grouped into the collinear subgraph of the diffractive hadron h → h′ ; see Fig. 3.23 as an
illustration. The factorization of soft gluons leads to the independence among different
collinear subgraphs, and help to establish the factorization of the collinear subgraph of the
diffractive hadron into a universal GPD, and the other collinear subgraphs into universal
meson DAs.
                                C(q1)                                        C(q1)
              S                                            S
                                                                        )
                         )      H                                 A (p 1     H
                                                                   ∗
                   A (p 1
                    ∗
                                      B(p2)                                        B(p2)
                 F                                            F
         h(p)           h′ (p′)    D(q2)             h(p)            h′ (p′)    D(q2)
                          (a)                                         (b)
Fig. 3.23: (a) SDHEP in the general case, with all possible soft gluon connections. (b) The
result of soft cancellation in (a). The cancellation of the soft gluons in the 2 → 2 hard
exclusive scattering implies the same cancellation of the soft gluons that couple to B, C,
and/or D.
                                              212


3.3.2      Assumptions for the exclusive factorization
The keys to collinear factorization are the cancellation of soft subgraphs that connect to
different collinear subgraphs and the factorization of all collinear subgraphs from the infrared-
safe short-distance hard part.
    The first assumption that we made is that the leading active quark lines or transversely
polarized gluon lines from the mesons must be coupled to the hard interaction, but not to the
soft subgraph, for which we effectively assume that we could get an additional suppression
from the expected end point behavior of meson wave function, when one of the active quarks
(or gluons) has a soft momentum, which we have referred to as the soft-end suppression. The
result of this assumption is that, to the leading power, the soft subgraph is only connected to
collinear subgraphs by gluon lines that are longitudinally polarized, for which Ward identity
can be applied to factorize them onto Wilson lines. The soft Wilson lines are only connected
to the rest of the graph by colors, and can be disentangled and factorized from the collinear
subgraphs because the collinear subgraphs are in color singlet states, which is an important
feature of exclusive processes. Consequently, the soft cancellation for the factorization of
SDHEPs is very different from typical soft cancellation for the factorization of inclusive
processes [Collins et al., 1989].
    Another consequence of the soft-end suppression is that we are allowed to constrain the
light-cone parton momenta of the mesons on the real axis and arrive at a definition of meson
DA, ϕ(z) with 0 < z < 1, as argued at the end of Sec. 3.2.2.1.
    This assumption was also applied to most factorizations of exclusive processes involving
high-momentum mesons, notably for the pion form factor and large-angle production pro-
cesses; see the review [Brodsky and Lepage, 1989]. Even though the soft-end region was
                                               213


conjectured to be Sudakov suppressed in [Brodsky and Lepage, 1989], which is more than
the power suppression taken as our assumption, a more extensive discussion on this issue is
still lacking in the literature.
     The second assumption that we implicitly made is that there is only one single hard
interaction in which all the parton lines are effectively off shell by the hard scale. This
applies especially to the mesoproduction of a meson pair process in Secs. 3.1.3.3 and 3.2.4.3.
It is well known that the exclusive hadron-hadron scattering into large-angle hadrons can
happen via multiple hard interactions, which has an enhanced power counting with respect
to the single hard interaction [Landshoff, 1974; Botts and Sterman, 1989]. We have shown
the factorization for the hard exclusive 2 → 2 meson-meson scattering and the correspond-
ing SDHEP with a meson beam for the single hard interaction case. Within the two-stage
paradigm, it is unclear to us whether the factorization of the large-angle meson-meson scat-
tering via multiple hard interactions can imply a corresponding factorization for the SDHEP
with a meson beam; it is left for future study.
     One may also consider representing A∗ as a sum over virtual hadronic states, instead of
the expansion in terms of partonic states like [q q̄ ′ ] and [gg]. However, the exchanged state A∗
in the SDHEP enters a hard collision, which has a resolution scale 1/Q much smaller than the
typical hadronic scale, and therefore it is the partonic degrees of freedom inside the virtual
hadronic state or the diffractive hadron that are probed. For example, the leading-power
contribution from a virtual hadronic state should also be mediated by two active parton
lines, just as in Figs. 3.1(b), 3.3(a), 3.8(a), 3.9(a), and 3.10(a), along with the same short-
distance hard part as the n = 2 partonic channel in connection with GPDs. In principle,
to this power, one should add all the two-parton-mediated contributions from all possible
virtual hadronic states of the same diffractive hadron, which could possibly recover the full
                                                214


contributions from the corresponding GPDs of the same hadron, but, only from their ERBL
region. GPDs also contain the DGLAP region, which cannot be covered by the subprocesses
mediated by virtual hadronic states. The approach of taking out a virtual meson A∗ from
the h → h′ transition, described by some form factor Fh→h   A
                                                               ′ (t), followed by extracting two
parton lines via its distribution amplitude, should also be captured by the GPD of h → h′
transition in a more general sense. The choice to represent A∗ by a single virtual meson state,
like the Sullivan process, is therefore an additional approximation. On the other hand, the
expansion in terms of the number of partons, n, is an expansion in powers of 1/Q.
3.3.3     Non-factorizability of double diffractive processes
From the procedure for proving factorization in the two-stage paradigm, it is easy to under-
stand the importance of the single diffraction for factorizability of the exclusive process. The
whole difficulty from the diffraction is the DGLAP region that pinches one component of the
soft gluon momentum in the Glauber region, and we get away with it by only deforming the
other components associated with other mesons. After factorizing out all the other mesons,
the rest of the soft gluons are only coupled to the diffracted hadron and can be grouped
together into this hadron’s GPDs.
                           N1      (1 − z1 )p1 − ks                  N1′
                           p1    z1 p                                p′1
                                     1 +k                         q1
                                         s
                                        ks              H
                                                      − ks        q2
                            p2                 z 2p 2                p′2
                            N2      (1 − z2 )p2 + ks                 N2′
Fig. 3.24: Diphoton production in a double diffractive hard exclusive scattering process
between two head-on hadrons N1 and N2 along the z axis.
                                                       215


    If we consider the double diffractive process, as shown in Fig. 3.24, the soft gluon ks
exchanged between the remnants along opposite directions is pinched in the Glauber region
for both ks+ and ks− , and thus no deformation can be done to get it out. As a result, this
process cannot be factorized, even if we do have a hard scale provided by the transverse
momentum qT of the final-state photon pair.
    Similar conclusion holds for the inclusive diffractive processes [Soper, 1997; Collins, 1998].
The observation of the diffracted hadron anchors the inclusive sum over the final state
and forbids the use of unitarity to cancel the Glauber gluon exchanges. While the soft
gluon momentum can be deformed out of the Glauber region for single diffractive inclusive
processes [Collins, 1998], in a similar way to the exclusive processes discussed in this thesis, it
does not work for inclusive diffractive hadron-hadron scattering [Landshoff and Polkinghorne,
1971; Henyey and Savit, 1974; Cardy and Winbow, 1974; DeTar et al., 1975; Collins et al.,
1993; Soper, 1997].
    This phenomenon is very similar to the factorization of Drell-Yan process at high twists
[Qiu and Sterman, 1991a,b], where the hadron connected by more than two active partons
to the hard part is analogous to the diffracted hadron here. Even though the extra trans-
versely polarized gluon lines at a high twist may be confused by soft gluons and endangers
factorization, this is still factorizable as one can first factorize soft gluons out of the other
hadron at the leading twist, similar to the procedure for the single diffractive process here
that we first factorize the soft gluons out of the other mesons. This can only be done at the
first subleading twist for which one of the two hadrons still has a twist-2 PDF involved, and
so the Drell-Yan process is not factorizable beyond the first nonvanishing subleading twist,
similar to the nonfactorizability of double diffractive processes.
                                                216


3.3.4     Comparison to high-twist inclusive processes
               x1 + ξ  x1              x2  x2 + ξ         x + x1 x1     x2  x + x2
                             p′  p′
                 p                            p             p                  p
                              (a)                                   (b)
Fig. 3.25: Sample leading-order cut diagrams for (a) DVCS amplitude squared and (b)
inclusive DIS cross section at twist-4. The red thick lines indicate the hard parts, and the
blue lines are collinear partons.
    The factorization of exclusive processes at the amplitude level shares many common fea-
tures with the inclusive process factorization at a high twist. Taking the leading-order DVCS
amplitude as an example, we show the amplitude square as a cut diagram in Fig. 3.25(a),
which is compared with one of the leading-order diagrams of the inclusive DIS at twist-4 in
Fig. 3.25(b). They only differ in that the cut line for the DVCS forces an exclusive final state.
Both diagrams have two collinear parton lines connecting the hadron-collinear subgraph to
the hard part, in both the amplitude to the left of the cut and conjugate amplitude to the
right. In this sense, the DVCS amplitude squared corresponds to a twist-4 contribution
to the cross section of the real photon electroproduction process. On the other hand, the
amplitude squared of the n = 1 channel for the γ ∗ -mediated subprocess corresponds to a
twist-2 contribution (see Fig. 3.26(a)), and the interference between the n = 1 and n = 2
channels corresponds to a twist-3 contribution (see Fig. 3.26(b)).
    In the DVCS amplitude in Fig. 3.25(a), the two partons carry momenta (x1 + ξ)P +
and (x1 − ξ)P + (following the directions indicated by the curved arrow), with x1 integrated
in [−1, 1]. In its conjugate amplitude, the two partons carry momenta (x2 ± ξ)P + with
x2 integrated in the same range. Similarly, for the twist-4 DIS diagram in Fig. 3.25(b),
                                                217


                   2ξ γ ∗           2ξ                2ξ γ ∗           x2  x2 + ξ
                           p′  p′                            p′  p′
                 p                     p            p                         p
                            (a)                                 (b)
Fig. 3.26: Sample cut diagrams of the amplitude squared of the real photon electroproduction
process for (a) the γ ∗ -mediated channel, and (b) the interference between the γ ∗ -mediated
channel and GPD channel. The red thick lines indicate the hard parts, and the blue lines
are collinear partons or photons.
the amplitude part has two collinear partons with momenta (x + x1 )p+ and x1 p+ , with
x1 integrated in [−1, 1 − x]. The conjugate amplitude part has two collinear partons with
momenta (x + x2 )p+ and x2 p+ , with x2 integrated in the same range. In both cases, the
x1 and x2 integrations are not related and to be integrated independently. Only the total
momentum of the two partons, which is 2ξP + for the DVCS and xp+ for the twist-4 DIS, is
observable, whose dependence is probed by the experiment.
    On the other hand, there are soft breakpoint poles of x1 (or x2 ), given by the situations
when one of the two partons has zero momentum, which is x1 = ±ξ for DVCS and x1 = 0
or −x for twist-4 DIS. However, those poles are not pinched and they happen at the middle
part of the x1 integration range. As a result, we can deform the contour of x1 to avoid them,
just as discussed around Eq. (3.136). This situation is contrary to the DA factorization, for
which the soft poles happen at the end points of the DA integration and cannot be deformed
away, which requires us to make the soft-end suppression assumption in Sec. 3.3.2.
                                              218


Chapter 4
Generalized parton distributions
The generalized parton distributions (GPDs) resulting from factorization of single-diffractive
exclusive scattering processes are important nonperturbative parton correlation functions
that reveal many aspects of the confined partonic structures of hadrons. By their universal
operator definitions, GPDs can be studied by themselves. Their values can be obtained by
nonperturbative calculation methods like Lattice QCD [Ji, 2013; Chen et al., 2020; Alexan-
drou et al., 2020; Lin, 2021, 2022; Hashamipour et al., 2023; Bhattacharya et al., 2022a],
which will not be discussed in this thesis, or by fitting to experimental data by virtue of the
factorization theorems discussed in Sec. 3. Nevertheless, the exclusive nature of the GPD
factorization poses substantial challenges for the fitting programs, making the extraction of
GPDs, especially their x dependence, from experimental data, extremely difficult. This is our
focus in this section. First, we will first review some important properties of GPDs, especially
their roles in unveiling the hadron structures. Then we will lay out the phenomenological
framework for the single-diffractive hard exclusive processes (SDHEPs). We will see that the
two-stage paradigm gives a clear description of the azimuthal correlations that arise from
different spin structures of the GPDs. As an illustration, we will discuss the most popular
process, deeply virtual Compton scattering (DVCS), within this framework. Similar to many
other processes, it can probe GPDs only up to a few moments. This information is far from
enough to map out the full x dependence of GPDs. To resolve this issue, following a general
                                                 219


discussion on the x sensitivity of GPD processes, we will introduce a type of processes that
can provide enhanced sensitivity to the x dependence, and demonstrate how well they can
help determine the latter. We will close this section by proposing a global analysis of all
types of observables that can be used for the task of determining GPDs.
4.1      GPD properties
4.1.1     Definitions and spin dependence
As remarked below Eq. (3.134), the GPDs defined in Eq. (3.131) contain full dependence
on the hadron spin states. This dependence shall be separated by decomposing the matrix
elements into independent form factors,
                      Z
                         dy − −ixP + y− ′ ′                               
           q
        F (x, ξ, t) =         e         ⟨p , α |ψ̄q y − /2 γ + ψq −y − /2 |p, α⟩
                          4π
                                                                            
                        1       ′  ′     q          +      q        iσ +α ∆α
                    =      ū(p , α ) H (x, ξ, t)γ − E (x, ξ, t)               u(p, α), (4.1a)
                      2P +                                             2m
                      Z
                         dy − −ixP + y− ′ ′                                  
         e q
        F (x, ξ, t) =         e         ⟨p , α |ψ̄q y − /2 γ + γ5 ψq −y − /2 |p, α⟩
                          4π
                                                                            
                        1       ′  ′   e q          +       e q       γ5 ∆+
                    =      ū(p , α ) H (x, ξ, t)γ γ5 − E (x, ξ, t)            u(p, α), (4.1b)
                      2P +                                              2m
where we take the hadron states as protons for definiteness, and use the kinematic convention
in Eq. (3.128). α and α′ explicitly denote the spin states. The ∆ differs from the usual
convention [Diehl, 2003] by a sign, which has been compensated by the minus sign in front
of E and E   e such that the GPDs are the same. We have dropped the time ordering and
omitted the Wilson lines. The decomposition is done following Lorentz covariance, parity
                                                 220


invariance, and Dirac matrix properties. The same decomposition applies to gluon GPDs,
                     X             Z
                                         dy − −ixP + y− ′ ′ +i −  +j                            
          g
       F (x, ξ, t) =         δ ij
                                             +
                                                e           ⟨p , α |G y /2 G             −y − /2 |p, α⟩
                         i,j            2πP
                                                                                      
                       1          ′    ′      g             +       g         iσ +α ∆α
                   =        ū(p , α ) H (x, ξ, t)γ − E (x, ξ, t)                        u(p, α),           (4.2a)
                     2P +                                                        2m
                     X                   Z
                                              dy − −ixP + y− ′ ′ +i −  +j                           
        e g
       F (x, ξ, t) =         (−iϵT )ij
                                                     e            ⟨p , α |G y /2 G            −y − /2 |p, α⟩
                         i,j                2πP    +
                                                                                    +
                                                                                       
                       1          ′    ′    e (x, ξ, t)γ γ5 − E
                                              g             +        e (x, ξ, t)
                                                                       g         γ5 ∆
                   =        ū(p , α ) H                                                 u(p, α).           (4.2b)
                     2P +                                                         2m
It is the scalar coefficients H q,g , H    e q,g , E q,g , E
                                                           e q,g that are usually referred to as GPDs, which
are constrained to be real functions that are even in ξ. In this thesis we loosely refer to both
these coefficients and F ’s, Fe’s as GPDs. There are also form factor decompositions for the
transversity GPDs, but we will not discuss them in this thesis.
    In the GPD definitions, the parton spin states are dictated by the spinor or tensor
projectors, γ + and γ + γ5 , or δ ij and −iϵij        T , whereas the proton spin structures are selected
by the different form factors. It is, however, not straightforward to quantitatively describe
them. First, the partons in GPDs are not on-shell, but instead we have integrated out
their transverse and minus momentum components (see the discussion above Eq. (3.131)).
Second, the proton states are not both along the z direction, and one can even go to a frame
where both p and p′ are not along the z direction. On the other hand, the parton states
in the hard scattering have been projected to be on-shell along the z direction, and their
spin states can be chosen as the helicities. To unify the whole picture, we now introduce the
concept of light-cone helicity state.
                                                           221


4.1.1.1      Transverse boost and light-cone helicity
A transverse boost Λ(v) is a special Lorentz transformation that takes a momentum k to
k ′µ = Λµ ν (v)k ν by
                                          √ +                     √
                 k ′+ = k + ,  kT′ = kT +   2k v,    k ′− = k − +  2kT · v + k + v 2 ,      (4.3)
where v = (v1 , v2 ) is a transverse vector and v 2 = v12 + v22 . This keeps the plus momentum
invariant but shifts the transverse momentum (the k − transformation is determined by re-
quiring k 2 to be invariant). The transformation matrix Λ(v) can be written in the Cartesian
coordinate system as
                                                                                     
                                  2                  2
                        1 + v /2 v1         v2     v /2            0 v1        v2 0 
                                                                                     
                                                                                     
                         v1
                                       1    0       v1           
                                                                     v1 0         0 v1 
                                                                                        
  Λ(v) = (Λµ ν (v)) =                                      ≃1+                        . (4.4)
                                                                                     
                         v2
                                       0    1       v2    
                                                                    v2 0
                                                                                  0 v2 
                                                                                        
                                                                                     
                                2                       2
                              −v /2 −v1 −v2 1 − v /2                   0 −v1 −v2 0
In the last step, we have taken the small v approximation and thrown away higher-power
terms. This helps identify the transverse boost generators T = (T1 , T2 ) with the usual boost
and rotation generators, K = (K1 , K2 , K3 ) and J = (J1 , J2 , J3 ),
                                    T1 = K1 + J2 ,  T2 = K2 − J1 .                          (4.5)
The transverse boost can then be written as
                                           Λ(v) = e−iT ·v ,                                 (4.6)
                                                 222


                                                                                               
which induces the transverse boost operator Û (v) = exp −iT̂ · v that acts on the quantum
Hilbert space.
     Using Eq. (4.5) and the Poincare algebra, we can get [T1 , T2 ] = 0 and work out their
commutation relations with the momentum operator P̂ µ ,
                                                                √                                      √
                     [T̂ i , P̂ + ] = 0,    [T̂ i , P̂ − ] = − 2 i P̂ i ,      [T̂ i , P̂ j ] = −i δ ij 2 P̂ + ,      (4.7)
where i, j = 1, 2 are the transverse indices, and we take T̂ i = T̂i . By defining P̂ µ (v) =
Û (v)P̂ µ Û −1 (v), we have
                                        ∂ µ                            h            i
                                           P̂    (v)   =   Û (v)(−i)   T̂ i
                                                                             , P̂ µ
                                                                                       Û −1 (v),                     (4.8)
                                      ∂v i
which gives
               ∂ +                        ∂ j                   ij
                                                                   √ +                  ∂ −              √ i
                   P̂ (v) = 0,                P̂   (v)   =   −δ     2P̂ (v),                 P̂ (v)  =  −   2P̂ (v).  (4.9)
              ∂v i                       ∂v i                                          ∂v i
This gives the solution
                                                                √                                  √         
                   Û (v)P̂ µ Û −1 (v) = P̂ + , P̂ − − 2P̂T · v + P̂ + v 2 , P̂T − 2P̂ + v .                        (4.10)
A one-particle state can be specified by its plus and transverse momentum, |k + , kT ⟩, with
its minus momentum component determined as k − = (k 2 + kT2 )/(2k + ). After acting on it a
transverse boost operation, we have
      P̂ µ Û (v)|k + , kT ⟩ = Û (v)P̂ µ (−v)|k + , kT ⟩
                                                                  223


                                             √                          √          
                           = k + , k − + 2kT · v + k + v 2 , kT + 2k + v Û (v)|k + , kT ⟩.    (4.11)
That is,
                                                                     √
                                     Û (v)|k + , kT ⟩ = |k + , kT +   2k + v⟩                 (4.12)
realizes the same momentum transformation as in Eq. (4.3). Together with the normalization
                     ⟨k + , kT |k ′+ , kT′ ⟩ = (2π)3 (2k + ) δ(k + − k ′+ ) δ (2) (kT − kT′ ), (4.13)
Eq. (4.12) facilitates the unitary representation of Û (v) in the Hilbert space. The helicity
quantum number will be specified in the following.
     The usual helicity state |k, λ⟩ is defined by transforming from the basic reference state
|k0 ẑ, λ⟩ by first boosting along the z direction such that it has the same energy as |k, λ⟩,
and then rotating around the y and z axes to reach the momentum k (see Ch. 7 for details),
                             |k, λ⟩ ≡ U (Rz (ϕ))U (Ry (θ))U (Λz (β))|k0 ẑ, λ⟩,                (4.14)
where θ and ϕ are the polar and azimuthal angles of k, respectively. For such helicity state,
the spin quantization axis is the momentum direction, which transforms as we rotate the
momentum. A rotation around the z axis thus transforms |k, λ⟩ in a trivial way (see the
discussion below Eq. (7.38) in Ch. 7),
                                        U (Rz (α))|k, λ⟩ = |Rz (α)k, λ⟩,                       (4.15)
without any phase signifying a spin component along the z direction.
                                                        224


     In a similar way, we define the light-cone helicity state |k + , kT , λ⟩ by transforming from
the basic reference state |k0+ , 0T , λ⟩. First boost along z to reach the plus momentum k + .
                                                              √
Then perform a transverse boost with v = kT /( 2k + ), that is,
                                                                 
                                                            k
                                     +
                                  |k , kT , λ⟩ = U        √ T        |k + , 0T , λ⟩.                  (4.16)
                                                            2k +
This applies to both massless and massive particle states.1 Since the transverse boosts form
an Abelian subgroup, acting a transverse boost on |k + , kT , λ⟩ only changes the momentum
component kT , but keeps k + and λ invariant. Also, by noting [T , K3 ] = iT and thus
                                            e−iK3 β T eiK3 β = e−β T ,                                (4.17)
the light-cone helicity state transforms under a longitudinal boost as
                                                                         
                     e−iK3 β |k + , kT , λ⟩ = e−iK3 β e−iT ·v eiK3 β e−iK3 β |k + , 0T , λ⟩
                                                        −β v)
                                             = e−iT ·(e       |eβ k + , 0T , λ⟩
                                             = |eβ k + , kT , λ⟩,                                     (4.18)
which keeps the light-cone helicity invariant. Similarly, under a rotation around the z direc-
tion, the state transforms as
                                                                         
                      e−iJ3 α |k + , kT , λ⟩ = e−iJ3 α e−iT ·v eiJ3 α e−iJ3 α |k + , 0T , λ⟩
                                             = e−iλα e−iT ·[Rz (α)v] |k + , 0T , λ⟩
   1
     The only exception is for massless states moving along the −z direction, which have zero plus momentum
so cannot be achieved by Eq. (4.16).
                                                        225


                                            = e−iλα |k + , Rz (α)kT , λ⟩,                       (4.19)
which is obtained by using [J3 , T1 ] = iT2 , [J3 , T2 ] = −iT1 and
                                        e−iJ3 α T eiJ3 α = Rz−1 (α) T .                         (4.20)
Eqs. (4.18) and (4.19) establish the light-cone helicity λ with a physical interpretation of the
spin component along the z direction.
     We note the difference between the light-cone helicity state and the canonical spin state.
The latter only applies to a massive state and is defined by boosting the basic reference state
|0, s⟩ along the momentum direction,
        |k, j, s⟩ = e−iK·β |0, j, s⟩ = U (R3 (ϕ)R2 (θ))e−iK3 β U −1 (R3 (ϕ)R2 (θ))|0, j, s⟩,    (4.21)
where k is along the (θ, ϕ) direction, β = k/k 0 and β = |β|. This is related to the helicity
state by
                                                       X
                                    |k, j, s⟩ = eisϕ        djsλ (θ) |k, j, λ⟩.                 (4.22)
                                                         λ
Under a rotation around z, this also transforms as |k, j, s⟩ → e−isα |k, j, s⟩. But under a
boost along z, s is not kept invariant. By using Eqs. (7.41) and (7.42), it transforms into
                                              X
                  U (Λz (β))|k, j, s⟩ = eisϕ        djsλ (θ) |Λz (β)k, j, λ′ ⟩ djλ′ λ (χ(β, k))
                                               λ,λ′
                               X                         
                        = eisϕ      djsλ θ − χ(β, k) |Λz (β)k, j, λ⟩
                                λ
                          Xh                                                ′
                                                                                i
                        =       eisϕ djss′ θ − χ(β, k) − θ′ (Λk) e−is ϕ |Λz (β)k, j, s′ ⟩
                           s′
                                                       226


                       X                                                        
                     =     |Λz (β)k, j, s′ ⟩ Dsj′ s ϕ, θ′ (Λk) − θ + χ(β, k), ϕ          (4.23)
                        s′
where in the third step we used the inverse of Eq. (4.22). In Eq. (4.23), Λz (β) does not
change the azimuthal angle of k, but changes its polar angle to θ′ (Λk). This can be easily
verified not to equal θ − χ(β, k). Therefore, the canonical spin component s shall only be
interpreted as the spin component along the z direction in the rest frame, but not in the
boosted frame.
4.1.1.2    Light-front quantization
One may designate the light-cone helicity states defined in Eq. (4.16) into field decomposi-
tion and define single-particle creation and annihilation operators. For a fermion field, the
amplitude for annihilating a state at some space-time point x is given by
                            ⟨0|ψ(x)|k + , kT , λ⟩ = uλ (k + , kT )e−ik·x ,               (4.24)
which defines the spinor uλ (k + , kT ) associated with this state. By using the definition in
Eq. (4.16), the left-hand side of Eq. (4.24) becomes
                ⟨0|ψ(x)U (v)|k + , 0T , λ⟩ = ⟨0|U −1 (v) ψ(x) U (v)|k + , 0T , λ⟩
                                               
                      = S(v)⟨0|ψ Λ−1 (v)x |k + , 0T , λ⟩ = S(v)uλ (k + , 0T )e−ik·x ,    (4.25)
which therefore gives the definition for the spinor, in a similar way to the state definition,
                                 uλ (k + , kT ) = S(v) uλ (k + , 0T ).                   (4.26)
                                                   227


Here S(v) is the Lorentz group representation for the Dirac spinor associated with the
transverse boost Λ(v). It can be easily solved by using the generator definitions in Eq. (4.5),
and gives the explicit spinor definitions,
                                                                                                             
                                                                                                         kT e−iϕ
                                        m 1                                                    − √2k+ 
                                        √2k+  
                                                       
                                                                                                   
                                                                                                                  
                       h√        i1/2               0                              h√       i1/2         1        
                                                                                                                  
    u+ (k + , kT ) =        2k +                     ,      u− (k + , kT ) =         2k +                 ,
                                                                                                                  
                                        1                                                                  0  
                                                                                                  √m              
                                                                                                2k+           
                                              k√T eiϕ
                                                 2k+
                                                                                                                 1
                                                                                                                      (4.27)
where we took kT = kT (cos ϕ, sin ϕ). The spinors for antiparticles can be obtained in the
same way, or by simply using charge conjugation relation vλ (k + , kT ) = iγ 2 u∗λ (k + , kT ),
                                                                                                         
                                              kT  e−iϕ
                                          √2k+                                                   m 1 
                                                                                                 √2k+   
                                                                                                                  
                         h√       i1/2          −1                                 h√        i1/2             0   
                                                                                                                  
     v+ (k + , kT ) =       2k +                      ,      v− (k + , kT ) =         2k +                    .
                                                                                                                  
                                                     0                                             −1 
                                         √ m                                                                    
                                         2k+                                                                 
                                                                                                                iϕ
                                                      1                                                 − k√T2ke +
                                                                                                                      (4.28)
It is straightforward to verify that the light-cone helicity spinors satisfy the usual normal-
ization relations,
               ūλ (k + , kT ) uλ′ (k + , kT ) = 2mδλλ′ ,       v̄λ (k + , kT ) vλ′ (k + , kT ) = −2mδλλ′ ,
               u†λ (k + , kT ) uλ′ (k + , kT ) = 2Eδλλ′ ,       vλ† (k + , kT ) vλ′ (k + , kT ) = 2Eδλλ′ ,
               ūλ (k + , kT ) γ µ uλ′ (k + , kT ) = v̄λ (k + , kT ) γ µ vλ′ (k + , kT ) = 2k µ δλλ′ ,                (4.29)
                                                            228


                          √
with E = (k + + k − )/ 2, the orthogonality,
                        ūλ (k + , kT ) vλ′ (k + , kT ) = v̄λ (k + , kT ) uλ′ (k + , kT ) = 0,           (4.30)
and the sum rules,
       X                                                   X
           uλ (k + , kT ) ūλ (k + , kT ) = k/ + m,             vλ (k + , kT ) v̄λ (k + , kT ) = k/ − m. (4.31)
        λ                                                   λ
   The same procedure can apply to a vector particle state, which is annihilated by the
vector field by
                                  ⟨0|Aµ (x)|k + , kT , λ⟩ = ϵµλ (k + , kT )e−ik·x ,                      (4.32)
where the polarization vector ϵµλ (k + , kT ) is defined by the transverse boost in a similar way
to Eq. (4.26),
                                       ϵµλ (k + , kT ) = Λµ ν (v) ϵνλ (k + , 0T ).                       (4.33)
For massless vector bosons, the basic polarization vector is transverse, ϵµλ (k + , 0T ) = (0, ϵT , 0),
where ϵT = (ϵ1T , ϵ2T ) is the transverse part. With the help of Eq. (4.4), this then gives the
polarization vector for a general momentum,
                                                                                               
                                        kT · ϵ T           k T · ϵT            + kT · ϵ T
                 ϵµλ (k + , kT )  =      √         , ϵT , − √          = 0 ,                 , ϵT     ,  (4.34)
                                            2k +              2k +                    k+           lc
where the last expression is in light-front coordinates. We note that it has the same transverse
component as the basic vector ϵµλ (k + , 0T ).
   With the fixed definitions of the spinors and polarization vectors for arbitrary momenta,
one can decompose the fields in terms of the light-cone helicity state creation and annihilation
                                                           229


operators. For a fermion field, one has
          XZ         dk + d2 kT  +                                                                               
  ψ(x) =                  √          b(k , kT , λ) uλ (k + , kT ) e−ik·x + d† (k + , kT , λ) vλ (k + , kT ) eik·x ,
           λ
                   (2π)3 2k +
                                                                                                               (4.35)
where b(k + , kT , λ) and d(k + , kT , λ) respectively annihilate a fermion and an anti-fermion
with momentum (k + , kT ) and light-cone helicity λ, and the integration of k + is from 0 to
∞. And for a massless vector field, one has
           XZ        dk + d2 kT  +                                                                                
   µ
 A (x) =                   √         a(k , kT , λ) ϵµλ (k + , kT ) e−ik·x + a† (k + , kT , λ) ϵµ∗
                                                                                               λ  (k + , kT ) eik·x ,
            λ
                    (2π)3 2k +
                                                                                                               (4.36)
where λ = ±1, and we have taken A to be a Hermitian field, as is the case for photons
and gluons. The operators a(k + , kT , λ) and a† (k + , kT , λ) respectively annihilate and create
a vector boson with momentum (k + , kT ) and light-cone helicity λ. Then Eqs. (4.24) and
(4.32) can be realized by Eqs. (4.35) and (4.36) provided the single-particle state definitions,
                                                        √
                                   |k + , kT , λ⟩ =       2k + a† (k + , kT , λ)|0⟩,                           (4.37)
etc., and the commutation relations,
                                                     
                a(k + , kT , λ), a† (k ′+ , kT′ , λ′ ) = (2π)3 δ(k + − k ′+ ) δ (2) (kT − kT′ ) δλλ′ ,         (4.38)
and similar anticommutation relations for the fermion operators.
    Nevertheless, when converting such commutators among the creation and annihilation
operators into the canonical commutators or anticommutators among the fields, one does not
get the “naturally conjectured” equal-x+ commutation relations, but instead, at the last step
                                                           230


                R                       R
one has to use    dk + d2 kT /2k + =       d3 k/2Ek to get the equal-time commutation relations.
The existence of nonzero kT and mass m in the spinors and polarization vectors forbids the
derivation of an equal-x+ commutation relation. The use of light-cone helicity states does
not embed itself into a simply covariant formalism.
    To overcome this problem, we introduce the light-front quantization. Instead of tak-
ing equal-time commutation relations plus time evolutions, one take, right in the beginning,
equal-x+ commutation relations, and evolve everything with respect to x+ , under the “Hamil-
tonian” P+ ,
                                        ∂
                                    i       O(x+ ) = [O(x+ ), P+ ].                       (4.39)
                                      ∂x+
Then one shall immediately notice from the QCD Lagrangian that there are some “bad”
field components that are non-evolving and dynamically dependent on other components,
and some “good” dynamically independent field components. In the light-cone gauge A+ = 0,
the good fields components are
                  γ −γ +                            γ +γ −
        ψG (x) =         ψ(x),    ψ̄G (x) = ψ̄(x)          ,   A⊥ (x) = (A1 (x), A2 (x)), (4.40)
                     2                                 2
where the color indices are omitted.
    The field decompositions for the good fields are particularly simple. We notice that the
spinor projector γ − γ + /2 takes all the spinors in Eqs. (4.27) and (4.28) to their lightlike
versions,
                         γ −γ +
                                uλ (k + , kT ; m) = uλ (k + , 0T ; 0) ≡ uλ (k + ),
                            2
                         γ −γ +
                                vλ (k + , kT ; m) = vλ (k + , 0T ; 0) ≡ vλ (k + ).        (4.41)
                            2
                                                   231


Similarly, the transverse components of the polarization vectors are reduced to ϵµλ (k + , 0T ),
for both massless and massive vector particles. Then, the decompositions in Eqs. (4.35) and
(4.36) become,
            XZ       dk + d2 kT  +                                                                     
  ψG (x) =                √         b(k , kT , λ) uλ (k + ) e−ik·x + d† (k + , kT , λ) vλ (k + ) eik·x , (4.42a)
            λ
                    (2π)3 2k +
           XZ        dk + d2 kT  +                                                                     
    i
  A (x) =                 √         a(k , kT , λ) ϵiλ (k + ) e−ik·x + a† (k + , kT , λ) ϵi∗
                                                                                         λ  (k + ) eik·x , (4.42b)
            λ
                    (2π)3 2k +
where i = 1, 2 and x = (0+ , x− , xT ) is at the zero light-front time. Now there is no place for
k − to come in, and the good field components satisfy the equal-x+ commutation relations,
                                                        γ−
           ψG (x+ , x− , xT ), ψ̄G (x+ , x′− , x′T ) =       δ(x− − x′− ) δ (2) (xT − x′T ),                (4.43)
                                                          2
          i + −                                       i
          A (x , x , xT ), ∂− Aj (x+ , x′− , x′T ) = δ ij δ(x− − x′− ) δ (2) (xT − x′T ).                   (4.44)
                                                          2
4.1.1.3    Parton spin structure
The parton spin structure is best understood in the light cone gauge A+ = 0. The presence
of the Wilson lines in the covariant gauge obscures the parton picture.
    In the quark GPD definitions [Eq. (4.1)], the quark fields sandwich a γ + matrix. Since
                                                                    
                                        +        γ +γ −     +   γ −γ +
                                      γ =                 γ               ,                                 (4.45)
                                                    2              2
both the quark and antiquark fields are projected to be the good field components,
                                     ψ̄γ + (1, γ5 )ψ = ψ̄G γ + (1, γ5 )ψG .                                 (4.46)
                                                       232


Similarly, in the light-cone gauge, the gluon fields in the gluon GPD definitions [Eq. (4.2)]
only have the transverse components, so are also good field components. Thus we can
decompose the fields according to Eq. (4.42), with the partons interpreted as carrying light-
cone helicities. Note that in this picture, the creation and annihilation operators are for
on-shell partons, which may or may not be massless but whose light-cone helicity states are
the same as the massless parton helicity states moving along the z direction.
   Inserting Eq. (4.42a) into Eq. (4.1), the light-cone operators can be expanded as,
     Z
        dy − −ixP + y−
             e         ψ̄q (y − /2) γ + (1, γ5 ) ψq (−y − /2)
         4π
          X Z dk + dk ′+ d2 kT d2 k′
        =                        6
                                          T
                                            θ(k + )θ(k ′+ )
           λλ′
                           (2π)
                                                                                           
               × b† (k ′+ , kT′ , λ′ ) b(k + , kT , λ) δλ′ λ , σλ3′ λ δ 2xP + − (k + + k ′+ )
                                                                                                
                 + d(k ′+ , kT′ , λ′ ) d† (k + , kT , λ) δλ′ λ , −σλ3′ λ δ 2xP + + (k + + k ′+ )
                                                                                                     
                 + b† (k ′+ , kT′ , λ′ ) d† (k + , kT , λ) −σλ1′ λ , −iσλ2′ λ δ 2xP + + (k + − k ′+ )
                                                                                               
                 +d(k ′+ , kT′ , λ′ ) b(k + , kT , λ) −σλ1′ λ , iσλ2′ λ δ 2xP + − (k + − k ′+ ) ,       (4.47)
where we used the explicit results for the light-cone helicity spinor algebra,
                                                            √                     
                     ūλ′ (k ′+ )γ + (1, γ5 )uλ (k + ) = 2 k + k ′+ δλ′ λ , σλ3′ λ ,
                                                            √                        
                      v̄λ′ (k ′+ )γ + (1, γ5 )vλ (k + ) = 2 k + k ′+ δλ′ λ , −σλ3′ λ ,
                                                            √                           
                      ūλ′ (k ′+ )γ + (1, γ5 )vλ (k + ) = 2 k + k ′+ −σλ1′ λ , −iσλ2′ λ ,
                                                            √                         
                      v̄λ′ (k ′+ )γ + (1, γ5 )uλ (k + ) = 2 k + k ′+ −σλ1′ λ , iσλ2′ λ ,                (4.48)
with abuse of the Pauli matrix notations as in Eq. (3.126). Now in each term of Eq. (4.47),
                                                         233


we label the momentum k + as (x + ξ)P + when it corresponds to an annihilated quark
or −(x + ξ)P + when it corresponds to a created antiquark. The momentum k ′+ can be
determined by the δ-function. This gives
   Z
      dy − −ixP + y−
           e         ψ̄q (y − /2) γ + (1, γ5 ) ψq (−y − /2)
       4π
      X        Z
             +    dxd2 kT d2 kT′
    =      P
       λλ′
                       (2π)6
                                                                    
        × b† (x − ξ)P + , kT′ , λ′ b (x + ξ)P + , kT , λ δλ′ λ , σλ3′ λ θ(x + ξ)θ(x − ξ)
                                                                            
           + d† − (x + ξ)P + , kT′ , λ′ d (ξ − x)P + , kT , λ −δλ′ λ , σλ3′ λ θ(ξ − x)θ(−x − ξ)
                                                                               
           + b† (x − ξ)P + , kT′ , λ′ d† − (x + ξ)P + , kT , λ −σλ1′ λ , −iσλ2′ λ θ(x − ξ)θ(−x − ξ)
                                                                                         
           +d (ξ − x)P + , kT′ , λ′ b (x + ξ)P + , kT , λ −σλ1′ λ , iσλ2′ λ θ(ξ − x)θ(x + ξ) , (4.49)
where the θ-functions arise from the constraints k + > 0 and k ′+ > 0. In the second term,
we used dd† = −d† d to reverse the operator order and relabeled the momenta and helicities.
This results in an extra minus sign to the unpolarized antiquark GPD F q̄ but the correct
sign for the polarized antiquark GPD Feq̄ to have the helicity polarization interpretation.
Depending on the sign of ξ, only three terms of Eq. (4.49) can survive. For the case ξ > 0,
the term b† d† that corresponds to the creation of a [q q̄] pair is not allowed. For the remaining
three terms,
    • when x > ξ, it is the b† b term that annihilates a quark with light-cone momentum
      fraction (x + ξ) and helicity λ, and then inserts back a quark with with light-cone
      momentum fraction (x − ξ) and helicity λ′ = λ. The GPD F q simply adds the two
      helicity states so is unpolarized, whereas Feq takes the difference and thus corresponds
      to the helicity polarization;
                                                       234


    • when x < −ξ, it is the d† d term that annihilates an antiquark with light-cone mo-
      mentum fraction (ξ − x) and helicity λ, and then inserts back an antiquark with with
      light-cone momentum fraction (−ξ−x) and helicity λ′ = λ. Similarly, F q is unpolarized
      and Feq is helicity polarized;
    • when −ξ < x < ξ, it is the db term that annihilates a pair of quark and antiquark, with
      light-cone momentum fractions (ξ ± x) and helicities (λ, λ′ ) respectively. Due to the σ 1
      or σ 2 structure, we either have (λ, λ′ ) = (+, −) or (−, +) so that the |qλ q̄λ′ ⟩ state has
      zero net helicity. The GPD F q adds the two helicity configurations, |q+ q̄− ⟩ + |q− q̄+ ⟩,
      so is unpolarized, while Feq takes the difference, |q+ q̄− ⟩ − |q− q̄+ ⟩, and is polarized.
When inserting Eq. (4.49) between the hadron states ⟨p′ |·|p⟩, momentum conservation yields
two more δ-functions for x and kT′ to kill the corresponding integrations, leaving us with
only kT integration. It also constrains x within [−1, 1], as argued around Eq. (3.134).
    Due to the different physical interpretations of GPDs at different x, we call the region
with ξ < |x| < 1 DGLAP region, which contains two subregions, one for quark and the other
for antiquark, and the region with |x| < ξ ERBL region.
    The gluon GPDs have similar decompositions as Eq. (4.49), which we will not repeat
here but refer to [Diehl, 2003] for details.
4.1.1.4    Proton spin structure
In Eqs. (4.1) and (4.2), each GPD is defined for a certain parton spin structure. The form
factor decomposition for each GPD corresponds to different proton spin structure. Following
the discussion above Sec. 4.1.1.1, we also describe the proton spin using light-cone helicity
states. With the notation Γα,α′ = (2P + )−1 ū(p′ , α′ )Γu(p, α) and using the explicit spinor
                                                235


forms in Eq. (4.27), we have
                         +                    p             +            
                         γ ++ = γ + −− = 1 − ξ 2 ,               γ +− = γ + −+ = 0,
                                              p             +                   
                γ + γ5  ++
                           = − γ + γ5 −− = 1 − ξ 2 ,             γ γ5 +− = γ + γ5 −+ = 0,                (4.50)
for the helicity non-flipping structures, and
                                                        
                         −iσ +α ∆α             −iσ +α ∆α              −ξ 2
                                          =                      =p            ,
                           2m         ++            2m       −−        1 − ξ2
                                                           ∗                 √
                         −iσ +α ∆α                 −iσ +α ∆α              iϕ∆     t0 − t
                                          =−                       = −e ·                   ,
                           2m         +−              2m       −+                  2m
                                                     
                         −γ5 ∆+                −γ5 ∆+               −ξ 2
                                       =−                     =p            ,
                          2m ++                   2m −−             1 − ξ2
                                                  ∗                  √
                         −γ5 ∆+             −γ5 ∆+                 iϕ∆      t0 − t
                                       =                   = −ξ e ·                ,                     (4.51)
                          2m +−                2m −+                        2m
for the helicity flipping structures, where t0 = −4ξ 2 m2 /(1 − ξ 2 ) is the maximum value of t
at a given ξ. Here we describe the diffraction of p → p′ by using the azimuthal angle ϕ∆ of
∆. Making explicit the proton helicity labels in the GPDs in Eqs. (4.1) and (4.2) as Fαα′
and Feαα′ , we have
                     p                     2
                                                                                           √
                                          ξ                              ∗          iϕ∆        t0 − t
    F++  = F−− = 1 − ξ 2 H −                     E ,       F+− =    −F−+      = −e        ·           E,
                                       1 − ξ2                                                  2m
                        p                     2
                                                                                           √
                                             ξ      e ,                                       t0 − t e
    Fe++ = −Fe−− = 1 − ξ H      2   e−             E       Fe+− = Fe−+∗
                                                                           = −ξeiϕ∆     ·            E,  (4.52)
                                          1 − ξ2                                              2m
which applies to both quark and gluon GPDs.
    In this way, the GPDs H and H       e are associated with proton helicity non-flipping channels,
whereas the GPDs E and E       e are with proton helicity flipping ones. Since we are dealing with
                                                      236


parton helicity non-flipping GPDs, the proton helicity flipping breaks the light-cone angular
momentum conservation by one unit. This is compensated by the linear power of a nonzero
∆T , which is described in the lab frame by the phase eiϕ∆ and the factor
                                                   s
                                     √                 1+ξ
                                        t0 − t =              ∆T .                                  (4.53)
                                                       1−ξ
    The information contained in the GPDs E and E             e can only be probed by the exclusive
diffractive processes. In most experiments, the diffracted proton spin is not observed. If we
only consider the GPD channels, then the unpolarized proton scattering cross section will
depend on E (or E)   e through their squares, or their interference with H and H,             e which is,
however, suppressed by the small ξ 2 . By having the initial-state proton to be transversely
polarized, the associated azimuthal asymmetry observables will have leading dependence on
E (or E)e through its product with the H (or H)          e GPDs, as we will see in Secs. 4.6 and
4.7. If we also have the γ ∗ -mediated channel at n = 1, like the Bethe-Heitler process, its
interference with the GPD channels can also offer linear dependence on both H (or H)              e and
       e as we will see in Sec. 4.3.
E (or E),
4.1.2      Moments and sum rules
Because GPDs only have support in x ∈ [−1, 1], taking the x moments converts them into
the matrix elements of local twist-2 operators,
     Z 1                     Z ∞
              n   q
         dx x F (x, ξ, t) =       dxxn F q (x, ξ, t)
      −1                      −∞
             Z  ∞     Z                n               
                        dy −
                                 i ∂           −ixP + y −
                                                                                              
          =        dx                        e              ⟨p′ , α′ |ψ̄q y − /2 γ + ψq −y − /2 |p, α⟩
              −∞         4π     P + ∂y −
                                                  237


              Z       Z   ∞                                           n
                 dy −                 −ixP + y −       −i ∂                                                          
          =                   dx e                       +          −
                                                                               ⟨p′ , α′ |ψ̄q y − /2 γ + ψq −y − /2 |p, α⟩
                  4π    −∞                            P ∂y
              Z                                        n
                 dy −              + −           1               ′      ′             −
                                                                                             + ↔+ n                
          =           (2π)δ(P y )                 +
                                                             ⟨p    , α    |ψ̄  q    y   /2   γ (i ∂ ) ψq −y − /2 |p, α⟩
                  4π                           P
                   1         ′     ′            +
                                                        ↔
          =       +  n+1
                          ⟨p   , α   |ψ̄q (0) γ      (i ∂ + )n ψq (0) |p, α⟩,                                             (4.54)
              2(P )
       ↔        →      ←                                                                             ↔      →     ←
where ∂ + = ( ∂ + − ∂ + )/2 will become the covariant derivative D+ = (D+ − D+ )/2 once the
Wilson line is included. Similar relations apply to the other GPDs,
    Z  1
                                            1                                                ↔
          dx xn Feq (x, ξ, t) =             +  n+1
                                                      ⟨p ′
                                                           , α ′
                                                                 | ψ̄ q  (0)     γ +
                                                                                      γ 5 (i D + n
                                                                                                ) ψq (0) |p, α⟩,
      −1                               2(P )
    Z  1
                                              1                                             ↔
          dx xn−1 F g (x, ξ, t) =                      δ ij ⟨p′ , α′ |G+i (0) (iD+            A)
                                                                                                  n−1 +j
                                                                                                     G (0) |p, α⟩,
      −1                                 (P + )n+1
    Z  1
                                              1                                                   ↔
          dx xn−1 Feg (x, ξ, t) =                      (−iϵ     ij
                                                                T   )⟨p    ′
                                                                             , α ′
                                                                                   |G  +i
                                                                                           (0) (i D + n−1 +j
                                                                                                    A)   G (0) |p, α⟩,    (4.55)
      −1                                 (P + )n+1
where we weight the gluon GPDs by xn−1 instead of xn such that the local twist-2 operators
have spin (n + 1), similar to the quark case.
   The off-forward matrix elements of the twist-2 operators can be decomposed into inde-
pendent form factors based on Lorentz covariance, parity, and time reversal symmetries.
Taking + for all the Lorentz indices then leads to important polynomiality properties for the
GPDs,
         Z  1                                Xn
              dx xn H q (x, ξ, t) =                  (2ξ)i Aqn+1,i (t) + mod(n, 2)(2ξ)n+1 Cn+1                q
                                                                                                                 (t),   (4.56a)
           −1                              i=0,2,···
         Z  1                               X n
                                                                   q                                          q
               dx xn E q (x, ξ, t) =                (2ξ)i Bn+1,i             (t) − mod(n, 2)(2ξ)n+1 Cn+1         (t),   (4.56b)
           −1                             i=0,2,···
      Z  1                                   Xn
           dx xn−1 H g (x, ξ, t) =                   (2ξ)i Agn+1,i (t) + mod(n, 2)(2ξ)n+1 Cn+1                g
                                                                                                                 (t),   (4.56c)
       −1                                  i=0,2,···
                                                                     238


       Z 1                              Xn
                                                         g                                      g
           dx xn−1 E g (x, ξ, t) =              (2ξ)i Bn+1,i   (t) − mod(n, 2)(2ξ)n+1 Cn+1        (t), (4.56d)
        −1                            i=0,2,···
for the unpolarized GPDs, and
                               Z  1                               Xn
                                     dx xn H e q (x, ξ, t) =                    eq (t),
                                                                          (2ξ)i A                      (4.57a)
                                                                                  n+1,i
                                 −1                             i=0,2,···
                               Z   1                              Xn
                                     dx xn E e q (x, ξ, t) =                    e q (t),
                                                                          (2ξ)i B                      (4.57b)
                                                                                  n+1,i
                                 −1                             i=0,2,···
                           Z   1                                  Xn
                                 dx xn−1 H   e g (x, ξ, t) =                    eg (t),
                                                                          (2ξ)i A                      (4.57c)
                                                                                  n+1,i
                             −1                                 i=0,2,···
                            Z  1                                  Xn
                                  dx xn−1 E  e g (x, ξ, t) =                    e g (t).
                                                                          (2ξ)i B                      (4.57d)
                                                                                  n+1,i
                             −1                                 i=0,2,···
That is, the x moments of GPDs reduce to even polynomials of ξ. For unpolarized GPDs,
the maximum power of ξ is equal to the spin of the corresponding twist-2 operator, whereas
for polarized GPDs, it is the spin minus 1.
    The low-order moments are related to the matrix elements of physical currents that can
be probed in experiments. For n = 0, the twist-2 quark operators become the electric and
axial currents,
                                      Jqµ = ψ̄q γ µ ψq ,     Jq5µ = ψ̄γ µ γ5 ψ,                         (4.58)
which can be accessed experimentally through electromagnetic and weak interactions, re-
spectively, giving the Dirac and Pauli form factors,
                                                                                       
                                                                             i σ µα ∆α
                     ⟨p ′
                          |Jqµ (0)|p⟩   = ū(p ) ′
                                                     F1q (t) γ µ −  F2q (t)               u(p),         (4.59)
                                                                                2m
                                                          239


and the axial and pseudoscalar form factors,
                                                                                             
                                                                                       γ5 ∆µ
                        ⟨p ′
                             |Jq5µ (0)|p⟩        ′
                                          = ū(p )     gAq (t) γ µ γ5     −   gPq (t)           u(p).               (4.60)
                                                                                        2m
Taking µ = + then relates them to the form factors of GPDs in Eqs. (4.56) and (4.57),
                 Z 1                                                             Z  1
    Aq1,0 (t)  =              q
                      dx H (x, ξ, t) =     F1q (t),                q
                                                                B1,0    (t)  =         dx E q (x, ξ, t) = F2q (t),
                  −1                                                               −1
                 Z 1                                                             Z  1
    Aeq1,0 (t) =      dx He q (x, ξ, t) = g q (t),              Be1,0
                                                                   q
                                                                        (t) =          dx Ee q (x, ξ, t) = g q (t). (4.61)
                                            A                                                               P
                  −1                                                               −1
Since twist-2 gluon operators start from spin 2, there are no corresponding relations for gluon
GPDs.
   For n = 1, the spin-2 twist-2 operators are just the energy momentum tensor [Polyakov
and Schweitzer, 2018],
                  Z  1
                                                    1         ′     ′
                                                                                         ↔
                       dx x F q (x, ξ, t) =          +   2
                                                           ⟨p   , α   |ψ̄ q (0)  γ +
                                                                                      (iD  +
                                                                                             )ψq (0) |p, α⟩
                   −1                          2(P )
                                                   1
                                           =               ⟨p′ , α′ |Tq++ (0)|p, α⟩,
                                               2(P + )2
                    Z  1
                                                   1
                         dx F g (x, ξ, t) =              ⟨p′ , α′ |G+µ (0) Gµ + (0) |p, α⟩
                      −1                       (P + )2
                                                   1
                                           =             ⟨p′ , α′ |Tg++ (0)|p, α⟩,                                  (4.62)
                                               (P + )2
where the energy momentum tensors are
                               1  ↔µ ν            ↔                                     ↔         
                   Tqµν                               ν µ
                         = ψ̄q i D γ + i D γ ψq − g ψ̄q iγ · D − mq ψq ,    µν
                               2
                                             1
                   Tgµν  = Ga,µρ Ga ρ ν + g µν (Gaρσ )2 .                                                           (4.63)
                                             4
                                                           240


The latter can be decomposed into the so-called gravitational form factors,
                                            
                                                     γ (µ P ν)           iP (µ σ ν)ρ ∆ρ
       ′
    ⟨p , α  ′
              |Tiµν (0)|p, α⟩          ′  ′
                               = ū(p , α ) Ai (t)             − Bi (t)
                                                         2                    4m
                                                                                                  
                                                             ∆µ ∆ν − g µν ∆2                   µν
                                                 +Di (t)                         + m c̄i (t) g      u(p, α),  (4.64)
                                                                    4m
where i = q, g, and we used the notation a(µ bν) = aµ bν + aν bµ . Taking µ = ν = + gives
                                                                           
         ⟨p′ , α′ |Ti++ (0)|p, α⟩ = P + ū(p′ , α′ )    Ai (t) + ξ 2 Di (t) γ +
                                                                                              
                                                                          2
                                                                                   iσ +ρ ∆ρ
                                                          − Bi (t) − ξ Di (t)                   u(p, α).      (4.65)
                                                                                       2m
Comparing this with Eqs. (4.62)(4.1a) and (4.2a), we have the sum rules,
     Z   1                                                     Z  1
                     q                        2
           dx x H (x, ξ, t) = Aq (t) + ξ Dq (t),                     dx x E q (x, ξ, t) = Bq (t) − ξ 2 Dq (t),
      −1                                                         −1
     Z   1                                                     Z  1
                                                                                                              
           dx H g (x, ξ, t) = 2 Ag (t) + ξ 2 Dg (t) ,                dx E g (x, ξ, t) = 2 Bg (t) − ξ 2 Dg (t) ,
      −1                                                         −1
                                                                                                              (4.66)
which relate the gravitational form factors to the GPD moments. While the former cannot
be easily measured in experiments, the latter can in principle be measured (or calculated in
lattice QCD) and give a probe to the energy momentum tensor. This can uncover certain
global dynamic properties inside the proton.
    In the case of PDFs, p = p′ , ∆ = 0, and ξ = t = 0. Then Eq. (4.65) reduces only to the
Ai factor,
                   ⟨p, α′ |Ti++ (0)|p, α⟩ = p+ Ai (0) ū(p, α′ )γ + u(p, α) = 2(p+ )2 Ai (0) δαα′ ,           (4.67)
                                                           241


and Eq. (4.66) reduces to the forward limit which only gives access to the total momentum
fraction Ai (0) of each parton flavor,
                       Z  1                  Z  1
                                   q
                            dx x f (x) =          dx x (f q (x) + f q̄ (x)) = Aq (0),
                         −1                   0
                       Z  1                    Z   1
                                   g
                            dx x f (x) = 2           dx x f g (x) = 2Ag (0).              (4.68)
                         −1                      0
Accessing the same operators as the PDFs, the GPDs have the capability of probing the
gravitational form factors because they are associated with off-forward kinematics, which
opens up the dependence on ξ and t and the related form factors.
    By combining the moments of H and E in Eq. (4.66), the D terms cancel and we get the
sum rule,
                 Z  1
                      dx x (H q (x, ξ, t) + E q (x, ξ, t)) = Aq (t) + Bq (t) = 2Jq (t),
                  −1
                 Z  1
                      dx (H g (x, ξ, t) + E g (x, ξ, t)) = 2 (Ag (t) + Bg (t)) = 4Jg (t), (4.69)
                  −1
where the Ja (t) form factor is related to the angular momentum sum of the parton a, nor-
malized by
                                              X                1
                                                    Ja (0) = .                            (4.70)
                                               a
                                                               2
Eq. (4.69) then gives the angular momentum sum rules for partons inside the proton [Ji,
1997a],
                                        Z  1
                                 1                                              
                          Jq = lim           dx x H q (x, ξ, t) + E q (x, ξ, t) ,
                                 2 t→0    −1
                                        Z  1
                                 1                                            
                          Jg =     lim       dx H g (x, ξ, t) + E g (x, ξ, t) .           (4.71)
                                 4 t→0    −1
                                                      242


Therefore, the measurement of GPDs, especially the construction of the moments of their x
distributions, gives important handles to the partonic dynamics inside a hadron.
4.1.3      Two-scale nature and hadron tomography
The relation of GPDs to form factors [Eq. (4.61)] encodes a great aspect of internal hadron
structures, as pointed out by [Burkardt, 2000, 2003]. On the one hand, Eq. (4.61) can be
viewed as a decomposition of form factors in the x space. In the limit of ξ → 0, x is the
momentum fraction of the active quark. Since the Fourier transform of the form factors with
respect to t gives a spatial distribution of the quarks, the Fourier transform of the GPDs
with respect to t in the limit of ξ → 0 should give a decomposition of the spatial image of
the partons in the x space. On the other hand, in the forward limit, the electric current
operator projects out the electric charge Qp of the proton, while in the off-forward case, the
extra t dependence maps out a spatial distribution of that quantum number. Similarly, since
the (non-local light-cone) GPD operator projects out the x distribution in the forward limit,
i.e., the PDF, it should further map out a spatial distribution of the PDF for each given
value of x in the off-forward case. To put it more formally, we expect
                                    Z
                                      d2 ∆T i∆T ·bT
                       f (x, bT ) =         e       H(x, ξ = 0, t = −∆2T )                (4.72)
                                      (2π)2
to be the parton number density in the three-dimensional space of the longitudinal momen-
tum fraction x and transverse spatial position bT . Now we give a detailed derivation of
Eq. (4.72).
     To discuss the transverse spatial distribution, we need to first localize the proton states
in the coordinate space. The easiest way is to construct a transverse position eigenstate from
                                               243


the light-cone helicity state in the momentum space as Eq. (4.16),
                                                   Z
                                  +                    d2 pT −ipT ·bT +
                                |p , bT , λ⟩ =                 e           |p , pT , λ⟩,                   (4.73)
                                                       (2π)2
which is normalized by
                ⟨p′+ , b′T , λ′ |p+ , bT , λ⟩ = 2π (2p+ ) δλλ′ δ(p+ − p′+ ) δ (2) (bT − b′T ).             (4.74)
Such state can be created by the operator
                                                   Z
                              †   +                    d2 kT −ikT ·bT † +
                            b (k , bT , λ) =                   e           b (k , kT , λ),                 (4.75)
                                                       (2π)2
with the commutation relation,
               
                 b(k + , bT , λ), b† (k ′+ , b′T , λ′ ) = 2π δλλ′ δ(k + − k ′+ ) δ (2) (bT − b′T ),
                                                         
                b(k + , bT , λ), b(k ′+ , b′T , λ′ ) = b† (k + , bT , λ), b† (k ′+ , b′T , λ′ ) = 0.       (4.76)
From this one can define a parton number operator
                                  X Z dk + d2 bT
                           N̂ =                            b† (k + , bT , λ) b(k + , bT , λ),              (4.77)
                                       λ           2π
which is normalized properly to give a particle number interpretation since
            [b(k + , bT , λ), N̂ ] = b(k + , bT , λ),       [b† (k + , bT , λ), N̂ ] = −b† (k + , bT , λ). (4.78)
Then the number density operator for partons with light-cone helicity λ and longitudinal
                                                          244


momentum fraction x while situated at the transverse distance bT from the proton center is,
                                  dN̂         P+ † +
                                          =        b (k , bT , λ) b(k + , bT , λ),                          (4.79)
                                dx d2 bT      2π
where x = k + /P + with P + the proton momentum.
    With all these defined, the parton density in the x and bT space is
                               P + ⟨P + , 0T , α′ |b† (k + , bT , λ′ ) b(k + , bT , λ)|P + , 0T , α⟩
                f (x, bT ) =                                                                         ,      (4.80)
                               2π                       ⟨P + , 0T |P + , 0T ⟩
where we also allow off-diagonal parton and proton helicities, and the infinite normalization
in the denominator can formally resolve the infinity in the numerator,
                                                                                  Z        Z
                  +         +                  +        +    (2)                +       −      d 2 pT
              ⟨P , 0T |P , 0T ⟩ = 2π (2P )δ(0 )δ (0T ) = 2P                          dx                .    (4.81)
                                                                                               (2π)2
The parton creation and annihilation operators can be related to the fermion fields through
                                               Z
                                         1                    + −
                        +
                     b(k , bT , λ) = √             dx− eik x ū(k + , λ)γ + ψG (x− , bT ),
                                         2k +
                                               Z
                     † +                 1                     + −
                    b (k , bT , λ) =   √           dx− e−ik x ψ̄G (x− , bT )γ + u(k + , λ),                 (4.82)
                                         2k  +
where the “light-cone time” x+ is taken to 0. The numerator in Eq. (4.80) has the expression
                                                   Z
                  ′      −                             d2 pT d2 p′T + ′ ′
        +                          +
      ⟨P , 0T , α |O(x , bT )|P , 0T , α⟩ =                      4
                                                                      ⟨P , pT , α |O(x− , bT )|P + , pT , α⟩
                                                          (2π)
              Z
                   d2 pT d2 p′T −i(p′T −pT )·bT + ′ ′
           =                    e               ⟨P , pT , α |O(0− , 0T )|P + , pT , α⟩,                     (4.83)
                      (2π)4
where the operator O(x− , bT ) is at the coordinate (0+ , x− , bT ). Then inserting Eq. (4.82) to
                                                       245


Eq. (4.80) gives the numerator
                     Z                                Z
                 1       d2 pT d2 p′T −i(p′T −pT )·bT                     + (x−x′ )−
  ⟨· · · ⟩ = +                         e                  dx− dx′− e−ik
                2k          (2π)4
                    × ⟨P + , p′T , α′ |ψ̄G (x− − x′− , 0T )γ + u(k + , λ′ )ū(k + , λ)γ + ψG (0, 0T )|P + , pT , α⟩
               Z 2 ′ Z                      Z 2                Z
                    d pT           ′−     1      d ∆T i∆T ·bT                   + −
            =             2
                                dx          +         2
                                                        e            dx− e−ik x
                    (2π)                 2k      (2π)
                    × ⟨P + , −∆T , α′ |ψ̄G (x− , 0T )γ + u(k + , λ′ )ū(k + , λ)γ + ψG (0, 0T )|P + , 0T , α⟩, (4.84)
where in the second step we performed a transverse boost to set the pT of the initial proton
state to zero. The operator is left invariant under such a boost because only the minus
coordinate component is nonzero and the Lorentz index of the Dirac matrix is plus. Now
we make a simplification by only considering the diagonal parton helicity elements, λ =
λ′ = ±1/2; the off-diagonal ones are related to transversity GPDs beyond the scope of our
discussion. Then we can use
                                                      X
                         u(k + , ±)ū(k + , ±) = P±          u(k + , λ)ū(k + , λ) = k + P± γ − ,                 (4.85)
                                                           λ
where P± = (1 ± γ5 )/2. Inserting this back to Eq. (4.84), with the other factors included in
Eq. (4.80) gives
                       Z
     ±                     d2 ∆T i∆T ·bT
   fαα   ′ (x, bT ) =             e
                           (2π)2
                            Z
                                dx− −ik+ x− +
                         ×            e        ⟨P , −∆T , α′ |ψ̄(x− , 0T )γ + P± ψ(0, 0T )|P + , 0T , α⟩. (4.86)
                                 4π
The second line in Eq. (4.86) is exactly the GPD at ξ = 0, but we have allowed for arbitrary
parton and proton helicities.
                                                           246


    For a general proton spin state described by the spin vector s = (s1 , s2 , λp ), we have the
three-dimensional quark number density,
                                     Z
                                         d2 ∆T i∆T ·bT
                        f (x, bT ) =           e        ραα′ (s)Fαα′ (x, −∆2T ),                 (4.87)
                                         (2π)2
and helicity density,
                                      Z
                                          d2 ∆T i∆T ·bT
                       ∆f (x, bT ) =            e        ραα′ (s)Feαα′ (x, −∆2T ),               (4.88)
                                          (2π)2
where ρ(s) = (1 + σ · s)/2 is the proton spin density matrix and (F, Fe)αα′ (x, −∆2T ) ≡
(F, Fe)αα′ (x, ξ = 0, t = −∆2T ) are the GPDs defined in Eq. (4.1) at ξ = 0 and with the
subscripts denoting the proton helicities. According to Eq. (4.52), we have
                                                                ∗           ∆T
        F++ = F−− = H(x, 0, −∆2T ),                F+− = −F−+      = −eiϕ∆       E(x, 0, −∆2T ),
                                                                            2m
        Fe++ = −Fe−− = H(x,e 0, −∆2T ),            Fe+− = Fe−+∗
                                                                  = 0,                           (4.89)
where ∆T = |∆T | for definiteness. This then gives
                     Z                                                                        
                        d2 ∆T i∆T ·bT                 2       i                            2
        f (x, bT ) =            e         H(x, 0, −∆T ) −       (s1 ∆2 − s2 ∆1 )E(x, 0, −∆T )
                        (2π)2                              2m
                                    1
                   = fH (x, bT ) −     [sT × ∇bT fE (x, bT )]z                                   (4.90)
                                   2m
for the unpolarized parton density, where fH,E are the Fourier transforms of H and E,
respectively. Note that both H and E are rotationally invariant with respect to ∆T , so
                                                  247


fH,E (x, bT ) = fH,E (x, b2T ) and
                                                    ∂fE (x, b2T )        ∂fE (x, b2T )
                     ∇bT fE (x, bT ) = ∇bT b2T                     = 2b T               .  (4.91)
                                                         ∂b2T                 ∂b2T
Hence,
                                                       (sT × bT )z ∂fE (x, b2T )
                           f (x, bT ) = fH (x, b2T ) −                            ,        (4.92)
                                                             m           ∂b2T
so that the unpolarized proton contains an isotropic parton density in the transverse plane
whereas a nonzero transverse spin sT yields an asymmetry in the perpendicular direction,
whose magnitude is quantified by the gradient of the GPD fE with respect to b2T . The parton
helicity distribution, on the other hand, is only governed by the GPD H,              e
                                     Z
                                        d2 ∆T i∆T ·bT e
                 ∆f (x, bT ) = λp            2
                                               e        H(x, 0, −∆2T ) = λp fHe (x, b2T ), (4.93)
                                        (2π)
which is isotropic and has no distortion from the proton’s transverse spin.
    The derivation from Eq. (4.80) to Eq. (4.93) is general and applies to the antiquark and
gluon cases as well. The generalization to transversely polarized partons is also straightfor-
ward. In this way, we showed that the GPDs encode transverse spatial parton images at
each slice of the longitudinal momentum fraction x. This is sometimes referred to as QCD
tomography, or proton imaging. Compared with PDFs, such tomography is due to the extra
scale t, which is a nonperturbative soft scale. In the SDHEPs (and other processes) for
probing GPDs, one has two separated scales — one hard scale Q = qT that localizes the
interaction to identify partons, and one soft scale t that gives an extra handle for probing the
confined parton images. Such two-scale property is also true for TMDs, where one also has
a hard scale Q for probing parton degrees of freedom, but instead of t, one has a low trans-
                                                     248


verse momentum scale QT ≪ Q to probe the (nonperturbative) transverse partonic motion.
TMDs and GPDs thus provide complementary information about the parton dynamics.
4.2       The SDHEP kinematics and observables
By the two-stage paradigm described at the beginning of Sec. 3.2, we introduce a two-
step description of the SDHEP kinematics. First, the Lab frame is naturally chosen as the
c.m. frame of the colliding beams h(p) and B(p2 ), with h along the ẑLab direction. The
kinematics of the diffraction subprocess [Eq. (3.111)] is fully captured by the momentum
transfer vector ∆ = p1 = p − p′ through its invariant mass t = ∆2 , azimuthal angle ϕ∆ , and
the variable ξ related to its leading momentum component, ξ = ∆+ /(p + p′ )+ . Once the
diffraction subprocess (and thereby the c.m. energy of the 2 → 2 hard scattering subprocess
[Eq. (3.112)]) is determined, we perform a Lorentz transformation to the SDHEP frame to
describe the 2 → 2 subprocess, defined as the c.m. frame of A∗ B with A∗ along the ẑ axis, in
terms of the polar and azimuthal angles (θ, ϕ) of C(q1 ) (or D(q2 )). This is shown in Fig. 4.1,
where the diffraction subprocess [Eq. (3.111)] happens in the blue plane (diffraction plane),
and the hard scattering subprocess [Eq. (3.112)] happens in the orange plane (scattering
plane). The x̂ axis lies on the diffraction plane and points to the same direction as ∆T in
the Lab frame, and the ŷ axis is perpendicular to the diffraction plane, determined by the
right-hand rule.
    In a fixed Lab frame coordinate system, the ϕ∆ distribution is nontrivial only when h has
a transverse spin to break the azimuthally rotational invariance. To simplify the kinematic
description, we choose the x̂Lab axis of the Lab frame to be along the diffraction direction,
x̂Lab = ∆T /∆T , and ŷLab = ẑLab × x̂Lab is determined accordingly. This choice varies event
                                               249


                      ŷ lab ⃗
                             sT
                                                                scatteri
                              ϕS    x̂lab                               ng       C (q1 )
                                                                  plane
                                  ẑlab           ∆T
                                   h(p               ∗
               iff
                   r                   )          F A (p1 )                  θ             ϕ
               p ac
                 la ti                                               H
                     n on            ŷ x̂
                                                  h ′(
             d
                      e                                  p ′)                B (p2 )
                                             ẑ
                                                                D(q2 )
Fig. 4.1: The two frames for describing the SDHEP process. The Lab frame x̂Lab -ŷLab -ẑLab
is the c.m. frame of the colliding h and B beams, with ẑLab along the direction of h and x̂Lab
along the diffraction direction, ∆T . The SDHEP frame x̂-ŷ-ẑ is the c.m. frame of A∗ and B,
with ẑ along the A∗ momentum direction and x̂ on the diffraction plane along the ∆T in the
Lab frame. F denotes the (nonperturbative) diffraction process h → h′ + A∗ , which happens
in the blue plane (“diffraction plane”), and H denotes the hard interaction between A∗ and
B to produce C and D, which happens in the orange plane (“scattering plane”). The two
planes form an angle of ϕ and intersect at the collision axis between A∗ and B.
                                                    250


by event, and trades the azimuthal angle ϕ∆ of the diffraction in a fixed Lab frame for the
azimuthal angle ϕS of the transverse spin sT of h in the varying x̂Lab -ŷLab frame. This is
also illustrated in Fig. 4.1 but it needs to be emphasized that the x̂Lab -ŷLab -ẑLab coordinate
system together with sT and ϕS should be considered to be in the Lab frame, not in the
c.m. frame of the hard scattering (that is, the SDHEP frame).
    The c.m. energy square of the 2 → 2 hard scattering is
                                                 − +               + −     (t + ∆2T )m22
    ŝ = (∆ + p2 )2 = t + m22 + 2(∆+ p−                     2
                                         2 + ∆ p2 ) = t + m2 + 2∆ p2 +                   ,  (4.94)
                                                                              2∆+ p−2
which is written in terms of the kinematic variables in the Lab frame. Here m2 is the mass of
the beam particle B, which may be a lepton, photon, or light meson. As elaborated in Ch. 3,
the factorizability into GPD (and light meson DA) necessarily requires ŝ ≳ qT2 ≫ t, m22 , so
that we can approximate ŝ as
                                             ŝ ≃ 2∆+ p−
                                                       2,                                   (4.95)
up to error of order O((t, m22 )/qT2 ). This can be related to the overall c.m. energy square,
                                                                       2 2
                                                   m2 m22               m , m2
                        2
          s = (p + p2 ) = m + 2
                                 m22  +  2p+ p−2
                                                              + −
                                                 + + − = 2p p2 + O                ,         (4.96)
                                                   2p p2                    s
by
                                                   
                                            2ξ + −         2ξ
                                  ŝ ≃ 2          p p2 ≃        s.                          (4.97)
                                           1+ξ            1+ξ
    Going from the Lab frame to the SDHEP frame is most easily done when we neglect the
mass of particle B (which is within the error of factorization) and approximate its momentum
by p̂2 = (0, p−            −
               2 , 0T ) = p2 n. In both frames, this momentum is along the minus light-cone
                                                  251


direction, so that the Lorentz transformation connecting the two frames can be constructed
as a Lorentz transformation S that leaves the lightlike vector n invariant, followed by a boost
along ẑ to enter the c.m. frame of A∗ B. Since an arbitrary vector r transforms under S with
its plus component unchanged, (Sr)+ = (Sr) · n = (Sr) · (Sn) = r · n = r+ , S is exactly a
transverse boost introduced in Eq. (4.3), which can be parametrized by a two-dimensional
transverse vector vT . By requiring S to transform ∆ = (∆+ , (t + ∆2T )/2∆+ , (∆T , 0)) in
the Lab frame to (S∆) = (∆+ , t/2∆+ , 0T ) in the SDHEP frame gives the solution vT =
        √          
  −∆T / 2∆+ , 0 , where we have used the fact that the x̂lab is chosen along the direction of
∆T . Therefore, the transformation S takes any vector rµ to
                                                                  2                  !
                                                    ∆T     r+   ∆T              r+
    r = (r+ , r− , rx , ry ) → S · r =  r+ , r− − rx + +              , rx − ∆T + , ry   ,   (4.98)
                                                    ∆       2   ∆+              ∆
where we have written explicitly rT = (rx , ry ). Following S, we may approximate ∆ by
∆ˆ = ∆+ n̄ and perform a trivial boost along ẑ to make ∆     ˆ and p̂2 have the same energy.
    The fact that it is the transverse boost that connects the Lab and the SDHEP frame
means that GPD definitions in Eqs. (4.1) and (4.2) are the same in both frames, so are the
GPDs H, H,  e E, and E.      e As a result, one can use the same factorization formulas in the two
frames, making the use of the SDHEP frame rather convenient for describing the 2 → 2 hard
scattering subprocess. We note that the transformation from the Lab to the SDHEP frame
can also be achieved by boosting along −p′ [Berger et al., 2002a], but expressing in terms of
the transverse boost elucidates the light-cone kinematics more clearly.
    Now we give the differential cross section formula for SDHEPs in terms of the two-stage
description. Given that the amplitude M can be written as convolutions of GPDs and hard
                                                   252


coefficients equally in both frames, the cross section σ is
             Z
          1         d 3 p′         d 3 q1        d3 q2
     σ=                                                    |M|2 (2π)4 δ (4) (p + p2 − p′ − q1 − q2 ),  (4.99)
          2s    (2π)3 2Ep′ (2π)3 2Eq1 (2π)3 2Eq2
where |M|2 is the amplitude squared, averaged over the initial-state spins with corresponding
density matrices and summed over final-state spins. Rewriting d3 p′ /(2Ep′ ) as d4 p′ δ + (p′2 −
m2 ) and trading p′ for ∆ gives
                  Z
               1        d4 ∆ +
          σ=                   δ ((p − ∆)2 − m2 )
               2s     (2π)3
                        Z
                                d 3 q1         d3 q2
                    ×                                   |M|2 (2π)4 δ (4) (∆ + p2 − q1 − q2 ),         (4.100)
                             (2π)3 2Eq1 (2π)3 2Eq2
where the first line describes the diffraction kinematics while the second line is for the 2 → 2
hard scattering. Taking advantage of the Lorentz invariance of each integration measure,
the delta functions, and the amplitude squared, we express the first line in the Lab frame
and the second line in the SDHEP frame. This gives
                             Z
                                  d4 ∆ +                               d|t| dξ dϕS
                                        3
                                          δ ((p − ∆)2 − m2 ) =                       ,                (4.101)
                                 (2π)                                2(2π)3 (1 + ξ)2
for the diffraction part, and, by approximating ∆ and p2 by ∆                 ˆ and p̂2 ,
                Z
                        d3 q1           d 3 q2                ˆ + p̂2 − q1 − q2 ) = d cos θ dϕ ,
                           3               3
                                                 (2π)4 δ (4) (∆                                       (4.102)
                    (2π) 2Eq1 (2π) 2Eq2                                                8(2π)2
for the hard-scattering part, which can be equally expressed in terms of the transverse
momentum qT of C or D. Note that in Eq. (4.101), we have traded ϕ∆ for ϕS since we
choose an event-by-event x̂Lab axis along the direction of ∆T , which makes the azimuthal
                                                          253


angle of the hadron’s transverse spin sT , which is fixed in the lab setting, vary event by
event. Combining Eqs. (4.100)(4.101) and (4.102) gives the fully differential cross section,
                                     dσ                      |M|2
                                                    =                    .                  (4.103)
                           d|t| dξ dϕS d cos θ dϕ       (4π)5 (1 + ξ)2 s
    Building the 2 → 2 hard scattering kinematics on top of the diffraction plane, the angle
ϕ describes the angular correlation between the diffraction and the hard collision. Its distri-
bution is solely determined by the spin states of A∗ and B. If we denote the helicities of A∗
and B by λA and λB , respectively, then the ϕ dependence of the hard scattering amplitude
is captured by a phase factor, ei(λA −λB )ϕ .
    For the n = 1 channel, A∗ = γ ∗ has three helicity states (+1, 0, −1). For the n = 2
channel, the quark GPDs have three possible helicities λqA = 0 or ±1, where λqA = 0 has
two independent contributions from the unpolarized and polarized GPDs, while λqA = ±1 is
given by the two transversity GPDs. Similarly, the gluon GPDs also have three helicities
λgA = 0 or ±2, with λgA = 0 receiving contributions from both the unpolarized and polarized
GPDs and λgA = ±2 from the two transversity GPDs.
    Because of the exclusive nature, the SDHEP cross section can receive contributions from
the interferences among any two of A∗ = γ ∗ , [q q̄ ′ ] and [gg] channels as well as their different
polarization states. Schematically, the scattering amplitude can be written as
                                 X
                          M∼         ei(λA −λB )ϕ FA∗ ⊗ CA∗ B→CD (ŝ, θ),                   (4.104)
                                  A∗
in accordance with Eq. (3.113), where FA∗ is the hadron structure function associated with
the diffraction h → h′ + A∗ , and C is the corresponding hard scattering amplitude, with the
                                                 254


ϕ dependence factored out. Squaring the amplitude in Eq. (4.104) can cause interference
of any two different channels A∗ . Having a transversely polarized beam B can also induce
interference of its different helicity states. The interference between (λA , λB ) and (λ′A , λ′B )
leads to the azimuthal correlations
                        cos(∆λA − ∆λB )ϕ, and/or sin(∆λA − ∆λB )ϕ,                        (4.105)
depending on details of the interaction, where ∆λA,B = λA,B − λ′A,B . Extracting different
trigonometric components of the azimuthal distribution is a great way to disentangle different
GPD contributions, in a way similar to using the angular modulations in the SIDIS to extract
different TMDs [Bacchetta et al., 2007]. Similarly, the angular distribution of the lepton pair
in the Drell-Yan process [Lam and Tung, 1978] was studied to capture richer structures of
QCD dynamics than the production rate alone.
4.3       DVCS as an SDHEP
The simplest SDHEP example is the real photon electroproduction in Sec. 3.2.2.1, which
gives the DVCS at n = 2. Usually calculations of the hard coefficients are carried out in
                                                              ∗          ∗
the c.m. frame of the parton pair and the virtual photon γee    . Since γee has a hard virtuality
and short lifetime, distinguishing the amplitudes according to its helicity somewhat obscures
the underlying physics. On the other hand, in the n = 1 channel, the virtual photon γ ∗
that connects the diffracted hadron and the hard part has a long lifetime. This gives it
the physical significance of being a “quasi-real” particle. Hence we may discuss the hard
scattering amplitudes in terms of its helicity state. With these considerations, in this section,
                                               255


we adopt the setup outlined in Sec. 4.2 and reformulate the DVCS within the SDHEP frame.
4.3.1      Hard coefficients of the DVCS
For the n = 2 channel, the DVCS amplitude can be factorized into GPDs, as given in
Eq. (3.135),
               XZ     1       h                                                                                   i
   Mα1 α2 λ2 =           dx F q (x, ξ, t) Cα1 α2 λ2 (x, ξ; ŝ, θ, ϕ) + Feq (x, ξ, t) C  eα1 α2 λ2 (x, ξ; ŝ, θ, ϕ) ,
                 q   −1
                                                                                                               (4.106)
where ŝ is given in Eq. (4.94), θ and ϕ are defined in Fig. 4.1 as the final-state electron
direction, and α1 , α2 , and λ2 are helicities of the initial-state electron, final-state electron
and photon in the SDHEP frame, respectively. Following discussions in Sec. 4.2, the GPDs
take the same values in the SDHEP frame as in the Lab frame. We work at LO, where
only quark GPDs contribute and we suppress the factorization scale dependence. The hard
coefficients are given by the scattering of a collinear [q q̄] pair of zero helicity with an electron,
                             [q q̄](p̂1 , 0) + e(p2 , α1 ) → e(q1 , α2 ) + γ(q2 , λ2 ),                        (4.107)
as given by the two diagrams in Fig. 4.2. The momentum p̂1 is projected on shell as in
Eq. (3.5). The SDHEP frame is the c.m. frame of the hard scattering in Eq. (4.107). By
introducing two auxiliary light-like vectors w and w̄,
            1                                1                      1
      w̄ = √ (1, ⃗q1 /|⃗q1 |) ,     w = √ (1, −⃗q1 /|⃗q1 |) = √ (1, ⃗q2 /|⃗q2 |) ,       w · w̄ = 1,           (4.108)
             2                                2                      2
                                                          256


the kinematics can be described as
                         p                  p                 p                     p
                  p̂1 =   ŝ/2 n̄,     p2 =   ŝ/2 n,  q1 =      ŝ/2 w̄,   q2 =     ŝ/2 w, (4.109)
with n and n̄ being the same as Eq. (2.59) and the scalar products,
                   w · n = w̄ · n̄ = (1 − cos θ)/2,     w · n̄ = w̄ · n = (1 + cos θ)/2.     (4.110)
     As indicated by Eq. (4.107) and Fig. 4.2, we calculate the hard coefficients by thinking of
both the quark and antiquark as entering the hard interaction, carrying momenta (ξ + x)P̂
and (ξ − x)P̂ , respectively. The results can be directly extrapolated to the full x range
[−1, 1] by keeping all related iϵ prescriptions explicitly. Following the two-stage paradigm,
it is convenient to make analogy between the DVCS and the corresponding meson process in
Sec. 3.1.1.1, such that it is natural to introduce a variable change x → z = (x+ξ)/(2ξ). Then
the two quarks carry momenta z p̂1 and (1 − z)p̂1 , respectively. The hard coefficients C and
Ce are then obtained by contracting the amputated parton lines with γ · p̂1 /2 and γ5 γ · p̂1 /2
(with an additional 1/2ξ factor), for the unpolarized and polarized GPDs, respectively.
                                      e q1                                     e q1
                                     ∗                                        ∗
                                   γ         e                              γ          e
                      p1                                      p1
                                       q     p2                                 q      p2
                                γ     q2                                  γ    q2
Fig. 4.2: The LO diagrams to the hard exclusive subprocess of the DVCS, initialized by the
state f2 = [q q̄]. The red thick lines indicate the propagators with high virtualities and thus
belong to the hard part.
                                                     257


    The P-even hard coefficient C is,
                                                                                           
                    ie2q e3           1           p̂/1 ∗      µ      1          p̂/1 µ ∗
   2ξCα1 α2 λ2 = − 2 ū2 γµ u1 2             Tr       /ϵ k/1 γ + 2         Tr       γ k/2/ϵ λ2   , (4.111)
                      q            k1 + iϵ          2 λ2          k2 + iϵ         2
where eq is the electric charge of the quark q with eu = 2/3 and ed = −1/3, and we have taken
the electron charge to be −1. Here u1 = u(p2 , α1 ) and u2 = u(q1 , α2 ) are the initial- and
final-state electron spinors, respectively, ϵλ2 = ϵλ2 (q2 ) is the final-state photon polarization
vector, and we have the following internal momentum definitions,
          q = p 2 − q1 ,                  q 2 = −2p2 · q1 = −ŝ (1 + cos θ)/2,
         k1 = q2 − (1 − z)p̂1 ,          k12 = −2(1 − z)p̂1 · q2 = −(1 − z) ŝ (1 + cos θ)/2,
         k2 = z p̂1 − q2 ,               k22 = −2z p̂1 · q2 = −z ŝ (1 + cos θ)/2.                 (4.112)
One can immediately notice that the two terms in the curly bracket in Eq. (4.111) are related
to each other by a charge conjugation and z → 1 − z, up to an overall minus sign, so we
have Cα1 α2 λ2 (z) = −Cα1 α2 λ2 (1 − z), or
                                  Cα1 α2 λ2 (x, ξ) = −Cα1 α2 λ2 (−x, ξ)                            (4.113)
in terms of the original GPD variable x. This property goes beyond LO and ensures that
only the charge-conjugation-even (C-even) unpolarized GPD component
                                F + (x, ξ, t) ≡ F (x, ξ, t) − F (−x, ξ, t)                         (4.114)
                                                      258


is probed by the DVCS. Evaluating the fermion traces, we have
                                                                                                    
                      ie2q e3            ∗
                                               µν n̄µ wν + n̄ν wµ                   1             1
   2ξCα1 α2 λ2  = − 2 ū2 γµ u1 ϵλ2 ,ν g −                                                    −          .  (4.115)
                        q                                        n̄ · w         1 − z − iϵ z − iϵ
    Similarly, replacing p/1 in Eq. (4.111) by γ5 p/1 gives the P-odd hard coefficient,
                                                                                              
                              ie2q e3                    −i ϵn̄wµν          1              1
          2ξ C eα1 α2 λ2 = −                      ∗
                                       ū2 γµ u1 ϵλ2 ,ν                              +           ,          (4.116)
                                q2                          n̄ · w       1 − z − iϵ z − iϵ
which is invariant under z → 1 − z (or equivalently, x → −x) and thus probes the C-even
polarized GPD component
                                    Fe+ (x, ξ, t) ≡ Fe(x, ξ, t) + Fe(−x, ξ, t),                             (4.117)
a property that goes beyond LO.
    Inserting explicit forms for the spinors and polarization vector into Eqs. (4.115) and
(4.116), we have the helicity amplitudes in the SDHEP frame,
                            r                                  
                              2            1              1           4 cos(θ/2) ∓iϕ/2
     2ξC±±±     =  −e2q e3      ·                 −                ·               e      ,                (4.118a)
                              ŝ 1 − z − iϵ            z − iϵ (1 + cos θ)2
                            r                                  
                              2            1              1           4 cos(θ/2)
     2ξC±±∓     = +e2q e3       ·                 −               ·              2
                                                                                   sin2 (θ/2) e∓iϕ/2 ,     (4.118b)
                              ŝ 1 − z − iϵ            z − iϵ (1 + cos θ)
                            r                                  
        e±±±                  2            1              1              1
     2ξ C       = ∓e2q e3       ·                 +               ·            e∓iϕ/2 ,                    (4.118c)
                              ŝ 1 − z − iϵ            z − iϵ cos(θ/2)
                            r                                  
        e±±∓                  2            1              1          sin2 (θ/2) ∓iϕ/2
     2ξ C       = ∓e2q e3       ·                 +               ·             e       .                  (4.118d)
                              ŝ 1 − z − iϵ            z − iϵ        cos(θ/2)
All the other helicity amplitudes are zero. We see that the z dependence factors out of the θ
and ϕ dependence, while the latter are experimental observables. In terms of x, the z factor
                                                          259


becomes
                                                   
                         1       1              1              1               1
                                         ∓            =               ∓               ,               (4.119)
                        2ξ 1 − z − iϵ z − iϵ             ξ − x − iϵ ξ + x − iϵ
which gives the (negative) “zeroth GPD moments”,
                           Z                                           Z
                             1
                                   F + (x, ξ, t)                          1
                                                                               Fe+ (x, ξ, t)
         −F0+ (ξ, t) ≡−        dx                ,  −Fe0+ (ξ, t) ≡ −        dx               ,        (4.120)
                            −1     x − ξ + iϵ                            −1     x − ξ + iϵ
when convoluting with F and Fe, respectively. Then the DVCS amplitudes in Eq. (4.106)
are
                   r                                                                      
                     2 ∓iϕ/2 X 2         q,+        4 cos(θ/2)       e q,+          1
    MDVCS
       ±±±   =e  3
                        e         eq F0 (ξ, t)                    ± F0 (ξ, t)                ,       (4.121a)
                     ŝ        q
                                                   (1 + cos θ)2                 cos(θ/2)
                   r
                     2 2                 X                       4 cos(θ/2)                       1
                                                                                                        
    MDVCS
       ±±∓   =e  3
                        sin (θ/2)e ∓iϕ/2        2      q,+
                                               eq −F0 (ξ, t)                      e q,+
                                                                               ± F0 (ξ, t)                ,
                     ŝ                    q
                                                                 (1 + cos θ)2                  cos(θ/2)
                                                                                                     (4.121b)
which are written as helicity amplitudes for the initial-state electron and final-state elec-
tron and photon, constituting four independent helicity structures. The diffracted hadron
helicities are implicitly encoded in the GPDs.
4.3.2      Bethe-Heitler process
As we noted in Sec. 3.2.1 and confirmed in Eq. (4.121), the DVCS amplitudes, which corre-
spond to the n = 2 channel for the real photon electroproduction process, have the power
                                          √
counting O(1/Q), with Q ∼ qT ∼ ŝ being the hard scale. In contrast, the γ ∗ -mediated
channel at n = 1, i.e., the Bethe-Heitler (BH) process, counts at a more leading power,
      p
O(1/ |t|), as discussed in Sec. 3.2.1. So it contributes more to the overall cross section.
                                                     260


In this section, we calculate the BH amplitudes in the SDHEP frame, where the γ ∗ carries
momentum p1 and collides with the electron in the c.m. frame,
                             γ ∗ (p1 , λ) + e(p2 , α1 ) → e(q1 , α2 ) + γ(q2 , λ2 ),                          (4.122)
where the helicity index λ of γ ∗ is to be specified in the following. One difference from the
DVCS amplitude calculation is that here we need to keep p1 as exact because neglecting
its virtuality may induce an error of order 1/Q in the amplitude, which is suppressed with
respect to the BH amplitude, but not to the DVCS. By denoting ŝ as the unapproximated
c.m. energy square of the hard collision system, we have the kinematics,
                 √                                                 √                               
                   ŝ ŝ + t            ŝ − t                         ŝ ŝ − t               ŝ − t
            p1 =                , 0, 0,          ,           p2 =                   , 0, 0, −            ,
                  2        ŝ              ŝ                         2        ŝ                 ŝ
                 √                                                   √
                   ŝ                                                  ŝ
            q1 =      (1, q1 /|q1 |) ,                       q2 =          (1, −q1 /|q1 |) .                  (4.123)
                  2                                                   2
    As discussed in Sec. 3.2.1, the BH amplitude reduces to the electromagnetic form factor
of the hadron, as given in Eqs. (3.114) and (3.115). In the SDHEP frame, the amplitude
structure simplifies by use of Ward identities for both F and H,
                 F · p1 = F + p−              − +
                                     1 + F p1 = 0,       H · p1 = H + p−             − +
                                                                            1 + H p1 = 0,                     (4.124)
                                                                                                     p
such that F + H− = F − H+ . By the power counting (F + , F − , FT ) ∼ (Q, t/Q,                         |t|), we have
(H+ , H− , HT ) ∼ (1, t/Q2 , 1) in the SDHEP frame, instead of the superficial power counting
Hµ ∼ O(1). Then the scalar product of F and H is
                                                              X
    F · H = 2F + H− − FT · HT = 2(F · n)(n̄ · H) −                        [F · ϵ∗λ (p1 )] [ϵλ (p1 ) · H] .    (4.125)
                                                                   λ=±
                                                       261


Although this is only valid in the SDHEP frame, it equips the virtual photon with well-defined
polarization states. The first term on the right-hand side of Eq. (4.125) corresponds to a
longitudinally polarized photon state, whose polarization vector ϵ0 (p1 ) ≡ n̄ is contracted with
H. Along with the 1/t factor from the photon propagator, it contributes to the amplitude
at the power 1/t × t/Q = 1/Q, the same as the GPD channel. So when calculating the
                                                                       p
hard coefficient n̄ · H, we may keep only the leading power in |t|/Q. The second term in
Eq. (4.125) corresponds to two transverse polarization states, with the polarization vectors
defined as
                                                                 √
                                    ϵµ± (p1 ) = (0, ∓1, −i, 0)/ 2.                           (4.126)
                                                                        p         p
It contributes to the amplitude with a power counting of 1/t ×            |t| ∼ 1/ |t|, which is one
power higher than the GPD channel. Hence we need to keep the hard coefficient Hλ ≡ ϵλ · H
                                  p
up to the subleading power in |t|/Q.
    The above discussion of the γ ∗ channel is generic to all processes. For the real photon
electroproduction, the subprocess in Eq. (4.122) is an elementary scattering. As we will
see through explicit calculations, the t dependence in ϵ± · H arises from kinematic effects.
Since there is no singularity associated with t → 0 in H, the subleading term in H± starts
at t/Q2 , which is power suppressed with respect to the GPD channel. Thus, it is a valid
approximation to also neglect t in the transverse amplitudes.
                                                                   q1
                            e                                  e
                               q1
                         γ∗                 e               γ∗      k2       e
                                 k1        p2                               p2
                         p1                                 p1
                                    γ     q2                           q2   γ
 Fig. 4.3: The LO diagrams for the BH subprocess, initialized by the virtual photon state.
    The hard scattering diagrams for the BH amplitude are shown in Fig. 4.3. Denoting
                                                   262


these hard coefficients as Hλ,α1 α2 λ2 ≡ ϵλ · Hα1 α2 λ2 with λ = ±1, 0 being the helicity of the
virtual photon γ ∗ , and α1 , α2 , and λ2 being helicities of the initial-state electron, final-state
electron and photon, respectively. Explicitly, we have
                                                                                    
                                           2       1             ∗      1 ∗
                        Hλ,α1 α2 λ2 = −ie ū2         /ϵ λ k/1/ϵ λ2 + 2 /ϵ λ2 k/2/ϵ λ u1 ,             (4.127)
                                                  k12                   k2
where k1 = p2 − q2 , k2 = p1 + p2 , and u1 , u2 , and ϵλ2 are the same as in Eq. (4.111). The
helicity amplitudes for the longitudinal photon polarization are
                      t    1                                        t                               
    H0,±±∓ = −2e2 p                cos(θ/2)e∓iϕ/2 = −2e2 cos(θ/2)e∓iϕ/2 + O t2 /s2 ,                   (4.128)
                      s 1 − t/s                                     s
which start at O(t/s) as argued below Eq. (4.124). In the second step we only kept the
leading terms. The transverse photon polarizations give
                          r
                    2e2            t ±iϕ/2                              2e2
   H±,±±±  =∓                1−      e                        =∓                e±iϕ/2 + O(t/s),
                sin(θ/2)           s                                 sin(θ/2)
                e2 (1 + cos θ) t       1
   H±,±±∓ = ∓                     p           e±iϕ/2          = 0 + O(t/s),
                    sin(θ/2) s 1 − t/s
                                    1
   H±,∓∓± = ±2e2 sin(θ/2) p                e±3iϕ/2            = ±2e2 sin(θ/2) e±3iϕ/2 + O(t/s),        (4.129)
                                  1 − t/s
which scale as O(1), with subleading terms of O(t/s). All other helicity amplitudes are zero.
    Together with Eqs. (3.114) and (4.125), we can determine the helicity amplitudes for the
BH process,
                   2e3       e±iϕ/2         p              
      MBH±±± = ∓        F±             +O       |t|/Q2 ,                                              (4.130a)
                     t     sin(θ/2)
                4e3                          2e3                                      p          
                                     ∓iϕ/2                              ∓3iϕ/2
      MBH±±∓ =        F0 cos(θ/2)e         ∓       F∓ sin(θ/2)e                 +O              2
                                                                                           |t|/Q .    (4.130b)
                 ŝ                            t
                                                    263


Similar to Eq. (4.121), there are four independent helicity amplitudes. Each of them are
obtained by summing over the helicity of the intermediate virtual photon γ ∗ . The diffracted
hadron helicities are implicitly encoded in the form factors F0 = F · n = F + and F± = F · ϵ∗± .
4.3.3     Combining the DVCS and Bethe-Heitler processes
The full amplitude of the single-diffractive real photon electroproduction is obtained by the
channel decomposition, with the first two powers given by the BH and DVCS amplitudes,
                                                        p       
                             M = MBH + MDVCS + O           |t|/Q2 .                   (4.131)
                                                                                        p
As we have seen in Secs. 4.3.1 and 4.3.2, the leading-power contribution starts at O(1/ |t|)
and is given by the transversely polarized photon channel in the BH amplitude,
                                     ∗
                                        T                            ∗
                                                                         T
               MLPαα′ ±±± = Fαα′ · ϵ± A±±± ,     MLPαα′ ±±∓ = Fαα′ · ϵ∓ A±±∓ ,        (4.132)
where we have made explicit the dependence on the hadron helicities α and α′ which is
encoded in the EM form factor Fαα′ , and the reduced hard scattering amplitudes AT can be
easily matched by comparing with Eq. (4.130). The next-to-leading power contribution is at
O(1/Q), given by the DVCS amplitude and the longitudinally polarized photon channel in
the BH amplitude,
                    MNLP                       e e
                       αα′ ±±± = Fαα′ G±±± + Fαα′ G±±± ,                             (4.133a)
                    MNLP                       e e                      L
                       αα′ ±±∓ = Fαα′ G±±∓ + Fαα′ G±±∓ + (Fαα′ · n) A±±∓ .           (4.133b)
                                              264


Here we defined the shorthand notations for the GPD convolutions,
                          X                                                                    
                                      q,+                1        ′   ′         +        iσ +∆
  Fαα′  = Fαα′ (ξ, t) ≡          e2q F0,αα ′ (ξ, t) =        ū(p , α ) H γ − E                   u(p, α),
                               q                       2P +                                2m
                          X                                                                     +
                                                                                                   
                                                         1                                  γ5 ∆
  Feαα′ = Feαα′ (ξ, t) ≡         eq Fe0,αα′ (ξ, t) =
                                   2  q,+                         ′   ′
                                                             ū(p , α ) H   e γ γ5 − Ee
                                                                                +
                                                                                                     u(p, α), (4.134)
                               q                       2P +                                  2m
with similar definitions for the (complex-valued) GPD moments H, E, H,                          e and E, e
                                                  X
                            (H, E, H, e E)e ≡           e2q (H0+ , E0+ , H      e0+ )(ξ, t).
                                                                         e 0+ , E                             (4.135)
                                                      q
The associated hard coefficients G and Ge in Eq. (4.133) can be obtained by matching
with Eq. (4.121). The longitudinal BH channel only contributes to the helicity amplitudes
MNLP                                            L
   αα′ ±±∓ and its hard coefficients A can be easily obtained from Eq. (4.130).
    We allow arbitrary polarization states of the hadron and electron beams in the Lab frame.
They can be introduced by averaging over the initial-state hadron and electron spins by the
density matrices ρN               e
                      αᾱ and ραe ᾱe . In the Lab frame, they can be written in terms of the spin
Bloch vectors sN = (sT , λN ) and se = (seT , λe ). Since we neglect the electron mass, seT
cannot enter the amplitude square in this process, and λe is Lorentz invariant. The initial-
state hadron spin average can be performed in a Lorentz covariant way by using the spin
4-vector S µ and using
                          X                                                                
                                                                1                      γ5 S
                                                                                          /
                               u(p, α)ρN  αᾱ (s)ū (p, ᾱ)  = (p/ + m) 1 +                   ,               (4.136)
                          α,ᾱ
                                                                2                       m
                                                                                      µ
where S µ transforms covariantly and takes the explicit form Srest                         = m(0, s) in the hadron
                                                          265


rest frame in terms of Cartesian coordinates. Transforming it to the Lab frame, we have
                                                                        
                                   µ               +          m2
                                SLab    =     λN p , −λN + , m sT             ,               (4.137)
                                                              2p           lc
in the lightfront coordinates, together with the diffracted hadron momenta,
                          2
                                                                                        
                    + m                             1 − ξ + 1 + ξ m2 + ∆2T
          pµLab = p , +,0 ,             p′µ
                                          Lab  =             p ,                    , −∆T   . (4.138)
                       2p                           1+ξ          1 − ξ 2p+
The polarization vectors of the virtual photon in Eq. (4.126) are written in the SDHEP
frame, and transform into
                                                            ∆T · ϵT µ
                                         ϵµLab,± = ϵµ +              n                        (4.139)
                                                              ∆+
in the Lab frame by the inversion of Eq. (4.98). In both Eqs. (4.138) and (4.139), we note
that ∆T is a transverse 2-component vector in the Lab frame, taken along the x direction,
∆T = (∆T , 0).
    The leading power of the amplitude square starts at O(1/t), given by the square of the
transverse BH channels in Eq. (4.132),
         |M|2LP = ρN     e         LP             LP∗
                    αᾱ ραe ᾱe Mαα′ αe α′e λ Mᾱα′ ᾱe α′e λ
                                                                                         
                 = ρe++ |AT+++ |2 ⟨F · ϵ∗+ F ∗ · ϵ+ ⟩ + |AT++− |2 ⟨F · ϵ∗− F ∗ · ϵ− ⟩
                                                                                           
                   + ρe−− |AT−−− |2 ⟨F · ϵ∗− F ∗ · ϵ− ⟩ + |AT−−+ |2 ⟨F · ϵ∗+ F ∗ · ϵ+ ⟩ ,     (4.140)
where the repeated helicity indices are summed over and we introduced the notation
                                                                         ∗
                             ⟨F · ϵ∗λ F ∗ · ϵλ′ ⟩ ≡ ρN             ∗
                                                     αᾱ (Fαα′ · ϵλ ) (Fᾱα′ · ϵλ′ )          (4.141)
                                                      266


for the hadron spin average. From Eq. (4.130), we see that
                                       2                                                  2
                                    2e3           1                                     2e3
     |AT+++ |2 =  |AT−−− |2   =                 2       ,   |AT++− |2 =  |AT−−+ |2 =            sin2 (θ/2),
                                      t      sin (θ/2)                                   t
such that
                            2                                                           
                         2e3             1                           2
         |M|2LP    =                            (B0 + λe B1 ) + sin (θ/2) (B0 − λe B1 ) ,               (4.142)
                          t         sin2 (θ/2)
where
           1                                              
     B0 =       ⟨F · ϵ∗+ F ∗ · ϵ+ ⟩ + ⟨F · ϵ∗− F ∗ · ϵ− ⟩
           2                                           
                          1 − ξ2           2       t
         = − 2m +    2
                                   t     F1 −          F − t(F1 + F2 )2 ,
                                                        2
                            2ξ 2                4m2 2
           1                                              
     B1 =       ⟨F · ϵ∗+ F ∗ · ϵ+ ⟩ − ⟨F · ϵ∗− F ∗ · ϵ− ⟩
           2                                                                                     
                                          4ξm2       t               sT · ∆T        2      1+ξ
         = −(F1 + F2 ) λN F1                      +       + t F2 +               4m F1 +          t F2 ,
                                          1+ξ ξ                         2m                    ξ
                                                                                                        (4.143)
with sT · ∆T = sT ∆T cos ϕS evaluated in the Lab frame. Organizing Eq. (4.142) in terms of
the polarization parameters, we have
                                       2                                                  
                                 2e3 m                                      ∆T cos ϕS LP
                 |M|2LP =                  ΣLPUU
                                                                LP
                                                    1 + λe λN ALL + λe sT              AT L ,           (4.144)
                                    t                                           2m
with the dimensionless polarization parameters,
                                                                                                    
                   1                            1 − ξ 2 −t                   t             t
   ΣLP
     UU  =                        2
                          + sin (θ/2)                        −2        2
                                                                     F1 −          2
                                                                                  F − 2 (F1 + F2 ) ,     2
              sin2 (θ/2)                          2ξ 2 m2                  4m2 2         m
                                                                                                       (4.145a)
                                                        267


                                                                                             
                           1                                              −t        4ξ        t
    ALP
      LL  = Σ−1UU
                                         2
                                    − sin (θ/2) (F1 + F2 ) F1                 −            − 2 F2 ,   (4.145b)
                     sin2 (θ/2)                                          ξm2 1 + ξ           m
                                                                                        
                           1                                                 1 + ξ −t
    ALP
      TL  = Σ−1UU        2          − sin2 (θ/2) (F1 + F2 ) −4F1 +                       F2 .         (4.145c)
                     sin (θ/2)                                                 ξ m2
Note that the unpolarized part ΣU U is positive definite by the kinematic constraint −t ≥
4ξ 2 m2 /(1 − ξ 2 ). As evident form Eq. (4.130), at the leading power, different γ ∗ helicities are
associated with different helicity structures of the hard scattering amplitude. By neglecting
the electron mass, there is no interference between the right-handed and left-handed γ ∗
states, and thus there is no nontrivial ϕ correlation between the diffraction and scattering
planes.
                                                                                            p
     The subleading power of the amplitude square is of order O(1/Q |t|), given by the
interference of the transverse BH amplitudes with the longitudinal BH amplitude and DVCS
amplitude in Eq. (4.132),
                                                                    
    |M|2NLP = 2 Re ρN           e        NLP          LP∗
                           αᾱ ραe ᾱe Mαα′ αe α′e λ Mᾱα′ ᾱe α′e λ
                     
                                                                        ∗
       = 2ρ++ Re G+++ ⟨FF ∗ · ϵ+ ⟩ + Ge+++ ⟨FF
            e                                                e ∗ · ϵ+ ⟩ AT+++
                                                                                                    
                                                                                            
               + G++− ⟨FF ∗ · ϵ− ⟩ + Ge++− ⟨FF        e ∗ · ϵ− ⟩ + AL ⟨F · n F ∗ · ϵ− ⟩ AT ∗
                                                                         ++−                    ++−
                     
                                                                        
       + 2ρ−− Re G−−− ⟨FF ∗ · ϵ− ⟩ + Ge−−− ⟨FF
             e                                                e ∗ · ϵ− ⟩ AT ∗
                                                                           −−−
                                                                                                    
                                                                                             T∗
               + G−−+ ⟨FF · ϵ+ ⟩ + Ge−−+ ⟨FF
                                  ∗                   e · ϵ+ ⟩ + A−−+ ⟨F · n F · ϵ+ ⟩ A−−+ ,
                                                           ∗             L             ∗
                                                                                                       (4.146)
where we introduced notations similar to Eq. (4.141), e.g.,
                                       ⟨FF ∗ · ϵ± ⟩ ≡ ρN                 ∗
                                                        αᾱ (Fαα′ ) (Fᾱα′ · ϵ± ) .                    (4.147)
                                                          268


Using the explicit forms of the hard scattering amplitudes, we have
                   r 
               4e6   2         4                                                          
  |M|2NLP  =                            Re e−iϕ ρe++ ⟨FF ∗ · ϵ+ ⟩ − eiϕ ρe−− ⟨FF ∗ · ϵ− ⟩
               −t    ŝ sθ (1 + cθ )
                           2       h                                               i
                                     −iϕ e       e   ∗
                        + Re e ρ++ ⟨FF · ϵ+ ⟩ + e ρ−− ⟨FF · ϵ− ⟩ iϕ e    e   ∗
                           sθ
                           sθ (1 − cθ )                                                     
                        −             2
                                         Re eiϕ ρe++ ⟨FF ∗ · ϵ− ⟩ − e−iϕ ρe−− ⟨FF ∗ · ϵ+ ⟩
                            (1 + cθ )
                           sθ (1 − cθ )      h                                               i
                        +                       iϕ e      e  ∗           −iϕ e
                                         Re e ρ++ ⟨FF · ϵ− ⟩ + e ρ−− ⟨FF · ϵ+ ⟩  e   ∗
                           2(1 + cθ )
                                                                                                   
                           sθ 1          iϕ e              ∗           −iϕ e            ∗
                                                                                                 
                        +           Re e ρ++ ⟨F · n F · ϵ− ⟩ − e ρ−− ⟨F · n F · ϵ+ ⟩ . (4.148)
                           2ξ P +
Clearly, it is composed of the interference of the left- or right-handed γ ∗ in the BH channel
with the unpolarized GPD, polarized GPD, and the longitudinally polarized γ ∗ from the BH
channel. The latter three all correspond to exchanges of helicity-0 states, so there will be
cos ϕ and sin ϕ correlations. Furthermore, the hadron spin average introduces dependence
on sT , which enters linearly through sT · ∆T ∝ cos ϕS or sT × ∆T ∝ sin ϕS and contributes
nontrivial azimuthal diffractive distributions. In this way, we organize Eq. (4.148) according
to the spin and azimuthal dependence,
                           r
                    4e6 m      1  NLP                                                        
        |M|2NLP  =                   ΣU U + λe λN ΣNLP   LL    cos ϕ + λe ΣNLP
                                                                             U L + λN ΣLU
                                                                                          NLP
                                                                                                 sin ϕ
                      −t       ŝ
                                                                      NLP
                                                                                        
                                     +sT ΣNLP T U,1 cos ϕS sin ϕ + ΣT U,2 sin ϕS cos ϕ
                                                                                           
                                     +λe sT ΣNLP                          NLP
                                                 T L,1 cos ϕS cos ϕ + ΣT L,2 sin ϕS sin ϕ      ,       (4.149)
with the polarization coefficients
                                               
           ∆T 1 + ξ 2sθ                  t               4 + (1 − cθ )2                e
  ΣNLP
    UU   =                        2
                               F1 −          F 2
                                                   −   ξ                (F1 + F2 ) Re H
           2m ξ          ξ             4m2 2                   sθ
                                                       269


                                                                                     
                          2    4 + (1 − cθ )2                           t
                      −                                 F1 Re H −            F2 Re E        ,               (4.150a)
                         sθ         1 + cθ                             4m2
                                                                                                      
           ∆T                           1+ξ                      3 − cθ
ΣNLP
 LL    =−        2(F1 + F2 ) sθ                  F1 + F2 +                 ((1 + ξ) Re H + ξ Re E)
           2m                              ξ                        sθ
                                                                                                   
                            3 − cθ 1 + ξ                 e − ξF1 + (1 + ξ)            t
                      +sθ                      F1 Re H                                   F2 Re Ee , (4.150b)
                            1 − cθ        ξ                                         4m2
                                                                                                     
           ∆T 1 + ξ 2(3 − cθ )                          t                                           e
ΣNLP
 UL    =−                            F1 Im H −                               2
                                                            F2 Im E + ξ cθ/2 (F1 + F2 ) Im H ,              (4.150c)
           2m ξ            sθ                         4m2
                                  "
           ∆T 4 + (1 − cθ )2             F1 + F2                                         1+ξ             e
ΣNLP
 LU    =−                                    2
                                                     ((1 + ξ) Im H + ξ Im E) +                   F1 Im H
           2m            sθ                 cθ/2                                            ξ
                                                                
                                       t                        e
                      − ξF1 +              (1 + ξ)F2 Im E ,                                                 (4.150d)
                                    4m2
                         (                              2                           
         4 + (1 − cθ )2 F1 + F2                              ξ           t
ΣNLP
 T U,1 =                                  ξ Im H +                  +            Im E
              sθ              c2θ/2                        1 + ξ 2 4m2
                                                         2                                          
                        1 − ξ2 t                              ξ              t
           − ξF1 −                     F2 Im H      e−             F1 +          (F1 + ξF2 ) Im Ee , (4.150e)
                            ξ 4m2                           1+ξ            4m2
                                                                    
         4 + (1 − cθ )2                             e       t        e
ΣNLP
 T U,2 =                    ξ(F1 + F2 ) Im H +                  Im E
              sθ                                          4m2
                                                                                                       )
              1                1 − ξ 2 −t                                      t               ξt
           − 2        ξF1 +                    F2 Im H +             ξ+               F1 +         F2 Im E       ,
             cθ/2                 ξ 4m2                                    4ξm2               4m2
                                                                                                            (4.150f)
                                                2                           
                        3 − cθ                         ξ          t
ΣNLP
 T L,1 = 2(F1 + F2 )               ξ Re H +                  +          Re E
                           sθ                       1 + ξ 4m2
                                                                 
                                                  ξ           t
                             +sθ F1 +                   +            F2
                                               1 + ξ 4ξm2
                                                              
             4 − (1 − cθ )2                     t 1 − ξ2               e
           −                       ξF1 −                     F2 Re H
                     sθ                      4m2 ξ
                                       2                                              
                                             ξ            t             ξt
                                  +                 +           F1 +          F2 Re Ee ,                    (4.150g)
                                           1 + ξ 4m2                   4m2
                                                                                                 
                                     t                   4 − (1 − cθ )2            e+      t
ΣNLP
 T L,2 = (F1 + F2 ) 2 F1 +               F 2 sθ − ξ                           Re H              Re Ee
                                  4m2                           sθ                       4m2
                                                                                                         
               3 − cθ               1 − ξ2 t                                       t               ξt
           +2               ξF1 −                    F2 Re H + ξ +                        F1 +         F2 Re E .
                  sθ                     ξ 4m2                                  4ξm2              4m2
                                                                                                            (4.150h)
                                                        270


    Combining the leading power (LP) in Eq. (4.144) and next-to-leading power (NLP) in
Eq. (4.149) contribution and inserting them into Eq. (4.103), we have the differential cross
section for the real photon electroproduction process,
                                                        
              dσ                1       dσ unpol                                   ∆T cos ϕS LP
                           =       2
                                                      · 1 + λe λN ALP   LL + λe sT             AT L
    d|t| dξ dϕS d cos θ dϕ    (2π) d|t| dξ d cos θ                                     2m
                                                                                                   
                                     + ANLP U U + λe λN ALL
                                                              NLP
                                                                    cos ϕ + λe ANLPU L + λN ALU
                                                                                               NLP
                                                                                                      sin ϕ
                                                                                         
                                     +sT ANLP                            NLP
                                               T U,1 cos ϕS sin ϕ + AT U,2 sin ϕS cos ϕ
                                                                                            
                                     +λe sT ANLP                           NLP
                                                  T L,1 cos ϕS cos ϕ + AT L,2 sin ϕS sin ϕ     ,      (4.151)
where
                                     dσ unpol              λ3e    m2 LP
                                                    =                  Σ                              (4.152)
                                  d|t| dξ d cos θ       (1 + ξ)2 s t2 U U
is the unpolarized differential cross section with the azimuthal dependence integrated out,
                                               LP           LP
the LP polarization parameters ΣLP     U U , ALL , and AT L are given in Eq. (4.145), and the NLP
ones are obtained from Eq. (4.150) by
        (ANLP     NLP    NLP    NLP    NLP      NLP      NLP    NLP
           U U , ALL , AU L , ALU , AT U,1 , AT U,2 , AT L,1 , AT L,2 )
                     −t 1
                 =    √        (ΣNLP     NLP     NLP      NLP    NLP      NLP    NLP    NLP
                                 U U , ΣLL , ΣU L , ΣLU , ΣT U,1 , ΣT U,2 , ΣT L,1 , ΣT L,2 ),        (4.153)
                   m ŝ ΣLPUU
with ŝ determined from the hard-scattering kinematics or approximated by Eq. (4.97), the
error of which is power suppressed. Evidently, the NLP does not change the event rate, but
only introduce azimuthal cos ϕ and sin ϕ modulations. Measuring the latter offers an efficient
way to determine the GPDs, with knowledge of the form factors F1,2 from other experiments.
Notably, GPDs enter in a linear way, and we have in total 8 GPD factors together with 8
                                                     271


NLP polarization parameters, with ANLP           NLP   NLP         NLP
                                         U U , ALL , AT L,1 , and AT L,2 depending on their real
parts, and ANLP     NLP    NLP          NLP
             U L , ALU , AT U,1 , and AT U,2 on their imaginary parts. In principle, the DVCS
can fully determine the GPD moments H, E, H,       e and E,
                                                          e for both real and imaginary parts.
Especially, their imaginary parts can directly constrain the GPD values at x = ±ξ.
    By neglecting the three-parton channels in Eq. (3.113), the amplitude in Eq. (4.131) is
                         p
valid up to the error of |t|/Q2 , and the cross section in Eq. (4.151) is accurate up to the
error of 1/Q2 . The square of the DVCS amplitude contributes at O(1/Q2 ) which mixes with
                                                                        p
the interference between MBH and the power suppressed term O( |t|/Q2 ) in Eq. (4.131),
so will not be included.
4.3.4     Comments on related processes
As evident from the definitions in Eqs. (4.120) and (4.135), the DVCS can constrain the
GPDs only up to their moments. There are three associated limitations: (1) only C-even
GPD combinations are probed, (2) the quark flavors cannot be disentangled, and (3) the
x dependence cannot be fully extracted, except the values at x = ±ξ. The first limitation
is because the hard part of the DVCS only has two external photons, which automatically
selects C-even parton combinations. By including more processes like the photoproduction
of two real photons [Pedrak et al., 2017; Grocholski et al., 2021, 2022], one can gain exclusive
access to C-odd GPD combinations. This process also gives different GPD flavor combina-
tions by being proportional to the cubes of the quark electric charges, instead of squares
as the DVCS. Similarly, in the DVMP, the quark charge enters linearly and gives another
independent way to disentangle the quark flavors. By selecting different produced mesons,
we may also gain sensitivity to different GPD combinations, including flavor changing GPDs.
    Nevertheless, the x dependence cannot be accessed by simply including multiple pro-
                                                272


cesses. For all the above-mentioned processes, the dependence of the amplitudes on the
GPDs is through the moments like Eq. (4.120), at least to the leading perturbative order.
Even with a complete separation of flavors and charge conjugation combinations, only know-
ing the moment for each GPD is not sufficient to map out the full x dependence. As will be
further discussed in the following sections, new types of processes with enhanced sensitivity
to the GPD x dependence are needed.
4.4        Sensitivity to x-dependence of GPDs: a general
           discussion
As we have seen in Sec. 4.3, the DVCS and similar processes only probe the GPDs via certain
moments, which are not sufficient to determine the full x dependence. In this section, we
give a systematic discussion on the sensitivity of different SDHEPs to the x-dependence of
GPDs. As shown in Eq. (4.104), the x dependence is probed by the hard part, which is a
function of the kinematic variables ŝ, qT or θ, and ϕ. While ŝ is determined by ξ in Eq. (4.97)
and the ϕ dependence is determined by the spin structures as in Eq. (4.104), which can be
useful to disentangle different GPD components, only the qT or θ dependence can be closely
connected to the x dependence of GPD convolutions.
      Here we are considering the x-sensitivity from the tree-level hard part C(x, Q), where
Q is the external observable(s) not associated with the diffractive hadron;2 in the context
of SDHEP kinematics in Sec. 4.2, Q can be qT or θ of the final-state particle C or D. We
consider the two types of sensitivity:
    2
      Even though the GPD variable ξ is also in the hard coefficient C and is directly observable from the
diffracted hadron momentum, we do not consider it to be included in Q, but instead it always comes with
x and is suppressed in C(x, Q).
                                                   273


  (I) Moment-type sensitivity: C(x, Q) factorizes into an x-dependent part and Q-dependent
      part,
                                        C(x, Q) = G(x) T (Q).                          (4.154)
      In this case, the measurement of the Q distribution, which is fully captured by the
      predictable T (Q), does not help in probing the x-dependence of GPDs, and all the
      sensitivity is in the moment-type quantity
                                        Z 1
                                            dx G(x) F (x, ξ, t).                       (4.155)
                                         −1
      We call a process with only moment-type sensitivity a type-I process.
 (II) Enhanced sensitivity: C(x, Q) does not factorize, in the sense of Eq. (4.154). Then, the
      distribution of Q depends on the detailed x distribution in the GPD. To some extent,
      Q is the “conjugate variable” of x, and they are related in the amplitude
                                            Z 1
                                 M(Q) ∼         dx C(x, Q) F (x, ξ, t)                 (4.156)
                                             −1
      through the transformation kernel C(x, Q), which is, in general, not invertible, of
      course. We call a process with enhanced sensitivity a type-II process.
    Only having moment-type sensitivity is far from enough, even with next-to-leading-order
hard coefficients and evolution effects included [Bertone et al., 2021], as also confirmed in
practical fits of GPDs [Diehl et al., 2005; Hashamipour et al., 2020, 2022; Guo et al., 2022].
Given the complicated functional dependence of the GPD on x plus its entanglement with
ξ and t variables, one should have as much enhanced sensitivity as possible while having as
                                              274


many independent moment constraints. Among the processes that have been studied in the
literature, only the DDVCS [Guidal and Vanderhaeghen, 2003], photoproduction of photon-
meson pair [Boussarie et al., 2017; Duplančić et al., 2018, 2023a,b; Qiu and Yu, 2023b], and
meson-production of diphoton [Qiu and Yu, 2022] processes are type-II processes, and all the
other processes [Ji, 1997b; Radyushkin, 1997; Brodsky et al., 1994; Frankfurt et al., 1996;
Berger et al., 2002a, 2001; Pedrak et al., 2017] belong to type I.
                                                                           q1
                                                          p2
                          p2 q1        q2
                                                             l1   q     l2
                             l1     l2
                          k            k ′                                  q2
                                                         k            ′
                                                                    k
                                (a)                            (b)
Fig. 4.4: Sample diagrams for the hard scattering of the single diffractive (a) photoproduction
of diphoton process, and (b) photoproduction of photon-meson pair process. The red thick
lines indicate the propagators in the hard part, and the blue lines are amputated parton
lines that are put on shell and massless.
     A careful examination of the denominator structure of the LO hard part of the partonic
scattering can help understand and identify the difference in the x-sensitivity between these
two types of processes. The type-I processes have one common feature that every internal
propagator can be made to have one end connect to two on-shell massless external lines,
whether the external line is an amputated parton line or a real massless particle.
     Take the photoproduction of diphoton process, with one of its hard scattering diagrams
in Fig. 4.4(a), as an example, the propagator of momentum l1 is connected to an amputated
parton line of on-shell momentum k = (x + ξ)P̂ and the incoming photon line of momentum
p2 , while the propagator of momentum l2 is connected to an amputated parton line of
momentum k ′ = (x − ξ)P̂ and the outgoing photon line of momentum q2 . In the c.m. frame
                                             275


of the hard exclusive collision as defined in Fig. 4.1, we have
                                                                                                 p
     P̂ µ = P + , 0− , 0T ,        pµ2 = 0+ , p−
                                               2 , 0T ,      ∆+ = p+    1 = 2ξP
                                                                                   +
                                                                                       = p− 2 =      ŝ/2 ,  (4.157)
and the final-state momenta q1 and q2 , which define the hard scale qT ,
                           √                 r                   r                      !
                               ŝ                ŝ 1  + cos θ      ŝ 1  − cos  θ
                   q1µ =          (1, n) =                     ,                   , qT       ,             (4.158a)
                             2                   2       2          2       2
                                                                                           lc
                           √                    r                   r                         !
                               ŝ                   ŝ 1 − cos  θ      ŝ 1 +  cos  θ
                   q2µ =          (1, −n) =                       ,                   , −qT        ,        (4.158b)
                             2                      2      2           2      2
                                                                                                lc
where we present them first in terms of Cartesian coordinates with n being a unit spatial
vector defined as ⃗q1 /|⃗q1 | and then in light-front coordinates, and we also introduced the polar
                                   √
angle θ to represent qT (= ŝ sin θ/2).
    With all external momenta defined in Eqs. (4.157) and (4.158), we can express the vir-
tuality of the internal momentum l1 as
                                                                       x+ξ
                            l12 = 2k · p2 = 2(x + ξ)P̂ · p2 =                 ŝ ≡ xξ ŝ ,                   (4.159)
                                                                         2ξ
where xξ = (x + ξ)/2ξ is the same as the z variable defined in Sec. 4.3.1. Similarly, we have
the virtuality of the other internal momentum l2 as
                           l22 = 2k ′ · q2 = 2(x − ξ)P̂ · q2 = x′ξ · cos2 (θ/2) ŝ ,                         (4.160)
where x′ξ = (x − ξ)/2ξ = xξ − 1. And then the hard coefficient of the diagram Fig. 4.4(a)
                                                         276


takes a factorized form,
                                                 "                    #
                                    1                      1                   1
          C(x, ξ, cos θ) ∝ 2           2
                                               ∝              ′
                                                                        ·    2
                                                                                           (4.161)
                            (l1 + iε)(l2 + iε)     (xξ + iε)(xξ + iε)     cos (θ/2)
in which the dependence on θ (or equivalently, qT ) is factorized from the momentum fraction
x of the relative momentum of the active [q q̄] pair.
     This is an immediate consequence of having the internal propagator directly connected to
two external on-shell massless particles. Generally, as a result of connecting to two on-shell
massless lines with momenta r1 and r2 , the virtuality of the internal propagator is just a
product r1 · r2 , which simply factorizes into a GPD-x (or DA-z) dependent factor and a
factor that depends on the external observable such as θ in Eq. (4.160). This example also
indicates that the poles of x take place at xξ = 0 or x′ξ = 0, that is, x = ± ξ, which are at
the boundary points between the DGLAP and ERBL regions.
     In contrast, a type-II process has at least one internal line in the hard part that cannot be
made to have either end connect to two on-shell massless lines. We take the photoproduction
of a photon-meson pair as an example, for which one hard scattering diagram is shown in
Fig. 4.4(b). The kinematics is the same as in Eqs. (4.157) and (4.158), and two of the
propagators, l1 and l2 , are the same as the previous diphoton production example, given in
Eqs. (4.159) and (4.160).
     However, the gluon propagator q is connected to l1 on one end and l2 on the other end,
both of which are not on shell. Letting the outgoing quark line along q1 have its momentum
zq1 , we have the gluon momentum,
                             q = k + p2 − zq1 = (x + ξ)P̂ + p2 − zq1 ,                     (4.162)
                                                 277


which has the virtuality
                                                                         
                             q 2 = ŝ xξ 1 − z sin2 (θ/2) − z cos2 (θ/2) .            (4.163)
This leads to a hard coefficient that does not take a simple factorized form to separate the
(xξ , z) dependence from the observable θ, and therefore the distribution of θ contains extra
sensitivity to the shape of x and z in the GPD and DA, respectively.
     Compared to Eq. (4.161), the gluon propagator in Eq. (4.163) leads to some new poles
of x, at
                                      z cos2 (θ/2)
                             xξ =                    ∈ [0, 1], for z ∈ [0, 1],        (4.164)
                                    1 − z sin2 (θ/2)
which corresponds to x ∈ [−ξ, ξ], and thus lies in the ERBL region. These are not pinched
poles, so do not pose any theoretical obstacles, but are just the regions where we need to
deform the contour of x to avoid them.
     Switching the roles of p2 and q1 in Fig. 4.4(b), we can get the diphoton mesoproduction
process, as the kinematically crossing counterpart of the photon-meson pair photoproduction.
The same gluon now has the momentum
                     q ′ = k − q1 + (1 − z)p2 = (x + ξ)P̂ − q1 + (1 − z)p2 ,          (4.165)
with the virtuality
                                                                             
                         q ′2 = ŝ xξ cos2 (θ/2) − z − (1 − z) cos2 (θ/2) ,           (4.166)
                                                    278


which equally does not factorize. It gives another new pole of x at
                         cos2 (θ/2) (1 − z)
                  xξ =                         ∈ [1, ∞) ∪ (−∞, 0], for z ∈ [0, 1],       (4.167)
                            cos2 (θ/2) − z
which corresponds to |x| ≥ ξ and lies in the DGLAP region. This process therefore differs
from the photoproduction one by giving complementary sensitivity to the x dependence.
   Similarly, in Fig. 4.4(a), if we make the photon q2 virtual in the diphoton production
process, the photon momenta in Eq. (4.158) will become
                             √                            √
                               ŝ                           ŝ
                     q1µ =        (1 − ζ)(1, n) ,   q2µ =      (1 + ζ, −(1 − ζ)n) ,      (4.168)
                              2                            2
where ζ = Q′2 /ŝ with Q′2 = q22 being the virtuality of the photon q2 that decays into a lepton
pair. Then the propagator l2 becomes
                                                                         
                          l22 = ŝ x′ξ cos2 (θ/2) + ζ 1 + x′ξ sin2 (θ/2) ,               (4.169)
which differs from Eq. (4.160) by having an additional term proportional to ζ that introduces
an extra scale dependence. By varying ζ and θ, one can get extra sensitivity to the x-
dependence of the GPD. This is the same mechanism that gives the enhanced x-sensitivity
as the DDVCS process [Guidal and Vanderhaeghen, 2003]. This propagator [Eq. (4.169)]
leads to a new pole of x at
                                        −ζ
                    x′ξ =                               ∈ [−1, −ζ], for θ ∈ [0, π],      (4.170)
                           cos2 (θ/2)   + ζ sin2 (θ/2)
that is x ∈ [−ξ, (1 − 2ζ)ξ] ⊂ [−ξ, ξ], which is again inside the ERBL region.
                                                    279


     By comparison, the type-I processes are usually topologically or kinematically simpler
than the type-II processes, so their theoretical analysis and hard coefficient calculations are
usually easier. The type-II processes introduce enhanced sensitivity to the x dependence by
having extra scale dependence that entangles with the x flow. For the three type-II exam-
ples we have just examined, the photon-meson pair production and the diphoton production
processes differ from the DVMP process by having one extra photon attaching to the ac-
tive parton lines, while the virtual photon production process differs from the real photon
production process by having an extra scale Q′ which is in turn achieved by having that
photon decay into two leptons. In general, extra scale dependence is introduced by more
complicated topology,3 which is usually the necessary condition for enhanced sensitivity.
     One important role that the SDHEP plays is that it sets a template for listing a number
of processes, which we have categorized according to the beam types. We have shown the
proof of factorization in a general sense. Within this framework, one shall study as many
independent processes as possible, which should in turn constrain the x dependence of GPDs
as much as possible.
4.5        Shadow GPDs
Only having type-I processes has an intrinsic problem that it is always possible to construct
a function SG (x, ξ, t), called a shadow GPD [Bertone et al., 2021], which has a vanishing
moment in Eq. (4.155) and forward limit,
                            Z 1
                                dx SG (x, ξ, t) G(x, ξ) = 0,     SG (x, 0, 0) = 0.                (4.171)
                             −1
   3
     Here, we consider virtual or massive particles as having more complicated topology than real massless
particles, even in the case when the mass scale is not associated with virtual particle decay.
                                                     280


Without including evolution effects and high-order corrections, shadow GPDs can never
be disentangled from the “regular” GPDs in type-I processes. From the discussion below
Eq. (4.161), the kernels G(x, ξ) at LO for type-I processes are limited to 1/(x − ξ ± iϵ) and
1/(x + ξ ± iϵ).4 Hence it is straightforward to construct analytic shadow GPDs at LO.
     We define the shadow GPDs S(x, ξ) as having null forward limits and moment integrals
in Eq. (4.120), while having the same polynomiality and time reversal properties as normal
GPDs. That is, we require
                                                                                 Z 1
                                                                                        S(x, ξ)
   S(x, −ξ) = S(x, ξ),      S(±1, ξ) = 0,      S(x, 0) = 0,       S(±ξ, ξ) = 0,      dx           = 0,
                                                                                  −1     x−ξ
                                                                                                 (4.172)
and the (n + 1)-th moment of S(x, ξ) to be an even polynomial of ξ of at most n-th order,
                                Z  1                    Xn
                                     dx xn S(x, ξ) =            (2ξ)i Sn+1,i .                   (4.173)
                                  −1                  i=0,2,···
Note that we have dropped the t dependence in S, which may be introduced [Moffat et al.,
2023] to relax the small ξ suppression (due to S(x, 0) = 0) in Eq. (4.172), and a possible ξ n+1
term in Eq. (4.173) which is associated with the D-term. We will construct a shadow D-term
separately below. As we have seen in Sec. 4.3.1 and will also see in Secs. 4.6.2 and 4.7.2, it
is either the C-even or C-odd GPD combination that enters the scattering amplitudes, so we
require S(x, ξ) to be either odd or even in x, when it is to be added to the GPDs H(x, ξ, t)
                     e ξ, t) and E(x,
and E(x, ξ, t) or H(x,                e ξ, t). This has allowed us to leave out the condition
R1
  −1
     dx S(x, ξ)/(x + ξ) = 0 in Eq. (4.172) from which it can be inferred. Besides, we also
require the first moment of the shadow GPD to vanish since that can be constrained by the
   4
     In the most general case, they can also be multiplied by powers of x and ξ, whose integrals with the
GPDs can be reduced to simple moments by use of identities like x/(x − ξ + iϵ) = 1 + ξ/(x − ξ + iϵ).
                                                   281


electromagnetic form factor measurements (see Eq. (4.61)), i.e.,
                                       Z 1
                                           dx S(x, ξ) = S1,0 = 0.                        (4.174)
                                        −1
    The conditions in Eq. (4.172) lead to some general constraints on the shadow GPDs. In
low energy scattering such as at JLab, the accessible ξ values are small, ξ ≪ 1. The zeros
at x = ±ξ then severely constrain the shadow GPD values in the ERBL region, which can
only grow up to a certain power of ξ. In this case, the integral in Eq. (4.174) and the last
equation in Eq. (4.172) mainly receive contributions from the DGLAP region, which must
be highly suppressed. As a result, the shadow GPDs must have extra zeros in the DGLAP
region, but not necessarily in the ERBL region.
    As we shall see later, such oscillation will strongly suppress the contributions from the
DGLAP regions of shadow GPDs to the special integrals in Eqs. (4.213) and (4.252), which we
have briefly discussed in Sec. 4.4. Even though it is the special integrals that distinguish the
type-II processes from type-I ones by producing cos θ distributions that depend sensitively
on the GPDs and have capability of disentangling shadow GPDs, the oscillation of the latter
at small ξ makes them difficult to be probed. As a consequence, in the examples to be shown
in Secs. 4.6.4 and 4.7.4, large weights are needed to produce significant effects.
    In contrast, at a larger ξ, the ERBL region can become stronger, and such constraints
no longer exist.
    To construct specific models for shadow GPDs, we choose the following ansatz,
                   Ŝ(x, ξ; n, a, b; c) = K0 ξ 2 xa (x2 − ξ 2 ) (1 − x2 )b · Q2n (x, c), (4.175)
                                                   282


where a ≥ 0 and b, n > 0 are integers, and Q2n (x, c) = 1 + c x2 + · · · + q2n (c) x2n is an even
x polynomial of (2n)-th order. This parametrization automatically satisfies the first four
conditions in Eq. (4.172). Since it is only a fourth order polynomial of ξ, the polynomiality
condition can also be readily satisfied. We have fixed the power of (x2 − ξ 2 ) to be unity; a
higher power further suppresses the ERBL region and would lead to an even smaller impact
on the integrals in Eqs. (4.213) and (4.252). For given a and b, we choose n to be the
minimum integer such that Eq. (4.175) satisfies all the conditions in Eqs. (4.172)(4.173)
and (4.174). The single parameter c is allowed to tune the shape of the shadow GPD.
For any given choice, we fix the normalization K0 (which is independent of ξ) such that
R1
 −1
    dx [Ŝ(x, ξ; n, a, b; c)]2 = 1 when ξ = 0.1.
    We choose the GK model as the standard GPDs, H0 (x, ξ, t) and H                     e 0 (x, ξ, t), and vary
them by adding shadow GPDs to the u quark GPD. For the unpolarized GPD, since the
H + that will enter the special integrals in Eqs. (4.213) and (4.252) is an odd function of x,
we choose n = 3 and (a, b) = (1, 2) or (1, 6). The parameter c is chosen to maximize the
integral in Eq. (4.174) from the DGLAP region, which gives c = −11 or −17, respectively.
That is, we defined two shadow GPDs,
                 S1 (x, ξ) = Ŝ(x, ξ; 3, 1, 2; −11),      S2 (x, ξ) = Ŝ(x, ξ; 3, 1, 6; −17),            (4.176)
which, added to the GK model H0u , make up two other models,
                           Hiu (x, ξ, t) = H0u (x, ξ, t) + Si (x, ξ),   for i = 1, 2.                    (4.177)
Similarly, for the polarized GPD, we choose n = 3 and (a, b) = (0, 2) or (0, 6), and c = −24
                                                       283


or −40. This gives two other shadow GPDs,
                Se1 (x, ξ) = Ŝ(x, ξ; 3, 0, 2; −24),     Se2 (x, ξ) = Ŝ(x, ξ; 3, 0, 6; −40),   (4.178)
and GPD models with
                         e u (x, ξ, t) = H
                         H                e u (x, ξ, t) + Sei (x, ξ),     for i = 1, 2.         (4.179)
                           i                0
The GPDs of other flavors are kept unchanged from the GK model.
    For the unpolarized GPD, an additional term proportional to mod(n, 2)(2ξ)n+1 can exist
on the right hand side of Eq. (4.173), just as in Eq. (4.56). This comes from the D-term in
the double distribution representation,
                  Z  1     Z  1−|β|                                                               
     q
   H (x, ξ, t) =       dβ           dα δ(x − β − ξα) f q (β, α, t) + sgn(ξ) Dq (x/ξ, t) θ ξ 2 − x2 ,
                   −1        −1+|β|
                                                                                                (4.180)
where Dq (x, t) is an odd function of x. Now we can construct a shadow GPD specifically for
this term. Similarly, to retain the conditions in Eq. (4.172), we drop the t dependence and
choose the “shadow D-term” Ds (x) such that
                                                                   Z   1
                                                                             Ds (x)
                       Ds (−x) = −Ds (x),         Ds (1) = 0,            dx         = 0,        (4.181)
                                                                      −1     x−1
where the subscript “s” is to remind that this D-term is to be part of the shadow GPD,
but not to be the requirement for the D-terms in the normal GPDs. Note that since the
D-term automatically disappears in the forward limit, its magnitude does not necessarily
suffer from the suppression when ξ is small. Because of the last condition in Eq. (4.181), a
                                                      284


                                                         u
          H u(x, ξ = 0.2, t = -0.2 GeV2, μ = 2 GeV)     H (x, ξ = 0.2, t = -0.2 GeV2, μ = 2 GeV)
     5
     0
                                                                               
                            H0 = HGK                                     H 0 = H GK
                                                                                   
                            H1 = H0 + S1                                 H 1 = H 0 + S1
                                                                                   
                            H2 = H0 + S2                                 H 2 = H 0 + S2
    -5
                            H3 = H0 + Ds
       -1         -0.5          0        0.5        1 -1       -0.5           0         0.5      1
                                x                                             x
                                                         u
         H u(x, ξ = 0.4, t = -0.2 GeV2, μ = 2 GeV)     H (x, ξ = 0.4, t = -0.2 GeV2, μ = 2 GeV)
    40
    20
     0
                           H0
                                                                   
   -20                     H1                                     H0
                           H2                                      
                                                                  H1
                           H3                                      
   -40                                                            H2
       -1         -0.5          0        0.5        1 -1       -0.5           0         0.5      1
                                x                                             x
Fig. 4.5: GPD models given in Eqs. (4.177)(4.179) and (4.183). The upper row is for a small
ξ = 0.2 while the lower row is for a large ξ = 0.4. In the lower row, the thick black lines are
the GK model, which are small compared to the shadow GPDs and hard to notice in the
figure.
                                                  285


shadow D-term cannot be probed by the dispersion relation in the DVCS data [Girod et al.,
2008; Jo et al., 2015], but it can modify the D-term in the gravitational form factor. We
choose the ansatz for the shadow D-term
                                                                      
                                                          7
                Ds (x) = J0 x (1 − x ) · 1 + c x − (3c + 5)x4
                                     2             2
                                                                         θ(1 − x2 ) ,       (4.182)
                                                         15
                                                                   R1
with c = 50 and the normalization factor J0 chosen to make           −1
                                                                        dx Ds2 (x) = 1. Adding this
to the u quark GPD H0u gives another GPD model,
                             H3u (x, ξ, t) = H0u (x, ξ, t) + Ds (x/ξ).                      (4.183)
    These GPD models are shown in Fig. 4.5 for the u quark at a small ξ = 0.2 and a large
ξ = 0.4, with a fixed t = −0.2 GeV2 and evolution scale µ = 2 GeV, for the GK model. As
expected, at a small ξ, the shadow GPDs are small in the ERBL region, being dominated
by the DGLAP region, whereas at a larger ξ = 0.4, one can immediately notice that the
shadow GPDs (not the shadow D-term) scale up with ξ very rapidly and the ERBL region
becomes dominant.
4.6       Single diffractive hard exclusive diphoton mesopro-
          duction
Now let us study the single diffractive hard exclusive diphoton production in nucleon-pion
collisions,
                           N (p) + π(p2 ) → N ′ (p′ ) + γ(q1 ) + γ(q2 ),                    (4.184)
                                               286


which was introduced in [Qiu and Yu, 2022]. Here N can be a proton (p) or a neutron (n)
and π can be π − or π + , making up various exclusive processes, such as p π − → nγγ and
n π + → pγγ, and those that could be measured with a pion beam at J-PARC [Aoki et al.,
2021] and AMBER [Adams et al., 2018] experiments. The pion beam can also be replaced
by a kaon beam and makes up more processes. The exclusive process, p π − → nγγ, could be
made analogous to the π + π − collision by thinking of the p → n transition as taking a virtual
π + out of the proton, carrying momentum ∆ = p − p′ and colliding with π − to produce two
hard photons exclusively.
    As discussed in Sec. 3.2.4.1, the charged meson beam requires a flavor change of the
nucleon and forbids the γ ∗ -mediated channel at n = 1. The leading contribution to the
scattering amplitude of Eq. (4.184) thus comes from the quark-antiquark pair exchange at
n = 2 (the gluon pair exchange is also forbidden by the charge conservation), which is
factorized into nucleon transition GPDs and the pion DA, as argued in Sec. 3.2.4.1. Taking
the p π − → nγγ process as an example, the factorization formula of the amplitude is
                    Z 1    Z   1    h
        Mλ 1 λ 2 =      dx       dz Fpnu           eλ1 λ2 (x, ξ, z; ŝ, θ, ϕ)
                                         (x, ξ, t) C
                     −1      0
                                                                                    i
                                          +Fepn
                                              u
                                                (x, ξ, t) Cλ1 λ2 (x, ξ, z; ŝ, θ, ϕ) Dd/π− (z),     (4.185)
where we have suppressed the factorization scale, and λ1 and λ2 are the helicities of the final-
state two photons in the SDHEP frame. The polarized nucleon transition GPD Fepn                   u
                                                                                                    (x, ξ, t)
                                                                  u
is defined in Eq. (3.142), and the unpolarized one Fpn              (x, ξ, t) can be similarly obtained by
removing the γ5 . Both take form factor decompositions similar to Eq. (4.1),
                                                                                     
        u                1       ′   ′    u             +       u            iσ +α ∆α
      Fpn (x, ξ, t) =       ū(p , α ) Hpn (x, ξ, t)γ − Epn (x, ξ, t)                   u(p, α),   (4.186a)
                       2P +                                                     2m
                                                     287


                                                                                     +
                                                                                        
                          1                                                      γ5 ∆
       Fepn
          u
            (x, ξ, t) =           ′   ′
                             ū(p , α ) H  e pn (x, ξ, t)γ γ5 − E
                                             u            +        epn (x, ξ, t)
                                                                     u
                                                                                           u(p, α),         (4.186b)
                        2P +                                                      2m
thereby defining the flavor transition GPDs Hpn             u      u
                                                               , Epn  ,He pn
                                                                          u
                                                                             , and E epn
                                                                                      u
                                                                                         , which have the same
isospin relations to the flavor-diagonal GPDs as Eq. (3.144),
                                    u
                                 Hpn   (x, ξ, t) = Hpu (x, ξ, t) − Hpd (x, ξ, t),                             (4.187)
and similarly for Epn   u
                          ,He u , and E  e u . The DA Dd/π− (z) is similarly defined as Eq. (3.58),
                             pn           pn
                          Z
                              dy + izp−2 y+
             Dd/π− (z) =            e         ⟨0|ū(0)γ − γ5 Φ(0, y + n̄; n̄) d(y + )|π − (p2 )⟩,             (4.188)
                               4π
which can be related to the DA Du/π+ (z) in Eq. (3.58) by isospin symmetry,
                                                                        ifπ
                                   Dd/π− (z) = −Du/π+ (z) = −               ϕ(z).                             (4.189)
                                                                         2
Similar factorization formulae can be written for the n π + → pγγ and other processes,
with the GPDs replaced by Fnp         d
                                        (x, ξ, t) and Fenp d
                                                              (x, ξ, t), which are equal to Fpn       u
                                                                                                        (x, ξ, t) and
Fepn
  u
     (x, ξ, t) by isospin symmetry, and the DA by Du/π+ (z).
4.6.1        Calculation of the hard coefficients
The hard coefficients C     eλ1 λ2 and Cλ1 λ2 are the helicity amplitudes of the collision between
two collinear quark-antiquark pairs, [q1 q̄2 ] from the diffracted nucleon which carries light-like
momenta (ξ ± x)P + n̄, respectively, and [q2 q̄1 ] from the annihilated pion carrying light-like
momenta zp−                         −                                   −
                2 n and (1 − z)p2 n respectively. For the pπ collision, we have (q1 , q2 ) = (u, d),
while for the nπ + collision, we have (q1 , q2 ) = (d, u). In both processes, C                  eλ1 λ2 is obtained
                                                         288


by contracting the amputated [q1 q̄2 ] legs with the spinor projector γ · n̄/2, and Cλ1 λ2 with
γ5 γ · n̄/2. The amputated [q1 q̄2 ] legs are contracted with γ5 γ · n/2 in any cases. In this way,
the hard coefficient C  e associated with the unpolarized GPD F is parity-odd (P-odd), while
the C associated with the polarized GPD Fe is parity-even (P-even). The hard coefficients
are then obtained by averaging over the colors of each quark pair, and then multiplied by
an extra factor ŝ/2.
    Note that we are taking the convention for both partons from the diffracted nucleon to
enter the hard collisions. This makes the calculation of the hard coefficients simpler and the
two-stage paradigm more transparent. By a variable change z1 = (x + ξ)/2ξ, the [q1 q̄2 ] pair
carries momenta z1 p+                      +
                        1 n̄ and (1 − z1 )p1 n̄. To ease the notation, we also denote z2 = z. In the
c.m. frame of the hard collision, we take
                                                   r       s
                                              −       ŝ       ξs
                                     p+1 = p2 =          =         .                         (4.190)
                                                      2      1+ξ
The hard-scattering diagrams are shown in Fig. 4.6. They are classified into two types:
(A) there are 8 type-A diagrams for which the two photon lines are attached to two different
fermion lines, which are denoted as (A1 )–(A4 ), and (A′1 )–(A′4 ) that are obtained by switching
the two photons, and (B) there are 12 type-B diagrams for which both photon lines are
attached to one single fermion line, where (B1 )–(B3 ) and (B1′ )–(B3′ ) have both photons
attached to the q1 quark line that carries electric charge e1 e, and (B4 )–(B6 ) and (B4′ )–(B6′ )
have them attached to the q2 quark line that carries electric charge e2 e.
    By defining two auxiliary light-like vectors w and w̄ as
             1                         1                     1
       w̄ = √ (1, ⃗q1 /|⃗q1 |) , w = √ (1, −⃗q1 /|⃗q1 |) = √ (1, ⃗q2 /|⃗q2 |) , w · w̄ = 1,  (4.191)
              2                          2                    2
                                                    289


                q1
      z1 p̂1             z̄2 p̂2
                   q
      z̄1 p̂1   q2       z2 p̂2
               (A1 )                           (A2 )         (A3 )                  (A4 )
                                 q1 q2
                     z1 p̂1            z̄2 p̂2
                                     q
                     z̄1 p̂1           z2 p̂2
                                 (B1 )               (B2 )              (B3 )
                     z1 p̂1            z̄2 p̂2
                                     q
                     z̄1 p̂1     q1 q2 z2 p̂2
                                 (B4 )               (B5 )              (B6 )
Fig. 4.6: Hard scattering diagrams for the diffractive nucleon-pion scattering into a photon
pair. The two incoming quark lines on the left are from the diffracted nucleon, carrying
momenta z1 p̂1 and z̄1 p̂1 ≡ (1 − z1 )p̂1 , respectively, where p̂1 = (∆ · n)n̄. The two incoming
quark lines on the right are to form the annihilated pion, carrying momenta z2 p̂2 and z̄2 p̂2 ≡
(1 − z2 )p̂2 , respectively, where p̂2 = (p2 · n̄)n. The variables z1 and z2 are related to x and z
by z1 = (x + ξ)/2ξ and z2 = z (see the text). Another set of diagrams are also to be included
by switching the two photon lines, giving 20 diagrams in total.
                                                     290


we can write the photon momenta as
                                              p                                 p
                        q1 = (q1 · w)w̄ = w̄    ŝ/2,   q2 = (q2 · w̄)w = w        ŝ/2,     (4.192)
with the scalar products,
                  w · n = w̄ · n̄ = (1 − cos θ)/2,      w · n̄ = w̄ · n = (1 + cos θ)/2.     (4.193)
   For type-A diagrams, the intermediate gluon propagator carries the hard transverse mo-
mentum qT ,
                                                           r
                                                              ŝ
                  qA = −z1 p̂1 − (1 − z2 )p̂2 + q1 = −           (z1 n̄ + (1 − z2 )n − w̄) , (4.194)
                                                              2
with a virtuality,
                   ŝ 
          qA2 = −       z1 z2 + (1 − z1 )(1 − z2 ) − cos θ [z1 z2 − (1 − z1 )(1 − z2 )] .    (4.195)
                   2
Switching the two photons changes q1 and w̄ in Eq. (4.194) by q2 and w, respectively, and
results in a similar gluon virtuality,
                    ŝ 
          qA2 ′ = −      z1 z2 + (1 − z1 )(1 − z2 ) + cos θ [z1 z2 − (1 − z1 )(1 − z2 )] .   (4.196)
                    2
The type-A gluons have special propagators whose dependence on the external observable
θ cannot be simply factored out, so they induce enhanced sensitivity to the x dependence
of GPDs, following the criterion in Sec. 4.4. In contrast, the type-B diagrams have simple
                                                     291


gluon propagators whose virtualities are
                                    qB2 = (1 − z1 )z2 ŝ,       qB′2 = z1 (1 − z2 )ŝ,                           (4.197)
for diagrams (B1 , B1′ , · · · , B3 , B3′ ) and (B4 , B4′ , · · · , B6 , B6′ ), respectively. These simply fac-
torize into factors that only depend on z1 and z2 , but not θ at all, following the structure
of Eq. (4.154). Similar are the quark propagators for all diagrams. As a result, the type-B
diagrams only yield moment-type sensitivity.
    Denoting the hard coefficients as
                    Cλ1 λ2 (z1 , z2 ; ŝ, θ) = Cµν (z1 , z2 ; ŝ, n, n̄, w, w̄) ϵµ∗           ν∗
                                                                                     λ1 (q1 )ϵλ2 (q2 ),         (4.198a)
                    Ceλ1 λ2 (z1 , z2 ; ŝ, θ) = C eµν (z1 , z2 ; ŝ, n, n̄, w, w̄) ϵµ∗ (q1 )ϵν∗ (q2 ),          (4.198b)
                                                                                     λ1       λ2
the γ5 structures in the corresponding fermion spinor traces determine that C                           eµν contains one
antisymmetric Levi-Civita tensor, while Cµν does not. The Ward identities
                                 w̄µ C eµν = C eµν wν = w̄µ C µν = C µν wν = 0                                   (4.199)
then allows us to decompose them into gauge-invariant tensor structures,
                                                          
                                         µν      n̄µ⊥ n̄ν⊥
  C µν
       =  C+ (−g⊥  µν
                      ) + C−         −g⊥    +2 2             + Cw n̄µ⊥ wν + Cw̄ w̄µ n̄ν⊥ + Cw̄w w̄µ wν , (4.200a)
                                                    n̄⊥
               µ ν n̄⊥ ww̄                                    µ ν n̄⊥ ww̄                       
   eµν     e     n̄⊥ ϵ           − ϵµn̄⊥ ww̄ n̄ν⊥          e       n̄⊥ ϵ          + ϵµn̄⊥ ww̄ n̄ν⊥
  C    = i C+                                         + iC−
                                n̄2⊥                                             n̄2⊥
         + iCew̄ w̄µ ϵν n̄⊥ ww̄ + iC  ew ϵµn̄⊥ ww̄ wν ,                                                         (4.200b)
                                                            292


where we defined
                                                                                      µν
                                  g⊥ µν
                                         = g µν − wµ w̄ν − w̄µ wν ,         n̄µ⊥ = g⊥     n̄ν .                (4.201)
When contracting with the polarization vectors as Eq. (4.198), the terms in Eq. (4.200) that
are proportional to w̄µ or wν vanish. The helicity amplitudes are purely determined by the
first two tensor structures,
                                                      N                               N
                                C++ = C−− =                C+ , C+− = C−+ =                 C− ,
                                                       ŝ                              ŝ
                              e++ = −C      e−−       N e         e+− = −C      e−+ = N C       e− ,
                             C                     =       C+ , C                                              (4.202)
                                                       ŝ                                  ŝ
where N = 2ie2 g 2 CF /Nc and the four independent hard coefficients are
                           2        z1 z2 + (1 − z1 ) (1 − z2 )
  2ξC+ = (e1 − e2 )2 2 · P
                          sθ          z1 z2 (1 − z1 )(1 − z2 )
                                                                                                 
        2iπ        2      2      δ(z1 ) δ(1 − z1 )                         δ(z1 ) δ(1 − z1 )
     + 2 · (e1 − e2 )                     −                  − 2e1 e2             +                      ,     (4.203)
         sθ                        z2           1 − z2                       z2         1 − z2
                           2           z1 + z2 − 2z1 z2                                      z1 − z2
  2ξC− = (e1 − e2 )2        2
                               ·P                                + (e21 − e22 )P
                          sθ        z1 z2 (1 − z1 )(1 − z2 )                      z1 z2 (1 − z1 )(1 − z2 )
                    2e1 e2                         (z1 (1 − z1 ) + z2 (1 − z2 )) (z1 z2 + (1 − z1 )(1 − z2 ))
     +P                                 ·
          z1 z2 (1 − z1 )(1 − z2 ) (2z1 z2 + (1 − cθ )(1 − z1 − z2 )) (2z1 z2 + (1 + cθ )(1 − z1 − z2 ))
                                                                                                        
                         2     δ(1 − z1 )         δ(z1 )          2      2 2       δ(z1 )        δ(1 − z1 )
     − iπ (e1 − e2 )                         +               − (e1 − e2 ) 2                   −
                                    z2           1 − z2                     sθ 1 − z2                 z2
                                 2
                                                                
                          1 + sθ δ(1 − z1 )               δ(z1 )        1
             + 2e1 e2          2
                                                     +              + 2×
                              sθ             z2           1 − z2        sθ
                                                                                                   
                                 2                                      cθ                  1
                    × sgn cθ/2 − z2 δ(z1 − ρ(z2 ))                                 −
                                                                    z2 (1 − z2 ) c2θ/2 − z2
                                                                                                      
                                            2
                                                                            cθ                 1
                        + sgn z2 − sθ/2 δ(z1 − ρe(z2 ))                              −                       , (4.204)
                                                                       z2 (1 − z2 ) z2 − s2θ/2
     e+ = (e1 − e2 )2 −2 · P
  2ξ C
                                            1 − z1 − z2
                             2
                           sθ        z1 z2 (1 − z1 )(1 − z2 )
                                                                                                 
        2πi        2      2      δ(1 − z1 ) δ(z1 )                         δ(z1 ) δ(1 − z1 )
     − 2 · (e1 − e2 )                           +            − 2e1 e2             −                      ,     (4.205)
         sθ                        1 − z2            z2                      z2         1 − z2
                                                              293


    e− = (e1 − e2 )2      2cθ               z1 − z2
 2ξ C                       2
                               ·P
                          sθ       z1 z2 (1 − z1 )(1 − z2 )
                   2e1 e2 cθ                                   (z1 − z2 )(1 − z1 − z2 )2
     +P                              ·
          z1 z2 (1 − z1 )(1 − z2 ) (2z1 z2 + (1 − cθ )(1 − z1 − z2 )) (2z1 z2 + (1 + cθ )(1 − z1 − z2 ))
                                                                                         
        2πi       2     2        δ(1 − z1 )     δ(z1 )                    δ(z1 )   δ(1 − z1 )
     − 2 (e1 − e2 )cθ                        +             − e1 e2 cθ            −
         sθ                          z2         1 − z2                   1 − z2        z2
                                                                                                      
                   z1 − z2            2                                  2                      
              +                 sgn cθ/2 − z2 δ(z1 − ρ(z2 )) − sgn sθ/2 − z2 δ(z1 − ρe(z2 ))              ,
                 z2 (1 − z2 )
                                                                                                        (4.206)
where P indicates that the hard coefficients should be understood in the sense of principle-
value integration for z1 (or x), when convoluted with the GPD and DA, and we used the
notation (cθ , sθ , cθ/2 , sθ/2 ) = (cos θ, sin θ, cos(θ/2), sin(θ/2)).
    The special gluon propagators in the type-A diagrams (cf. Eqs. (4.195)(4.196)) introduce
new poles of z1 in addition to 0 and 1,
                        (1 + cos θ)(1 − z2 )        cos2 (θ/2)(1 − z2 )
            ρ(z2 ) =                            =                       ,                              (4.207a)
                          1 + cos θ − 2z2             cos2 (θ/2) − z2
                        (1 − cos θ)(1 − z2 )        sin2 (θ/2)(1 − z2 )
            ρe(z2 ) =                            =                       = 1 − ρ(1 − z2 ),            (4.207b)
                          1 − cos θ − 2z2             sin2 (θ/2) − z2
which have small imaginary parts by the iϵ prescription,
                                                                                 
                             iϵ sgn z2 − cos2 (θ/2) ,        iϵ sgn z2 − sin2 (θ/2) ,                   (4.208)
respectively. In terms of x = ξ(2z1 − 1), Eq. (4.207) translates to the special poles,
                                                                                             
                         (1 + cos θ)(1 − z) + (1 − cos θ)z                  1 − z + tan2 (θ/2)z
   xp (ξ, z, θ) = ξ ·                                               =ξ·                           ,    (4.209a)
                         (1 + cos θ)(1 − z) − (1 − cos θ)z                  1 − z − tan2 (θ/2)z
                                                   
                         tan2 (θ/2)(1 − z) + z
   ep (ξ, z, θ) = ξ ·
   x                                                  = −xp (ξ, 1 − z, θ).                            (4.209b)
                         tan2 (θ/2)(1 − z) − z
                                                         294


As z goes from 0 to 1, the two poles go from ξ to ∞ on the lower half complex x plane, and
then from −∞ to −ξ on the upper half plane. They then cover the whole DGLAP regions
of GPDs.
     We immediately see from Eqs. (4.203)–(4.206) that the e21 or e22 proportional terms,
which come from type-B diagrams, only carry moment-type sensitivity, while the e1 e2 terms
coming also from type-A diagrams carry enhanced z1 (or x) sensitivity. We have organized
Eqs. (4.203)–(4.206) in terms of (e1 − e2 )2 , (e21 − e22 ), and e1 e2 , such that if we had a neutral
pion beam, e1 = e2 = eu or ed , and then the moment-type terms proportional to (e1 − e2 )2
or (e21 − e22 ) would be cancelled, further enhancing the x-sensitivity in e1 e2 terms.
     Charge conjugation symmetry amounts to (z1 , z2 ) ↔ (1 − z1 , 1 − z2 ) (or equivalently,
(x, z) ↔ (−x, 1 − z)) and e1 ↔ e2 . Due to the presence of two γ5 matrices in the evaluation
of C± , the charge-conjugation-even (C-even) (e1 −e2 )2 and e1 e2 are manifestly invariant under
(z1 , z2 ) ↔ (1−z1 , 1−z2 ), while the charge-conjugation-odd (C-odd) (e21 −e22 ) term flips a sign.
Given the symmetry property D(z) = D(1−z) of the DA, the first two terms are probing the
C-even GPD combination Fe+ (x, ξ, t) ≡ Fe(x, ξ, t) + Fe(−x, ξ, t), whereas the (e21 − e22 ) term is
probing the C-odd GPD combination Fe− (x, ξ, t) ≡ Fe(x, ξ, t) − Fe(−x, ξ, t). Similarly, since
there is only one γ5 matrix in the evaluation of C     e± , the C-even (e1 − e2 )2 and e1 e2 terms flip
signs under (z1 , z2 ) ↔ (1 − z1 , 1 − z2 ), while the C-odd (e21 − e22 ) term is manifestly invariant.
As a result, the first two terms are probing the C-even GPD combination F + (x, ξ, t) ≡
F (x, ξ, t) − F (−x, ξ, t), whereas the (e21 − e22 ) term is probing the C-odd GPD combination
F − (x, ξ, t) ≡ F (x, ξ, t) + F (−x, ξ, t).
     The pπ − and nπ + channels differ from each other by changing (e1 , e2 ) = (eu , ed ) to
(ed , eu ), which does not affect the (e1 − e2 )2 and e1 e2 terms, but flips the signs of the (e21 − e22 )
terms. So combining both channels helps distinguish C-even and C-odd GPD components.
                                                   295


4.6.2      Helicity amplitudes
The convolutions of the hard coefficients in Eqs. (4.203)–(4.206) with the GPD and DA
can be simplified by using symmetry property of the DA. Specifically, by introducing the
notation
                                      Z  1      Z   1
                         Cα[F ]   ≡         dx        dz Fe(x, ξ, t) D(z) Cα (x, ξ; z; θ),
                              e
                                        −1        0
                                      Z 1       Z   1
                         Ce[F ] ≡           dx        dz F (x, ξ, t) D(z) C     eα (x, ξ; z; θ),                      (4.210)
                            α
                                        −1        0
with α being any single or double helicity indices, we have
      [Fe]      2D0 n                  2 e+
                                                              h
                                                                  2      2 e−                          e +
                                                                                                                     io
    C+ = −               (e 1  −  e 2 )  F  0  (ξ, t) +   iπ    (e1 −  e 2 ) F   (ξ,  ξ, t) + 2e    e
                                                                                                  1 2 F    (ξ, ξ, t)     ,
               sin2 θ
                                                                                                                     (4.211a)
                                                                              
      [Fe]                2             2 e+                      e +
    C−     = − (e1 − e2 ) D0                  F (ξ, t) + iπ F (ξ, ξ, t)
                                    sin2 θ 0
                                                                          
                  2    2           e −               2iπ e−
             − (e1 − e2 ) D0 F0 (ξ, t) +                   F (ξ, ξ, t)
                                                    sin2 θ
                     Z 1                                                                   
                                   D(z)          1 + cos θ − 2z                    1
             + e1 e2        dz                             2          −                         · I[Fe+ ; ξ, t, z, θ]
                        0        z(1   −  z)           sin θ              1  + cos   θ −  2z
                            2D0        h                                                    i
                                         e +
                       − 2 · F0 (ξ, t) + iπ (1 + sin θ) F (ξ, ξ, t) ,  2      e +
                                                                                                                    (4.211b)
                          sin θ
                2D0                                          2                                                    
    e+[F ]
    C      =−           (e 1  −   e2 ) 2 +
                                         F 0  (ξ, t)  +  iπ   (e 1 −  e 2
                                                                        2 ) F −
                                                                                (ξ, ξ, t)  + 2e  1 e2 F +
                                                                                                          (ξ, ξ, t)    ,
               sin2 θ
                                                                                                                     (4.211c)
                                                                                              
    e−[F ] = − 2 cos θ · D0 (e1 − e2 )2 F0+ (ξ, t) + iπ (e21 − e22 ) F − (ξ, ξ, t)
    C
                sin2 θ
                     Z 1                                                                   
                                   D(z)          1 + cos θ − 2z                    1
             + e1 e2        dz                             2          +                        · I[F + ; ξ, t, z, θ]
                        0        z(1 − z)              sin θ              1 + cos θ − 2z
                                                                                   
                          2 cos θ D0  +                             +
                                                                                 
                       −                   · F0 (ξ, t) + iπ F (ξ, ξ, t) ,                                           (4.211d)
                               sin2 θ
                                                            296


where we defined the “zeroth moments” of the DA and GPDs,
                              Z   1                            Z  1
                                    dz D(z)                           dx F(x, ξ, t)
                       D0 ≡                  ,   F0 (ξ, t) ≡ P                      ,             (4.212)
                                0       z                        −1      x−ξ
and the special GPD integral,
                                      Z 1
                                                             F(x, ξ, t)
                 I[F; ξ, t, z, θ] ≡       dx                                            ,         (4.213)
                                       −1    x − xp (ξ, z, θ) + iϵ sgn [cos2 (θ/2) − z]
where F can take any GPD function such as F ± , Fe± , H ± , etc.. The special integral in
Eq. (4.213) yields a function of z that depends sensitively on the x distribution of the GPD.
On a further integration of z under the profiling of a given DA, this maps out a distribution
of θ that contains enhanced sensitivity to the GPD x dependence. Since the pole xp lies on
the DGLAP region, the enhanced sensitivity is mainly on that region.
4.6.3       Cross section and single nucleon spin asymmetry
First, we write Eq. (4.185) explicitly as
                                                     +α
                                                                                             
               1       ′   ′   e  [H]   +    e [E] iσ     ∆α      [H]    +         [E] γ5 ∆+
   Mλ 1 λ 2 =      ū(p , α ) Cλ1 λ2 γ − Cλ1 λ2               + Cλ1 λ2 γ γ5 − Cλ1 λ2           u(p, α),
                                                                    e               e
              2P +                                    2m                                2m
                                                                                                  (4.214)
which is written in the SDHEP frame and where the subscripts (λ1 , λ2 ) refer to the two final-
state photon helicities. As argued in Sec. 4.2, because the Lorentz transformation connecting
the Lab and SDHEP frames is a transverse boost, the factorization formula [Eq. (4.234)] can
be written equally in the Lab and SDHEP frames, and the corresponding GPDs F , Fe, H, E,
 e and E
H,       e take the same values in both frames. This has allowed us to write the coefficients
                                                   297


C and C,e which contain the kinematics of the final state, in the SDHEP frame.
    Now, we omit the photon helicities in the notations, sum over the initial-state nucleon
spin, and average over the initial-state nucleon spin using the density matrix,
                                                                              
                                     1                   1  1 + λ sx − isy 
                     ραα′ (sT , λ) = (1 + s · σ) =                            ,          (4.215)
                                     2                   2
                                                             sx + isy 1 − λ
where s = (sx , sy , λ) = (sT cos ϕS , sT sin ϕS , λ) is the spin Bloch vector of the initial-state
nucleon. The spin average can be conveniently done by introducing a covariant spin vector
S µ via
               X                                         1
                     u(p, α) ραα′ (sT , λ) ū(p, α′ ) = (γ · p + m) (1 + γ5 γ · S/m) ,     (4.216)
               α,α′
                                                         2
with S µ = (λp+ , −λm2 /(2p+ ), msT ) in the light-front coordinates in the Lab frame. It is
important to notice that S enters the calculation at most in a linear way. The evaluation of
the spinor algebra thus involves three Lorentz invariants associated with S,
                                                                
                                       ′        λ          4ξm2
                       p · S = 0,    p ·S =          −t +          + m sT · ∆T ,
                                                2          1+ξ
                            ′
                       ϵnpp S = mp+ (sT × ∆T )z = −mp+ sT ∆T sin ϕS .                      (4.217)
Then the nucleon-spin averaged amplitude square can be expressed as
                                                      sT · ∆T 1 (sT × ∆T )z 2
         |M|2 = ⟨MM∗ ⟩N N ′ = A0 + λ A1 +                      A2 +              A2 ,      (4.218)
                                                         2m             2m
where the photon helicities are left open but suppressed. The nucleon spin independent part
                                                     298


A0 is
                                                                                               2
                   2
      A0 = (1 − ξ ) C C   e [H] e [H]∗
                                       +C C [H]
                                              e    [H]∗
                                                    e
                                                          − ξ2 +
                                                                         t      e[E] C
                                                                                C     e[E]∗ − ξ t C [E]  e
                                                                                                           C [E]∗
                                                                                                              e
                                                                       4m2                       4m   2
                                                                                        
                − ξ2 C    e[H] Ce[E]∗ + C  e[E] Ce[H]∗ + C [H] e
                                                                  C [E]∗ + C [E] C [H]∗ .
                                                                     e          e    e
                                                                                                                (4.219)
The nucleon helicity dependent part A1 is
                                                                                                             
                 2     e [H] [H]∗          e e [H]∗
                                          [H]                     t         ξ2         e [E] [E]∗        e e [E]∗
                                                                                                        [E]
    A1 = (1 − ξ ) C C                +C C               −ξ             +              C C           +C C
                                e                                                               e
                                                                4m2 1 + ξ
                                                                                      
              − ξ2 C   e[H] C [E]∗
                                 e         e e [H]∗
                                     + C [E] C
                                                             e e [E]∗
                                                      + C [H] C        +C  e[E] C [H]∗
                                                                                   e
                                                                                          .                     (4.220)
The nucleon transverse spin dependent parts A12 and A22 are
                        h                                                                     i
                              e e [E]∗      e[E] C [H]∗          e[H] C [E]∗         e e [H]∗
        A12 = (1 + ξ) C [H] C            +C              −ξ C                 + C [E] C
                                                     e                    e
                                                          
                   − ξ2 C     e[E] C [E]∗
                                       e         e e [E]∗
                                          + C [E] C          ,                                                (4.221a)
                           h                                                                    i
          2
        A2 = i (1 + ξ) C C   e [H] e [E]∗
                                          −C Ce [E] e [H]∗
                                                           −ξ C C    [H]
                                                                      e    [E]∗
                                                                            e
                                                                                −C C [E]
                                                                                       e    [H]∗
                                                                                             e
                                                                                                    .         (4.221b)
In Eqs. (4.219)–(4.221), the omitted photon helicity indices take (λ1 , λ2 ) in the helicity
amplitudes C and C,  e and (λ′1 , λ′2 ) in the complex conjugate ones C ∗ and C                   e∗ .
    The values of ξ and t are limited in a certain experiment for a fixed collision energy. By
introducing an upper cut for t, say |t| ≤ 0.2 GeV2 , we have ξ ≤ 0.218, which suppresses all
the E and E e related terms in A0 and A1 by at least a factor of 0.048. Neglecting the ξ 2 and
t suppressed terms, we have the approximations for Eqs. (4.219)–(4.221),
                                                     
     A0 = (1 − ξ 2 ) C e[H] C e[H]∗ + C [H] e
                                               C [H]∗ + O(ξ 2 , t/m2 ),
                                                  e
                                                                                                              (4.222a)
                                                         299


                                                       
      A1 = (1 − ξ 2 ) C e[H] C [H]∗e          e e [H]∗
                                       + C [H] C           + O(ξ 2 , ξt/m2 ),                              (4.222b)
                    h                                                                    i
                           e e [E]∗      e[E] C [H]∗          e[H] C [E]∗        e e [H]∗
      A12 = (1 + ξ) C [H] C           +C              −ξ C                + C [E] C           + O(ξ 2 ),   (4.222c)
                                                  e                   e
                      h                                                                     i
        2
      A2 = i (1 + ξ) C C e  [H] e [E]∗
                                       −C Ce [E] e [H]∗
                                                        −ξ C C    [H]
                                                                   e   [E]∗
                                                                        e
                                                                            −C C  [E]
                                                                                   e    [H]∗
                                                                                         e
                                                                                                ,          (4.222d)
so that the leading effect of the unpolarized cross section A and helicity asymmetry A1 is
from the GPDs H and H,        e whereas that of the transverse spin asymmetries A21,2 also receives
contribution from the GPDs E and E.             e
    The helicity amplitude structure in Eq. (4.202) encodes various polarization degrees of
freedom of the two photons. These are, unfortunately, not measured at a collider detector.
The only polarization observable is of the initial-state nucleon. Tracing over the two photon
helicities in Eq. (4.222), we get
                         2                                                    
                      N              [H]         [H]        e  [H] 2     e [H] 2
                                 |C+ | + |C− | + |C+ | + |C− | + O(ξ 2 , t/m2 ),
                2                     e 2          e 2
  A0 = 2(1 − ξ )                                                                                           (4.223a)
                      ŝ
                      2 n                                                e e                        o
    2                  N               e+[H] C
                                             e+[E]∗ + C e−[H] C
                                                              e−[E]∗ − ξ C+[H]      [E]∗      [H] [E]∗
  A2 = −4(1 + ξ)                Im C                                             C+ + C− C−              , (4.223b)
                                                                                               e   e
                        ŝ
  A1 = A12 = 0.                                                                                            (4.223c)
Without observing the photon polarizations, the only nucleon spin asymmetry is with respect
to its transverse spin component perpendicular to the diffraction plane, which gives a sin ϕS
correlation.
    Inserting Eq. (4.218) to Eq. (4.103), with the explicit forms in Eq. (4.223), and inte-
grating over the trivial ϕ dependence, we have the differential cross section of the diphoton
                                                           300


production,
                   dσ               1         dσ
                               =                         [1 + sT AN (t, ξ, cos θ) sin ϕS ] ,       (4.224)
          d|t| dξ dϕS d cos θ      2π d|t| dξ d cos θ
where
                                                                 2
                                   dσ                          CF       1 − ξ2
                                               = 2π αe αs                         ΣU               (4.225)
                            d|t| dξ d cos θ                    Nc        ξ 2 s3
is the unpolarized differential cross section, with
                              [H]          [H]
                    ΣU = |C+ |2 + |C− |2 + |C
                               e            e
                                                     e+[H] |2 + |C
                                                                 e−[H] |2 + O(ξ 2 , t/m2 ),        (4.226)
and
                ∆T /m        n                                    e e                        o
      AN =               Im C   e+[H] C
                                      e+[E]∗ + Ce−[H] C
                                                      e−[E]∗ − ξ C+[H]        [E]∗
                                                                          C+ + C− C−
                                                                                     [H]
                                                                                      e  [E]∗
                                                                                          e
                                                                                                   (4.227)
              (1 − ξ)ΣU
is the single transverse spin asymmetry (SSA) of the initial-state nucleon. Here ∆T can be
determined from t and ξ by
                                              p
                                                −(1 − ξ 2 )t − 4ξ 2 m2
                                     ∆T =                                   .                      (4.228)
                                                         1+ξ
To the leading accuracy, the unpolarized cross section allows to probe the GPDs H and
 e while the SSA gives an opportunity to probe the GPD E and E,
H,                                                                                   e both with enhanced
x-sensitivity.
                                                      301


4.6.4       Numerical results
Now we present some numerical results for the diphoton production process, especially on the
enhanced x-sensitivity. We consider the pion beam as accessed at the J-PARC [Aoki et al.,
2021] or AMBER [Adams et al., 2018] experiment, with an energy Eπ = 20 or 150 GeV,
respectively. Both facilities only probe unpolarized nucleon targets, so only give the differ-
ential cross section observable in Eq. (4.225), which receives contributions from the GPDs
H and H. e For simplicity, we fix the DA to the asymptotic form with ϕ(z) = 6z(1 − z), and
renormalization and factorization scales to 2 GeV, turning off QCD evolution effects.
                                                               80
                                                                     t = -0.2GeV2, ξ = 0.2
       0                                                       60
                                                               40
    - 50    t = -0.2GeV2, ξ = 0.2
                    +
                Re(I [H0+])
                                      +
                                 Re(I [S2+])
                                                               20
   - 100
                    +
                Im(I [H0+])
                                      +
                                 Im(I [S2+])                    0
                    +                 +
                Re(I [S1 ])
                       +
                                 Re(I [Ds ])
                                         +                                    +             +           +
                                                             - 20          -
                                                                       Re(I [H GK ])
                                                                                           -
                                                                                      Re(I [S 1 ])
                                                                                                        -
                                                                                                    Re(I [S 2 ])
                    +                 +
                Im(I [S1 ])
                       +
                                 Im(I [Ds ])
                                         +
                                                                           -  +           - +         - +
   - 150                                                               Im(I [H GK ])  Im(I [S 1 ])  Im(I [S 2 ])
                                                             - 40
        0.0        0.2           0.4         0.6       0.8       0.0         0.2        0.4        0.6           0.8
                                cosθ                                                   cosθ
Fig. 4.7: The special integral in Eq. (4.230), evaluated for the u quark GPDs in the GK
model and the shadow GPDs defined in Eqs. (4.176) (4.178) and (4.183). On the left are
a = 1 for the unpolarized GPD H and associated shadow GPDs, while on the right are
a = −1 for the polarized GPD H             e and associated shadow GPDs. The superscripts “+” refer
to C-even components defined as Eqs. (4.114) and (4.117). By symmetry, I¯+ is odd with
respect to cos θ and I¯− is even, so only the parts with cos θ > 0 are shown.
    The main thing of our interest is how the special integral in Eq. (4.213) helps to distin-
guish the GPDs from shadow GPDs. Since this integral is to be convoluted with the DA,
we further define the profiled special integral,
                        Z    1                                                      
  ¯a                                 ϕ(z)       1 + cos θ − 2z              a
  I [ϕ, F; ξ, t, θ] =          dz                               +                      · I[F; ξ, t, z, θ], (4.229)
                           0      z(1 − z)          sin2 θ         1 + cos θ − 2z
                                                         302


where we have temporarily removed the constant coefficient in the DA, and a = 1 corresponds
to F = H + and −1 to F = H       e + . With the asymptotic DA, the z variable in Eq. (4.229) can
be readily integrated to give
                                               Z   1
                        I¯a [ϕ, F; ξ, t, θ] =        dx F(x, ξ, t)K a (x, ξ, cos θ),             (4.230)
                                                −1
with the kernel K a (x, ξ, cos θ) being
                                                                                            
      a                               ξ 2 (1 − c2 )                     1          c(x − ξc)
   K (x, ξ, c) = 6 a L(x, ξ, c) +                      L(x, ξ, c) +            +
                                       (x − ξc)2                    ξ(1 − c2 ) ξ 2 (1 − c2 )2
                                                                             2               
                                            2     2                  1        ξ (1 − c2 )
                               + iπ · θ(x − ξ ) sgn(x) ·                                  + a , (4.231)
                                                                −2(x − ξc) (x − ξc)2
where c ≡ cos θ and
                                                   1            (1 − c)(x + ξ)
                            L(x, ξ, c) =                   ln                   .                (4.232)
                                             −2(x − ξc)         (1 + c)(x − ξ)
At given t and ξ, Eq. (4.231) can be integrated for different GPDs to yield distributions of
cos θ. In Fig. 4.7 we show results of the special integral in Eq. (4.230) for both the “standard”
GPDs in the GK model and the shadow GPDs and shadow D-term. An interesting feature
can be immediately noticed: On the left-hand side, the imaginary part of the integral I¯+ for
H0+ is much greater than all the others and the large oscillation in the DGLAP region of the
shadow GPDs diminish their integrals a lot, especially for the imaginary parts; whereas on
the right-hand side, different components of the integration results are of similar orders, and
the shadow GPD integrals supersede the GK model ones.
    When combined into the whole amplitudes, the special integrals are further weighted by
e1 e2 , which is −2/9 for a charged pion reaction and further suppresses their contribution. In
Fig. 4.8, we show the helicity amplitudes (squared) in Eq. (4.211) for the GK model, with
                                                       303


                                    u                                                                     u
                                                                                  Helicity amplitudes for HGK
            Helicity amplitudes for H GK                                1.0
        3
              t = -0.2GeV2, ξ = 0.2                                     0.8                          
                                                                                        |C + 2       |C - 2
                    |C+ 2           |C- 2                               0.6               +
                                                                                        |C - /I 2         |I
                                                                                                               +2
        2                                                                                            
                                                                        0.4
 A.U.                                                           A.U.
                             -2               -2                                               +           +
                    |C- /I               |I                                             2Re[(I ) · (C -/I )]
                              -
                    2Re[(I ) · (C-/I )]
                                         -                              0.2
        1
                                                                        0.0
                                                                       - 0.2       t = -0.2GeV2, ξ = 0.2
        0
                                                                       - 0.4
        0.0         0.2            0.4             0.6   0.8                0.0         0.2          0.4            0.6   0.8
                                  cosθ                                                              cosθ
                                                                 [H]
Fig. 4.8: Helicity amplitudes squared of (in arbitrary scales) C± and C
                                                                  e
                                                                          e±[H] in Eq. (4.211)
evaluated at t = −0.2 GeV2 and ξ = 0.2 for the u quark GPD H        e u and H u of the GK
                                                                     GK         GK
                                       [H]
model, respectively. The amplitudes C− and C
                                        e
                                               e−[H] contain the special integrals I¯− [H]
                                                                                        e and
¯+                                                                                    ¯−
I [H], and we have displayed three separate contributions. On the left figure, C− /I refers
                    [H]                          e removed, and on the right figure, C
to the amplitude C− with contribution from I¯− [H]
                     e
                                                                                        e− /I¯+
removes the contribution from I¯+ [H].
explicitly shown the contributions from the special integrals, including both their interference
with the other contributions. Both figures are in arbitrary scales, but the relative sizes are
exact among all the curves and between the two figures. So the P-even amplitudes (left)
are larger than the P-odd ones (right). In both figures, the helicity-conserving amplitudes
                                                                                                         [H]
are greater than the helicity-flipping ones. For the P-even amplitude C− , the contribution
                                                                                                          e
                                                                                 e−[H] , they
from the special integral is much smaller than the rest, while for the P-odd one C
are of similar orders. In both cases, the special integrals have destructive interference with
the other terms.
    Therefore, the capability of using the special integrals to distinguish different GPDs is
limited. Especially, given the fact that we only have the unpolarized cross section observable
in Eq. (4.226) as the sum over all the amplitude squares, which causes the large “background”
                   [H]   e+[H] . To demonstrate the distinguishing power of the diphoton
contribution from C+ and C
                    e
process, we weight the shadow GPDs by 6 in Eq. (4.177) and 2 in Eq. (4.178), and weight the
                                                               304


      40 Hpn                                               u
             u
               (x, ξ = 0.2, t = -0.2 GeV2, μ = 2 GeV)     H pn(x, ξ = 0.2, t = -0.2 GeV2, μ = 2 GeV)
      20
       0
                                                                            
                          H0                                               H0
                                                                            
                          H1                                               H1
     -20                  H2
                                                                            
                                                                           H2
                          H3
        -1          -0.5           0       0.5         1 -1        -0.5          0       0.5        1
                                   x                                             x
 Fig. 4.9: Flavor transition GPD models used for the diphoton production in pπ − collision.
                                                           1.8 (b) Relative q distribution
       35 (a) Absolute qT distribution J-PARC                                  T              J-PARC
       30                                                  1.6
       25                                                  1.4
       20                                                  1.2
       15                                                  1.0
       10                                                  0.8
         1.0        1.2        1.4     1.6       1.8          1.0       1.2        1.4     1.6     1.8
Fig. 4.10: Unpolarized differential cross section in the transverse momentum qT , for the
diphoton diphoton production in pπ − collision at the J-PARC energy Eπ = 20 GeV, evalu-
ated for different GPD models in Fig. 4.9 at t = −0.2 GeV2 and ξ = 0.2. (a) is the absolute
qT distribution, and (b) exhibits the distribution ratio of each model to the GK model.
                                                     305


      0.50 (a) Absolute qT distribution   AMBER        1.8 (b) Relative qT distribution   AMBER
                                                       1.6
      0.10                                             1.4
      0.05                                             1.2
                                                       1.0
      0.01                                             0.8
           1        2          3        4       5          1        2          3        4       5
Fig. 4.11: Same as Fig. 4.10, but for the AMBER energy Eπ = 150 GeV. The vertical axis
of (a) is in log scale.
shadow D-term by 24 in Eq. (4.183), as a liberal choice. This results in the flavor transition
GPD models to be used for the pπ − scattering.
    The typical pion beam energy is 20 GeV at the J-PARC experiment [Aoki et al., 2021]
and 150 GeV at the AMBER experiment [Adams et al., 2018]. Since the photons in the final
state are identical particles, there is symmetry between the forward and backward regions,
so it is convenient to convert the cos θ distribution in Eq. (4.225) to a qT distribution, which
                                    √
gives a Jacobian peak at qT = ŝ/2. The results are shown in Figs. 4.10 and 4.11 for the J-
PARC and AMBER energies, respectively, for the GPD models in Fig. 4.9 at the phase space
point (t, ξ) = (−0.2 GeV2 , 0.2), including both the absolute and the relative distributions.
Importantly, both the absolute rates and relative shapes of the qT distributions are altered
by the added shadow GPDs. By the chosen weights, each one alters the qT distribution
by a comparable amount. Only by measuring the unpolarized qT distributions, one is more
sensitive to the polarized GPD H      e than the unpolarized GPD H. Since it requires a much
larger weight for the shadow D-term to reach the same impact as the others, the diphoton
process is more sensitive to the DGLAP region than the ERBL region, as we have anticipated
                                                 306


below Eq. (4.213). Clearly, at a higher collision energy at AMBER, the cross section at each
phase space point is much lower than that at J-PARC, whereas the kinematic coverage is
much wider and gives similar sensitivity in terms of the relative qT distribution shapes.
4.7        Single diffractive hard exclusive photon-meson pair
           photoproduction
In this section, we study the crossing process of Eq. (4.184), the single diffractive hard
exclusive photon-pion pair production in nucleon-photon collisions,
                               N (p) + γ(p2 ) → N ′ (p′ ) + π(q1 ) + γ(q2 ),              (4.233)
which was studied in [Boussarie et al., 2017; Duplančić et al., 2018; Qiu and Yu, 2023a;
Duplančić et al., 2023a,b; Qiu and Yu, 2023b]. The final-state pion can be replaced by any
other light meson, and N ′ does not have to be restricted to nucleons. Similar to the process
in Eq. (4.184), the γ ∗ -mediated channel is forbidden by charge conservation for charged pion
production or by the charge parity in the hard scattering for neutral pion production. In the
SDHEP kinematic region [Eq. (3.110)], the leading configuration of the process [Eq. (4.233)]
happens via an exchange of a collinear parton pair. Following the discussion in Sec. 3.2.3.2,
the amplitude of Eq. (4.233) can be factorized into the GPDs for the hadron transition
N → N ′ , a DA for the formation of the final-state pion, and perturbatively calculable
coefficients,
                        XZ      1    Z  1   h
         [F,Fe]
     MN γλ →N ′ πγλ′ =            dx                           ef f′′ (x, ξ, z; ŝ, θ, ϕ)
                                          dz FNf N ′ (x, ξ, t) C λλ
                         f,f ′ −1     0
                                                  307


                                                                                       i
                                                               ff′
                                          +FeNf N ′ (x, ξ, t) Cλλ ′ (x, ξ, z; ŝ, θ, ϕ)  D̄f ′ /π (z), (4.234)
where f = [q q̄] and [gg] for quark and gluon GPDs, respectively, if N ′ = N , or f = [q q̄ ′ ] for
transition GPDs with N ̸= N ′ , and correspondingly, f ′ = [q q̄] or [q q̄ ′ ] for the pion DA D̄f ′ /π .
                          ff′      ef f ′
The hard coefficients Cλλ   ′ and Cλλ′ are helicity amplitudes for the photon scattering off a
collinear on-shell parton pair f with λ and λ′ denoting the photon helicities in the SDHEP
frame [Fig. 4.1]. The correction to the factorization in Eq. (4.234) is suppressed by powers
of |t|/qT2 ≪ 1.
    As we will see shortly, similar to the process [Eq. (4.184)], in the crossing process
[Eq. (4.233)], the transverse momentum, or equivalently, the polar angle θ, of the pion
or final-state photon, provides an additional sensitive handle to the x dependence of GPDs.
However, there are some major differences from the diphoton production process that high-
light the photoproduction one:
    • the crossing kinematics provides an enhanced x-sensitivity mainly in the ERBL region,
       complementary to the diphoton production;
    • while only charged pion beams are accessible for the diphoton production, one can
       readily select a neutral pion product in the photoproduction process; this makes it not
       restricted to the flavor transition GPDs, and, more importantly, will further enhance
       the x-sensitivity due to a cancellation of certain “moment” terms; and
    • the polarization of the initial-state photon beam can be easily controlled, as can be
       realized in the JLab Hall D facility [Al Ghoul et al., 2017]. This allows the study of
       various polarization asymmetries, enabling to disentangle different types of GPDs.
                                                    308


4.7.1        Calculation of the hard coefficients
Similar to Sec. 4.6.1, the hard coefficients C       ef f ′ and C f f ′ are the helicity amplitudes of
                                                      λ1 λ2        λ1 λ2
the collision f (p̂1 ) + γ(p2 ) → f ′ (q̂1 ) + γ(q2 ), where p̂1 = (∆ · n)n̄ with n = (0+ , 1− , 0T )
and n̄ = (1+ , 0− , 0T ), and q̂1 = (q1 · w)w̄ with w and w̄ defined as in Eq. (4.191). For the
quark GPD channel, f = f ′ = [q1 q̄2 ], with the two quarks q1 and q̄2 from the diffracted
nucleon carrying light-like momenta (ξ ± x)P + n̄, respectively, and those for the produced
pion carrying light-like momenta z q̂1 and (1 − z)q̂1 , respectively. We neglect the gluon GPD
in this thesis.
     For the π + production, we have (q1 , q2 ) = (u, d), while for the π − collision, we have
(q1 , q2 ) = (d, u). For the π 0 production, we have q1 = q2 = u or d. The hard coefficients C     eλλ′
(Cλλ′ ) are obtained from the diagrams in Fig. 4.12 by amputating the parton lines associated
with the diffracted proton and produced pion, and contracting them with γ · n̄/2 (γ5 γ · n̄/2)
and γ5 γ · w̄/2, respectively, for the unpolarized (longitudinally polarized) GPD. So similar
to Sec. 4.6.1, the hard coefficient C   e associated with the unpolarized GPD F is parity-odd
(P-odd), while the C associated with the polarized GPD Fe is parity-even (P-even). The
hard coefficients are then obtained by averaging over the colors of each quark pair, and then
multiplied by an extra factor ŝ/2.
     Similar to Sec. 4.6.1, we take the convention for both partons from the diffracted nucleon
to enter the hard collisions and introduce the variable change z1 = (x + ξ)/2ξ and z2 = z.
Then the incoming [q1 q̄2 ] pair carries momenta z1 p̂1 and (1 − z1 )p̂1 , respectively. In the
c.m. frame of the hard collision, we take
                            r                r                r               r
                               ŝ              ŝ               ŝ              ŝ
                      p̂1 =       n̄, p2 =        n,    q̂1 =      w̄,   q2 =      w.          (4.235)
                               2               2                2               2
                                                    309


                 q2
       z1 p̂1             z2 q̂1
                    q
       z̄1 p̂1  p2        z̄2 q̂1
               (A1 )                             (A2 )       (A3 )                (A4 )
                                  p 2 q2
                      z1 p̂1             z2 q̂1
                                       q
                      z̄1 p̂1            z̄2 q̂1
                                  (B1 )                (B2 )           (B3 )
                      z1 p̂1             z2 q̂1
                                       q
                      z̄1 p̂1     p 2 q2 z̄2 q̂1
                                  (B4 )                (B5 )           (B6 )
Fig. 4.12: Hard scattering diagrams for the photon-proton scattering into a photon-pion pair.
The two incoming fermion lines on the left are from the diffracted nucleon, carrying momenta
z1 p̂1 and z̄1 p̂1 ≡ (1 − z1 )p̂1 , respectively. The two outgoing fermion lines on the right are
to form the produced pion, carrying momenta z2 q̂1 and z̄2 q̂1 ≡ (1 − z2 )q̂1 , respectively. The
variables z1 and z2 are related to x and z by z1 = (x + ξ)/2ξ and z2 = z (see the text).
Another set of diagrams are also to be included by switching the two photon lines, giving 20
diagrams in total.
                                                       310


Similar to Fig. 4.6, the diagrams in Fig. 4.12 are classified into 8 type-A ones and 12 type-B
ones. The kinematic crossing of Eq. (4.184) introduces different intermediate type-A gluon
propagators,
                                                                  r
                                                                       ŝ
                          qA = −z1 p̂1 + z2 q̂1 + q2 = −                  (z1 n̄ − z2 w̄ − w) ,           (4.236)
                                                                       2
and qA′ = qA (q2 → −p2 , w → −n) obtained by switching the two photons in Fig. 4.12. They
have virtualities different from the ones in Fig. 4.6,
                                                                                        
                                  qA2 = ŝ (1 − z1 )z2 − cos2 (θ/2)z1 (1 − z2 ) ,                        (4.237a)
                                                                                        
                                 qA2 ′ = ŝ z1 (1 − z2 ) − cos2 (θ/2)(1 − z1 )z2 ,                       (4.237b)
which contain non-factorizable θ dependence and thereby induce enhanced sensitivity to the
x dependence of GPDs, following the criterion in Sec. 4.4. The type-B diagrams have simple
gluon propagators with virtualities
                   qB2 = −(1 − z1 )(1 − z2 )ŝ sin2 (θ/2),                qB′2 = −z1 z2 ŝ sin2 (θ/2),    (4.238)
for diagrams (B1 , B1′ , · · · , B3 , B3′ ) and (B4 , B4′ , · · · , B6 , B6′ ), respectively. Their θ dependence
similarly factorizes out of z1 and z2 , following the structure of Eq. (4.154), and similarly for
the quark propagators of all diagrams. So the type-B diagrams only yield moment-type
sensitivity.
    Similar to Eq. (4.198), we denote the hard coefficients in Eq. (4.234) as
             Cλ1 λ2 (z1 , z2 ; ŝ, θ, ϕ) = Cµν (z1 , z2 ; ŝ, n, n̄, w, w̄) ϵµλ1 (p2 )ϵν∗λ2 (q2 ),       (4.239a)
             Ceλ1 λ2 (z1 , z2 ; ŝ, θ, ϕ) = Ceµν (z1 , z2 ; ŝ, n, n̄, w, w̄) ϵµ (p2 )ϵν∗ (q2 ),         (4.239b)
                                                                                 λ1      λ2
                                                           311


neglecting the superscripts f f ′ . By the parity property, C                     eµν contains one antisymmetric
Levi-Civita tensor, while Cµν does not. Now, the Ward identities
                                  nµ C eµν = C    eµν wν = nµ C µν = C µν wν = 0                          (4.240)
motivate us to expand the tensor structure in the light-cone basis formed by n and w. Any
vector V µ can be decomposed in this basis as
                                                (V · n)wµ + (V · w)nµ
                                      Vµ =                                    + V⊥µ ,                     (4.241)
                                                              n·w
where the transverse component can be projected by acting on V the tensor projector,
                                              µν                w µ nν + nµ w ν
                                            g⊥    = g µν −                      .                         (4.242)
                                                                      n·w
Then we can decompose Cµν and C                eµν into gauge-invariant tensor structures as,
                                                             
                      µν                   µν       n̄µ⊥ n̄ν⊥
   C µν
        = C− (−g⊥        ) + C+       −g⊥     +2 2              + Cn nµ n̄ν⊥ + Cw n̄µ⊥ wν + Cnw nµ wν ,  (4.243a)
                                                       n̄⊥
                 µ νnwn̄⊥             µnwn̄⊥ ν
                                                                 µ νnwn̄⊥          µnwn̄⊥ ν
                                                                                                 
   eµν     e−      n̄ ⊥ ε        +   ε         n̄ ⊥           e+    n̄⊥ ε      −   ε       n̄⊥
   C    = iC                                            + iC
                             n · w n̄2⊥                                    n · w n̄2⊥
          + iC en nµ ενnwn̄⊥ + iC     ew εµnwn̄⊥ wν ,                                                    (4.243b)
where we defined n̄µ⊥ = g⊥       µν
                                     n̄ν , similarly to Eq. (4.201). When contracting with the polar-
ization vectors as Eq. (4.239), the terms in Eq. (4.243) that are proportional to nµ or wν
vanish. The helicity amplitudes are purely determined by the first two tensor structures,
                                 N ∓iϕ                                                       N
     C±± (z1 , z2 ; ŝ, θ, ϕ) =       e C+ (z1 , z2 ; θ), C±∓ (z1 , z2 ; ŝ, θ, ϕ) = e∓iϕ C− (z1 , z2 ; θ),
                                  ŝ                                                          ŝ
                                                               312


  e±± (z1 , z2 ; ŝ, θ, ϕ) = ± N e∓iϕ C
  C                                           e+ (z1 , z2 ; θ), C e±∓ (z1 , z2 ; ŝ, θ, ϕ) = ± N e∓iϕ C   e− (z1 , z2 ; θ),
                                    ŝ                                                             ŝ
                                                                                                                     (4.244)
where N = 2ie2 g 2 CF /Nc and the four independent hard coefficients are
                                         (1 − z1 )(1 − z2 ) + z1 z2 e21 − e22                     z1 + z2 − 1
 2ξC+ = −(e1 − e2 )2 · t2θ/2 P                                            + 2           ·P
                                          2z1 z2 (1 − z1 )(1 − z2 )           sθ/2          z1 z2 (1 − z1 )(1 − z2 )
      e1 e2                  s2θ/2
    +        P                                 ×
        2       z1 z2 (1 − z1 )(1 − z2 )
                        (z1 (1 − z2 ) + (1 − z1 )z2 ) (z1 (1 − z1 ) + z2 (1 − z2 ))
            ×                                                                                     
                   (1 − z1 )z2 − c2θ/2 z1 (1 − z2 ) z1 (1 − z2 ) − c2θ/2 (1 − z1 )z2
                                                                                          δ(1 − z ) δ(z ) 
                          2 2cθ      δ(1 − z1 ) δ(z1 )              e21 − e22          −2               1          1
    + iπ (e1 − e2 ) 2                              +            +               1 + sθ/2                    −
                             sθ        1 − z2            z2              2                         1 − z2         z2
                                                                       
                 e1 e2 2                         δ(1 − z1 ) δ(z1 )                      e1 e2
            −              tθ/2 − 2s−2  θ/2                    +            − 2                       ×
                   2                               1 − z2          z2          2 cθ/2 z2 (1 − z2 )
                                                                                                     
                         z1        2 z2                                 2 z1       z2
                ×             + cθ/2         δ (z1 − ρ(z2 )) + cθ/2 +                    δ (z1 − ρe(z2 ))     ,      (4.245)
                         z2            z1                                   z2 z1
                                       z1 (1 − z2 ) + (1 − z1 )z2
 2ξC− = −(e1 − e2 )2 t2θ/2 P                                                                                         (4.246)
                                        2z1 z2 (1 − z1 )(1 − z2 )
                                                                                                               
              (e1 − e2 )2 e1 e2                 δ(1 − z1 )       δ(z1 )        e21 − e22 δ(z1 )          δ(1 − z1 )
    − iπ                       + 2                           +               +                         −                    ,
                   2c2θ/2           cθ/2            z2          1 − z2              2        1 − z2          z2
                            2              
    e+ = − (e1 − e2 ) 1 + c−2 P
 2ξ C
                                                        z1 + z2 − 1
                                                                                                                     (4.247)
                                        θ/2
                     2                            z1 z2 (1 − z1 )(1 − z2 )
      e1 e2               1 + c2θ/2
    +        P                                 ×
        2       z1 z2 (1 − z1 )(1 − z2 )
                                            (z1 + z2 − 1)(z1 − z2 )2
            ×                                                                                     
                   (1 − z1 )z2 − c2θ/2 z1 (1 − z2 ) z1 (1 − z2 ) − c2θ/2 (1 − z1 )z2
                                                                                   
             2(e1 − e2 )2                    4      t2θ/2       δ(z1 ) δ(1 − z1 )               e1 e2 2
    + iπ                        + e1 e2 2 −                              −                  −         t ×
                     sθ2
                                             sθ       2           z2          1 − z2              2 θ/2
                                                                                                                   
            z1 + z2 − 1                                                   e21 − e22 δ(1 − z1 ) δ(z1 )
                               δ (z1 − ρ(z2 )) + δ (z1 − ρe(z2 )) +                                      +                ,
            z2 (1 − z2 )                                                        2 t2θ/2        1 − z2         z2
    e− = (e1 − e2 )2 t2 P                     z1 − z2
 2ξ C                       θ/2                                                                                      (4.248)
                                    2z1 z2 (1 − z1 )(1 − z2 )
                                                                                                               
              (e1 − e2 )2 e1 e2                  δ(z1 )      δ(1 − z1 )        e21 − e22 δ(z1 )          δ(1 − z1 )
    − iπ                       + 2                        −                  +                         +                    ,
                   2c2θ/2           cθ/2        1 − z2           z2                 2        1 − z2          z2
                                                              313


where we introduced tθ/2 = tan(θ/2) and used the same notations for P and (cθ , sθ , cθ/2 , sθ/2 )
as in Eqs. (4.203)–(4.206).
    The special gluon propagators in the type-A diagrams (cf. Eq. (4.237)) introduce special
poles of z1 ,
                                                z2
                          ρ(z2 ) =            2
                                                              ,                             (4.249a)
                                    z2 +  cos (θ/2)(1  − z2 )
                                         cos2 (θ/2) z2
                          ρe(z2 ) =                         = 1 − ρ(1 − z2 ),               (4.249b)
                                    1 − z2 + cos2 (θ/2)z2
which both lie between 0 and 1, and have small positive and negative imaginary parts by
the iϵ prescription, respectively. In terms of x = ξ(2z1 − 1), Eq. (4.249) translates to the
special poles,
                                                              
                                       z − cos2 (θ/2)(1 − z)
                  xp (ξ, z, θ) = ξ ·                             ,                          (4.250a)
                                       z + cos2 (θ/2)(1 − z)
                                      2                       
                                       cos (θ/2)z − (1 − z)
                  ep (ξ, z, θ) = ξ ·
                  x                                              = −xp (ξ, 1 − z, θ),       (4.250b)
                                       cos2 (θ/2)z + (1 − z)
which cross the whole ERBL region as z goes from 0 to 1, complementary to the diphoton
production process (see Eq. (4.209)).
    Similar to Eqs. (4.203)–(4.206) for the diphoton process, we have organized Eqs. (4.245)–
(4.248) in terms of (e1 − e2 )2 , (e21 − e22 ), and e1 e2 , where the first two kinds of terms only
carry moment-type sensitivity, whereas the e1 e2 terms carry enhanced z1 (or x) sensitivity.
For a neutral pion production channel, e1 = e2 , and then the first two kinds of terms
are cancelled, which further enhances the x-sensitivity in the e1 e2 terms. By the charge
conjugation symmetry, the (e1 − e2 )2 and e1 e2 terms are probing C-even GPDs Fe+ and F + ,
whereas the (e21 − e22 ) terms are probing C-odd GPDs Fe− and F − . Also, the two processes
                                                   314


pγ → nπ + γ and pγ → nπ + γ are related by isospin symmetry, which is broken by the (e21 −e22 )
terms. Combining both channels then can help distinguish C-odd GPD components from
C-even ones. The same is true for the two processes pγ → pπ 0 γ and nγ → nπ 0 γ.
4.7.2      Helicity amplitudes
The convolutions of the hard coefficients in Eqs. (4.245)–(4.248) with the GPD and DA can
be simplified by using symmetry property of the DA. With the same notation in Eq. (4.210),
we have
                                                                                                 
          [Fe]              2         1 2        e +               2iπ cos θ e+
       C+      = (e1 − e2 ) D̄0 tθ/2 · F0 (ξ, t) +                               · F (ξ, ξ, t)                         (4.251a)
                                      2                              sin2 θ
                                                                                                              
                                              2                           iπ      3  −  cos  θ
                      2      2
                 − (e1 − e2 ) D̄0                      · Fe (ξ, t) −
                                                            −
                                                                               ·                · Fe (ξ, ξ, t)
                                                                                                    −
                                         1 − cos θ 0                       2 1 − cos θ
                          (Z                     "                                            #
                    e1 e2        1
                                    dz D̄(z)                 1             c2θ/2 + z s2θ/2
                 −                                   2             2
                                                                       +            2
                                                                                                 · J[Fe+ ; ξ, t, z, θ]
                     2          0 z(1 − z) cθ/2 + z sθ/2                          cθ/2
                                     "                                                 !               #)
                                                                                   2
                              −D̄0 tθ/2 Fe0 (ξ, t) − iπ tθ/2 − 2
                                          2     +                      2
                                                                                          Fe (ξ, ξ, t)
                                                                                             +
                                                                                                             ,
                                                                                 sθ/2
                                   "                                                     #
                            2
                 (e1 − e2 )                                      iπ
                               D̄0 t2θ/2 · Fe0+ (ξ, t) − 2 · Fe+ (ξ, ξ, t)
          [Fe]
       C−      =
                     2                                          cθ/2
                           "                                                              #
                               2       2
                              e − e2 e−                          e1 e2
                 + iπ D̄0 1               · F (ξ, ξ, t) − 2 · Fe+ (ξ, ξ, t) ,                                          (4.251b)
                                  2                              cθ/2
                                                                                                      
        e+[F ]              2             3 + cos θ             +              2iπ          +
       C       = (e1 − e2 ) D̄0                            · F (ξ, t) −                 · F (ξ, ξ, t)                  (4.251c)
                                      2 (1 + cos θ) 0                         sin2 θ
                                   iπ 1 + cos θ
                 + (e21 − e22 ) ·       ·                · D̄0 · F − (ξ, ξ, t)
                          (Z        2      1 −  cos
                                                "    θ                                        #
                    e1 e2        1
                                    dz D̄(z)                 1             c2θ/2 + z s2θ/2
                 +                                   2             2
                                                                       −            2
                                                                                                 · J[F + ; ξ, t, z, θ]
                     2         0   z(1    −  z)    c θ/2  +    z s θ/2            c θ/2
                                                                                                                 
                                        3 + cos θ           +                        8         2       +
                              +D̄0                    · F (ξ, t) − iπ                      − tθ/2 F (ξ, ξ, t) ,
                                        1 + cos θ 0                              sin2 θ
                                         "                                                 #
        e−[F ]      (e1 − e2 )2                                       iπ
       C       =−                  D̄0 t2θ/2 · F0+ (ξ, t) − 2 F + (ξ, ξ, t)
                          2                                          cθ/2
                                                              315


                           "                                                  #
                             e21 − e22      −             e1 e2     +
                − iπ D̄0                · F (ξ, ξ, t) − 2 · F (ξ, ξ, t) ,                            (4.251d)
                                 2                        cθ/2
where we used the same moment notations as Eq. (4.212), and defined the special GPD
integral,
                                                    Z  1
                                                                 F(x, ξ, t)
                              J[F; ξ, t, z, θ] ≡         dx                         ,                 (4.252)
                                                      −1     x − xp (ξ, z, θ) + iϵ
where F can take any GPD function such as F ± , Fe± , H ± , etc., and the pole xp is given in
Eq. (4.250a). The special integral will map out a distribution of θ that contains enhanced
sensitivity to the GPD x dependence. Since the pole xp lies on the ERBL region, the
enhanced sensitivity is mainly in that region.
4.7.3      Cross section and polarization asymmetries
The treatment of the nucleon spins works in the same way as Sec. 4.6.3 for the diphoton
production process. We first write Eq. (4.234) explicitly as
                                                     +α                                 +
                                                                                           
             1       ′   ′     e ′ γ −C
                                 [H]   +     e ′
                                               [E] iσ    ∆ α      [ H]   +       [ e γ5 ∆
                                                                                   E]
  Mλλ′   =       ū(p , α ) C                                 + Cλλ′ γ γ5 − Cλλ′             u(p, α), (4.253)
                                                                    e
                                 λλ            λλ
            2P +                                      2m                              2m
which is written in the SDHEP frame, with the subscripts λ and λ′ referring to the photon
helicities. Following the same derivation, we get the same Eqs. (4.218)–(4.222), just with
different photon helicity labels that are left implicit.
    Now we sum over the final-state photon helicity and average over the initial-state one by
the density matrix,
                                                                           
                                                                       2iϕγ
                                              1  1 + λγ        −ζ e 
                                     ργλλ̄ =                               ,                        (4.254)
                                              2          −2iϕγ
                                                   −ζ e          1 − λγ
                                                        316


where λγ is the helicity of the photon beam, and ζ > 0 is the linear polarization degree,
along the azimuthal direction ϕγ in the SDHEP frame. Introducing the shorthand notation,
                                              [F1 ]    [F2 ]
                                                                  X         [F ]   γ      [F ]
                                          ⟨C1       C2       ⟩≡         C1,λλ 1
                                                                                 ′ρ    C 2 ,
                                                                                   λλ̄ 2,λ̄λ′
                                                                                                                          (4.255)
                                                                  λλ̄λ′
where Ci can stand for C or C           e and Fi for any GPD, we have the photon polarization averaged
result for each amplitude component in Eq. (4.222),
                                    2 h
         [F1 ]   [F2 ]∗          N          [F ]     [F ]∗        [F ]     [F ]∗
      ⟨C       C        ⟩=                C+ 1 C+ 2 + C− 1 C− 2
                                  ŝ
                                                                                                                 i
                                                                                [F ] [F ]∗          [F ] [F ]∗
                                              −ζ cos 2(ϕ − ϕγ ) C+ 1 C− 2 + C− 1 C+ 2                                 ,  (4.256a)
                                    2 h
       e[F1 ] Ce[F2 ]∗ ⟩ =       N        e+[F1 ] C
                                                  e+[F2 ]∗ + C  e−[F1 ] C
                                                                        e−[F2 ]∗
      ⟨C                                  C
                                  ŝ
                                                                                                                 i
                                              +ζ cos 2(ϕ − ϕγ ) C                     e−[F2 ]∗ + C
                                                                              e+[F1 ] C           e−[F1 ] C
                                                                                                          e+[F2 ]∗ ,     (4.256b)
                                    2 h                                               
         [F1 ] e[F2 ]∗           N                  [F ] e [F2 ]∗        [F1 ] e [F2 ]∗
      ⟨C       C        ⟩=                λγ C+ 1 C          +     +  C  −     C  −
                                  ŝ
                                                                                                                   i
                                                                                 [F ] e [F2 ]∗       [F1 ] e [F2 ]∗
                                              −iζ sin 2(ϕ − ϕγ ) C+ 1 C                  −      + C  −    C  +         , (4.256c)
                                    2 h                                               
       e[F1 ] C [F2 ]∗ ⟩ =       N               e+[F1 ] C+[F2 ]∗ + C  e−[F1 ] C−[F2 ]∗
      ⟨C                                  λγ    C
                                  ŝ
                                                                                                                   i
                                              +iζ sin 2(ϕ − ϕγ ) C            e+[F1 ] C−[F2 ]∗ + Ce−[F1 ] C+[F2 ]∗ .     (4.256d)
The the result of Eq. (4.222) with the photon polarization averaged is,
                                2 n
                             N             [H]             [H]          e+[H] |2 + |C   e−[H] |2
     A0 = (1 − ξ )     2
                                       |C+ |2 + |C− |2 + |C
                                            e                e
                             ŝ
                                                h                                      io
                +2ζ cos 2(ϕ − ϕγ ) Re C             e+[H] Ce−[H]∗ − C+[H]   e
                                                                               C−
                                                                                  [H]∗
                                                                                   e
                                                                                             + O(ξ 2 , t/m2 ),           (4.257a)
                             2 n                h                                      i
                         2    N                       e+[H] C+[H]∗       e−[H] C−[H]∗
     A1 = 2(1 − ξ )                      λγ Re C                     +C
                                                               e                     e
                                ŝ
                                                                  317


                                         h                                 io
               −ζ sin 2(ϕ − ϕγ ) Im C      e+[H] C−[H]∗
                                                     e
                                                         +C  e−[H] C+[H]∗
                                                                       e
                                                                                + O(ξ 2 , ξt/m2 ),                (4.257b)
                      2 n                h                                                                     i
       1                N                    e+[E] C+[H]∗      e−[E] C−[H]∗          e+[H] C+[E]∗    e−[H] C−[E]∗
     A2 = 2(1 + ξ)                 λγ Re C                +C                 −ξ C                 +C
                                                       e                 e                     e               e
                          ŝ
                                         h                                                                      io
               −ζ sin 2(ϕ − ϕγ ) Im C      e+[E] C−[H]∗
                                                     e
                                                         +C  e−[E] C+[H]∗
                                                                       e
                                                                           −ξ C    e+[H] C−[E]∗
                                                                                             e
                                                                                                 +Ce−[H] C+[E]∗
                                                                                                             e
               + O(ξ 2 ),                                                                                         (4.257c)
                         2 n h                                                   e e                          i
       2                     N              e+[H] C
                                                  e+[E]∗ + C e−[H] Ce−[E]∗ − ξ C+[H]        [E]∗      [H]   [E]∗
     A2 = −2(1 + ξ)                   Im C                                               C+ + C− C−
                                                                                                       e     e
                             ŝ
                                         h                                        e e                           io
               +ζ cos 2(ϕ − ϕγ ) Im C      e+[H] Ce−[E]∗ + C e−[H] Ce+[E]∗ + ξ C+[H]       [E]∗
                                                                                         C− + C− C+
                                                                                                     [H]
                                                                                                       e   [E]∗
                                                                                                             e
                                                                                                                     ,
                                                                                                                  (4.257d)
Inserting Eq. (4.218) to Eq. (4.103), with the explicit forms in Eq. (4.257), we have the
differential cross section of the photoproduction process,
                                                                 
                 dσ                     1            dσ
                                  =                                1 + λN λγ ALL + ζAU T cos 2(ϕ − ϕγ )
       d|t| dξ dϕS d cos θ dϕ         (2π)2 d|t| dξ d cos θ
                                                                            
                                                                 sT ∆T
                        + λN ζALT sin 2(ϕ − ϕγ ) +                            AT U sin ϕS + λγ AT L cos ϕ∆
                                                              m(1 − ξ)
                                                                                                       
                                1                                       2
                        + ζAT T cos ϕS sin 2(ϕ − ϕγ ) + ζAT T sin ϕS cos 2(ϕ − ϕγ ) .                              (4.258)
where λN in place of λ in Eq. (4.218) is the helicity of the initial-state nucleon
                                                                       2
                                       dσ                          CF       1 − ξ2
                                                   = π αe αs                         ΣU U                          (4.259)
                                d|t| dξ d cos θ                    Nc        ξ 2 s3
is the unpolarized differential cross section, with
                                  [H]
                   ΣU U = |C+ |2 + |C− |2 + |C
                                   e           [H]
                                                e
                                                          e+[H] |2 + |C  e−[H] |2 + O(ξ 2 , t/m2 ).                (4.260)
                                                          318


The presence of the beam polarizations give rise to various asymmetries,
                        h                                 i
       ALL = 2 Σ−1   Re  Ce+[H] C+[H]∗
                                    e
                                        + C e−[H] C−[H]∗
                                                      e
                                                            ,                                          (4.261a)
                 UU
                        h                                 i
                 −1       e [H] e [H]∗
       AU T = 2 ΣU U Re C+ C− − C+ C−
                                              [H]
                                               e     [H]∗
                                                      e
                                                            ,                                          (4.261b)
                           h                                 i
       ALT = −2 Σ−1   Im    Ce+[H] C−[H]∗
                                      e
                                          +   Ce−[H] C+[H]∗
                                                        e
                                                               ,                                       (4.261c)
                   UU
                      h                                        e e                             i
       AT U = Σ−1  Im   e+[H] C
                        C      e+[E]∗ + Ce−[H] Ce−[E]∗ − ξ C+[H]       C +
                                                                          [E]∗
                                                                               + C −
                                                                                    [H]
                                                                                     e
                                                                                        C −
                                                                                           [E]∗
                                                                                            e
                                                                                                   ,   (4.261d)
                UU
                      h                                                                        i
       AT L = Σ−1 Re C  e+[E] C+[H]∗
                                  e
                                      +C e−[E] C−[H]∗
                                                   e
                                                        −ξ C     e+[H] C+[E]∗
                                                                           e
                                                                               +C e−[H] C−[E]∗
                                                                                            e
                                                                                                   ,   (4.261e)
                UU
                        h                                                                        i
       A1T T = −Σ−1  Im   Ce+[E] C−[H]∗
                                    e
                                        +  Ce−[E] C+[H]∗
                                                      e
                                                          −   ξ    Ce+[H] C−[E]∗
                                                                             e
                                                                                 +  Ce−[H] C+[E]∗
                                                                                              e
                                                                                                     , (4.261f)
                  UU
                      h                                        e e                             i
         2      −1      e [H] e [E]∗     e [H] e [E]∗              [H] [E]∗         [H]   [E]∗
       AT T = ΣU U Im C+ C− + C− C+ + ξ C+ C− + C− C+                                              ,   (4.261g)
                                                                                     e      e
which are approximations of the full expressions up to the errors suppressed by ξ 2 or t/m2 .
The ALL can be measured through the double helicity asymmetry,
                                                   1 σ(λN , λγ ) − σ(λN , −λγ )
                      ALL (t, ξ, cos θ) =                                                   ,           (4.262)
                                               λN λγ σ(λN , λγ ) + σ(λN , −λγ )
where σ stands for the differential cross section in (t, ξ, cos θ) with ϕS and ϕ integrated out.
All the other asymmetries can be measured through the azimuthal modulations.
    To the leading accuracy, the differential cross section without sT allows to probe the
GPDs H and H,  e whereas with a nonzero sT , we also gain access to probe the GPD E and
e both with enhanced x-sensitivity.
E,
4.7.4      Numerical results
Now we present the numerical results for the photoproduction process on the enhanced
x-sensitivity. We target our analysis toward the JLab Hall D experiment, with a photon
                                                      319


beam Eγ = 9 GeV of arbitrary polarization and a proton target that can be unpolarized
or longitudinally polarized. With a potential upgrade to JLab 22 GeV, a photon beam
Eγ = 22 GeV can also be accessed. Similarly, we fix the DA to the asymptotic form with
ϕ(z) = 6z(1 − z), and renormalization and factorization scales to 2 GeV, turning off QCD
evolution effects.
                                                            u
            H u(x, ξ = 0.2, t = -0.2 GeV2, μ = 2 GeV)      H (x, ξ = 0.2, t = -0.2 GeV2, μ = 2 GeV)
        10
         5
         0
                                                                                  
                            H0 = HGK                                        H 0 = H GK
        -5                                                                             
                                                                            H 1 = H GK + S 1
                            H1 = HGK + S1
                                                                                       
                            H2 = HGK + S2                                   H 2 = H GK + S 2
       -10                  H3 = HGK + Ds
          -1        -0.5          0        0.5        1 -1         -0.5          0          0.5     1
                                  x                                              x
       Fig. 4.13: Choices of the u-quark GPD models for the photoproduction process.
    As we shall see here, this photoproduction process gives better sensitivity to GPDs, so
we lower the weights of both the shadow GPDs and D-term to 2 in Eqs. (4.177)(4.179) and
(4.183), in contrast to the diphoton process in Sec. 4.6.4. The resultant GPD models for the
u quark are displayed in Fig. 4.13.
    In Fig. 4.14, we show the unpolarized differential cross section in Eq. (4.259) together
with the various asymmetries in Eq. (4.261) for π 0 production as a function of its polar angle
θ in the SDHEP frame at Eγ = 9 GeV. Since the amplitudes C− and C
                                                                             [H]
                                                                               e
                                                                                       e−[H] only depend on
GPDs through their moments (Eq. (4.251)), they are not visible to the shadow GPDs. On
the other hand, the the amplitudes C+ and C
                                             [H]
                                               e
                                                      e+[H] have nontrivial cos θ dependence through
the special GPD integral in Eq. (4.252). Therefore, GPDs with different x-dependence lead
to different rate and asymmetries. In particular, the ALT is sensitive to the imaginary parts
                                                   320


             250 (a) d σ / dt d ξ dcosθ [pb/GeV2]                  p γ  p π0 γ                (b) AUT -0.3
                                                         
             200                  ( H0 , H 0 )     ( H3 , H 0 )
                                                                                                           -0.5
             150                  ( H1 , H 0 )     ( H0 , H 1 )
                                                         
                                  ( H2 , H 0 )     ( H0 , H 2 )                                              -0.7
             100
               50                                                                                            -0.9
              0.8 (c) ALL                                                                      (d) ALT
                                                                                                             0
              0.4
                                                                                                             -0.1
                0                                                                       Eγ = 9 GeV
                                                                         t = -0.2 GeV2, ξ = 0.2              -0.2
            -0.4
                   -0.5  -0.25       0         0.25       0.5    -0.5  -0.25     0          0.25       0.5
                                  cosθ                                        cosθ
Fig. 4.14: Unpolarized rate (a) and polarization asymmetries (b)-(d) as functions of cos θ at
(t, ξ) = (−0.2 GeV2 , 0.2), using different GPD sets as given in Fig. 4.13.
             140 (a) d σ / dt d ξ dcosθ [pb/GeV2]                  p γ  n π+ γ                (b) AUT 0.2
                         Eγ = 9 GeV                                                                          0
             100
                          t = -0.2 GeV2, ξ = 0.2                                                             -0.2
                                                                                                      
                                                                               ( H0 , H 0 )     ( H3 , H 0 )
               60                                                                                          -0.4
                                                                               ( H1 , H 0 )     ( H0 , H 1 )
                                                                                                           -0.6
                                                                               ( H2 , H 0 )     ( H0 , H 2 )
               20                                                                                            -0.8
                   (c) ALL                                                                     (d) ALT 0
            -0.2
                                                                                                             -0.1
            -0.4
                                                                                                             -0.2
            -0.6                                                                                             -0.3
            -0.8                                                                                             -0.4
                   -0.5  -0.25       0         0.25       0.5    -0.5  -0.25     0          0.25       0.5
                                  cosθ                                        cosθ
                  Fig. 4.15: Same as Fig. 4.14, but for the pγ → nπ + γ process.
                                                               321


              120 (a)
                       d σ / dt d ξ dcosθ [pb/GeV2]                n γ  p π- γ          (b) AUT 0.3
              100                                       
                                  ( H0 , H 0 )   ( H3 , H 0 )                                     0.1
               80                                                                               -0.1
                                  ( H1 , H 0 )   ( H0 , H 1 )
               60                                                                               -0.3
                                  ( H2 , H 0 )   ( H0 , H 2 )
               40                                                                                 -0.5
                                                                                                  -0.7
               20
                                                                                                  -0.9
                0 (c ) A                                                                 (d) ALT  0
                         LL                    Eγ = 9 GeV
             -0.2              t = -0.2 GeV2, ξ = 0.2                                             -0.1
             -0.4                                                                                 -0.2
             -0.6                                                                                 -0.3
             -0.8                                                                                 -0.4
               -1                                                                                 -0.5
                   -0.5   -0.25        0       0.25         0.5  -0.5  -0.25    0     0.25    0.5
                                     cosθ                                      cosθ
                  Fig. 4.16: Same as Fig. 4.14, but for the nγ → pπ − γ process.
of the amplitudes, which are generated in the ERBL region, and has better sensitivity to the
shadow D-term than the other three observables as shown in Fig. 4.14.
    As for the diphoton production process in Sec. 4.6.4, the oscillation of shadow GPDs in the
DGLAP region generally causes a big cancellation in their contributions to the amplitudes,
while the sensitivity is more positively correlated with the GPD magnitude in the ERBL
region. The shadow Sei associated with the x-dependence of the polarized GPD H                         e gives
bigger contribution to the amplitude C+ than Si to C
                                                      [S]
                                                        e
                                                                        e+[S] due to charge symmetry property,
so can be better probed.
    Contrary to diphoton production process, now we have various polarization observables
that have linear dependence on the helicity-flipping amplitudes, such as the AU T and ALT in
Eq. (4.261), thereby on the special integrals. The helicity-conserving amplitudes, which are
blind to shadow GPDs, now do not just come as large backgrounds, but are large coefficients
of the special integrals to enhance the discriminative power. This therefore gives better
opportunities to probe the GPDs.
                                                               322


 100                                                                              0
       (a) d σ / dt d ξ dcosθ [pb/GeV2]            p γ  p π0 γ          (b) AUT
   80                                                                             -0.2
                                      
                   ( H0 , H 0 ) ( H3 , H 0 )
   60                                                                           -0.4
                   ( H1 , H 0 ) ( H0 , H 1 )
   40                                                                           -0.6
                   ( H2 , H 0 ) ( H0 , H 2 )
   20                                                                             -0.8
                                                                                  -1
  0.9 (c) A
             LL                                                          (d) ALT
  0.6                                                                             0
  0.3
                                                                                  -0.1
    0
-0.3                                                              Eγ = 17 GeV
                                                                                  -0.2
-0.6                                                    t = -0.2 GeV2, ξ = 0.2
                                                                                  -0.3
    -0.8      -0.4           0    0.4         -0.8   -0.4      0       0.4     0.8
                          cosθ                               cosθ
          Fig. 4.17: Same as Fig. 4.14, but with Eγ = 17 GeV.
   70 (a) d σ / dt d ξ dcosθ [pb/GeV2]             p γ  n π+ γ          (b) AUT 0.2
                                                                                0
   50              ( H0 , H 0 ) ( H3 , H 0 )
                                                                                -0.2
                   ( H1 , H 0 ) ( H0 , H 1 )
                                                                                  -0.4
   30                      
                   ( H2 , H 0 )
                                       
                                ( H0 , H 2 )                      Eγ = 17 GeV
                                                                                  -0.6
                                                        t = -0.2 GeV2, ξ = 0.2    -0.8
   10
       (c) ALL                                                           (d) ALT
    0                                                                             0
-0.2                                                                              -0.1
-0.4                                                                              -0.2
-0.6                                                                              -0.3
-0.8                                                                              -0.4
    -0.8      -0.4           0    0.4         -0.8   -0.4      0       0.4     0.8
                          cosθ                               cosθ
          Fig. 4.18: Same as Fig. 4.15, but with Eγ = 17 GeV.
                                             323


                70 (a) d σ / dt d ξ dcosθ [pb/GeV ]
                                                              2       n γ  p π- γ    (b) AUT 0.3
                                                                                             0.1
                50               ( H0 , H 0 )    ( H3 , H 0 )
                                                                                               -0.1
                                                       
                                 ( H1 , H 0 )    ( H0 , H 1 )                                  -0.3
                30                                     
                                                                                               -0.5
                                 ( H2 , H 0 )    ( H0 , H 2 )
                                                                                               -0.7
                10                                                                             -0.9
                     (c) ALL                  Eγ = 17 GeV                             (d) ALT 0
                 0
                               t = -0.2 GeV2, ξ = 0.2                                          -0.1
              -0.2
              -0.4                                                                             -0.2
              -0.6                                                                             -0.3
              -0.8                                                                             -0.4
               -1                                                                              -0.5
                 -0.8       -0.4           0       0.4           -0.8   -0.4    0   0.4     0.8
                                        cosθ                                   cosθ
                        Fig. 4.19: Same as Fig. 4.16, but with Eγ = 17 GeV.
    Furthermore, for the neutral pion production, we can eliminate terms proportional to
(e1 − e2 )2 or (e21 − e22 ) in the amplitudes [Eq. (4.251)] since e1 = e2 , which effectively removes
a good number of moment-type terms, giving the maximum amount of entanglement and
the most sensitivity to GPDs’ x-dependence. In Fig. 4.15, we present the same study for the
pγ → nπ + γ process. With different flavor combination, it provides different x-sensitivity.
The nγ → pπ − γ process gives a similar result in Fig. 4.16, but with a smaller production rate.
Although the charged pion channels have less sensitivity in the absolute distributions, they
yield greater polarization asymmetry values than the neutral pion channel so can provide
equally significant and complementary results.
    For ungraded JLab energy, the photon beam with Eγ = 17 GeV yields similar results,
shown in Fig. 4.17. The rate at each phase space point decreases as the energy increases,
but the kinematic coverage becomes wider, giving a complementary opportunity to probe
the x dependence via wider cos θ distributions.
    As demonstrated in Figs. 4.14–4.19, both the production rate and asymmetries are sizable
                                                                324


and measurable, making the photoproduction process uniquely different from DVCS and
other processes in terms of its enhanced sensitivity for extracting the x-dependence of GPDs.
                                             325


Chapter 5
Summary and Outlook
One of the dominant features of QCD is that colors are fully entangled and confined within
hadrons, which makes the internal structure of hadrons by no means like the atomic structure,
where electrons are bound to nucleus in a sparsely distributed space. On the contrary, inside
a hadron, quarks and gluons are densely distributed and strongly tied together. As a result,
it is less useful to describe the hadronic structure using the concept of wavefunction in non-
relativistic quantum mechanics. Instead, the study of hadrons’ internal dynamics is to use
parton correlation functions, which are the expectation values of a set of parton fields in a
hadron state. A full understanding of partonic hadron structure can be obtained by knowing
all possible parton correlation functions.
     However, the correlation functions are by definition nonperturbative and require exper-
imental measurement, given the lack of a full nonperturbative calculation method. The
connection of the correlation functions to experimental observables is given by QCD factor-
ization theorems. At a hard scattering process involving hadrons, one can show that the
scattering cross section or amplitude can be factorized into certain parton correlation func-
tions with perturbatively calculable hard coefficients, to the leading power of the hard scale.
Depending on the specific type of processes, one end up with different type of parton corre-
lation functions, which have operator definitions, can be studied on their own, and uncover
different aspects of the hadronic structures.
                                               326


    For inclusive processes, the factorization of their cross sections leads to the forward parton
distribution functions, which capture the one-dimensional longitudinal parton correlation on
the light cone within a fast-moving hadron, and transverse-momentum-dependent parton
distribution functions, which in addition capture the parton correlations in the transverse
plane. Both distributions correspond to cut diagrams, and are expressed as the diagonal
matrix elements of parton operators.
    Exclusive processes, on the other hand, are factorized at amplitude level into new types of
parton correlation functions, among which are the meson or baryon distribution amplitudes
that play the role of hadron wavefunctions on the light cone, and the generalized parton
distributions (GPDs), which form the main part of this thesis. Among others, the GPDs
entail three-dimensional parton pictures in the space of parton momentum fraction x and
transverse position bT . We have shown a general class of 2 → 3 processes in Sec. 3.2, the
single diffractive hard exclusive processes, whose amplitudes can be factorized into GPDs
and which can provide useful experimental probes to GPDs.
    While two of the three variables (x, ξ, t) of GPDs are directly related to the measured
momenta of the diffractive hadron, p − p′ , it is the relative momentum fraction x of the
two exchanged partons, [q q̄ ′ ] or [gg], between the diffractive hadron and the hard probe
that is the most difficult one to extract from the experimental measurement, while it is
the most important one to define the slices of the hadron’s spatial tomography. We have
systematically examined the sensitivity of various SDHEPs for extracting the x-dependence
of GPDs in Sec. 4.4, and divided the sensitivity into two types: moment type and enhanced
type. We argued that the requirement for enhanced sensitivity on x is to have at least one
internal propagator in the hard part that is not connected to two on-shell massless external
lines on either of its ends, which usually requires observing more than one external particle
                                               327


that comes out of the hard scattering. We gave two example processes, the hard diphoton
production in single-diffractive pion-nucleon collision, and single-diffractive photoproduction
of a hard photon-pion pair. These two processes give complementary enhanced sensitivity to
the x dependence of GPDs, which were demonstrated by using the shadow GPDs. d Given
both the theoretical and experimental difficulties to unambiguously extract the x-dependence
of GPDs, one should not only study as many independent GPD-related processes as possible,
but also identify more processes that yield enhanced sensitivity to the x dependence of GPDs.
With a generic factorization proof, the SDHEP can serve as a framework to identify and
categorize all specific processes for the study of GPDs. In this thesis, we categorized these
processes in terms of the type of the beam colliding with the diffractive hadron. With the
two-stage paradigm of the SDHEP, we are well motivated for the search of new processes for
extracting GPDs, and in particular, their x-dependence.
                                              328


              Part II
   Single Transverse Polarization
Phenomena at High-Energy Colliders
                 329


Chapter 6
Introduction
Spins are unique features of quantum mechanics, as a product of quantum Lorentz sym-
metry [Wigner, 1939; Weinberg, 2005]. At high-energy colliders such as the Large Hadron
Collider (LHC), however, spin phenomena are relatively rarely discussed, because (1) the
LHC is an unpolarized proton-proton collider, so usually it does not produce polarized par-
ticles; and (2) the detectors of high-energy colliders only record the energy and momentum
information, but do not measure the spins, so even if a particle is produced polarized, the
spin information will be lost. Both obstacles can be overcome. First, the Standard Model
(SM) contains parity-violating weak interactions, so particles can be produced with net spin
polarizations along their momentum directions, or net helicities. Furthermore, even without
parity violations, there can be significant transverse polarizations produced even at unpo-
larized colliders. In both contexts, the polarization refers to a single particle, with all other
particles’ spins unobserved, so it belongs to the regime of single polarization production, sim-
ilar to the discussion of single spin asymmetry at polarized colliders. Second, even though the
high-energy detectors do not directly measure spins, if the polarized particle is unstable and
decays into other particles, its polarization information will be imprinted on the kinematic
distributions, especially angular distributions, of the decay products. This is because the
polarization of the mother particle breaks the spatial rotational invariance, and so it leads
to certain angular distributions, which can be determined by rotation group properties.
                                               330


    The same story holds for high-energy quarks and gluons produced in the hard collisions.
Due to the asymptotic freedom, such particles are produced as (quasi-)free particles, with well
defined polarization properties. But as they travel away from each other, the color interaction
among them becomes stronger and stronger, and eventually turns each fast-moving quark
or gluon into a jet of hadrons. It may be argued that such hadronization process will wash
out all the original parton spin information. But it is more presumably motivated from
the high-energy jetty event structures that only soft gluons are exchanged among the hard
partons to neutralize colors [Collins, 1993]. Perturbatively, soft gluon exchanges do not
change the spins of hard partons, so we can expect the polarization of the quark or gluon
produced from the hard collision to be preserved when it fragments into a jet. As a result,
the angular distribution of the jet constituents will reflect the polarization state of the parton
that initiates the jet.
    Therefore, it is equally feasible to study spin phenomena at high-energy colliders as well
as at low-energy experiments. This leads to much more observables than the pure production
rates. Especially, as we will elaborate in this thesis, the transverse polarization corresponds
to the quantum interference between different helicity states. Such information would be lost
had one not measured the decay distributions. The spin-sensitive observables hence provide
new tests on the interaction structures of the SM.
    This rest of this thesis is devoted to the study of single transverse polarization phenomena
at such high-energy colliders as the LHC. Historically, such study dates back to 1976 when it
was discovered at Fermilab that the inclusively produced Λ0 hyperon in hadron collisions had
a substantial transverse polarization [Bunce et al., 1976; Heller et al., 1978]. This triggered a
number of both experimental and theoretical studies until today. Among the early theoretical
works was Ref. [Kane et al., 1978], where it was realized that the single transverse spin of
                                                 331


a quark is an infrared-safe observable in Quantum Chromodynamics (QCD), which can be
calculated perturbatively by virtue of the asymptotic freedom. Following the observation
that only the transverse spin component perpendicular to the scattering plane is allowed by
parity conservation, the authors argued that this must be sourced by the imaginary part of
the interference between a helicity-conserving and a helicity-flipping amplitudes. Therefore,
one necessarily requires a nonzero quark mass to flip the quark helicity and a threshold effect
at loop level to generate a nonzero phase. So then in the scaling limit, the quark polarization
                          √
is suppressed by αs mq / s, where αs is the strong coupling due to the loop effect, mq is the
                   √
quark mass, and      s is the scattering energy.
    Although this means the single transverse spin of a strange quark produced at high-energy
collisions would be too small to explain the observed large Λ0 polarization [Dharmaratna and
Goldstein, 1990], it does imply the possibility of having a largely polarized top quark [Kane
et al., 1992], which is the heaviest quark in the SM and whose polarization could be a new
probe for new physics. Any deviation from the SM prediction, especially a nonzero transverse
spin within the production plane, could indicate the existence of a new interaction or even
CP violation.
    One advantage of the transverse spin is that it leads to a nontrivial azimuthal correlation
of the decay products with the spin direction, as a result of breaking the rotational invariance.
Since a transverse spin is the interference between different helicity states, λ1 and λ2 , the
specific correlation form can be easily obtained from rotational properties as cos(λ1 − λ2 )ϕ
and/or sin(λ1 − λ2 )ϕ, with ϕ characterizing the overall azimuthal direction of the decay
products. Such correlations can be readily measured to determine the value of the transverse
polarization. Unlike the helicity polarization that leads to a forward-backward asymmetry
for the decay products with respect to the momentum direction of the mother particle, the
                                               332


azimuthal correlations resulting from the transverse polarization stay invariant when the
polarized particle is boosted. This makes them a source of new jet substructures for boosted
objects. However, due to the spin-half nature, the azimuthal correlations associated with
a transversely polarized quark are cos ϕ and/or sin ϕ, the observation of which requires to
identify the flavor of the decay products. For example, in a jet initialized by a transversely
polarized u quark, one may be observing the correlation of a charged pion π + with the
polarization direction.
    On the other hand, a gluon can also be produced in high-energy collisions with a linear
polarization, as was noticed around the same time as the transverse quark spin [Brodsky
et al., 1978]. Contrary to the latter, though, the linear gluon polarization does not suffer
from the mass and high-order suppression, and can in principle be produced at leading
order with a large magnitude [Brodsky et al., 1978; Olsen et al., 1980; Devoto et al., 1980,
1979; DeGrand and Petersson, 1980; Petersson and Pire, 1980; Olsen et al., 1981; Devoto
and Repko, 1982; Korner and Schiller, 1981; Olsen and Olsen, 1984; Hara and Sakai, 1989;
Jacobsen and Olsen, 1990; Groote et al., 1997, 1999; Groote, 2002; Yu et al., 2022]. Since
gluons are spin-one massless particles, their linear polarization is the interference between a
+1 and a −1 helicity states, with a helicity flip by two units. Hence, they will leave cos 2ϕ
and/or sin 2ϕ azimuthal correlations in the fragmented jets. Such correlations are invariant
under ϕ → ϕ+π so do not require distinguishing the particle flavors, but instead they will be
reflected as an azimuthal anisotropy in the energy deposition. Observation of such polarized
gluon jet substructure could be easier than for the polarized quark ones. As we will show,
this can serve as a new tool to probe CP -violating interactions.
    Similar effects also apply to massive vector bosons like the W and Z, which can also be
produced with a linear polarization when they carry a nonzero transverse momentum. Such
                                              333


phenomena have actually been noticed all along when one studies the angular functions of
the Drell-Yan pair in their rest frame [Lam and Tung, 1978]. However, one may still gain
some insights when framing in terms of linear polarizations. Especially, as one goes to the
boosted regime, a W or Z may be produced with a very high transverse momentum such
that their decay products are highly collimated. In particular, when they decay hadronically,
it may not be easily determined whether they are QCD jets or are indeed from the heavy
boson decays, and one cannot simply reconstruct the rest frame for each event. Carefully
designed jet substructure observables must be employed to tag the boosted objects. Then the
angular function decomposition back in the rest frame loses its advantage, but the azimuthal
correlation substructures due to the linear polarization retain their simplicity and can be used
to tag the observed jets.
    Linearly polarized gluons can not only be produced from hard collisions, but also can
exist ubiquitously elsewhere, such as from heavy meson decay [Brodsky et al., 1978; Koller
et al., 1981; Robinett, 1991] and from parton showering [DeGrand and Petersson, 1980].
In particular, it has been noticed that a linearly polarized gluon can be emitted in the
shower of an unpolarized parton and lead to nontrivial cos 2ϕ correlations [Chen et al., 2021,
2022; Karlberg et al., 2021; Hamilton et al., 2022]. The reason for this is that a 1 → 2
splitting in the boosted parton showering defines a plane and allows a linear polarization
along or perpendicular to the plane. This resembles the gluon Boer-Mulders function in the
transverse-momentum-dependent QCD factorization [Mulders and Rodrigues, 2001; Nadol-
sky et al., 2007; Boer et al., 2011; Catani and Grazzini, 2012; Sun et al., 2011; Qiu et al.,
2011; Boer et al., 2012], for which a linearly polarized gluon distribution can exist in an
unpolarized hadron target when the gluon carries a nonzero transverse momentum.
    Again, similar effects can be extended to massive vector bosons, which can come from
                                              334


the decay of a boosted heavy object like a top quark or Higgs boson [Yu and Yuan, 2022b].
For the same reason, the intermediate vector boson can carry a linear polarization and then
decay into light particles preferentially along the direction parallel or perpendicular to the
polarization direction. This leads to a more complicated azimuthal correlation in the original
boosted heavy particle. The minimal configuration is a 1 → 3 decay. When extended to
the hadronic decay mode, the intermediate linearly polarized vector boson gives rise to an
azimuthally inhomogeneous energy deposition pattern that makes the whole “fat” jet more
circular or planar. Such phenomena could be measured as a precision test of the SM and
probe for new physics.
    The rest of this thesis is organized as the following. First, to lay the foundation of
the polarization study, I will review in Ch. 7 the definitions of the spin states and their
Lorentz transformation properties which are governed by the corresponding little group,
mainly following the discussion in [Weinberg, 2005]. Along the line will be derived the
explicit little group forms for some important cases that will be used in later sections. Then
in Ch. 8, I will discuss the transverse spins of quarks, using the top quark as a main example.
The discussion is mainly as an introduction for the vector boson polarization in the following
chapter, with most being known in the literature. A brief comparison between the single
quark polarization and the quark spin-spin correlations is given at the end of this chapter.
Next, Ch. 9 is devoted to the study of linear vector boson polarization at the LHC. This
forms the main part of the rest of this thesis. We will first discuss the linear polarization of
a gluon as produced directly from a hard collision. This discussion leads to the definition of
a polarized gluon jet function, which provides a concrete procedure for measuring the gluon
polarization at the LHC. As will be explained, such measurement will provide a sensitive
probe for possible CP -violating effects. Then we will discuss the linear polarization of a
                                                335


vector boson that comes from the decay of a boosted heavy object. The focus will be on a
boosted top quark that decays into a bottom quark and a W boson, which further decays
into a lepton pair or quark pair. We will give a physical argument of why the W boson
can be linearly polarized in the boosted regime. The derivation will clarify it as a general
phenomenon that a boosted 1 → 3 decay system can exhibit such azimuthal correlation if
it is mediated by a vector boson. Finally, in Ch. 10, we conclude our discussion and present
the outlook.
                                             336


Chapter 7
Poincare group representation and
little group transformation
In this chapter, we review the Poincaré group representation and the associated little group,
following Ch. 2.5 of [Weinberg, 2005].
7.1      General formalism
Setting the Poincaré symmetry as the fundamental spacetime symmetry, we identify states
that can transform into each other under a Poincaré transformation as belonging to the same
particle species. The Poincaré symmetry transformation acts on the coordinate space as
                                   xµ → x′µ = Λµ ν xν + aµ ,                              (7.1)
defined to make gµν dxµ dxν invariant, with gµν = diag{1, −1, −1, −1}. This defines the
Lorentz transformation Λµ ν and the translation aµ , such that gµν Λµ ρ Λν σ = gρσ . Eq. (7.1)
induces a unitary operator U (Λ, a) on the Hilbert space, the whole set of which forms the
Poincaré group and satisfies
                             U (Λ, a) U (Λ′ , a′ ) = U (ΛΛ′ , a + Λa′ ),                  (7.2)
                                                  337


and
                      U (1, 0) = 1,   U † (Λ, a) = U −1 (Λ, a) = U (Λ−1 , −Λ−1 a).       (7.3)
We also define the Lorentz transformation U (Λ) ≡ U (Λ, 0), whose set forms the Lorentz
group, as a unitary subgroup of the Poincaré group.
    By the translation properties, we label each single-particle state by its momentum pµ and
some internal quantum number collectively denoted as σ,
                                           P̂ µ |p, σ⟩ = pµ |p, σ⟩,                      (7.4)
where P̂ µ is the momentum operator, defined as the generator of the translation group,
                                              U (1, a) = eiP̂ ·a .                       (7.5)
The momenta of two states can be related to each other under Lorentz transformation only
if they have the same mass m2 = pµ pµ and, if m2 > 0, sign of p0 . Since
                                              h                      i
                  P̂ µ U (Λ)|p, σ⟩ = U (Λ) U −1 (Λ)P̂ µ U (Λ) |p, σ⟩
                                               h          i
                                    = U (Λ) Λµ ν P̂ µ |p, σ⟩ = [Λµ ν pµ ] U (Λ)|p, σ⟩,
the state U (Λ)|p, σ⟩ has a momentum Λp, and hence can be expanded as
                                                   X
                                U (Λ)|p, σ⟩ =           |Λp, σ ′ ⟩ Dσ′ σ (Λ, p).         (7.6)
                                                    σ′
                                                       338


This forms the unitary representation of the Lorentz group, which is infinitely dimensional,
                     "                           #
                       XZ             d3 p′                                                              
     U (Λ)|p, σ⟩ =                       3
                                                     |p′ , σ ′ ⟩ (2π)3 (2Ep′ )δ (3) (p′ − Λp) Dσ′ σ (Λ, p) ,  (7.7)
                       σ′
                                  (2π) 2Ep′
where Λp is the three-vector part of Λp. By choosing the normalization
                                ⟨p′ , σ ′ |p, σ⟩ = (2π)3 (2Ep ) δ (3) (p − p′ ) δσσ′ ,                        (7.8)
we can write the representation matrix as
                     ⟨p′ , σ ′ |U (Λ)|p, σ⟩ = (2π)3 (2Ep′ ) δ (3) (p′ − Λp) Dσ′ σ (Λ, p)                      (7.9)
in the (p, σ) space. The matrix Dσ′ σ (Λ, p) is also unitary,
                                       X
                                             Dσ† ′′ σ′ (Λ, p)Dσ′ σ (Λ, p) = δσ′′ σ ,                         (7.10)
                                         σ′
and satisfies the multiplication rule,
                                                        X
                               Dσ′′ σ (ΛΛ′ , p) =              Dσ′′ σ′ (Λ, Λ′ p)Dσ′ σ (Λ′ , p),              (7.11)
                                                          σ′
with
                                 D(1, p) = 1,             D−1 (Λ, p) = D(Λ−1 , Λp).                          (7.12)
    To find the representation matrix of the Lorentz group, it is necessary to first clearly define
each particle state |p, σ⟩, especially the quantum number σ. For a particular particle, we
define all its states by relating to the “standard” states with a canonical reference momentum
                                                                339


k, of which the quantum number σ is defined for all values. This can be chosen as
                                           k = (m, 0, 0, 0)                              (7.13)
for a massive particle with m2 = k 2 > 0, or
                      k = (E0 , 0, 0, E0 ) (with E0 = 1 GeV for example)                 (7.14)
for a massless particle. Any other possible momentum p is related to k by a standard Lorentz
transformation L(p) ≡ L(p; k),
                                           pµ = Lµ ν (p)k ν ,                            (7.15)
where we suppress the dependence on k since it is common for all the states of a particular
particle. L(p) can be standardly defined by first boosting along the +ẑ direction by U (Λz (β))
such that p1 = Λz (β)k has the same energy as p, and then rotating p1 to the same direction
as p by first rotating around the y axis by θ and then around the z axis by ϕ,
                   L(p) = R(θ, ϕ)Λz (β) = Rz (ϕ)Ry (θ)Λz (β) = Λp̂ (β)R(θ, ϕ),           (7.16)
where θ and ϕ are the polar and azimuthal angles of p, respectively. The last step gives an
alternative definition of L(p) that first rotates k to the direction of p by R(θ, ϕ) and then
boosts along the direction of p by Λp̂ (β) = R(θ, ϕ)Λz (β)R−1 (θ, ϕ) to reach the same energy.
The induced Lorentz transformation U (L(p)) in the Hilbert space is thus
                    U (L(p)) = U (R(θ, ϕ))U (Λz (β)) = U (Λp̂ (β))U (R(θ, ϕ)),           (7.17)
                                                340


with
                      U (R(θ, ϕ)) = e−iJz ϕ e−iJy θ ,    U (Λn̂ (β)) = e−iK·n̂β .        (7.18)
   The state |p, σ⟩ is defined as the Lorentz transformation of |k, σ⟩ under U (L(p)),
                                    |p, σ⟩ ≡ U (L(p))|k, σ⟩.                             (7.19)
Under an arbitrary Lorentz transformation U (Λ), the state |p, σ⟩ becomes
                      U (Λ)|p, σ⟩ = U (Λ)U (L(p))|k, σ⟩ = U (ΛL(p))|k, σ⟩
                                  = U (L(Λp)) U (L−1 (Λp)ΛL(p))|k, σ⟩.                   (7.20)
Note that although the transformation ΛL(p) brings the momentum k to Λp, it is not
necessarily equal to L(Λp). But it does imply that the transformation
                                   W (Λ, p) ≡ L−1 (Λp)ΛL(p)                              (7.21)
keeps k invariant,
                                       W µ ν (Λ, p)k ν = k µ .                           (7.22)
Now for a specific momentum k, all the Lorentz transformations that leave it invariant form a
subgroup of the Lorentz group, which is called the little group. A little group transformation
W thus only mixes the quantum number σ, and its representation can be easily obtained,
                                               X
                               U (W )|k, σ⟩ =         |k, σ ′ ⟩Dσ′ σ (W ),               (7.23)
                                                σ′
                                                341


where D(W ) is a unitary matrix. Plugging Eq. (7.23) back into Eq. (7.20) then gives the
Lorentz group representation,
                                                                                                X
         U (Λ)|p, σ⟩ = U (L(Λp))U (W (Λ, p))|k, σ⟩ = U (L(Λp))                                       |k, σ ′ ⟩Dσ′ σ (W (Λ, p))
                                                                                                  σ′
                                X
                          =           |Λp, σ ′ ⟩Dσ′ σ (W (Λ, p)).                                                               (7.24)
                                 σ′
Compared with Eq. (7.6), we get
                                                   Dσ′ σ (Λ, p) = Dσ′ σ (W (Λ, p)),                                             (7.25)
which explains why the same symbol “D” is used. In this way, the Lorentz group representa-
tion is induced from its little group representation. The task of obtaining the transformation
behavior of |p, σ⟩ under U (Λ) is then reduced to finding the corresponding little group trans-
formation W (Λ, p). For this purpose, we need to clearly define k for each particle and the
corresponding L(p). We will do that separately for massive and massless particles.
    Before that, let us first work out the Lorentz transformation behavior of a scattering
amplitude. The helicity amplitude of a scattering (p1 , σ1 ; p2 , σ2 ; . . .) → (q1 , λ1 ; q2 , λ2 ; . . .) is
obtained from the scattering S-operator by
      Mσ1 ,σ2 ,...; λ1 ,λ2 ,... (p1 , p2 , . . . ; q1 , q2 , . . .) ∼ ⟨q1 , λ1 ; q2 , λ2 ; . . . |S|p1 , σ1 ; p2 , σ2 ; . . .⟩. (7.26)
If we transform the scattering system to another frame by Λ, the new helicity amplitude
                                                                      342


becomes
  Mσ1 ,σ2 ,...; λ1 ,λ2 ,... (Λp1 , Λp2 , . . . ; Λq1 , Λq2 , . . .) ∼ ⟨Λq1 , λ1 ; Λq2 , λ2 ; . . . |S|Λp1 , σ1 ; Λp2 , σ2 ; . . .⟩.
                                                                                                                              (7.27)
Their relations can be obtained by using S = U (Λ)SU −1 (Λ) in Eq. (7.27),
                                                                                                                    
       ⟨Λq1 , λ1 ; . . . |S|Λp1 , σ1 ; . . .⟩ = ⟨Λq1 , λ1 ; . . . |U (Λ) S U −1 (Λ)|Λp1 , σ1 ; . . .⟩                         (7.28)
Now Eq. (7.24) gives
                                       X                                            X
       U −1 (Λ)|Λp, σ⟩ =                   |p, σ ′ ⟩Dσ′ σ (W (Λ−1 , Λp)) =                  |p, σ ′ ⟩Dσ−1′ σ (W (Λ, p)),      (7.29)
                                        σ′                                            σ′
where we used W (Λ−1 , Λp) = W −1 (Λ, p) by the definition in Eq. (7.21). So then Eq. (7.28)
becomes
  ⟨Λq1 , λ1 ; . . . |S|Λp1 , σ1 ; . . .⟩
               X                                                                             h                           i
     =                        Dλ1 λ′1 (W (Λ, q1 )) . . . ⟨q1 , λ′1 ; . . . |S|p1 , σ1′ ; . . .⟩ Dσ† ′ σ1 (W (Λ, p1 )) . . . . (7.30)
                                                                                                      1
          λ′1 ,...; σ1′ ,...
Therefore, helicity amplitudes in different frames are connected by a unitary transformation,
  Mσ1 ,...; λ1 ,... (Λp1 , . . . ; Λq1 , . . .)
          X                                                                                   h                          i
                                                                                                     †
   =                      Dλ1 λ1 (W (Λ, q1 )) . . . Mσ1 ,...; λ1 ,... (p1 , . . . ; q1 , . . .) Dσ′ σ1 (W (Λ, p1 )) . . . . (7.31)
                                ′                          ′      ′
                                                                                                       1
     λ′1 ,...; σ1′ ,...
Because of the unitarity of the representation matrices D’s, multiplying the helicity ampli-
tude by its complex conjugate and summing over all helicities leads to a Lorentz invariant
                                                                    343


unpolarized amplitude square.
7.2       Massive case: m > 0
The little group for a massive particle is the three-dimensional rotation group, SO(3). For
the reference momentum k, we define the quantum number σ to be the angular momentum
component along ẑ. Particles can be further decomposed into different species according
to different irreducible representations Dj of SO(3). This introduces a total spin quantum
number j, so we label the state as |k, j, σ⟩,
                                                                          j
                                                                       X
                  Jz |k, j, σ⟩ = σ|k, j, σ⟩,   U (W )|k, j, σ⟩ =              |k, j, σ ′ ⟩Dσj ′ σ (W ), (7.32)
                                                                     σ ′ =−j
where Dj (W ) is the (2j + 1)-dimensional irreducible representation matrix of the SO(3)
group. For an arbitrary momentum p, the quantum number σ in the state |p, j, σ⟩, which is
related to |k, j, σ⟩ by Eqs. (7.16) and (7.19), is defined as the helicity,
     (J · p̂)|p, j, σ⟩ = U (R(θ, ϕ))Jz U −1 (R(θ, ϕ))|p, j, σ⟩ = U (R(θ, ϕ))Jz U (Λz (β))|k, j, σ⟩
                       = U (R(θ, ϕ))U (Λz (β))Jz |k, j, σ⟩ = σ|p, j, σ⟩,                                (7.33)
                                                              p
where we have used Eq. (7.17) with β = |p|/ p2 + m2 , and that Jz commutes with
U (Λz (β)). By Eq. (7.24), we can get the representation of a general Lorentz transformation
U (Λ), which mixes different helicity states of a given particle,
                                                   j
                                                X
                              U (Λ)|p, j, σ⟩ =         |Λp, j, σ ′ ⟩Dσj ′ σ (W (Λ, p)).                 (7.34)
                                               σ ′ =−j
                                                       344


     The little group transformation W (Λ, p) for a general Λ and p is not easily worked out.
Here we only consider two special cases.
7.2.1        Pure Rotation: Λ = R̂
The first is for a pure rotation Λ = R̂. It only changes the direction p̂ to R̂p̂, but does not
change its energy. Following our notation of R(n̂) as the standard rotation that takes ẑ to
n̂, we have
                                                 h                     i
                           W (R̂, p) = Λ−1
                                         z   (β)   R −1
                                                        (R̂p̂)R̂R(p̂)    Λz (β).            (7.35)
The rotation matrices in the square bracket first take ẑ to p̂, then to R̂p̂, and then back to
ẑ, so it is at most a rotation around ẑ,
                                  R−1 (R̂p̂)R̂R(p̂) = Rz (δ(R̂, p̂)).                       (7.36)
Inserting this back to Eq. (7.35) gives
                                      W (R̂, p) = Rz (δ(R̂, p̂)).                           (7.37)
So the little group for a rotation R̂ is merely a rotation around z. The corresponding Lorentz
representation is thus a pure phase under,
                                 U (R̂)|p, j, σ⟩ = e−iσδ(R̂,p̂) |p, j, σ⟩,                  (7.38)
which keeps σ invariant. We consider three special examples of R̂:
   (1) R̂ = R(θ, ϕ)Rz (γ)R−1 (θ, ϕ) is a rotation around p̂ by γ, which gives δ(R̂, p̂) = γ;
                                                  345


  (2) R̂ = Rz (γ) is a rotation around ẑ by γ, which gives δ(R̂, p̂) = 0;
  (3) R̂ = Rz (ϕ)Ry (γ)Rz−1 (ϕ) is a rotation of the ẑ-p̂ plane (usually defined as the inclusive
      scattering plane) by γ, which gives δ(R̂, p̂) = 0.
7.2.2      Boost along ẑ: Λ = Λz (β̂)
The second case is for a pure Lorentz boost along the z direction. This is useful in two
circumstances. First, the spin state of a particle produced from a hard scattering can be
usually calculated easily in the c.m. frame. But at a hadron collider such as the LHC, each
hard scattering event in the lab frame differs from the c.m. frame event by a longitudinal
boost along ẑ. Second, if the particle is produced from a heavy particle decay, its spin state is
easily worked out in the rest frame of the mother particle. But the latter is likely boosted in
the lab frame. The connection between the two frames requires a boost along the momentum
of the mother particle which we can define as ẑ.
    Denote v, θ, and ϕ as the speed, polar angle, and azimuthal angle of p. The boost Λz (β̂)
transforms it to p′ = Λz (β̂)p, with speed v ′ , polar angle θ′ , and azimuthal angle ϕ′ . They are
related by
                                q                           s
                        v sin θ   1 − β̂ 2                          (1 − β̂ 2 )(1 − v 2 )
              tan θ′ =                     ,   ϕ′ = ϕ, v′ =     1−                        .  (7.39)
                           β̂ + v cos θ                              (1 + β̂v cos θ)2
By Eq. (7.21), the little group transformation is
                   W (Λ, p) = Λ−1       ′    −1 ′   −1
                                   z (v )Ry (θ )Rz (ϕ)Λz (β̂)Rz (ϕ)Ry (θ)Λz (v)
                              = Λ−1     ′    −1 ′
                                   z (v )Ry (θ )Λz (β̂)Ry (θ)Λz (v),                         (7.40)
                                                   346


where the ϕ dependence cancels since Rz commutes with Λz . Note that Eq. (7.40) only
involves boosts along ẑ and rotation around ŷ, which all keep the vector y µ = (0, 0, 1, 0)
unchanged. So the resulting little group must be a rotation Ry (χ) around ŷ. This is verified
by an explicit calculation, which gives
                                                      v + β̂ cos θ
    W (Λ, p) = Ry (χ),   cos χ = q                                                     ,    χ ∈ [0, π]. (7.41)
                                        (1 + β̂v cos θ)2 − (1 − β̂ 2 )(1 − v 2 )
Such a nontrivial little group transformation causes the boost Λz (β̂) to mix the helicity
states,
                                                 j
                                              X
                    U (Λz (β̂)) |p, j, σ⟩ =          |Λz (β̂)p, j, σ ′ ⟩ djσ′ σ (χ(β̂, p)),             (7.42)
                                             σ ′ =−j
where dj is the Wigner-d function, being the representation matrix of U (Ry (χ)).
7.3      Massless case: m = 0
The little group that keeps invariant the standard reference momentum vector k = (1, 0, 0, 1)
(suppressing the irrelevant E0 factor in this section) is isomorphic to ISO(2), the two-
dimensional translation and rotation group. We will follow [Weinberg, 2005] for the deriva-
tion.
    First introduce an auxiliary vector tµ = (1, 0, 0, 0). The little group transformation W
has the properties
      W µν kν = kµ,  (W t)µ kµ = (W t) · (W k) = t · k = 1,              (W t)µ (W t)µ = t2 = 1.        (7.43)
                                                   347


The second property of those implies
                                      W µ ν tν = (1 + ζ, α, β, ζ)                       (7.44)
and the third one constrains
                                                 α2 + β 2
                                             ζ=           .                             (7.45)
                                                    2
This determines the first column of the W matrix, W µ 0 . The first condition in Eq. (7.43)
further constrains the last column, W µ 3 . The remaining two columns can be determined by
Lorentz group properties up to some degrees of freedom. One solution for W is
                                                                    
                                                1 + ζ      α β   −ζ 
                                                                    
                                                                    
                                                 α         1 0 −α 
                                                                    
                           S = (S µ ν (α, β)) =                     .                 (7.46)
                                                                    
                                                 β         0 1 −β 
                                                                    
                                                                    
                                                     ζ      α β 1−ζ
To find the most general form of W , we notice that by W t = St, the transformation S −1 W
leaves t invariant, so S −1 W ∈ SO(3). On the other hand, S −1 W also leaves k invariant, so it
can only be a rotation Rz (θ) around ẑ. Hence, we have the general expression for the little
group element,
                                  W = W (α, β, θ) = S(α, β)Rz (θ),                      (7.47)
which has three parameters α, β, and θ.
    The little group multiplication properties can be worked out straightforward, and we get:
   1. the subgroup formed by S is Abelian and has a simple addition rule for the parameters
      (α, β): S(α, β)S(α′ , β ′ ) = S(α + α′ , β + β ′ );
                                                 348


   2. the subgroup formed by Rz has the same property: Rz (θ)Rz (θ′ ) = Rz (θ + θ′ ); and
   3. the parameters (α, β) have a simple rotation property under the action of Rz (θ):
      Rz (θ)S(α, β)Rz−1 (θ) = S(α cos θ − β sin θ, α sin θ + β cos θ).
The first and third properties together mean that the elements S form an invariant Abelian
subgroup, so that the little group is not semi-simple. If we denote the v = (α, β), the
multiplication rules will become more transparent,
                  S(v)S(v ′ ) = S(v + v ′ ), Rz (θ)S(v)Rz−1 (θ) = S(Rz (θ)v),          (7.48)
where in the expression Rz (θ)v, Rz (θ) is the rotation matrix adapted to the x-y plane in
an obvious way. This clearly shows its isomorphism to ISO(2) on the x-y plane, which
transforms a point (x, y) to Rz (θ)(x, y) + v. with v corresponding to the two-dimensional
translation vector, and θ the rotation angle around ẑ.
    In the neighborhood of the identity element, the little group element W (α, β, θ) can be
expanded around α = β = θ = 0, which gives
                                                      
                                    0     α  β     0 
                                                      
                                                      
                                    α     0 −θ −α
                                                      
                          W ≃1+                       
                                                      
                                    β     θ 0 −β 
                                                      
                                                      
                                      0    α β      0
                              = 1 − i(K1 α + K2 β + J1 β − J2 α + J3 θ)
                              ≡ 1 − i(A α + B β + J3 θ),                               (7.49)
                                              349


such that the little group is spanned by three generators,
                                A = K1 − J2 ,      B = K2 + J 1 ,    J3 .             (7.50)
Here Ki and Ji are the representation matrices of the Lorentz group in the vector space.
The corresponding Lie algebra is
                          [J3 , A] = iB,    [J3 , B] = −iA,     [A, B] = 0.           (7.51)
A finite little group element W (α, β, θ) can then be generated from the exponential
                                   W (α, β, θ) = e−i(Aα+Bβ) e−iJ3 θ .                 (7.52)
So far we have been working on the 4-dimensional Lorentz group representation in the
Minkowski space. This also induces a unitary representation on the Hilbert space,
                                                                   ˆ
                                 U (W (α, β, θ)) = e−i(Âα+B̂β) e−iJ3 θ ,             (7.53)
where Â, B̂, and Jˆ3 are Hermitian operators and have the same properties as Eqs. (7.50)
and (7.51).
    The little group ISO(2) contains all Lorentz group elements that leave k invariant. The
transformation property of the σ index in the state |k, σ⟩ under the little group gives a
physical definition of σ. Because only two of the Hermitian generators, A and B, commute,
                                                  350


we may orient the reference state to be a simultaneous eigenstate, |k, a, b⟩, of P̂ µ , A, and B,
                            A|k, a, b⟩ = a|k, a, b⟩,      B|k, a, b⟩ = b|k, a, b⟩,               (7.54)
with (a, b) the quantum numbers charactering the state, together with the momentum k.
Now we define
                                        (Aθ , Bθ ) = e−iJ3 θ (A, B)eiJ3 θ .                      (7.55)
Applying a derivative with respect to θ using Eq. (7.51) gives
                                                               
                         d                            0 −1
                            (Aθ , Bθ ) = (Aθ , Bθ )             = −i(Aθ , Bθ )σ2 .             (7.56)
                         dθ
                                                        1 0
The solution is obtained by an exponentiation,
   (Aθ , Bθ ) = (A, B) e−iσ2 θ = (A, B)Rz (θ) = (A cos θ + B sin θ, −A sin θ + B cos θ).         (7.57)
Then from Eq. (7.54), we have
              Ae−iJ3 θ |k, a, b⟩ = e−iJ3 θ A−θ |k, a, b⟩ = (a cos θ − b sin θ)e−iJ3 θ |k, a, b⟩,
              Be−iJ3 θ |k, a, b⟩ = e−iJ3 θ B−θ |k, a, b⟩ = (a sin θ + b cos θ)e−iJ3 θ |k, a, b⟩, (7.58)
and so a rotation Rz (θ) mixes the two quantum numbers a and b,
                        e−iJ3 θ |k, a, b⟩ = |k, a cos θ − b sin θ, a sin θ + b cos θ⟩.           (7.59)
                                                      351


Such a continuous spectrum is not observed in nature, and hence we must have a = b = 0.
    While J3 does not commute with A or B and thus they generally cannot have simultane-
ous eigenstates, now A and B have zero eigenvalues, so the state can also be a simultaneous
eigenstate of J3 ,
                                  |k, σ⟩ ≡ |k, (a, b) = (0, 0), σ⟩,                     (7.60)
with
                        A|k, σ⟩ = 0,    B|k, σ⟩ = 0,     J3 |k, σ⟩ = σ|k, σ⟩,           (7.61)
without violating Eq. (7.51). The quantum numbers associated with A and B thus become
redundant and σ has the physical meaning of helicity. The little group transformation is
                               U (W (α, β, θ))|k, σ⟩ = e−iσθ |k, σ⟩.                    (7.62)
This then induces the Lorentz group representation,
                                  U (Λ)|k, σ⟩ = e−iσθ(Λ,p) |k, σ⟩,                      (7.63)
with θ(Λ, p) determined by
                           W (Λ, p) = L−1 (Λp)ΛL(p) = S(α, β)Rz (θ),                    (7.64)
where S(α, β) is the little group element defined to have the form in Eq. (7.46).
    Similar to Sec. 7.2, now we give two special cases where the little group can be explicitly
evaluated.
                                                352


7.3.1      Pure Rotation: Λ = R̂
A pure rotation Λ = R̂ on massless states has the same effects as it applies on massive states,
since it only involves the momentum directions. The little group transformation is the same
as Eq. (7.37). Compared with Eq. (7.64), we have
                                     α = β = 0,      θ = δ(R̂, p̂),                         (7.65)
which gives the same Lorentz representation as Eq. (7.38),
                                    U (R̂)|p, σ⟩ = e−iσδ(R̂,p̂) |p, σ⟩.                     (7.66)
7.3.2      Boost along ẑ: Λ = Λz (β̂)
A pure Lorentz boost along ẑ results in a similar expression like Eq. (7.40), just with different
values for θ′ , v, and v ′ . Before evaluating it, we make the observation that none of the
transformations in Eq. (7.40) changes y µ = (0, 0, 1, 0). On the other hand, the little group
element in Eq. (7.64) takes it to
                     y ′µ = (β cos θ − α sin θ, − sin θ, cos θ, β cos θ − α sin θ).         (7.67)
Therefore, we must have
                                              β = θ = 0.                                    (7.68)
                                                  353


This is easily verified by an explicit evaluation of Eq. (7.40), which gives
                                             E0 β̂ sin θp
                                     α=−                      ,                       (7.69)
                                              E 1 + β̂ cos θp
where E0 and E are the energies of k and p, respectively, and θp is the polar angle of p. As
a result, even if the corresponding little group transformation is not identity, the Lorentz
boost along ẑ does leave the helicity invariant,
                                     U (Λz )|p, σ⟩ = |Λz p, σ⟩,                       (7.70)
which is in contrast to the massive case in Eq. (7.42).
                                                354


Chapter 8
Polarization of fermions at
high-energy colliders
8.1       Fermion spin density matrix
At high-energy colliders, fermion spins are usually described in the helicity basis {|p, ±⟩}. A
general fermion spin state is described by the density matrix, defined as
                            1/2
                           ραα′ (p) = ⟨p, α|ρ̂1/2 |p, α′ ⟩,  α, α′ = ±1/2,                   (8.1)
with ρ̂1/2 being the spin density operator. It is a 2 × 2 Hermitian matrix with a unity trace,
so can be decomposed in terms of the Pauli matrices σ = (σ1 , σ2 , σ3 ),
                                                                                     
             1/2       1                       1  1 + λ(p)           b1 (p) − ib2 (p)
            ραα′ (p) =   (1 + s(p) · σ)αα′ =                                             , (8.2)
                       2                       2
                                                     b1 (p) + ib2 (p)     1 − λ(p)
                                                                                       αα′
which defines the spin vector s(p) = (b1 (p), b2 (p), λ(p)) for the fermion. From now on, we will
suppress the momentum dependence of the density matrix and spin vector, unless necessary.
                                                  355


The positivity condition requires
                                                
                                       det ρ1/2 = 1 − s2 ≥ 0,                           (8.3)
which means
                                         s2 = b21 + b22 + λ2 ≤ 1,                       (8.4)
where s2 = 1 refers to a pure state and s2 < 1 to a mixed state.
    Under a general Lorentz transformation Λ, the density matrix becomes
                                                                   
                         ραα′ (p) → ⟨Λp, α| U (Λ) ρ̂ U −1 (Λ) |Λp, α′ ⟩,                (8.5)
where we have temporarily suppressed the superscript “1/2” because it applies to all cases.
Then using Eq. (7.29), we have
                                  X
                    ραα′ (p) →           Dαᾱ (W (Λ, p))ρᾱᾱ′ (p)Dᾱ† ′ α′ (W (Λ, p)), (8.6)
                                  ᾱ,ᾱ′
which transforms in a similar way to the helicitiy amplitude [Eq. (7.30)]. As a result, a
general Lorentz transformation mixes different components of the density matrix.
    The physical meaning of (b1 , b2 , λ) can be examined through their properties under a
rotation R̂(ϕ) around the momentum direction. That gives the little group W (R̂(ϕ), p) =
Rz (ϕ), by Eq. (7.37), and thus
                                  1/2                    1/2          ′
                                 ραα′ (s′ ) = e−i α ϕ ραα′ (s) e+i α ϕ ,                (8.7)
                                                    356


which gives
                   λ′ = λ,     b′1 = b1 cos ϕ − b2 sin ϕ,   b′2 = b1 sin ϕ + b2 cos ϕ.          (8.8)
              1/2      1/2
Hence λ = ρ++ − ρ−− is the “net” helicity of the fermion, which is unchanged under the
rotation R̂(ϕ), and bT ≡ (b1 , b2 ) is the transverse spin of the fermion, which rotates as a
two-dimensional vector.
    Let us choose the particle momentum direction as the z direction, and the two perpen-
dicular directions as x and y directions. Since each Pauli matrix σ i can be decomposed into
the spin eigenstates along the i-th direction,
                                σ i = |i⟩⟨i| − | − i⟩⟨−i|,   (i = x, y, z)                      (8.9)
where the bra and ket notations are abused to refer to two-component spinors, then Eq. (8.2)
implies
                         1
                ρ1/2 =     [1 + b1 (|x⟩⟨x| − | − x⟩⟨−x|)
                         2
                              + b2 (|y⟩⟨y| − | − y⟩⟨−y|) + λ (|z⟩⟨z| − | − z⟩⟨−z|)] ,         (8.10)
where the z direction is along the particle momentum. This gives a clear physical meaning
for each component of the spin vector,
                                     1/2                   1/2                  1/2
                     b1 = ρ1/2x − ρ−x ,      b2 = ρy1/2 − ρ−y ,   λ = ρ1/2
                                                                         z − ρ−z ,            (8.11)
        1/2
where ρi    ≡ ⟨i|ρ1/2 |i⟩. That is, (1 ± si )/2 is the probability for the particle spin to be along
the i or −i direction.
                                                    357


8.2       Singly polarized fermion production: general dis-
          cussion
In this section, we focus on the production of a singly polarized fermion, that is, we only
observe the polarization of a certain fermion in the final state, and inclusively sum over all
the other particles’ spins. Intuitively, such an unpolarized scattering should not produce
a singly polarized particle. However, there is an interesting correlation between spins and
momenta, which can yield singly polarized particle.
    We consider a 2 → 2 scattering
                           a(p1 , α1 ) + b(p2 , α2 ) → c(p3 , α3 ) + f (p, α),              (8.12)
in which f is the fermion whose spin α we observe. In the c.m. frame, we choose a to be along
the ẑlab direction, which together with the fermion momentum direction p̂(θf , ϕf ) defines a
scattering plane, whose normal is ẑlab × p̂. The spin density matrix of f can be obtained
form the helicity amplitude Mα1 α2 α3 α (p1 , p2 , p3 , p) by
                                        P
                                           α1 ,α2 ,α3  Mα1 α2 α3 α M∗α1 α2 α3 α′
                            ραα′ (p) =     P                                 2   ,          (8.13)
                                               α1 ,α2 ,α3 ,α4 |Mα1 α2 α3 α4 |
which in turn defines the spin vector s = (bT , λ) through Eq. (8.2). A nonzero λ implies the
asymmetry between productions of a right-handed f and a left-handed f . The transverse
spin bT is provided by the off-diagonal elements of ρ, which is given by the interference of two
amplitudes, Mα1 α2 α3 + and Mα1 α2 α3 − , which differ by only flipping the helicity of f . Before
going further, let us first clarify with respect to which axes the transverse spin is defined.
                                                    358


    Note that the state vector |p, α⟩ used in the calculation of the helicity amplitude is
constructed from a reference state |k, α⟩ in the standard way as specified in Eq. (7.16).
Similarly, the transverse spin eigenstates |p, ⊥, φ⟩, defined as linear superpositions of the
helicity eigenstates,
                                                  1                                
                               |p, ⊥, φ⟩ = √ e−iφ/2 |p, +⟩ + eiφ/2 |p, −⟩ ,                             (8.14)
                                                   2
are obtained from |k, ⊥, φ⟩ by the same set of transformations in Eq. (7.16). Since the same
definitions of bT in Eq. (8.11) hold for the density matrix in Eq. (8.13) using the transverse
spin basis |p, ⊥, φ⟩, the reference directions x̂ and ŷ with respect to which bT is defined are
obtained from the lab frame x̂lab and ŷlab by first rotating around ŷlab by angle θf and then
around ẑlab by angle ϕf . And the ẑ direction referred to by λ in Eq. (8.13) is the direction
p̂. Therefore, we have
                                                       ẑlab × ẑ
                                    ẑ = p̂,     ŷ =              , x̂ = ŷ × ẑ,                      (8.15)
                                                      |ẑlab × ẑ|
such that x̂ and ŷ are perpendicular to the particle momentum, with x̂ lying on the scattering
plane, and ŷ perpendicular.
8.2.1       Constraints from parity conservation
Assuming parity conservation, the helicity amplitude has the property
               Mα1 ,α2 ,α3 ,α (p1 , p2 , p3 , p) = (phase) × M−α1 ,−α2 ,−α3 ,−α (p̄1 , p̄2 , p̄3 , p̄), (8.16)
where the overall phase is independent of α’s and p̄µi = pi,µ . The parity inversion not only
flips all the helicities, but also flips all the momenta. To relate back to the original scattering,
we need to perform a further rotation on the scattering plane by π. This rotation will restore
                                                           359


all the momenta but retain the flipped helicities. So overall we are examining the symmetry
transformation
                                    UP = U (R3 (ϕf ))U (R2 (π))U −1 (R3 (ϕf ))P,                          (8.17)
where P is the parity operator. The rotation operation in Eq. (8.17) is similar to the third
rotation case below Eq. (7.38), which gives an identity little group transformation. However,
one key difference is that the rotation R2 (π) will change the polar angle θi of a particle to
θi + π, which will cross the boundary of the θ domain: θ ∈ [0, π]. The discontinuity of the
SO(3) topology will thus play a nontrivial role. It introduces an extra phase that depends
on the helicity,
    Mα1 ,α2 ,α3 ,α (p1 , p2 , p3 , p) = (phase) · eiδ123 (−1)α−1/2 M−α1 ,−α2 ,−α3 ,−α (p1 , p2 , p3 , p), (8.18)
where δ123 is the phase associated with the particles a, b, c, which may depend on α1,2,3 ,
but will eventually cancel when we multiply M by its complex conjugate in Eq. (8.13). The
phase (−1)α−1/2 for the particle f gives an extra minus sign when α = −1/2. Such phase will
also cancel in the diagonal elements of ρ, but not in the off-diagonal elements, and therefore
will set a special constraint on the transverse spin.
    Using Eq. (8.18), we can get the parity relation for the density matrix,
                                  P                 α+α′ −1
                                     α1 ,α2 ,α3 (−1)        M−α1 ,−α2 ,−α3 ,−α    M∗−α1 ,−α2 ,−α3 ,−α′
                  ραα′ (p) =                         P
                                                       α1 ,α2 ,α3 ,α4 |Mα1 α2 α3 α4 |2
                                              ′
                              = (−1)α+α −1 ρ−α,−α′ (p),                                                   (8.19)
which means
                                              ρ++ = ρ−− ,       ρ+− = −ρ−+ ,                              (8.20)
                                                            360


or equivalently,
                                                λ = b1 = 0.                                          (8.21)
Therefore, b2 is the only allowed spin degree of freedom if parity is conserved. In the case
with parity violation, all three components are not forbidden.
8.2.2     Constraints from the amplitude structure
In the general case, the scattering amplitude has both real and imaginary parts,
                          Mα1 α2 α3 α = Re Mα1 α2 α3 α + i Im Mα1 α2 α3 α .                          (8.22)
Introducing the shorthand notation
                            X                                             X
              Aα ∗ Bα′ ≡            Aα1 α2 α3 α Bα1 α2 α3 α′ ,   |A|2 =            |Aα1 α2 α3 α |2 , (8.23)
                           α1 α2 α3                                     α1 α2 α3 α
we have
                              (Re Mα + i Im Mα ) ∗ (Re Mα′ − i Im Mα′ )
                     ραα′ =                                                             ,            (8.24)
                                              | Re M|2 + | Im M|2
which gives the spin vector
                                 M+ ∗ M+∗ − M− ∗ M−∗
                          λ=                                   ,
                                 M+ ∗ M+∗ + M− ∗ M−∗
                                  Re M+ ∗ Re M− + Im M+ ∗ Im M−
                         b1 = 2                                                     ,
                                             | Re M|2 + | Im M|2
                                  Re M+ ∗ Im M− − Im M+ ∗ Re M−
                         b2 = 2                                                     .                (8.25)
                                             | Re M|2 + | Im M|2
                                                    361


Therefore, b2 can only exist if the amplitude has an imaginary part. In a parity-conserving
perturbation theory, such a phase can only occur through loops, which necessarily suppresses
b2 by the coupling constant. In contrast, there is no such constraint for λ and b1 , which can
be produced at tree level as long as there is parity violation.
8.2.3      Constraints from chiral symmetry
It is crucial that transverse spin arises from the interference between two amplitudes which
only differ in the observed fermion helicity. In terms of the cut diagram notation, a fermion
line always forms a closed loop. A nonzero interference thus requires that the fermion helicity
be flipped at some point of the fermion loop. This can only happen if (1) the fermion is
massive, or (2) there is a Yukawa or tensor interaction vertex. In the SM, there is no tensor
interaction, and the only Yukawa interaction is the source for the fermion mass, so the
necessary condition to generate a single transverse spin is to have the fermion being massive.
This in turn means that the magnitude of the transverse spin will be proportional to the
                                                                     √
fermion mass mf , which is compensated by the scattering energy        s. That means,
                                                 mf
                                            bT ∝ √ ,                                     (8.26)
                                                   s
which will be highly suppressed at high energies.
    On the other hand, in a new physics or effective field theory scenario with tensor interac-
tions, one may have a single transverse spin that does not suffer from such suppression. One
recent work [Wen et al., 2023] employed this effect to explore the impacts of single transverse
spin asymmetry at transversely polarized lepton colliders on dimension-six electron dipole
operators. Similar study can be done for final-state single fermion spin by measuring the
                                               362


heavy fermion spins through their decays.
8.2.4     Short summary
Summarizing this section, we note that the transverse spin b2 is the only spin degree of
freedom allowed by parity, but it must require an imaginary part of the amplitude, which
only occurs beyond tree level. In contrast, the other two spin degrees of freedom, λ and b1 ,
can occur at tree level as long as parity is violated. Furthermore, a single transverse spin
can only happen to massive fermions, not to massive ones, and the magnitude is suppressed
by the fermion mass.
    We note that the derivation for the parity relation in Eq. (8.19) only applies to 2 → 2
scattering and inclusive one-particle production. It cannot be trivially extended to more
complicated final states. But it is generally true that a single transverse spin is allowed by
symmetries.
8.3      Example: s-channel single top production
In this section, we illustrate the single spin production with the example of s-channel single
top quark production at the LHC. This is well suited for illustrating all the points discussed
in Sec. 8.2 because (1) the top quark is a massive fermion whose mass mt is not negligible
at the LHC energy, which makes the production of a transverse spin possible; (2) the top
quark is produced via an s-channel W boson, whose interaction violates parity, such that
a nonzero λ and b1 can be produced at tree level; (3) beyond tree level, the one-loop QCD
correction can trigger a threshold effect to generate an imaginary part in the amplitude, so
that a b2 can be produced.
                                               363


   At LO, we consider the partonic process u(p1 , α1 ) + d(p                   ¯ 2 , α2 ) → t(q1 , σ1 ) + b̄(q2 , σ2 ),
which happens through an s-channel W + boson, in its c.m. frame, with u along ẑ, and t
along n̂(θ, ϕ). The kinematics can be easily worked out by momentum conservation,
                           √                                  √                                
                               s                                 s s ± m2t            s − m2t
                   p1,2  =       (1, 0, 0, ±1),       q1,2 =                    ,±            n̂ ,            (8.27)
                            2                                   2          s             s
with s = (p1 + p2 )2 being the partonic c.m. energy squared. The scattering amplitude iM is
                                                                                                  
                                    −ig µ                        −ig µν                   −ig ν
    iMα1 α2 σ1 σ2 = ū(q1 , σ1 )     √ γ PL v(q2 , σ2 )                    v̄(p2 , α2 ) √ γ PL u(p1 , α1 )
                                        2                       s − m2w                     2
                          ig 2
                  =              2
                                     [ū(q1 , σ1 )γ µ PL v(q2 , σ2 )] [v̄(p2 , α2 )γµ PL u(p1 , α1 )] ,       (8.28)
                     2(s − mw )
where g is the SU(2) gauge coupling. The helicity structure is greatly simplified by the left-
handed vector current interaction and that u, d,              ¯ and b̄ are taken massless. This constrains
α1 = −1/2 and α2 = σ2 = +1/2. So we are only reduced to two helicity amplitudes, with
σ1 = ±1/2, wherein only the −1/2 helicity can exist if we take mt → 0, and thus the
amplitude with σ1 = +1/2 is proportional to mt . By explicit calculation, we obtain
                                                                                                r
                                                          mt                        g 2 s e−iϕ          m2t
   M−+−+ = N (1 + cos θ),             M−+++       = N √ sin θ,            N =−                     1−       . (8.29)
                                                            s                       2 s − m2w           s
A few remarks are in order.
   • The initial (ud)  ¯ state has a nonzero spin, α1 − α2 = −1, along ẑ, which gives a phase
      factor ei(α1 −α2 )ϕ = e−iϕ . This phase applies to both amplitudes, M−+−+ and M−+++ ,
      and will cancel when we take a product between an amplitude and a complex conjugate
      amplitude.
                                                          364


                                                         p
     • The factor N contains a threshold factor             1 − m2t /s to suppress the amplitude as
       s ≳ m2t . As s ≫ m2t , N approaches a constant −(g 2 /2)e−iϕ .
     • M−+−+ is the only amplitude that survives as s ≫ m2t , and it favors production of
       the top quark in the forward region, controlled by the angular function d1−1,−1 (θ) ∝
       (1 + cos θ), as a result of the left-handed coupling. In contrast, the amplitude M−+++
       flips the top quark helicity by a mass insertion, and so is only significant when s is not
       much greater than m2t . The angular distribution is controlled by d11,0 (θ) ∝ sin θ, which
       is symmetric between forward and backward regions.
     The density matrix of the top quark can be easily calculated by Eq. (8.13),
                                             M−,+,α,+ M∗−,+,α′ ,+
                                  ρtαα′ =                             ,                       (8.30)
                                          |M−+−+ |2 + |M−+++ |2
which gives
                                                                                         
                                       −1         m2t     2
                                                        sin θ         mt
                                                                      √   sin θ(1 + cos θ)
              m2t                                   s                  s
      ρt =        sin2 θ + (1 + cos θ)2                                                  ,  (8.31)
               s                              mt                                     2
                                              √
                                                s
                                                  sin θ(1 + cos θ)        (1 + cos θ)
where the factor in front plays the role of normalizing ρt . Clearly, due to the fact that there
are only two non-zero helicity amplitudes, the density matrix has the structure
                                                           
                                                    2
                                                a ab
                                           ρt ∼            ,                                (8.32)
                                                          2
                                                   ab b
up to a normalization. This immediately leads to det ρt = 0 such that the polarization vector
|st | = 1, recalling Eq. (8.3). As a result, the top quark must be at a pure spin state at LO; in
                                                  365


other words, it is 100% polarized. From Eq. (8.31), one can obtain the polarization vector,
            s − m2t + (s + m2t ) cos θ          mt             2s · sin θ
     λ=−                               ,  b1 = √                                 ,   b2 = 0, (8.33)
            s + m2t + (s − m2t ) cos θ            s s + mt + (s − m2t ) cos θ
                                                             2
from which we can easily verify λ2 + b21 = 1.
   We note that the polarization vector expression in Eq. (8.33) holds only in the partonic
c.m. frame. When the whole system is boosted along ẑ by Λz (β), the density matrix trans-
forms according to Eq. (8.6). The little group corresponding to such boost has been obtained
in Eq. (7.41), which is a rotation around ŷ by χ, with
                                         v + β cos θ                          s − m2t
                 cos χ = p                                            ,    v=         .      (8.34)
                            (1 + βv cos θ)2 − (1 − β 2 )(1 − v 2 )            s + m2t
Therefore, Eq. (8.6) becomes
                                                                       †
                             ρt (st ) → d1/2 (χ) · ρt (st ) · d1/2 (χ) ,                     (8.35)
which keeps b2 invariant but mixes b1 with λ,
                       λ → λ cos χ − b1 sin χ,      b1 → b1 cos χ + λ sin χ.                 (8.36)
Because of this mixing, it is necessary to analyze the top polarization in the partonic c.m.
frame event by event.
   The mixing [Eq. (8.36)] does not alter the fact λ2 + b21 = 1, and also provides a physical
                                                366


understanding for the full polarization. If we take an infinite boost with β = 1, which gives
                                                           √
                                    v + cos θ                 1 − v 2 sin θ
                           cos χ =             ,   sin χ =                  ,             (8.37)
                                   1 + v cos θ               1 + v cos θ
then Eq. (8.36) gives
                                        λ → −1,      b1 → 0,                              (8.38)
so that the top quark becomes completely left-handed. This agrees with the physical picture
that the infinite boost takes t and b̄ to be collinear along ẑ. Their spins sum up to −1
along ẑ, equal to the spin of the initial state. Such “infinite momentum frame” explains why
the top quark is 100% polarized, and the nonzero b1 in the “finite momentum frame” is a
result of polarization mixing when going from the “infinite momentum frame” to the “finite
momentum frame”.
8.4       Observing the fermion spin
Spins are usually not directly measured at high-energy colliders, so that information is lost
in merely constructing the production rates. However, if the polarized particle decays, the
kinematic distributions of the decay products are likely to retain the spin information of the
mother particle. This is the case for the heavy fermions in the SM, especially for the top
quark.
    This is best illustrated in the rest frame of top quark, constructed by boosting along −pt
with the same set of coordinate system x̂-ŷ-ẑ as Eq. (8.15), with p̂ = pt /|pt | the direction
of the top quark momentum pt . The helicity amplitude of the decay t(αt ) → W + (αw )b(αb )
                                                 367


can be written generally as [Tung, 1985]
                                         1/2∗                                     ⋆   1/2
        Mαt αw αb (θ⋆ , ϕ⋆ ) = Aαw ,αb Dαt ,αw −αb (ϕ⋆ , θ⋆ , 0) = Aαw ,αb eiαt ϕ dαt ,αw −αb (θ⋆ ). (8.39)
where α’s denote the helicities, with αt with respect to ẑ, and d1/2 is the Wigner d-function.
The angles θ⋆ and ϕ⋆ characterize the W boson direction in the top rest frame. The coefficient
Aαw ,αb does not depend on top helicity or the angles. The angular distribution of the W is
given by
                            dΓt
                              ⋆   ⋆
                                    ∝ ρtαt α′t (st )Mαt αw αb (θ⋆ , ϕ⋆ )M∗α′t αw αb (θ⋆ , ϕ⋆ ),      (8.40)
                       d cos θ dϕ
where the summation over repeated indices is implied, and st = (b1 , b2 , λ) is the top spin
vector. Substituting Eq. (8.39) for the amplitudes in Eq. (8.40) gives
                                  1       dΓt            1
                                              ⋆    ⋆
                                                     =       [1 + κw st · Ω⋆ ] ,                     (8.41)
                                 Γt d cos θ dϕ          4π
where st · Ω = b1 sin θ⋆ cos ϕ⋆ + b2 sin θ⋆ sin ϕ⋆ + λ cos θ⋆ and
                                |A1,1/2 |2 − |A0,1/2 |2 + |A0,−1/2 |2 − |A−1,−1/2 |2
                          κw =                                                                       (8.42)
                                |A1,1/2 |2 + |A0,1/2 |2 + |A−1,−1/2 |2 + |A0,−1/2 |2
is the spin analyzing power for the W . As a result, the nonzero polarization st of the mother
particle leads to an asymmetric decay distribution with respect to the polarization direction.
In the chosen x̂-ŷ-ẑ frame, the longitudinal polarization λ leads to a forward-backward
asymmetry while the transverse spin bT introduces an azimuthal asymmetry.
    It is worth noting that using such single-particle distribution as a spin observable belongs
                                                       368


to the single-spin phenomenon. It relies on parity violation in the tbW interaction. As can
be obviously noticed from Eq. (8.42), if the top decay process preserves parity, one would
have κw = 0, which offers no ability to probe the top polarization. In the SM with a purely
left-handed tbW current interaction and neglecting the b mass, the LO coefficients Aλw ,λb are
                                 q                                    q
                                                                                          mt
                 A−1,−1/2 = −g      m2t  −  m2w ,   A0,−1/2    = −g m2t − m2w √                ,         (8.43)
                                                                                          2mw
with all the others being 0. This gives the spin-analyzing power for W as
                                              m2t − 2m2w
                                       κw =                   ≃ 0.4.                                     (8.44)
                                              m2t + 2m2w
It is positive, so the W prefers to be along the spin direction of t, as a result of being
dominantly produced with a longitudinal polarization.
    In the real situation, the W boson from top decay rapidly decays to another fermion-
anti-fermion pair f f¯′ , so that the top quark decays into three particles, t → bf f¯′ . The
single-particle distribution in Eq. (8.45) simply generalizes to the three-body decay,
                                1       dΓt           1
                                           ⋆   ⋆
                                                  =       [1 + κi st · Ω⋆i ] ,                           (8.45)
                                Γt d cos θi dϕi     4π
where i can be b, f , or f¯′ , Ω⋆i = (sin θi⋆ cos ϕ⋆i , sin θi⋆ sin ϕ⋆i , cos θi⋆ ) is its direction in the top
rest frame, and κi is the corresponding spin-analyzing power. This general form [Eq. (8.45)]
holds because of rotational invariance and the fact that spin vectors st appears at most in a
linear form.
    Eq. (8.45) can be marginalized to give a cos θ⋆ distribution that exclusively probes the
                                                   369


helicity λ,
                                  1 dΓt            1
                                            ⋆
                                              = (1 + κi λ cos θi⋆ ),                        (8.46)
                                 Γt d cos θi       2
or a ϕ⋆ distribution that only probes the transverse spin bT ,
                     1 dΓt     1 h     π                                                i
                                                        ⋆       ⋆             ⋆      ⋆
                            =      1 +   κ i (b 1 sin θi  cos ϕ i + b 2 sin θi  sin ϕi )  . (8.47)
                    Γt dϕ⋆i   2π       4
While the θi⋆ and ϕ⋆i distributions play similar roles in the rest frame, they do not when the
top quark is boosted. In the boosted frame, the θi distribution becomes highly distorted such
that all the decay products prefer to be collinear with t regardless of the value of λ. Thus
Eq. (8.46) loses its power as a spin polarimeter in the boosted frame. The azimuthal angle ϕi ,
on the other hand, remains unchanged by the boost, so Eq. (8.47) still gives a good method
for measuring the transverse spin. Since at the LHC, the top quark can be produced with
a large boost, the transverse spin stands out against the longitudinal polarization, which is
one of the reasons why we study the transverse spin in this thesis. While one can convert
the polar angle distribution [Eq. (8.46)] into the energy fraction distribution of the daughter
particles and produces a new polarimeter for λ, that is not the focus of our discussion in this
thesis. We will instead give another method for measuring λ for a highly boosted top quark,
based on the azimuthal correlation among the three daughter particles of the top decay, in
Sec. 9.4.
8.5       Compared to fermion spin correlation
We have seen that the single transverse spin of a fermion is produced via a single helicity flip
in the amplitude. This comes with a mass penalty in the SM. So as we go to higher energies,
                                                   370


a single transverse spin is usually suppressed. However, if we measure the transverse spins
of two fermions at the same time, the effects of double helicity flips in the product of the
amplitude and conjugate amplitude cancel each other and do not necessary require mass
insertions. We call this transverse spin correlation. It is not necessarily suppressed as one
goes to high energies. In this section, we lay down the general formalism for describing the
spin correlations between two fermions.
    Suppose a pair of fermions, f1 and f2 , are produced in a certain process,
                     a(p1 ) + b(p2 ) → f1 (k1 , α1 ) + f2 (k2 , α2 ) + X,                 (α1 , α2 = ±1/2)                   (8.48)
where we suppress the helicities of all other particles, which will be traced over. The helicity
amplitude is denoted as
                                                 Mα1 α2 (p1 , p2 ; k1 , k2 ).                                                (8.49)
Similar to Eq. (8.13), but now with both helicity indices open, we can get the multi-
dimensional spin density matrix for the two fermions,
                                                                Mα1 α2 M∗α′ α′
                                           ρα1 ,α2 ;α′1 ,α′2 =P                 1 2
                                                                                      2
                                                                                        ,                                    (8.50)
                                                                 ᾱ1 ᾱ2 |M ᾱ 1 ᾱ2 |
where the momentum dependence has been suppressed. This matrix can be thought of the
direct product of two single-fermion density matrices, so can be decomposed as [Bernreuther
et al., 2015]
                       "                                                                         3
                                                                                                                           #
                     1                                                                         X
 ρα1 ,α2 ;α′1 ,α′2 =     δα1 α′1 δα2 α′2 + s1 · σα1 α′1 δα2 α′2 + δα1 α′1 s2 · σα2 α′2 +            Cij σαi 1 α′1 σαj 2 α′ , (8.51)
                     4                                                                        i,j=1
                                                                                                                         2
                                                               371


or symbolically,
                                1                                                                
                          ρ=       1 ⊗ 1 + (s1 · σ) ⊗ 1 + 1 ⊗ (s2 · σ) + Cij σ i ⊗ σ j .                     (8.52)
                                4
This defines two single fermion spin vectors, s1 and s2 , and a spin-spin correlation matrix
Cij . They are all real-valued by the Hermiticity of ρ.
     Note that by a chosen convention of the fermion helicity spinors, the Pauli matrix decom-
position in Eq. (8.51) automatically determines the x-y-z systems for the spin parameters
s1 = (sx1 , sy1 , sz1 ), s2 = (sx2 , sy2 , sz2 ), and Cij = (Cxx , Cxy , · · · , Czz ). The x-y-z system for either
fermion is determined according to Eq. (8.15). In the c.m. frame of the exact 2 → 2 kine-
matics, the two systems have the same x direction and opposite y and z directions, with x
on the scattering plane and y perpendicular.
     Evidently, the single spin information in s1 and s2 is independent from that in the spin
correlation Cij . Tracing over the helicity of f1 or f2 in Eq. (8.50) or (8.51) reduces to the
single fermion spin density matrix in Eq. (8.13) or (8.2).
     Similar to the parity constraint for a single fermion density matrix in Eq. (8.19), in an
exact 2 → 2 kinematics, parity symmetry (if it holds) constrains Eq. (8.51) by
                                                             ′       ′
                                ρα1 ,α2 ;α′1 ,α′2 = (−1)α1 +α1 +α2 +α2 −2 ρ−α1 ,−α2 ;−α′1 ,−α′2 .            (8.53)
In terms of the Pauli matrix decompositions in Eq. (8.51), the right-hand side of Eq. (8.53)
differs from the left-hand side by adding a minus sign to all occurrences of σ 1 and σ 3 , so this
constrains the spin parameters by
                            sx1 = sz1 = sx2 = sz2 = 0,       Cxy = Cyx = Cyz = Czy = 0.                      (8.54)
                                                             372


Different from the single fermion spins, for which parity only allows the component (sy )
perpendicular to the scattering plane, the correlation of two helicity polarizations (Czz ) or
helicity and transverse spin (Cxz and Czx ) is allowed by parity. Also, the transverse spin
correlations Cxx and Cyy along each transverse direction are allowed.
    The spin parameters can be obtained from Eq. (8.51) by tracing with corresponding Pauli
matrices,
      si1 = σαi ′1 α1 ρα1 ,ᾱ2 ;α′1 ,ᾱ2 ,   si2 = σαi ′2 α2 ρᾱ1 ,α2 ;ᾱ1 ,α′2 ,  Cij = σαi ′1 α1 σαj ′ α2 ρα1 ,α2 ;α′1 ,α′2 , (8.55)
                                                                                                       2
where repeated indices are summed over and i, j run from 1 to 3. This gives, e.g.,
                                                                                                         
                       sy1 = i (ρ+,ᾱ2 ;−,ᾱ2 − ρ−,ᾱ2 ;+,ᾱ2 ) = −2 Im M+ᾱ2 M∗−ᾱ2 |M|2 ,
                                                                                               
                    Czz = |M++ |2 + |M−− |2 − |M+− |2 − |M−+ |2                                     |M|2 ,
                                                                                 
                    Cxx = 2 Re M++ M∗−− + M+− M∗−+ |M|2 ,
                                                                                  
                    Cxy = −2 Im M++ M∗−− − M+− M∗−+ |M|2 .                                                                      (8.56)
Thus, a single transverse spin corresponds to the interference of different helicity states for
a single fermion, whereas a transverse spin correlation is due to the interference of different
helicity states for a fermion pair. Of course, in another language, this is the entanglement
of two fermions in the helicity space. While a single transverse spin requires a single fermion
helicity flip so is suppressed at high energy limit by the fermion mass (at least for the current
interactions in the SM), the transverse spin correlations flip the fermion helicity twice so are
not necessarily suppressed.
                                                                    373


Chapter 9
Linear polarization of vector bosons
at high-energy colliders
The transverse spin of fermions is an interesting phenomenon that is readily overlooked by
only examining the total production rate. It encodes the quantum interference information
at high-energy scattering experiments, and reveals itself as an azimuthal distribution, cos ϕ
or sin ϕ, which can be easily measured at high-energy colliders. The natural question, then,
is whether a similar phenomenon holds for vector bosons.
    Unlike fermions, the spin of a vector boson cannot be pictured as an arrow pointing to
a certain direction in the space, but we have the same understanding that (1) a transverse
spin is an interference of different helicity states, and (2) it sets a special direction in the
transverse plane, which breaks the rotational invariance around the particle momentum. For
a massless vector boson, superposition of the two helicity states |±⟩ can make up the familiar
linear polarization states,
                              −1                         i
                       |x⟩ = √ (|+⟩ − |−⟩) ,      |y⟩ = √ (|+⟩ + |−⟩) ,                     (9.1)
                                2                          2
which transform as a transverse vector under a rotation Rz (ϕ) around the momentum direc-
                                               374


tion,
                      |x⟩ → cos ϕ |x⟩ + sin ϕ |y⟩,    |y⟩ → cos ϕ |y⟩ − sin ϕ |x⟩.        (9.2)
They, therefore, play the counterpart roles of the fermion transverse spins for the vector
bosons. A massive vector boson, on the other hand, has one additional helicity state |0⟩,
which can make up extra “transverse spin” states by superposing with |x⟩ and |y⟩.
    The mass suppression of single fermion transverse spin arises from the chiral symmetry
inherent in a massless fermion. In contrast, for a vector boson, no chiral symmetry protects
the helicity from flipping. Consequently, a vector boson’s linear polarization is more easily
produced. Given that the helicity can now flip by up to two units, we can observe cos 2ϕ and
sin 2ϕ azimuthal patterns in the decay products of the vector boson. This makes the linear
polarization phenomenon more intricate than the fermion transverse spin. In this section,
we will introduce the formalism for describing linear polarization phenomena and present
two physical examples where it can manifest and yield intriguing observational signals.
9.1      Vector boson spin density matrix
Massless vector bosons, such as gluons and photons, have only two helicity states, so their
spin density matrix is also a 2 × 2 Hermitian matrix with a unity trace, like the fermion
case. We can also decompose this into Pauli matrices, thereby defining three polarization
parameters ξ = (ξ1 , ξ2 , ξ3 ) in the helicity basis,
                                                                                      
                         1                      1  1 + ξ3 (p)        ξ1 (p) − iξ2 (p)
               ρ1λλ′ (p) = (1 + ξ(p) · σ) =                                           . (9.3)
                             2                   2
                                                      ξ1 (p) + iξ2 (p)    1 − ξ3 (p)
                                                  375


As for the fermion case, ξ3 = ρ1++ − ρ1−− is the net helicity, while the off-diagonal elements
(ξ1 , ξ2 ), being interference of different helicity states, are the linear polarization. In terms of
the linear polarization state
                                               −1                         
                                         |ϕ⟩ = √ e−iϕ |+⟩ − eiϕ |−⟩                              (9.4)
                                                 2
along the ϕ direction (in the transverse plane), (ξ1 , ξ2 ) can be represented as
                        ξ1 = ρ1+− + ρ1−+ = ⟨π/2|ρ̂1 |π/2⟩ − ⟨0|ρ̂1 |0⟩ = ρ1yy − ρ1xx ,          (9.5a)
                        ξ2 = i(ρ1+− − ρ1−+ ) = ⟨3π/4|ρ̂1 |3π/4⟩ − ⟨π/4|ρ̂1 |π/4⟩.               (9.5b)
Thus they are differences of the linear polarization degrees along two orthogonal directions, as
shown in Fig. 9.1, with respect to the same x̂-ŷ-ẑ coordinate system as defined in Eq. (8.15).
Under a rotation around the gluon momentum direction by ϕ, the density matrix changes as
                                                                      ′
                                  ρ1λλ′ (ξ) → ρ1λλ′ (ξ ′ ) = e−i(λ−λ )ϕ ρ1λλ′ (ξ),               (9.6)
so that ξ transforms as
                   ξ3′ = ξ3 , ξ1′ = cos 2ϕ ξ1 − sin 2ϕ ξ2 ,       ξ2′ = cos 2ϕ ξ2 + sin 2ϕ ξ1 .  (9.7)
This shows the difference of the linear polarization ξ⊥ = (ξ1 , ξ2 ) from the fermion transverse
spin bT : the former transforms like a spin-2 tensor, whereas the latter like a spin-1 vector.
     Massive vector bosons, on the other hand, have one extra longitudinal polarization |0⟩,
                                                         376


                         ξ1        ŷ                           ξ2          ŷ
                               |y⟩                                  |3π/4⟩
                                        ϕ    x̂                                  ϕ   x̂
                                       |x⟩
                                                                     |π/4⟩
                                     cos 2ϕ                                  sin 2ϕ
Fig. 9.1: Interpretations of the polarization ξ1 and ξ2 in the linear polarization basis, and
the associated azimuthal angular distributions.
which extends the density matrix to 3 × 3. We parametrize it as
                                                                                            
                               1    δL     J3    J1 +Qxz −i(J2 +Qyz )           ξ−iQxy
                              3
                                 + + 6     2
                                                          √
                                                         2 2                       2         
                                                                                          
                 ρM         J1 +Qxz +i(J2 +Qyz )                        J1 −Qxz −i(J2 −Qyz ) 
                                                                                             ,      (9.8)
                  λλ′ =             √                  1−δL                       √
                                   2 2                   3                      2 2         
                                                                                            
                                  ξ+iQxy         J1 −Qxz +i(J2 −Qyz )      1      δL    J3
                                      2
                                                          √
                                                         2 2               3
                                                                              + 6 − 2
on the helicity basis, in terms of the eight real polarization parameters (J1 , J2 , J3 , Qxy , Qyz ,
Qxz , δL , ξ). We have suppressed their dependence on the vector boson momentum p. Under
the rotation by ϕ around p̂, the same transformation in Eq. (9.6) holds for ρM , which gives
the transformation behaviors of the polarization parameters,
                                                                                           
      ′
  J1     Q′xz    cos ϕ − sin ϕ J1 Qxz 
                                                                 ′
                                                            ξ  cos 2ϕ − sin 2ϕ  ξ 
              =                                ,            =                              ,
    J2′     ′
          Qyz        sin ϕ cos ϕ            J2 Qyz              ′
                                                              Qxy             sin 2ϕ cos 2ϕ       Qxy
                                                                                                     (9.9)
with J3 and δL unchanged. In this way, all the parameters in off-diagonal elements behave
like transverse spins. As interference between |±⟩ and |0⟩, the parameters (J1 , J2 , Qxz , Qyz )
behave like transverse vectors with spin 1. Similarly, as interference between |+⟩ and |−⟩, (ξ,
Qxy ) are like transverse vectors with spin 2. The density matrix for a massive vector boson
reduces to the massless case by taking (J1 , J2 , Qxz , Qyz ) → 0, and equating (ξ, Qxy , J3 ) with
                                                    377


(ξ1 , ξ2 , ξ3 ).
     The physical meaning of (ξ1 , ξ2 , ξ3 ) carry through to (ξ, Qxy , J3 ) for the massive case, as
the linear polarization states and helicity. The parameter δL characterizes the longitudinal
polarization state. In the rest frame of the massive vector boson, J1 , J2 , and J3 are the
angular momentum (spin) components along the x, y, and z directions, which can be obtained
by tracing ρM with the spin operators,
                                                                                
                    0 1 0                    0 −i 0                  1 0 0 
                 1 
                             
                                           1 
                                                             
                                                             
                                                                         
                                                                         
                                                                                     
                                                                                     
           Jˆ1 = √ 1 0 1 ,       Jˆ2 = √  i 0 −i ,            Jˆ3 = 0 0 0  .             (9.10)
                  2                       2                                   
                                                                                  
                      0 1 0                      0 i       0               0 0 −1
The other five parameters can be made analogous to the electric quadrupole moments. By
transforming Eq. (9.8) into the linear polarization basis, constituted by Eq. (9.1) and |z⟩ =
|0⟩, we have
                                                                               
                                     2   δL
                                  + − ξ −Qxy − iJ3 −Qxz + iJ2 
                                     3    3
                           1                                                  
                        M                                                      
                       ρij = −Qxy + iJ3           2
                                                      + δ L
                                                            + ξ   −Q  yz − iJ 1 .             (9.11)
                               2                  3     3                      
                                                                               
                                   −Qxz − iJ2 −Qyz + iJ1 23 (1 − δL )
This immediately gives the quadrupole moments,
                                                            M
                 Qxy = −(ρM      M
                           xy + ρyx ),  Qyz = −(ρM   yz + ρzy ),  Qxz = −(ρM         M
                                                                               xz + ρzx ),     (9.12)
for the off-diagonal elements, and
                                                                M      M
                              ξ = ρM        M
                                    yy − ρxx ,   δL = ρMxx + ρyy − 2ρzz ,                      (9.13)
                                                  378


for the diagonal elements. This representation also gives a physical picture for the transfor-
mation in Eq. (9.9).
    Under a general Lorentz transformation Λ, the spin-1 density matrices also transform as
Eq. (8.6), with the transformation matrix D determined by the little group W (Λ, p) in the
same way. For massless vector bosons, this matrix D is only a phase, Dλλ′ = e−iλθ(Λ,p) δλλ′
[Eq. (7.63)], which only mixes between ξ1 and ξ2 , but does not change ξ3 . In particular,
for the special Lorentz boost along ẑ, we have θ(Λ, p) = 0 and D is an identity matrix
[Eq. (7.70)], which does not change ξ at all. For massive vector bosons, in contrast, D can
be an arbitrary rotation that mixes among various components of the polarization, especially
it can mix the linear polarization (ξ, Qxy ) with other components. As a result, even if the
linear polarization is 0 in one frame, it is likely to be nonzero in some other frame. We will
make further use of this fact in Sec. 9.4.
9.2       Parity constraint on the vector boson polarization
We observed in Sec. 8.2 that the single transverse spin of a fermion can only be produced if it
is massive. Parity-conserving cases further constrain b2 to be the only possible polarization
degree of freedom, which can only appear through threshold effects at a loop level so is
destined to be small. The linear polarization of a vector boson, on the other hand, does not
suffer from these constraints, because there is no counterpart of chiral symmetry to protect
the helicity of a vector boson from being flipped. So, in general, we should expect a nonzero
linear polarization to be produced for a vector boson.
    Similar to Eq. (8.19), if the vector boson is produced in a 2 → 2 process via a parity-
                                                379


conserving interaction, its polarization density matrix would satisfy
                                                               ′
                                     ρ1−λ,−λ′ (p) = (−1)λ+λ ρ1λλ′ (p),                      (9.14)
which is obtained by performing the same UP transformation in Eq. (8.17) and using the
transformation behavior of a vector boson state,
                                         UP |p, λ⟩ = (−1)λ |p, −λ⟩.                         (9.15)
Eq. (9.14) applies to both massless and massive vector bosons. For a massless vector boson,
it implies ρ1++ = ρ1−− and ρ1+− = ρ1−+ , such that only the linear polarization ξ1 is allowed to
be nonzero, while ξ2 and the helicity ξ3 are forbidden. This reduces Eq. (9.3) to
                                                   
                                  1  1 ξ1 
                            ρ1λλ′ =                ,   (if parity conserves.)             (9.16)
                                       2
                                            ξ1 1
Since ξ1 = 2 Re(ρ1+− ), it does not require an imaginary part from the amplitude, so it can
appear at tree level. The same conclusion also holds for a massive vector boson, for which
Eq. (9.14) means J1 = J3 = Qyz = Qxy = 0, and we are only allowed to have nonzero δL ,
Qxz , J2 , or ξ. Then Eq. (9.8) is reduced to
                                                                
                                2+δL      Qxz√−iJ2
                                 3             2
                                                         ξ       
                         1                                     
                             Qxz +iJ2                           
                ρM
                 λλ ′   =    √            2(1−δL )
                                                         √ −iJ2  ,
                                                      −Qxz           (if parity conserves.) (9.17)
                          2       2           3           2     
                                                                
                                          −Qxz√ +iJ2    2+δL
                                 ξ              2         3
    There are no other general symmetries to constrain the density matrix. For a particular
                                                     380


situation, one only needs to examine whether a single helicity flip is allowed for the vector
boson under study.
    The parity-conserving cases include pure QED and/or QCD production, but not the
processes involving EW or other parity-violating new physics interactions. In the latter
case, all the polarization parameters are in principle not forbidden, and they are mixed under
transformations between different frames. Then the parity-violating polarization parameters
would be sensitively dependent on the parity-violating interactions, so can serve as useful
probes. This will be illustrated in Sec. 9.3 for the gluon polarization.
9.3       Linearly polarized gluon and CP violation
As we see in Eq. (9.16), linear gluon polarization is generally allowed for the ξ1 degree of
freedom, in the parity-conserving case. Since gluons are charge neutral, the parity property
is equivalent to the CP property, so that ξ1 is CP even whereas ξ2 and ξ3 are CP -odd
polarization degrees of freedom. Measuring nonzero ξ2 and/or ξ3 can therefore serve as new
probes of CP -violating interactions. In this section, we illustrate this by considering the
linear gluon polarization in the associative Higgs and gluon jet production, which happens
in the SM from a gluon fusion through a top quark loop.
    Even though the LHC is an unpolarized proton-proton collider (so the gluon partons
are also unpolarized), the hard scattering gg → hg can serve as a “polarizer” to produce
a gluon with substantial polarization ξ. As this gluon further fragments into a jet, its
polarization will modulate the kinematic distribution of the jet constituents; in particular,
the linear polarization breaks the rotational invariance around the jet direction to leave
a nontrivial azimuthal distribution, which can be projected out by weighting each event
                                              381


with some azimuth-sensitive observable. The azimuthally weighted cross section σw of the
inclusive h + g production at a pp collider can be factorized into a hard scattering coefficient
multiplied by a polarized gluon jet function, in much the same way as the factorization for
an unpolarized jet function [Berger et al., 2003; Almeida et al., 2009a,b] or fragmentation
function [Nayak et al., 2005; Collins, 2013],
                                                                                
                       dσw             dσ̂ dJ(ξ(pT , yg ), m2J , ϕ)         mJ
                                 =                                   +O        ,R ,      (9.18)
                dyg dp2T dm2J dϕ    dyg dp2T            dϕ                  pT
in terms of the rapidity yg and transverse momentum pT of the gluon jet in the c.m. frame
of the hg system and the jet mass mJ and azimuthal substructure ϕ, which is to be defined
shortly.
9.3.1      Production of polarized gluons in the hard scattering
In Eq. (9.18), the hard coefficient
                                      dσ̂                 |M|2
                                             = L(s, ŝ)        √                         (9.19)
                                   dyg dp2T             16πEh ŝ
is the differential cross section for the on-shell gluon production, with respect to the gluon
rapidity yg and transverse momentum pT in the partonic c.m. frame. The s and ŝ are the
c.m. energies squared for the pp and hg systems, respectively, and Eh is the Higgs energy in
the partonic c.m. frame; they are sufficiently determined by pT and yg ,
                                                        √
                      Eh = (m2H + p2T cosh2 yg )1/2 ,     ŝ = pT cosh yg + Eh ,         (9.20)
                                                382


where mH is the Higgs mass. In Eq. (9.19),
                                      Z  1                                 
                                    1       dx                      ŝ
                         L(s, ŝ) =            fg/p (x, µF )fg/p       , µF                    (9.21)
                                    s  ŝ/s x                      xs
is the gluon-gluon parton luminosity, with the factorization scale chosen at µF = pT in
the parton distribution function (PDF) fg/p (x, µF ) of the proton. We have used the LO
kinematics to integrate out the Higgs phase space.
    Since the linear gluon polarization degrees are produced from the hard polarizer and
depend sensitively on the interaction structure of the hard scattering, the azimuthal sub-
structure of the gluon jet can help probe the CP property of the Higgs-top interaction, which
we parametrize by an effective operator,
                         yt                          yt
                 L ⊃ − √ h t̄ (κ + i κe γ5 ) t = − √ κt h t̄ (cos α + i sin α γ5 ) t ,         (9.22)
                          2                           2
               √
where yt =       2mt /v is the Yukuwa coupling of Higgs and top quark in the SM, and
    e) parametrize the CP -even and CP -odd htt̄ interactions, respectively, which are usually
(κ, κ
reparametrized as (κ, κe) = κt (cos α, sin α), with α being the CP phase. Pinning down the
CP nature of this interaction is an important program being pursued at the LHC [Sirunyan
et al., 2021, 2020; Aad et al., 2020; Tumasyan et al., 2023; Aad et al., 2020; ATL, 2023]. Any
deviation from a Standard-Model-like htt̄ coupling, i.e., (κ, κ      e) = (1, 0) or (κt , α) = (1, 0),
could indicate new physics as well as provide a potential source for the CP violation as
required by the baryogenesis [Sakharov, 1967]. Unlike CP -violating Higgs interactions with
vector bosons, which arise from dimension-six operators, CP -violating effects in Eq. (9.22)
occur via a dimension-four operator and can be potentially larger.
                                                 383


    Numerous approaches have been proposed for determining the CP phase, either directly
via associated Higgs and top production [Ellis et al., 2014; Boudjema et al., 2015; Buckley
and Goncalves, 2016; Gritsan et al., 2016; Mileo et al., 2016; Amor Dos Santos et al., 2017;
Azevedo et al., 2018; Li et al., 2018; Gonçalves et al., 2018; Faroughy et al., 2020; Bortolato
et al., 2021; Cao et al., 2021; Gonçalves et al., 2022; Patrick et al., 2020], or indirectly
via Higgs or top induced loop effects [Brod et al., 2013; Dolan et al., 2014; Englert et al.,
2013; Bernlochner et al., 2019; Englert et al., 2019; Gritsan et al., 2020; Bahl et al., 2020;
Martini et al., 2021]. The sensitivity to α can be enhanced by using observables that are
odd under CP transformation [Mileo et al., 2016; Gonçalves et al., 2018]. Machine learning
techniques have also been considered [Patrick et al., 2020; Ren et al., 2020; Bortolato et al.,
2021; Bahl and Brass, 2022; Barman et al., 2022] aiming to optimize the sensitivity. The
current experimental bounds from direct measurements are |α| ≤ 35◦ [Sirunyan et al., 2020],
|α| ≤ 48◦ [Tumasyan et al., 2023], and |α| ≤ 63◦ [ATL, 2023] at 68% C.L., and |α| > 43◦
has been excluded at 95% C.L. [Aad et al., 2020], by various Higgs detection channels. This
makes it necessary to have more complementary observables to further constrain α, at the
upcoming High-Luminosity LHC (HL-LHC) [Apollinari et al., 2017] and a possible future
pp collider at 100 TeV (FCC-hh) [Mangano and Mangano, 2017].
    In the following, we examine how ξ1 and ξ2 can serve as useful probes of the CP phase;
especially, we propose ξ2 as a new CP -odd observable. This ξ2 is a genuine CP -odd observ-
able that is constructed purely out of the kinematic information in the gluon jet, and not via
a neutral state of charged particles and antiparticles [Han and Li, 2010]. Such CP sensitiv-
ity would not be possible in the hg production process without the gluon jet substructure,
which has not been considered previously. Furthermore, we note that associated Higgs-top
production and indirect measurements via hV or V V production also depend on the hV V
                                              384


couplings and require assumptions on the latter, whereas hg production only depends on the
htt̄ coupling.
                     −                                                                      g              x̂
                  +/                                                                            ẑlab
    k1              pg                                                                                     ϕ
                                                                                   h                g          ẑ
         t                                                                                 H
    k2              ph
                                                                                                       ŷ
                  h                                                                    g
Fig. 9.2: Left three: Representative diagrams for gg → hg via a top loop. Rightmost one:
the gluon x̂-ŷ-ẑ frame defined in the same way as Eq. (8.15).
     With the Lagrangian in Eq. (9.22), we can calculate the polarization degree of the gluon.
At LO, both gg fusion and q q̄ annihilation contribute via a top loop, as exemplified in Fig. 9.2
(left) for the gg channel. Even though the q q̄ channel can also produce a substantially polar-
ized gluon, its contribution to the total cross section is much smaller and will be neglected.
Parametrizing the helicity amplitudes g(λ1 ) g(λ2 ) → h g(λ3 ) in the partonic c.m. frame in
terms of the gluon’s transverse momentum pT , rapidity yg , and azimuthal angle ϕg , we have
                                                      h                                                 i
       Mλ1 λ2 λ3 (pT , yg , ϕg ) = f abc i(λ1 −λ2 )ϕg
                                        e              κ Aλ1 λ2 λ3 (pT , yg ) + i κ   e
                                                                                  e Aλ1 λ2 λ3 (pT , yg ) ,    (9.23)
with f abc the color factor, and λi the gluon helicities. In Eq. (9.23), A and Ae are the CP -even
and CP -odd helicity amplitudes, respectively, constrained by their CP properties as
                           (A, A)e −λ1 ,−λ2 ,−λ3 (pT , yg ) = (−A, +A)   e λ1 λ2 λ3 (pT , yg ),               (9.24)
given by Eq. (9.15). The gluon density matrix is then determined through
                                  1 X
                                   2
                                                 Mλ1 λ2 λ M∗λ1 λ2 λ′ ≡ ρλλ′ (ξ) |M|2 ,                        (9.25)
                               4 Nc,g     λ1 ,λ2
                                                          385


where the convention of summing over repeated indices is taken, and |M|2 is the unpolarized
squared amplitude, averaged/summed over the spins and colors, with Nc,g = 8. Due to their
CP properties in Eq. (9.24), A and Ae individually only contribute to ξ1 , while it is their
interference that contributes to ξ2 . In terms of the CP phase α, ξ can be expressed as
                       ω + β1 cos 2α                    β2 sin 2α                  β3 sin 2α
                 ξ1 =                   ,   ξ2 =                   ,      ξ3 =                    ,     (9.26)
                       1 + ∆ cos 2α                   1 + ∆ cos 2α              1 + ∆ cos 2α
where we have defined the polarization parameters
                               |A|2 − |A| e2               2(A+ ∗ A∗− + Ae+ ∗ Ae∗− )
                        ∆=                    ,     ω=                                   ,
                               |A|2 + |A| e2                     |A|2 + |A|  e2
           2(A+ ∗ A∗− − Ae+ ∗ Ae∗− )                  4 Re(A+ ∗ Ae∗− )              4 Im(A+ ∗ Ae∗+ )
     β1 =                              ,   β2 =                         ,    β3 =                     , (9.27)
                           e2
                  |A|2 + |A|                                     e2
                                                        |A|2 + |A|                     |A|2 + |A|  e2
with the notations defined similarly to Eq. (8.23),
                      X                                         X
          Aλ ∗ Bλ′ ≡             Aλ1 λ2 λ Bλ1 λ2 λ′ ,    |A|2 ≡                Aλ1 λ2 λ3 A∗λ1 λ2 λ3 .   (9.28)
                          λ1 ,λ2                                    λ1 ,λ2 ,λ3
Parametrizing ξ as in Eq. (9.26) clearly shows that the polarization only depends on the
CP phase α, but not on the coupling strength κt , which only controls the event rate. The
helicity polarization ξ3 requires an imaginary part from the amplitudes so is nonzero only at
√
  ŝ > 2mt . Its value is generally small compared to ξ1 and ξ2 , and will not be discussed in
the following.
                                                        386


9.3.2       Polarized gluon jet function
With the polarization ξ produced from the hard scattering in Eq. (9.25), the gluon then
fragments into a polarized jet. In the partonic c.m. frame, the gluon jet momentum k
defines the jet mass m2J = k 2 and direction ẑ as in Eq. (8.15) and Fig. 9.2 (right). By
                                                 √                     √
defining two lightlike vectors nµ = (1, −ẑ)/ 2 and n̄µ = (1, ẑ)/ 2, we can approximate the
gluon momentum in the hard part to be on shell by only retaining the large component,
                                                                         √
pµg = (k · n)n̄µ , which then defines the rapidity yg and pT = k · n/( 2 cosh yg ). To the leading
power of mJ /pT , the polarized gluon fragmentation is described by the polarized jet function
in Eq. (9.18),
     dJ(ξ, m2J , ϕ)          1        XZ
                    =                        d4 x eik·x [ρλλ′ (ξ)O(ϕ, X)] ελµ (pg ) ε∗λ′ ν (pg )
           dϕ          2πNc,g (k · n)2 X
                                                                                       ρµ
                         × ⟨0|Wac (∞, x; n) nσ Gσν
                                                 c (x)|X⟩ ⟨X|Wab (∞, 0; n) nρ Gb (0)|0⟩ ,        (9.29)
where X denotes the state of the particles within the jet, in accordance with the jet algo-
rithm [Almeida et al., 2009b; Ellis et al., 2010], whose momenta are dominantly along n̄.
Gµνc is the gluon field strength tensor, and
                                                 Z   ∞                               
                                                                 c             adj,c
                    Wab (∞, x; n) = P exp −ig            dλ n · A (x + λn)(Tab       )           (9.30)
                                                    0
is the Wilson line in the adjoint representation from x to ∞ along n, with P denoting the
path ordering and T adj,c the SU(3) generator in the adjoint representation. The repeated
color indices a, b, and c in Eq. (9.29) are summed over. In Eq. (9.29), the gluon polarization
states are projected using the on-shell polarization vectors εµλ (pg ) with helicity λ = ±1, which
are then averaged with the density matrix ρλλ′ (ξ). The resultant azimuthal distribution is
                                                  387


extracted by inserting the observable
                                             1      X
                             O(ϕ, X) = P                pi,T δ(ϕ − ϕi ),                  (9.31)
                                           i∈X pi,T i∈X
where pi,T and ϕi are, respectively, the transverse momentum and azimuthal angle of the
jet constituent i with respect to the x̂-ŷ plane defined in Eq. (8.15) and shown in Fig. 9.2
(right). Such a ϕ distribution is a new jet substructure observable introduced by the linear
polarization. The dependence on ξ3 would vanish due to parity invariance of O(ϕ, X).
    As a result of the pi,T weight, the observable O(ϕ, X) is IR safe, and hence the polarized
gluon jet function is insensitive to hadronization effects and becomes perturbatively calcu-
lable, with a predictable ϕ dependence. Nevertheless, it was noted long before [DeGrand
and Petersson, 1980; Hara and Sakai, 1989] that the gluon polarization information will be
greatly washed out by the cancellation between the g → gg and g → q q̄ channels, which
was also found recently in a similar situation [Chen et al., 2021, 2022; Larkoski, 2022]. It
is possible to mitigate these effects by using jet flavor tagging techniques [Gallicchio and
Schwartz, 2011, 2013; Ferreira de Lima et al., 2017; Frye et al., 2017; Banfi et al., 2006; Gras
et al., 2017; Metodiev and Thaler, 2018; Larkoski et al., 2014; Bhattacherjee et al., 2015;
Kasieczka et al., 2019a; Larkoski and Metodiev, 2019; Bright-Thonney et al., 2022]. For
example, one may recluster the identified gluon jet into two subjets, and only keep those
gluon jets with their two subjets tagged as quarks. At O(αs ), requiring a tagged quark in
the gluon jet leaves g → q q̄ as the only diagram, giving the polarized gluon jet function,
                                                                        
                         dJ (q)    αs TF       1
                                = 2 2 1 + (ξ1 cos 2ϕ + ξ2 sin 2ϕ) ,                       (9.32)
                          dϕ      6π mJ        2
                                               388


where the jet algorithm dependence does not come in at this order to the leading power of mJ .
Eq. (9.32) needs to be multiplied by the tagging efficiency when used in Eq. (9.18). Although
flavor tagging reduces the statistics significantly, it enhances the gluon spin analyzing power
from O(1%) to about 50% [Hara and Sakai, 1989] and will improve the statistical precision.
    It is worth noting that even though we choose to express the event kinematics in the
partonic c.m. frame for simplicity, a boost along the beam direction does not change the
polarization of the gluon jet to the leading power of mJ , due to the special transformation
property of massless particle state in Eq. (7.70). Therefore, we may equally describe each
event in the lab frame, and the azimuthal jet anisotropy structure in Eq. (9.32) stays the
same.
    Before closing this section, we note the difference of the gluon polarization from a quark.
While a transversely polarized light (massless) quark can also be produced from hard scat-
tering processes, as discussed in Ch. 8, its transverse spin cannot be conveyed via the per-
turbative quark jet function due to the chiral symmetry of a massless quark. It is hence
related to chiral symmetry breaking and must require the presence of some non-perturbative
functions [Collins et al., 1994; Collins, 2013; Kang et al., 2020].
9.3.3      Numerical results for the polarization degrees
The parameters (∆, ω, β1 , β2 ) in Eq. (9.27) are functions of pT and yg , as shown in Fig. 9.3(a)
for some benchmark phase-space points. While the parameter ∆, which describes the relative
difference between the CP -even and CP -odd amplitudes squared, stays relatively flat around
−0.4 in the range pT < 10 TeV, the parameters ω, β1 , and β2 , which control the sizes of the
polarizations ξ1 and ξ2 , vary sizably with pT . Based on their pT dependence, we divide the
phase space into three pT regions and discuss them in turn.
                                               389


Fig. 9.3: (a) Polarization parameters ∆, ω, β1 , and β2 , as functions of the gluon pT in the
partonic c.m. frame. Each parameter is shown as a shaded region constrained by |yg | ≤ 0.8,
bounded by a solid curve and a dashed curve, √      corresponding to yg = 0 and |yg | = 0.8,
respectively. The two vertical lines stand for the ŝ = 2mt threshold for yg = 0 (red, solid)
and |yg | = 0.8 (blue, dashed), respectively. The three hatching-shaded regions are the low-pT
region (cyan) for pT < 100 GeV, transition region (blue) for pT ∈ (100, 300) GeV, and high-
pT region (brown) for pT > 300 GeV. (b) ξ1 in the low-pT region with the SM Lagrangian
(α = 0) for three values of yg , where the full one-loop calculation
                                                              √       (solid) is compared with
the EFT result (dashed). The three vertical lines are the ŝ = 2mt threshold for yg = 0
(red), yg = 1.2 (green) and yg = 2 (blue). (c) ξ1 and ξ2 in the transition and high-pT regions,
for CP phase α = 0 and π/4, respectively, at which ξ1 and ξ2 peak respectively.
                                               390


      1. Low-pT region, with pT ≲ 100 GeV. In this region, both |ω| and β1 have large values,
whereas β2 ≃ 0. The linear polarization is thus dominated by ξ1 , with ξ2 ≃ 0. The dominance
of ω over β1 further implies that ξ1 does not depend sensitively on α. Being well below the
√
   ŝ = 2mt threshold, this region can be well approximated by the infinite-top-mass effective
field theory (EFT) [Dawson, 1991; Djouadi et al., 1991],
                                          h                                      
                            LEFT ⊃ −           λhgg Gaµν Gaµν + λehgg Gaµν G eaµν ,               (9.33)
                                         4v
which is matched onto Eq. (9.22) by
                                            2αs            ehgg = − αs TF κ
                                  λhgg = −        TF κ,   λ                  e.                   (9.34)
                                             3π                       π
In Fig. 9.3(b), the SM predictions for ξ1 are shown for both the full one-loop calculation and
the EFT approximation,
                                                     "     2          2                  #
                                            1           mH           pT      λ2hgg − λ e2
                                                                                         hgg
                  ξ1EFT (pT , yg ) = −                          +                              , (9.35a)
                                        U (pT , yg )    pT           mH        2       e
                                                                             λhgg + λhgg 2
                                                     "     2               #
                                            1           pT             ehgg
                                                                 2λhgg λ
                  ξ2EFT (pT , yg ) =−                                           ,                (9.35b)
                                        U (pT , yg )    mH      λ2 + λ  e2
                                                                 hgg     hgg
with
                                                                                 2
                                                   pT                       mH
                           U (pT , yg ) = 2 +          (1 + 2 cosh 2yg ) +           .            (9.36)
                                                  mH                         pT
One can see that ξ1 generally has a large negative value, which means that the produced
gluon is dominantly polarized along the x̂ direction in the production plane, cf. Fig. 9.2
(right). Furthermore, it is not dramatically dependent on the gluon rapidity yg .
      Since the low-pT region contains most of the hg events, it is suitable for testing the gluon
                                                       391


linear polarization phenomenon. Here we expect a significant cos 2ϕ jet anisotropy due to
the dominant ξ1 , as shown in Fig. 9.1(a). Its insensitivity to α also means that this region
can serve as a calibration region for experimentally measuring the linear polarization, which
is important to ensure its viability and to understand the systematic uncertainties of the
measurement since such phenomenon has not been observed before.
    2. Transition region, with 100 GeV ≲ pT ≲ 300 GeV. In this region, β1 and ω rapidly
go to 0 and flip their signs, while |β2 | starts growing to an appreciable value. Hence, the
linear polarization is dominated by ξ2 if α is not too small, as illustrated in Fig. 9.3(c)
for ξ1 at α = 0, and ξ2 at α = π/4, which corresponds to a maximal CP mixing. A
nonzero α would then lead to a linearly polarized gluon jet that features a sin 2ϕ anisotropy,
whose measurement provides a good opportunity for constraining the CP -odd coupling.
                                       √
Furthermore, this region covers the ŝ = 2mt threshold, so the EFT is no longer a good
approximation, as indicated in the right half of Fig. 9.3(b). In this region, both ξ1 and ξ2
are sensitive to yg , and their magnitudes are larger for gluon jets at more central rapidity
region.
    3. High-pT region, with pT ≳ 300 GeV. Here, both β1 and β2 have appreciable negative
magnitude. Their values grow and approach each other as pT increases. Moreover, ω, being
smaller than |β1 |, becomes less important in ξ1 . Qualitatively, we can interpret this region
by taking ω, ∆ → 0 and β1 , β2 → β, which gives (ξ1 , ξ2 ) ∼ β(cos 2α, sin 2α). Then the jet
anisotropy in Eq. (9.32) can be recast as
                                          1
                                     1+     β cos 2(ϕ − α) ,                            (9.37)
                                          2
so that the main axis direction of the jet image gives a direct measure of the CP phase. It
                                               392


can be shown that as ŝ → ∞, this qualitative simplification becomes exact in the one-loop
calculation. The quantitative behavior of ξ1 and ξ2 in the high-pT region is shown in the
right half of Fig. 9.3(c), where we see that they drop rapidly to 0 as |yg | increases, and a
simple kinematic cut |yg | < 0.8 yields the polarization |β1,2 | ≳ 0.05.
9.3.4       Phenomenology
The gluon jet azimuthal anisotropy in Eq. (9.32) can be experimentally measured by simply
constructing the asymmetry observables,
                                   R 2π
                                    0
                                        dϕ (dσw /dϕ) · sgn [Fi (ϕ)]  ξi
                             Ai =         R 2π                      = ,                  (9.38)
                                               dϕ (dσw /dϕ)          π
                                           0
where i ∈ {1, 2}, F1 (ϕ) = cos 2ϕ and F2 (ϕ) = sin 2ϕ. The uncertainties of the asymmetries
                                                          √
A1,2 are dominated by statistical ones, given by 1/ N with N being the number of the
observed events. Now we provide a simple demonstration of the constraining power of the
gluon linear polarization on the CP phase, by confining ourselves to the transition region for
both the HL-LHC at 14 TeV and FCC-hh at 100 TeV, with integrated luminosities 3 ab−1
and 20 ab−1 , respectively.
     The hg cross section in the transition region is estimated for the Lagrangian [Eq. (9.22)]
using CT18NNLO PDFs [Hou et al., 2021] with MG5 aMC@NLO 2.6.7 [Alwall et al., 2014]
by first generating the hg events with pT ∈ [100, 300] GeV and |ηg | ≤ 2.5 in the lab frame,
and then boosting to the partonic c.m. frame with a further cut |yg | ≤ 0.8, which gives
κ2t (0.57 cos2 α + 1.3 sin2 α) pb for the HL-LHC and κ2t (13.7 cos2 α + 30.7 sin2 α) pb for the
FCC-hh. While both κt and α affect the total production rate and can be constrained by
the measurement of the latter, only α determines the polarization. In the following, we take
                                                 393


κt = 1 and consider the constraint on α from the polarization data.
     We are interested in final states where the (fat) gluon jet is composed of a pair of quark
subjets. While it is possible to also discriminate light quark subjets from gluon subjets,
here we only provide a conservative estimate by restricting to the bottom (b) and charm
(c) quark tagging as used in experiments [CMS, 2016; ATL, 2016; Sirunyan et al., 2018;
Aaboud et al., 2018b,a; Aad et al., 2019a,c; Tumasyan et al., 2022; Aad et al., 2022; ATL,
2022b,a]. We estimate the branching fraction fgbb̄ (fgcc̄ ) of g → bb̄ (g → cc̄) through parton
shower simulation using Pythia 8.307 [Sjöstrand et al., 2015], which gives fgbb̄ = 0.013 and
fgcc̄ = 0.019 in the selected kinematic region. Following Refs. [Aad et al., 2019a; Aaboud
et al., 2018a], we take b-tagging efficiency ϵb = 0.7 and c-tagging efficiency ϵc = 0.3. We
consider the diphoton decay channel of the SM Higgs boson and assume a Higgs tagging
efficiency ϵh = 0.002. This then gives about (51 cos2 α + 115 sin2 α) reconstructed events at
the HL-LHC and (8100 cos2 α + 18200 sin2 α) events at the FCC-hh.
Fig. 9.4: Constraining power of the FCC-hh gluon polarization data, in the transition region,
on the CP phase α. ⟨ξ1,2 ⟩ are the average values of ξ1,2 in the specified kinematic region.
Their statistical uncertainties are indicated by the red and blue bands, respectively, around
the SM prediction (with α = 0). The green-hatched region is the α range allowed by the ξ2
measurement.
     In Fig. 9.4, we display the predicted average values of ξ1,2 in the transition region at
                                               394


the FCC-hh as functions of the CP phase α, together with their uncertainty bands around
the SM central values. As expected, it is ξ2 that constrains small values of α, whereas ξ1 is
too small to have an impact in this region. Assuming the SM scenario with ξ2 = 0, we can
project the constraint |α| ≤ 8.6◦ . In this estimate, we have only used the gluon polarization
information with Higgs decaying to diphotons. In order to make a significant impact with
data from the HL-LHC, one will have to include other Higgs decay channels and light quark
flavor tagging in the gluon jets, as well as data from the low-pT and high-pT regions, which
will significantly improve the constraints. A more careful phenomenological study is left for
future.
9.3.5      Summary
A precise understanding of the CP property of the Higgs boson is important both to test the
SM and to probe new physics. In this section, we proposed a novel way of probing the CP
structure of the Higgs-top interaction, by measuring the azimuthal anisotropy substructure
of the gluon jet produced in association with a Higgs boson, which originates from the lin-
ear polarization of the final-state gluon. We have introduced a factorization formalism and
defined a perturbative polarized gluon jet function with insertion of an IR-safe azimuthal ob-
servable. Experimental measurement of the linearly polarized gluon jet will be an important
test of the SM and can also serve as a new tool to search for new physics.
                                               395


9.4      Linearly polarized W boson and boosted top quark
         jet substructure
As we have seen in Sec. 9.3, linearly polarized gluons can be produced from hard partonic
scattering at the unpolarized LHC and induce azimuthally anisotropic jet images which
can serve as useful probes for the hard interaction structures, especially the CP property.
Obviously, similar phenomena exist for heavy gauge bosons like the W and Z. The inherent
parity-violating interactions can give rise to richer phenomenology. Instead of studying their
prompt production from the hard scattering, however, in this section, we examine the linear
polarization of the W boson in the boosted top quark decay. As can be easily inferred from
Sec. 9.3, in the absence of CP -violating interactions, only the linear polarization ξ1 can be
produced, characterizing the difference between the linear polarization degrees along and
perpendicular to the top quark decay plane. Nevertheless, as we will see in the following,
due to the large parity violation in the tbW interaction, ξ1 will be sensitively dependent
on the longitudinal polarization of the top quark, so its measurement will serve as a useful
boosted top quark polarimeter.
9.4.1      Polarization of the W boson in the top rest frame
For the sake of a generic discussion, we work with the Lagrangian
                                   g
                           L ⊃ − √ b̄γ µ (gL PL + gR PR ) t Wµ + h.c.                     (9.39)
                                    2
to describe the t → bW decay. When a boosted top quark is produced in the lab frame, we
construct the coordinate system x̂-ŷ-ẑ as in Eq. (8.15), with ẑ along the top quark momentum
                                                396


direction, and x̂-ŷ quantifying the transverse plane. The decay process is most conveniently
analyzed in the top rest frame, where the amplitude MR                         λt λw λb takes a simple form,
                                                 1/2∗                                  iλt ϕw  1/2
       MR  λt λw λb (θw , ϕw ) = Aλw ,λb Dλt , λw −λb (ϕw , θw , 0) = Aλw ,λb e               dλt , λw −λb (θw ), (9.40)
where θw and ϕw describe the angles of the W with respect to the x̂-ŷ-ẑ coordinate sys-
tem. The constant coefficients Aλw ,λb can be explicitly calculated from Eq. (9.39), with four
independent components,
                 A1,1/2 = iN (gR − fL /r),                                  A−1,−1/2 = iN (gL − fR /r),
                            iN                                                           iN
                 A0,1/2 = √ (gR /r − fL ) ,                                 A0,−1/2 = √ (gL /r − fR ) ,           (9.41)
                               2                                                            2
               p
with N = g         m2t − m2w and r = mw /mt ≃ 0.46. We neglect the b mass throughout the whole
discussion. Importantly, because of the spin-half nature of the top quark, W cannot simul-
taneously have both left- and right-handed polarization states for a given b state, regardless
of whether b is massless or massive. There is thus no interference of those states, implying
a vanishing linear polarization.
    For a general top spin state described by a density matrix ρt = (1 + st · σ)/2 with
st = (b1 , b2 , λt ) being the top spin vector, the unnormalized W density matrix can be obtained
explicitly from Eq. (9.40) by
                                  X
        WλRw λ′w (θw , ϕw ) =                ρtλt λ′t MR          R∗
                                                       λt λw λb Mλ′t λ′w λb
                                 λt ,λ′t ,λb
                                                                397


                                                                                           
                         2                        g2
                     gR    (1 + nw · st )       √R
                                                    2r
                                                        L(θw , ϕw )                0        
                 N 
                   2
                        g2                       2                         2
                                                                             gL
                                                                                            
                                                                                            
              =         √R   L∗ (θw , ϕw )  1              2
                                                   g+ + g− nw · st          √ L(θw , ϕw )  ,         (9.42)
                  2       2r               2 r2                              2r            
                                                  2
                                                                                            
                                                 gL      ∗                  2
                                 0               √ L (θw , ϕw )
                                                  2r
                                                                         gL (1 − nw · st )
where we defined g±  2
                        ≡ gL2 ± gR2 and nw = (sin θw cos ϕw , sin θw sin ϕw , cos θw ), and
   L(θw , ϕw ) ≡ b1 (cos θw cos ϕw + i sin ϕw ) + b2 (cos θw sin ϕw − i cos ϕw ) − λt sin θw .        (9.43)
Compared with Eq. (9.8), we can extract the unnormalized polarization parameters,
                                                                                  
         R      R         R        R                  1            2         1          2
    trW =     W++   +  W00   + W−−           =             + 1 g+ +              − 1 g−   nw · st , (9.44a)
                                                    2 r2                  2 r2
              √           R        R
                                                g+2
        J1 = 2 Re W+0         + W0−          =         · Re [L(θw , ϕw )] ,                         (9.44b)
                                                  r
                √                                   g2
        J2 = − 2 Im W+0     R
                               + W0−  R
                                             = − + · Im [L(θw , ϕw )] ,                             (9.44c)
                                                      r
                R         R                           2      2
        J3 = W++    − W−−                    = −g−       + g+  nw · st ,                            (9.44d)
                √           R         R
                                                g−2
      Qyz = − 2 Im W+0         − W0−         =         · Im [L(θw , ϕw )] ,                         (9.44e)
                                                  r
              √                                     g2
      Qxz =     2 Re W+0  R
                              − W0−R
                                             = − − · Re [L(θw , ϕw )] ,                             (9.44f)
                                                     r                          
                R         R          R                   1          2        1         2
        δL = W++    + W−−     − 2W00         = − 2 − 1 g+ −                   2
                                                                                + 1 g−    nw · st , (9.44g)
                                                        r                   r
                       R
                          
         ξ = 2Re W+−                         = 0,                                                   (9.44h)
                         R
                             
      Qxy = −2Im W+−                         = 0,                                                    (9.44i)
where a common factor N 2 /2 has been omitted. Note that when the top quark is unpolarized
(st = 0), only the diagonal elements in Eq. (9.42) survive, so only J3 and δL are nonzero; if
parity further conserves (gL = gR ), only δL is allowed.
                                                     398


9.4.2      Polarization of the W boson in the boosted top frame
While one can (in principle) always keep the full dynamic information by analyzing the top
decay events in their rest frame and constructing the full W decay distributions, it is more
desirable to analyze boosted top quarks within the boosted frame. Boosted top quarks are
likely to originate from the decay of heavy particles beyond the SM, so both their production
rate and polarization information can serve as useful probes of new physics [Schätzel, 2015].
In this kinematic regime, a top quark decays into collimated particles, exhibiting a cone
signature that can greatly enhance the selection efficiency of boosted top quark events with
respect to the intrinsic W + j background, which compensates for the small production
rate [Abdesselam et al., 2011]. On the other hand, the semileptonic decay mode no longer
retains special advantages over the hadronic mode, and one ought to take the latter into
account to enhance the statistics. Identifying hadronically decayed boosted top quarks first
requires distinguishing them from light QCD jet background.
    Following the tagging procedures, one may identify the top decay products as different
subjets within the fat jet. Because the finite granular size of the detector leads to large
uncertainties of the angular separations (especially in polar angles) among the subjets inside
the top jet, going back to the top rest frame is not a valid choice, and it is more appropriate
to analyze the top quark jet substructure with directly measured kinematic observables. This
makes the azimuthal correlation like Eq. (9.32) a good candidate for such analysis, which
is very suitable for hadronic decay mode since cos 2ϕ and sin 2ϕ do not require identifying
particle types but can be measured from the azimuthal energy deposition anisotropy. In
contrast, cos ϕ and sin ϕ signatures do not have such advantages.
    As we have seen in Sec. 9.3, cos 2ϕ and sin 2ϕ arise from linear polarization of a vector
                                              399


boson. In the top rest frame, the linear W polarization ξ = Qxy = 0 independent of the
interaction details. Now we boost the top decay system along the ẑ direction by Λt = Λ(βt )
to recover the boosted top momentum in the lab frame,
                                      pµt = Λt (mt , 0, 0, 0)T = (Et , 0, 0, pt ),                                 (9.45)
where βt = pt /Et is determined by top momentum pt and energy Et . This does not change
the top helicity state but changes the b and W states, which transform according to their
little groups, such that in the boosted top frame, the t → bW amplitude MB                            λt λw λb is related
to MR  λt λw λb in Eq. (9.40) by
                                                      X
                                      −iλb Θ(Λt ,pb )
          MB  λt λw λb (θw , ϕw ) = e                     Dλ1 w λ′w (W (Λt , pw )) MR
                                                                                    λt λ′w λb (θw , ϕw ),          (9.46)
                                                      λ′w
where we describe each amplitude in terms of the W angles in the top rest frame, but
the helicity of each particle is with respect to the specified frame. The explicit forms of
the little group elements W (Λt , pw ) and Θ(Λt , pb ) have been worked out in Eqs. (7.41) and
(7.70), respectively, with Θ(Λt , pb ) = 0 and W (Λt , pw ) being a rotation Ry (χ) around the ŷ
direction by an angle χ, as determined by the top quark speed βt and W kinematics in the
top rest frame,
                                                                vw + βt cos θw
                         cos χ(βt , θw ) = p                                                      ,                (9.47)
                                                 (1 + βt vw cos θw )2 − (1 − βt2 )(1 − vw2 )
where vw = (1 − r2 )/(1 + r2 ) ≃ 0.64 is the W speed in the top rest frame. Then Eq. (9.46)
                                                           400


becomes
                                                     X                         
                    MB  λt λw λb (θw , ϕw ) =            d1λw λ′w χ(βt , θw ) MR    λt λ′w λb (θw , ϕw ).       (9.48)
                                                     λ′w
   The W density matrix W B in the boosted top frame can be obtained from Eq. (9.48) in
the same way as Eq. (9.42). It is related to W R by a rotation like for a rank-2 tensor,
                                   X                                                   
       WλBw λ′w (θw , ϕw ) =                d1λw λ̄w χ(βt , θw ) d1λ′w λ̄′w χ(βt , θw ) Wλ̄Rw λ̄′w (θw , ϕw ).  (9.49)
                                 λ̄w , λ̄′w
This translates into the transformations of the W polarization parameters,
                           J1′ = J1 cos χ + J3 sin χ,                                                          (9.50a)
                           J3′ = J3 cos χ − J1 sin χ,                                                          (9.50b)
                           J2′ = J2 ,                                                                          (9.50c)
                         Q′yz = Qyz cos χ − Qxy sin χ,                                                         (9.50d)
                         Q′xy = Qxy cos χ + Qyz sin χ,                                                         (9.50e)
                                                         ξ − δL
                         Q′xz = Qxz cos 2χ −                       sin 2χ,                                     (9.50f)
                                                          2                                    
                               ′       3ξ + δL 1                              ξ − δL
                            ξ =                     +      Qxz sin 2χ +                cos 2χ ,                (9.50g)
                                             4         2                         2
                                                                                               
                             ′         3ξ + δL 3                              ξ − δL
                           δL =                     −      Qxz sin 2χ +                cos 2χ .                (9.50h)
                                             4         2                         2
where the primed labels refer to the ones in the boosted top frame. The last three equations
can be rewritten as
                                                                          ξ − δL
                                             Q′xz = Qxz cos 2χ −                  sin 2χ,                      (9.51a)
                                                                            2
                                       ξ ′ − δL′                         ξ − δL
                                                    = Qxz sin 2χ +               cos 2χ,                       (9.51b)
                                            2                               2
                                                              401


                                3ξ ′ + δL′ = 3ξ + δL .                                    (9.51c)
In this way, the polarization parameters mix under the little group rotation. (J1 , J3 ) and
(Qxy , Qyz ) mix among themselves, so that J12 +J32 and Q2xy +Q2yz are two rotational invariants.
Qxz and (ξ−δL )/2 mix with each other, so Q2xz +(ξ−δL )2 /4 is rotationally invariant. Another
two invariants are J2 and 3ξ + δL . These transformations are reminiscent of the quadrupole
interpretations of the polarization parameters in Eqs. (9.12) and (9.13).
    The mixing of those parameters does not alter the physics information of the tbW in-
teraction encoded in the W polarization, but does change the angular distribution of the
W decay. Especially, we see that the linear polarizations ξ and Qxy can generate nonzero
values in the boosted top frame via mixing with other parameters. The resultant cos 2ϕ and
sin 2ϕ distributions can be more easily measured to reveal hidden interaction information.
This mixing is the source of generating new kinds of polarization observables in the boosted
top system, which are absent in the rest frame of top quark. It is due to the massiveness
(or off-shellness) of the W . If we are looking at an on-shell photon or gluon, its helicity will
be conserved under the boost, so that if ξ or Qxy (or ξ1 and ξ2 in the notations for massless
spin-1 particle in Eq. (9.3)) is 0 in a certain frame, it keeps being 0 in any other frame, cf.
Eq. (7.70).
    Using Eqs. (9.50) and (9.44), we can get the explicit forms of the polarization parameters
in the boosted top frame,
                                                       
               Re [L(θw , ϕw )]
    J1′ =  2
          g+                                                2
                                cos χ + (nw · st ) sin χ − g− sin χ,                     (9.52a)
                     r
               Im [L(θw , ϕw )]
    J2′ = −g+2
                                ,                                                        (9.52b)
                      r
                                                   402


                                                                    
                       Re [L(θw , ϕw )]
     J3′    = −g+ 2
                                          sin χ − (nw · st ) cos χ − g−       2
                                                                                 cos χ,                         (9.52c)
                                r
                  Im [L(θw , ϕw )]
   Q′yz = g−   2
                                      cos χ,                                                                   (9.52d)
                            r
               2 Im [L(θw , ϕw )]
  Q′xy = g−                           sin χ,                                                                    (9.52e)
                           r                                                      
                  2 Re [L(θw , ϕw )]                1 + r2                                2 1−r
                                                                                                  2
   Q′xz = −g−                             cos 2χ +          (n w  · s  t ) sin  2χ   −  g+           sin 2χ,    (9.52f)
                                r                     2r2                                    2r2
                                                                                  
          ′       2 Re [L(θw , ϕw )]               1 + r2                     2          2 1−r
                                                                                                 2
       ξ = −g−                            sin 2χ +          (nw  ·  s t )  sin   χ   − g +          sin2 χ,    (9.52g)
                               2r                    2r2                                     2r2
                                                                                         
        ′      2 3 Re [L(θw , ϕw )]                1 + r2                 1 + 3 cos 2χ                    2
                                                                                                2 1 − r 1 + 3 cos 2χ
     δL = g−                             sin 2χ −          (n w  ·  s t )                   − g +                    ,
                              2r                     2r2                         2                   2r2     2
                                                                                                               (9.52h)
which are again expressed in terms of the W angles in the top rest frame, and where we have
omitted an overall factor N 2 /2.
9.4.3          Decay distribution of the top quark
The top quark decays into a b and W , which further decays into a fermion-anti-fermion pair
f f¯′ . In the boosted top frame, this amplitude can be written as
                                            i           X
            MBλt λb λf λf ′ =                                MB                            B          ⋆
                                                                 λt λw λb (θw , ϕw )Mλw λf λf ′ (θf , ϕf ),
                                                                                                           ⋆
                                                                                                                 (9.53)
                               (p2w − m2w )2 + imw Γw λ
                                                           w
where all helicities are defined in the boosted top frame, but the event is described by the
W angles (θw , ϕw ) in the top rest frame and the fermion angles (θf⋆ , ϕ⋆f ) in the W rest frame.
The latter is obtained by boosting along −pw and has the coordinate system x̂w -ŷw -ẑw ,
                                          pw              pt × pw
                                  ẑw =        , ŷw =                  ,    x̂w = ŷw × ẑw ,                   (9.54)
                                         |pw |          |pt × pw |
                                                            403


where all momenta are in the boosted top frame. The kinematics in the boosted top frame
is shown in Fig. 9.6(a), where it is clear that the angle ϕf in the boosted frame is equal
to ϕ⋆f in the W rest frame and characterizes the relative angle between the two successive
decay planes of t → bW and W → f f¯′ , respectively. In Eq. (9.53), we have separated the
propagator of the intermediate W boson and converted its numerator to a polarization sum,
                                                         pµw pνw          X µ∗
                                            −g µν +          2
                                                                   =               ϵλw (pw )ϵνλw (pw ),                                 (9.55)
                                                          mw           λ =±,0
                                                                          w
with pw = pf + pf ′ being the W ’s 4-momentum and Γw its decay width. Averaging the
square of the amplitude in Eq. (9.53) with the top quark spin density matrix gives
           X
                         ρtλt λ′t MB               B∗
                                   λt λb λf λf ′ Mλ′t λb λf λf ′
   λt ,λ′t ,λb ,λf ,λf ′
                                                                                                                                          
                           1                 X         X                                                X
   =                                                            ρtλt λ′t MB λ   λ   λ  MB∗λ′ λ′ λ            MB               B∗
                                                                                                                     λw λf λf ′ Mλ′w λf λf ′
                                                                                                                                             
             2           2  2      2    2                                      t   w  b            b
        (pw − mw ) + mw Γw λw ,λ′ λ ,λ′ ,λ                                                  t w
                                                                                                       λf ,λ ′
                                                 w     t   t  b                                             f
             π                         X                                     X
   ≃                 δ p2w − m2w                 WλBw ,λ′w (θw , ϕw ) ·                MB              B∗            ⋆    ⋆
                                                                                          λw λf λf ′ Mλ′w λf λf ′ (θf , ϕf ),           (9.56)
        mw Γw                            λ ,λ′                               λ ,λ
                                          w    w                              f    f′
where we used narrow width approximation for the W and the definition of the polarization
matrix W B in the last step. Using Eq. (9.56) and integrating over the top decay phase space,
we can express its width as
                                         Z                         Z
                          1 1 − r2           d cos θw dϕw                 d cos θf⋆ dϕ⋆f
              Γt =                                               ·
                       2Et 2mw Γw                 32π 2                       32π 2
                                          X                                  X
                                     ×           WλBw λ′w (θw , ϕw ) ·                MB              B∗            ⋆    ⋆
                                                                                         λw λf λf ′ Mλ′w λf λf ′ (θf , ϕf ),            (9.57)
                                         λw ,λ′w                            λf ,λf ′
where we have set the W on shell using the δ-function in Eq. (9.56).
                                                                          404


     Boosting the W rest frame for the W → f f¯′ system changes the helicity states of f and
f¯′ according to their little group rotations, but does not change the W helicity state. Due
to the sum over λf and λf ′ , such rotation matrices reduce to identity by their unitarity, so
the W decay amplitude square can be equally described in the W rest frame,
                   X                                                    X
                                                               ⋆                                              ⋆     ⋆
                           MB                 B∗           ⋆
                               λw λf λf ′ Mλ′w λf λf ′ (θf , ϕf ) =             MR               R∗
                                                                                   λw λf λf ′ Mλ′w λf λf ′ (θf , ϕf ),      (9.58)
                  λf ,λf ′                                             λf ,λf ′
with R denoting the W rest frame. Then we can use the simple result
                                                                                                   ⋆
                                                                                              iλw ϕf 1∗
      MR                ⋆    ⋆                     1∗            ⋆     ⋆
           λw λf λf ′ (θf , ϕf ) = Cλf ,λf ′ Dλw , λf −λf ′ (ϕf , θf , 0) = Cλf ,λf ′ e              dλw , λf −λf ′ (θf⋆ ), (9.59)
where the constant coefficient Cλf ,λf ′ only has two nonzero components C−1/2, 1/2 ≡ C− and
C1/2, −1/2 ≡ C+ when neglecting the fermion masses. In the SM only the former exists, but
we keep both here for a general discussion. Then the matrix element in Eq. (9.57) (second
line) can be directly obtained from Eq. (9.8) with the boosted polarization parameters in
Eq. (9.52),
      X                               X
             WλBw λ′w (θw , ϕw ) ·            MR                R∗           ⋆    ⋆
                                                   λw λf λf ′ Mλ′w λf λf ′ (θf , ϕf )
     λw ,λ′w                         λf ,λf ′
                                                                                                
          |C+ |2 + |C− |2 N 2 2 2                     1                 2       1
      =                                     g             + 1 + g−                  − 1 nw · st
                   2             2 3 + 2 r2                                   2 r2
                                                δ ′ (θw , ϕw )                         1
                                            − L                   1 − 3 cos2 θf⋆ + ξ ′ (θw , ϕw ) sin2 θf⋆ cos 2ϕ⋆f
                                                        6                                  2
                                                1                                            1
                                            + Q′yz (θw , ϕw ) sin 2θf⋆ sin ϕ⋆f + Q′xz (θw , ϕw ) sin 2θf⋆ cos ϕ⋆f
                                                2                                        2
                                                1
                                            + Q′xy (θw , ϕw ) sin2 θf⋆ sin 2ϕ⋆f
                                                2
                      2          2     2
                                         
              |C+ | − |C− | N
          +                                J1′ (θw , ϕw ) sin θf⋆ cos ϕ⋆f + J2′ (θw , ϕw ) sin θf⋆ sin ϕ⋆f
                        2            2
                                                                  405


                                                              
                                   +  J3′  (θw , ϕw ) cos θf⋆   .                            (9.60)
Inserting this into Eq. (9.57) gives the full angular distribution of the top quark decay.
      Focusing on the azimuthal distribution of the f , which is the same in the W rest frame
and boosted top frame, we integrate out θf⋆ and (θw , ϕw ) in Eq. (9.57). The former kills the
angular components for δL′ , Q′yz , Q′xz , and J3′ , and the latter renders Q′xy and J2′ to vanish.
In the end, we can express the azimuthal distribution as
                                                                                     
                          1 dΓt      1           1 ′              3π      ′
                                 =          1 + ⟨ξ ⟩ cos 2ϕf +       fw ⟨J1 ⟩ cos ϕf ,       (9.61)
                         Γt dϕf     2π           2                 8
where fw ≡ (|C+ |2 − |C− |2 ) / (|C+ |2 + |C− |2 ) describes the parity violation degree in W
decay, which is −1 in the SM, and
                                                    Z
                              ′           2r2           d cos θw dϕw ′
                           ⟨O ⟩ ≡ 2                                  [O (θw , ϕw )] ,        (9.62)
                                   g+ (1 + 2r2 )               4π
is the averaged W polarization. In Eq. (9.61), we have changed ϕ⋆f to the azimuthal angle
ϕf in the boosted top frame since they are the same. Using Eq. (9.52), we have
                   Z  1
    ′        1          d cos θw                                                      
  ⟨ξ ⟩ =                           λt ft r sin θw sin 2χ − 1 + r2 cos θw sin2 χ − 1 − r2 sin2 χ ,
          1 + 2r2   −1      2
                   Z  1
            2r          d cos θw
 ⟨J1′ ⟩ =                        [λt (− sin θw cos χ + r cos θw sin χ) − ft r sin χ] ,       (9.63)
          1 + 2r2   −1      2
                2    2
where ft ≡ g−     /g+   is the parity violation degree in t → bW decay, which is equal to 1 in
the SM. We see that the angle between the two successive decay planes in the top quark
events exhibits cos 2ϕ and cos ϕ distributions. The cos 2ϕ component is due to the linear
                                                       406


polarization of W that is parallel or perpendicular to the t → bW plane, which is generated
due to the boost effect. The cos ϕ component is due to the angular momentum J1 of W ,
which is the interference between WL and WT , and is only present because the W decay
violates parity.
     In the case for antitop quark, we have the same angular correlation as in Eq. (9.61), but
the coefficient ⟨ξ¯′ ⟩ and ⟨J¯1′ ⟩ differ from Eq. (9.63) by λt → −λt due to CP invariance.
9.4.4       Phenomenology of the azimuthal correlation
Since the quantities ⟨ξ ′ ⟩ and ⟨J1′ ⟩ in Eq. (9.63) depend on the top quark energy through
its velocity, this dependence saturates very quickly to infinitely boosted limit, as shown in
Fig. 9.5. As a good approximation, we define top quarks with Et ≳ 500 GeV as being
boosted, and approximate their ⟨ξ ′ ⟩ and ⟨J1′ ⟩ by the infinitely boosted limit (βt = 1), at
which Eq. (9.63) gives simple analytic expressions,
                                                        
       ′            8r2             1 + r2 1
     ⟨ξ ⟩ =                                   ln − 1 (λt ft − 1)       ≃ 0.291 (λt ft − 1) ,
             (1 − r2 )(1 + 2r2 ) 1 − r2 r
                        πr                                          
    ⟨J1′ ⟩ = −                         4f t r 2
                                                + λ t (1 − r)(1 + 3r)  ≃ −0.203ft − 0.305λt , (9.64)
               2(1 + r)2 (1 + 2r2 )
where the second equalities are the numerical expressions with r ≃ 0.46. In the SM, we have
ft = 1.
9.4.5       Azimuthal correlation as a boosted top polarimeter
If the top quark events have been identified and distinguished from the background, then it
serves as a definite prediction for the azimuthal correlation, in which the linear dependence
on λ can be used to measure the longitudinal polarization of boosted top quarks within the
                                                         407


                SM: (gL, gR) = (1, 0)
                                                                        0.2                        SM: (gL, gR) = (1, 0)
         0.0
                                                                        0.0
        - 0.2
 <ξ'>                                                         <J1 >
                                             Et = 200 GeV              - 0.2       Et = 200 GeV
        - 0.4                                Et = 300 GeV                          Et = 300 GeV
                                                                       - 0.4       Et = 500 GeV
                                             Et = 500 GeV
                                             Et = ∞                                Et = ∞
        - 0.6                                                          - 0.6
            - 1.0      - 0.5       0.0       0.5        1.0                - 1.0      - 0.5       0.0      0.5        1.0
                                    λt                                                             λt
Fig. 9.5: ⟨ξ ′ ⟩ (left) and ⟨J1′ ⟩ (right) as functions of λt at different top quark energies for the
SM couplings ft = 1.
boosted frame. This can be done for both semileptonic and fully hadronic decay modes of
top quark. For the semileptonic mode, one needs to first reconstruct the missing neutrino
three-momentum by imposing kinematic constraints of the event [Aad et al., 2013b; Cha-
trchyan et al., 2012a]. Then Eq. (9.61) is exactly the azimuthal correlation for the neutrino
(the fermion, not antifermion). For the fully hadronic decay mode, however, one cannot
distinguish the up-type and down-type quarks in W decay products, so can only use ϕ in
the range of [0, π) and sum over the events with ϕ → ϕ + π. This kills the cos ϕ component
and gives
                                            π dΓ       1
                                Ptj (ϕ) ≡         = 1 + ⟨ξ ′ ⟩ cos 2ϕ,             ϕ ∈ [0, π),                      (9.65)
                                            Γt dϕ      2
where we have thrown the subscript “f ” in ϕf since this angle is now reinterpreted as the
angle between the two successive decay planes, as shown in Fig. 9.6(a). This distribution
is shown in Fig. 9.6(b) for a few different values of λt . We note particularly that for the
unpolarized case (λt = 0), the fluctuation is about 10%, which is significant enough for exper-
imental measurement. By fitting the top event ensemble to the corresponding distribution,
one can determine the value of λt .
                                                            408


                                          W decay plane
                 ŷ   x̂       ŷw x̂
                                                      f                                    x̂w
                                      w                                          ŷw
           ne                                                                          ϕ         ẑw
                      ẑ
         pla
                 t         W
                                          ẑw
                                                          ϕ
      ay                                                      t           W
    ec
  td                                                          λt          b
                           b
                                                    f¯′
                           (a)                                             (b)
Fig. 9.6: (a) The two successive decay planes in t → bW (→ f f¯′ ) decay process. The
coordinate systems of top frame and W frame are shown separately. The x̂w axis of W frame
lies on the t decay plane, while the x̂ axis of the top frame may not. (b) The azimuthal
correlation in the boosted top quark jet is reflected as energy deposition anisotropy, favoring
a rounder top jet image.
   The measurement of top quark polarization is important for testing the SM and exploring
new physics models [Kane et al., 1992; Berger et al., 2011], which is commonly done in the
top rest frame for the semileptonic decay mode [ATL, 2021; Aad et al., 2013a; Jezabek, 1994;
Brandenburg et al., 2002; Sirunyan et al., 2019; Mahlon and Parke, 2010; Schwienhorst et al.,
2011; Aguilar-Saavedra et al., 2017]. For such measurements in the boosted regime, some
methods have been designed [Shelton, 2009; Krohn et al., 2010; Kitadono and Li, 2016;
Godbole et al., 2019] by making use of the energy or polar angular distribution of the decay
products. It is the first time in [Yu and Yuan, 2022a] to employ the cos 2ϕ correlation as a
boosted top polarimeter in the hadronic mode.
   Even though we only performed a LO calculation in the analysis, the cos 2ϕ correlation
arises from the W boson polarization, which is robust against perturbative QCD correc-
tion [Do et al., 2003] and parton showering. In reality, we need to take the latter into
account by defining an infrared safe observable. Note that the energies of W decay products
are not correlated with the azimuthal angle ϕ, and therefore Eq. (9.65) can directly translate
                                                     409


into energy distribution in the transverse plane of the W frame,
                                                              
                                   dE   Etot       1 ′
                                      =         1 + ⟨ξ ⟩ cos 2ϕ ,                     ϕ ∈ [0, 2π),                        (9.66)
                                   dϕ   2π         2
where E can also be taken as the transverse momentum in the W frame, which is equally
infrared safe, and we have extended ϕ to [0, 2π). From Eq. (9.64), we see that ⟨ξ ′ ⟩ is always
negative, so that more energy is deposited perpendicular to the t → bW plane, making the
top quark jet image tend to be round, as shown in Fig. 9.6(b).
      1.3 Azimuthal angular correlation (a)                  80 Azimuthal   Energy Deposit                     (b)         1
                SM prediction                                   104 pp → t t events
                                                             60                   II: 20.40 GeV
      1.2       hadronic top                                    t → bqq'
                                                             40
                                                                     III: 17.16 GeV                        I: 16.90 GeV
       1.1                                                   20                                                            10
                                                                                                                                −1
 P t (φ )
                                                         y    0
            1
                                 λt = 1                      −20
      0.9                        λt = 0.5
                                                             −40                                                           10−2
                                 λt = 0
      0.8                                                    −60
                                 λt = − 0.5
                            t    λ = −1                      −80                           IV: 20.36 GeV
      0.7
         0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1                   −80 −60 −40 −20 0 20 40 60 80
                                φ/ π                                                              x
Fig. 9.7: (a) Azimuthal angular correlation in the decay of boosted top quark for different
values of top longitudinal polarization λt . (b) The transverse momentum distribution of W
decay products in the azimuthal plane of W frame, viewed from the ẑw direction in Fig. 9.6.
The accumulated transverse momentum, averaged over 104 events, has been indicated in
each quadrant.
     The cos 2ϕ distribution leads to an asymmetry of azimuthal energy deposition between
the regions with cos 2ϕ > 0 and cos 2ϕ < 0, which divides the transverse plane into four
quadrants, as shown by the two dashed diagonal lines in Fig. 9.7(b). This consideration
motivates the following method to extract the coefficient ⟨ξ ′ ⟩ that is suitable in experimental
                                                       410


analysis:
  (1) construct the top jet and its four-momentum pµt ;
  (2) use jet substructure technique with b tagging to reconstruct the b subjet with its four-
       momentum pµb ;
  (3) determine the W ’s four-momentum pµw = pµt − pµb ;
  (4) construct the W frame coordinate system (x̂w -ŷw -ẑw ) as in Eq. (9.54) and Fig. 9.6; and
  (5) remove the particles in the b subjet and determine the energy distribution of the rest
       of top quark jet in the transverse plane (x̂w -ŷw ).
This method does not require identifying the quarks or subjets from W decay.                As a
demonstration, in Fig. 9.7(b) we show the transverse energy deposit distributed in the az-
imuthal plane of W frame, which is the average of 104 hadronically decayed top quarks with
                                                                                  √
pT ∈ (500, 600) GeV from the tt̄ pair production in proton-proton collision at      s = 13 TeV.
The decayed events are generated with MG5 aMC@NLO 2.6.7 [Alwall et al., 2014] at leading
order and passed to Pythia 8.307 [Bierlich et al., 2022] for parton showering, with full
initial and final state radiations. Since hadronization is not correlated with the azimuthal
distribution, it will not change the infrared-safely defined azimuthal asymmetry. A similar
argument also holds for the effect of underlying events that cancel in the asymmetry observ-
able. The anti-kT algorithm [Cacciari et al., 2008] implemented in FastJet 3.4.0 [Cacciari
et al., 2012; Cacciari and Salam, 2006] is used for the jet analysis, with a radius parameter
R = 1.0 for finding the top jets and R = 0.2 for reclustering the top jets and identifying
the b-tagged subjets. The energy deposits in the four quadrants are denoted as E1 , · · · , E4 ,
sequentially, which have been indicated in Fig. 9.7(b). Evidently, there are more energy
                                              411


deposits in the ŷw direction, perpendicular to the tbW plane, than the x̂w direction, which
is parallel to the tbW plane. Then we have
                                             (E1 + E3 ) − (E2 + E4 )
                                ⟨ξ ′ ⟩ = π ·                         .                    (9.67)
                                             (E1 + E3 ) + (E2 + E4 )
This gives ⟨ξ ′ ⟩ = −0.282 ± 0.032 in the simulated tt̄ events, which agrees well with analytic
calculation in Eq. (9.64) for top helicity λt = 0. The quoted uncertainty is only of statistical
origin, which is the dominant uncertainty in asymmetry observables [Aad et al., 2019b; CMS,
2021]. When using the same event selection criteria as in [Aad et al., 2023], which yields
17 261 boosted tt̄ events at the LHC Run-2 with 139 fb−1 integrated luminosity, we obtain
an uncertainty δ⟨ξ ′ ⟩ = 0.024. Hence, the azimuthal correlation can already be observed with
                                      √
the Run-2 data. Since δ⟨ξ ′ ⟩ ∝ 1/ Nevents , we can project an uncertainty of 0.016 for 300 fb−1
at the LHC Run-3 and 0.0052 for 3000 fb−1 at the High-Luminosity LHC [Apollinari et al.,
2017]. It is evident that the LHC data allow the precision measurement of such azimuthal
correlation, making the latter a good polarimeter for boosted top quarks.
9.4.6      Azimuthal correlation as a top quark jet tagger
The derivation of Eqs. (9.50) and (9.61) makes it clear that the cos 2ϕ azimuthal correlation
is not only relevant to boosted top quarks, but also to any boosted 1 → 3 decay systems as
long as they are mediated by virtual vector bosons, such as boosted QCD jets with a virtual
gluon, boosted b → sl+ l− decay through a virtual photon or Z boson, or b → cν̄l l− decay
via a virtual W . In more general cases with CP violation, there will also be an additional
sin 2ϕ correlation.
    A particular example is the three-pronged QCD jets, for which the azimuthal angular
                                                  412


correlation Pj (ϕ) = 1 + (⟨ξj ⟩/2) cos 2ϕ has been pointed out for the three-point energy
correlator [Chen et al., 2021]. This is relevant to boosted top quarks because QCD jets can
be a source of background of the latter and needs to be distinguished when studying the
hadronically decayed boosted top quarks. However, there are more diagrams contributing
to the three-point energy correlator of QCD jet that are not mediated by a virtual gluon.
Furthermore, for the diagrams that are mediated by a virtual gluon, the splittings of g ∗ → gg
and g ∗ → q q̄ are not distinguishable if no flavor tagging criterion is imposed, and their
contributions to the cos 2ϕ correlation have opposite signs to each other [Chen et al., 2021;
Hara and Sakai, 1989], as we have seen in the polarized gluon jet in Sec. 9.3.2. As a result, the
⟨ξj ⟩ is rather small. The analytic formula in the collinear limit is given by Eq. (3) of [Chen
et al., 2021]. For an active fermion number nf = 5, ⟨ξj ⟩ is −0.02 for quark jets and −0.012
for gluon jets.
     In fact, since the W only decays only to a fermion pair, Eq. (9.66) exactly resembles the
energy deposition anisotropy for quark-subjet-tagged gluon jets in Eq. (9.32). Without such
(sub)jet flavor tagging, the gluon jet fragmentation is dominated by the g → gg splitting,
which greatly washes out the azimuthal correlation.
     This observation motivates the use of the azimuthal correlation as a new top quark jet
tagger. While there have been many tagging algorithms proposed and applied to discriminate
boosted top quark events from QCD jets and they have reached rather high efficiencies [CMS,
2014; Plehn et al., 2010; Aaboud et al., 2019], the current top quark taggers mainly make use
of the top and W mass conditions and the three-subjet structure, and are highly dependent
on machine learning methods [Kasieczka et al., 2019b; Bhattacharya et al., 2022b; Aad
et al., 2023] trained on Pythia simulated top and QCD jet events. Since Pythia fails to
incorporate the spin correlations in the parton showering, it is very likely that the azimuthal
                                              413


correlation feature is absent in both the top and QCD jet events. It is then not clear whether
machine learning captured the right physics, and there is a mismatch when it is applied to
real data. Therefore, it is worthwhile to feed the additional azimuthal correlation information
to proper machine learning algorithms, either by improving the parton showering simulator,
or by directly examining on real data. We leave this study to future.
      Here, instead of constructing an event-by-event top tagger against QCD jets, we propose
a simpler “tagger” that acts on the whole ensemble of boosted top candidates to determine
the fraction of top quark events. In this ensemble, one can first measure the azimuthal
asymmetry coefficient ξ0 following the same strategy discussed above. This ξ0 is not the
same as the one for pure top quark events, as given in Eqs. (9.66) and (9.64), but is for a
mixture of top and QCD jet events. Then, if the top quark events account for a fraction δt
of the whole ensemble, we should have
                                    ξ0 = δt ⟨ξ ′ ⟩ + (1 − δt ) ⟨ξj ⟩,                     (9.68)
from which we can determine
                                                  ξ0 − ⟨ξj ⟩
                                         δt =                   ,                         (9.69)
                                                 ⟨ξ ′ ⟩ − ⟨ξj ⟩
where ⟨ξj ⟩ is obtained by averaging over the light quark and gluon jet contributions and
only depends on their relative fraction in the boosted QCD jet events. As an example, for
single top quarks produced via s-channel SM-like heavy resonance W ′ with a mass > 1 TeV,
⟨ξ ′ ⟩ ∼ −0.58, while the magnitude of ⟨ξj ⟩ ≲ 0.01. As long as the top quark yield is not more
than an order of magnitude smaller than the QCD jet background rate, δt can be precisely
determined from the measurement of ξ0 to constrain the parameter space of this new physics
model, such as the W ′ -t-b coupling strength.
                                                   414


9.4.7      Conclusion
In this section, we examined the linear W polarization in a boosted top quark decay and
showed that it leads to a nontrivial cos 2ϕ azimuthal correlation between the t → bW and
W → f f¯′ decay planes, which induces a new substructure observable in the boosted top
quark jet. In the hadronic decay mode, this correlation translates into an energy deposition
asymmetry in the azimuthal plane. Interestingly, such linear polarization is not present in the
top rest frame but only emerges under the boost as a result of mixing with other polarization
parameters. This property is due to the massiveness of the W boson, differing from the linear
polarization of gluons in Sec. 9.3. As phenomenological applications, we demonstrated that
such correlation can be used to either measure the longitudinal polarization of a boosted top
quark for testing the SM and probing new physics or distinguish a boosted top quark from
the QCD jet background.
                                            415


Chapter 10
Summary and Outlook
Spin property is a long-studied subject throughout the history of particle physics but re-
mains relatively poorly explored in the context of high-energy unpolarized colliders such as
the LHC. Following the early works, we re-emphasized the importance of transverse polariza-
tion phenomena at the LHC, which correspond to the quantum interference effects between
different helicity states, entail information about the hard scattering that is not probed by
the unpolarized production rate, and can bring out a wealth of new physical observables.
In particular, in the boosted regime, a heavy unstable particle produced with a transverse
polarization can lead to a jet of decay products with a new substructure characterized by
certain azimuthal correlations.
    We have discussed two kinds of single transverse polarization productions. The first kind
is to have the polarized particle directly produced from the hard scattering with a hard
transverse momentum. This applies to both spin-half quarks and spin-one massless gluons
and massive W and Z bosons. For the quark case, chiral symmetry requires a nonzero quark
mass, which strongly suppresses the degree of the transverse spin, except for heavy quarks
like the top quark. No such suppression exists for the linear polarization of vector bosons,
and one generally expects a large degree of linear polarization. At the hadron colliders,
however, the lab frame generally differs from the c.m. frame of the hard scattering by a
longitudinal boost, under which transverse polarizations of massive particles mix with other
                                               416


polarization components but those of massless particles remain invariant. As a result, it is
better to measure the polarization in the partonic c.m. frame. The second kind is for linearly
polarized vector bosons that are not directly produced from the hard scattering, but appear
from the decay of a boosted heavy object. One example is the linear gluon polarization in a
parton showering. The other example, as we discussed in detail, is for the boosted top quark
that decays into a collimated pair of bottom quark and W boson. The linearly polarized W
leads to a nontrivial azimuthal correlation, which is in turn reflected as a new boosted top
quark jet substructure.
     The main idea of linear polarization is the resultant azimuthal correlation caused by
helicity interference effects. Therefore, the subject of this thesis can be readily extended
to much broader physical contexts. A direct application is to use azimuthal correlations
to determine the spin of the mother particle. Further, in the context with new physics
extensions of the SM, possible fermionic tensor interaction can lead to different fermion he-
licity structures from the SM interactions, which can interfere and generate nonzero fermion
transverse spin. Similarly, new physics operators can also generate linearly polarized vector
bosons. In particular, with possible CP -violating new physics interactions, new correlation
functions can appear due to CP -violating transverse polarization components, such as the
sin 2ϕ components in the polarized gluon jet. The measurements of transverse polarizations
thus provide new opportunities to probe possible new physics.
                                             417


                                    BIBLIOGRAPHY
Boosted Top Jet Tagging at CMS. Technical report, CERN, Geneva, 2014. URL http:
  //cds.cern.ch/record/1647419.
Studies of b-tagging performance and√ jet substructure in a high pT g → bb̄ rich sample of
  large-R jets from pp collisions at s = 8 TeV with the ATLAS detector. Technical report,
  CERN, Geneva, 2016.
Identification of double-b quark jets in boosted event topologies. Technical report, CERN,
  Geneva, 2016. URL http://cds.cern.ch/record/2195743.
Measurement of the polarisation of single top quarks √ and antiquarks produced in the t-
  channel collected with the ATLAS detector at s = 13 TeV and bounds on the tW b
  dipole operator. Technical report, CERN, Geneva, Jun 2021. URL http://cds.cern.
  ch/record/2773738.
Measurement of the Drell-Yan forward-backward asymmetry at high dilepton masses. Tech-
  nical report, CERN, Geneva, 2021. URL http://cds.cern.ch/record/2783928.
Measuring the b-jet identification
                                √     efficiency for high pT jets using multijet events in
  proton−proton collisions at s = 13 TeV recorded with the ATLAS detector. Technical
  report, CERN, Geneva, 2022a. URL https://cds.cern.ch/record/2804062.
Graph Neural Network Jet Flavour Tagging with the ATLAS Detector. Technical report,
  CERN, Geneva, 2022b. URL https://cds.cern.ch/record/2811135.
Probing the CP nature of the top-Higgs Yukawa coupling in tt̄H and tH events with H → bb̄
  decays using the ATLAS detector at the LHC. 3 2023.
M. Aaboud et al. Search for the Decay of the Higgs Boson to Charm Quarks with the
  ATLAS Experiment. Phys. Rev. Lett., 120(21):211802, 2018a. doi: 10.1103/PhysRevLett.
  120.211802.
Morad Aaboud et al.  √ Measurements of b-jet tagging efficiency with the ATLAS detector
  using tt events at s = 13 TeV. JHEP, 08:089, 2018b. doi: 10.1007/JHEP08(2018)089.
Morad Aaboud et al. Performance of top-quark and W -boson tagging with ATLAS in Run
  2 of the LHC. Eur. Phys. J. C, 79(5):375, 2019. doi: 10.1140/epjc/s10052-019-6847-8.
Georges Aad et al. Observation of a new particle in the search for the Standard Model
  Higgs boson with the ATLAS detector at the LHC. Phys. Lett. B, 716:1–29, 2012. doi:
  10.1016/j.physletb.2012.08.020.
Georges Aad et al. Measurement  √ of Top Quark Polarization in Top-Antitop Events from
  Proton-Proton Collisions at s = 7 TeV Using the ATLAS Detector. Phys. Rev. Lett.,
  111(23):232002, 2013a. doi: 10.1103/PhysRevLett.111.232002.
                                              418


Georges Aad et al. Search for tt̄ resonances
                                      √         in the lepton plus jets final state with ATLAS
                −1
   using 4.7 fb of pp collisions at s = 7 TeV. Phys. Rev. D, 88(1):012004, 2013b. doi:
   10.1103/PhysRevD.88.012004.
Georges Aad et al. ATLAS b-jet √      identification performance and efficiency measurement
   with tt̄ events in pp collisions at s = 13 TeV. Eur. Phys. J. C, 79(11):970, 2019a. doi:
   10.1140/epjc/s10052-019-7450-8.
Georges Aad et al. Measurement of the cross-section
                                            √             and charge asymmetry of W bosons
   produced in proton–proton collisions at s = 8 TeV with the ATLAS detector. Eur. Phys.
   J. C, 79(9):760, 2019b. doi: 10.1140/epjc/s10052-019-7199-0.
Georges Aad et al. Identification of boosted Higgs bosons decaying into b-quark pairs with
   the ATLAS detector at 13 TeV. Eur. Phys. J. C, 79(10):836, 2019c. doi: 10.1140/epjc/
   s10052-019-7335-x.
Georges Aad et al. CP Properties of Higgs Boson Interactions with Top Quarks in the tt̄H
   and tH Processes Using H → γγ with the ATLAS Detector. Phys. Rev. Lett., 125(6):
   061802, 2020. doi: 10.1103/PhysRevLett.125.061802.
Georges Aad et al.√Measurement of the c-jet mistagging efficiency in tt̄ events using pp
   collision data at s = 13 TeV collected with the ATLAS detector. Eur. Phys. J. C, 82
   (1):95, 2022. doi: 10.1140/epjc/s10052-021-09843-w.
Georges Aad et al. Differential tt cross-section measurements using boosted top quarks in
   the all-hadronic final state with 139 fb−1 of ATLAS data. JHEP, 04:080, 2023. doi:
   10.1007/JHEP04(2023)080.
A. Abdesselam et al. Boosted Objects: A Probe of Beyond the Standard Model Physics.
   Eur. Phys. J. C, 71:1661, 2011. doi: 10.1140/epjc/s10052-011-1661-y.
R. Abdul Khalek et al. Science Requirements and Detector Concepts for the Electron-Ion
   Collider: EIC Yellow Report. 3 2021.
A. Accardi et al. Electron Ion Collider: The Next QCD Frontier: Understanding the glue
   that binds us all. Eur. Phys. J. A, 52(9):268, 2016. doi: 10.1140/epja/i2016-16268-9.
B. Adams et al. Letter of Intent: A New QCD facility at the M2 beam line of the CERN
   SPS (COMPASS++/AMBER). 8 2018.
J. A. Aguilar-Saavedra, J. Boudreau, C. Escobar, and J. Mueller. The fully differential top
   decay distribution. Eur. Phys. J. C, 77(3):200, 2017. doi: 10.1140/epjc/s10052-017-4761-5.
H. Al Ghoul et al. Measurement of the beam asymmetry Σ for π 0 and η photoproduction on
   the proton at Eγ = 9 GeV. Phys. Rev. C, 95(4):042201, 2017. doi: 10.1103/PhysRevC.
   95.042201.
                                               419


Constantia Alexandrou, Krzysztof Cichy, Martha Constantinou, Kyriakos Hadjiyiannakou,
   Karl Jansen, Aurora Scapellato, and Fernanda Steffens. Unpolarized and helicity gener-
   alized parton distributions of the proton within lattice QCD. Phys. Rev. Lett., 125(26):
   262001, 2020. doi: 10.1103/PhysRevLett.125.262001.
Leandro G. Almeida, Seung J. Lee, Gilad Perez, George F. Sterman, Ilmo Sung, and Joseph
   Virzi. Substructure of high-pT Jets at the LHC. Phys. Rev. D, 79:074017, 2009a. doi:
   10.1103/PhysRevD.79.074017.
Leandro G. Almeida, Seung J. Lee, Gilad Perez, Ilmo Sung, and Joseph Virzi. Top Jets at
   the LHC. Phys. Rev. D, 79:074012, 2009b. doi: 10.1103/PhysRevD.79.074012.
Guido Altarelli and G. Parisi. Asymptotic Freedom in Parton Language. Nucl. Phys. B,
   126:298–318, 1977. doi: 10.1016/0550-3213(77)90384-4.
J. Alwall, R. Frederix, S. Frixione, V. Hirschi, F. Maltoni, O. Mattelaer, H. S. Shao,
   T. Stelzer, P. Torrielli, and M. Zaro. The automated computation of tree-level and next-to-
   leading order differential cross sections, and their matching to parton shower simulations.
   JHEP, 07:079, 2014. doi: 10.1007/JHEP07(2014)079.
S. Amor Dos Santos et al. Probing the CP nature of the Higgs coupling in tt̄h events at the
   LHC. Phys. Rev. D, 96(1):013004, 2017. doi: 10.1103/PhysRevD.96.013004.
Kazuya Aoki et al. Extension of the J-PARC Hadron Experimental Facility: Third White
   Paper. 10 2021.
G. Apollinari, O. Brüning, T. Nakamoto, and Lucio Rossi. Chapter 1: High Luminosity
   Large Hadron Collider HL-LHC. High Luminosity Large Hadron Collider HL-LHC. CERN
   Yellow Report, pages 1–19. 21 p, May 2017. doi: 10.5170/CERN-2015-005.1. URL https:
   //cds.cern.ch/record/2120673. 21 pages, chapter in High-Luminosity Large Hadron
   Collider (HL-LHC) : Preliminary Design Report.
D. Azevedo, A. Onofre, F. Filthaut, and R. Gonçalo. CP tests of Higgs couplings in
   tt̄h semileptonic events at the LHC. Phys. Rev. D, 98(3):033004, 2018. doi: 10.1103/
   PhysRevD.98.033004.
Alessandro Bacchetta, Markus Diehl, Klaus Goeke, Andreas Metz, Piet J. Mulders, and
   Marc Schlegel. Semi-inclusive deep inelastic scattering at small transverse momentum.
   JHEP, 02:093, 2007. doi: 10.1088/1126-6708/2007/02/093.
Henning Bahl and Simon Brass. Constraining CP-violation in the Higgs-top-quark in-
   teraction using machine-learning-based inference. JHEP, 03:017, 2022. doi: 10.1007/
   JHEP03(2022)017.
Henning Bahl, Philip Bechtle, Sven Heinemeyer, Judith Katzy, Tobias Klingl, Krisztian
   Peters, Matthias Saimpert, Tim Stefaniak, and Georg Weiglein. Indirect CP probes of the
   Higgs-top-quark interaction: current LHC constraints and future opportunities. JHEP,
   11:127, 2020. doi: 10.1007/JHEP11(2020)127.
                                                420


Andrea Banfi, Gavin P. Salam, and Giulia Zanderighi. Infrared safe definition of jet flavor.
   Eur. Phys. J. C, 47:113–124, 2006. doi: 10.1140/epjc/s2006-02552-4.
Rahool Kumar Barman, Dorival Gonçalves, and Felix Kling. Machine learning the Higgs
   boson-top quark CP phase. Phys. Rev. D, 105(3):035023, 2022. doi: 10.1103/PhysRevD.
   105.035023.
Carola F. Berger, Tibor Kucs, and George F. Sterman. Event shape / energy flow correla-
   tions. Phys. Rev. D, 68:014012, 2003. doi: 10.1103/PhysRevD.68.014012.
Edgar R. Berger, M. Diehl, and B. Pire. Probing generalized parton distributions in pi N
   —> l+ l- N. Phys. Lett. B, 523:265–272, 2001. doi: 10.1016/S0370-2693(01)01345-4.
Edgar R. Berger, M. Diehl, and B. Pire. Time - like Compton scattering: Exclusive photopro-
   duction of lepton pairs. Eur. Phys. J. C, 23:675–689, 2002a. doi: 10.1007/s100520200917.
Edmond L. Berger and Daniel L. Jones. Inelastic Photoproduction of J/psi and Upsilon by
   Gluons. Phys. Rev. D, 23:1521–1530, 1981. doi: 10.1103/PhysRevD.23.1521.
Edmond L. Berger, Jian-Wei Qiu, and Xiao-fei Zhang. QCD factorized Drell-Yan cross-
   section at large transverse momentum. Phys. Rev. D, 65:034006, 2002b. doi: 10.1103/
   PhysRevD.65.034006.
Edmond L. Berger, Qing-Hong Cao, Chuan-Ren Chen, and Hao Zhang. Top Quark Polar-
   ization As A Probe of Models with Extra Gauge Bosons. Phys. Rev. D, 83:114026, 2011.
   doi: 10.1103/PhysRevD.83.114026.
J. C. Bernauer et al. High-precision determination of the electric and magnetic form factors
   of the proton. Phys. Rev. Lett., 105:242001, 2010. doi: 10.1103/PhysRevLett.105.242001.
J. C. Bernauer et al. Electric and magnetic form factors of the proton. Phys. Rev. C, 90(1):
   015206, 2014. doi: 10.1103/PhysRevC.90.015206.
Florian U. Bernlochner, Christoph Englert, Chris Hays, Kristin Lohwasser, Hannes Mildner,
   Andrew Pilkington, Darren D. Price, and Michael Spannowsky. Angles on CP-violation in
   Higgs boson interactions. Phys. Lett. B, 790:372–379, 2019. doi: 10.1016/j.physletb.2019.
   01.043.
Werner Bernreuther, Dennis Heisler, and Zong-Guo Si. A set of top quark spin correlation
   and polarization observables for the LHC: Standard Model predictions and new physics
   contributions. JHEP, 12:026, 2015. doi: 10.1007/JHEP12(2015)026.
V. Bertone, H. Dutrieux, C. Mezrag, H. Moutarde, and P. Sznajder. Deconvolution problem
   of deeply virtual Compton scattering. Phys. Rev. D, 103(11):114019, 2021. doi: 10.1103/
   PhysRevD.103.114019.
Shohini Bhattacharya, Krzysztof Cichy, Martha Constantinou, Jack Dodson, Xiang Gao,
   Andreas Metz, Swagato Mukherjee, Aurora Scapellato, Fernanda Steffens, and Yong Zhao.
   Generalized parton distributions from lattice QCD with asymmetric momentum transfer:
                                              421


  Unpolarized quarks. Phys. Rev. D, 106(11):114512, 2022a. doi: 10.1103/PhysRevD.106.
  114512.
Soham Bhattacharya, Monoranjan Guchait, and Aravind H. Vijay. Boosted top quark tag-
  ging and polarization measurement using machine learning. Phys. Rev. D, 105(4):042005,
  2022b. doi: 10.1103/PhysRevD.105.042005.
Biplob Bhattacherjee, Satyanarayan Mukhopadhyay, Mihoko M. Nojiri, Yasuhito Sakaki,
  and Bryan R. Webber. Associated jet and subjet rates in light-quark and gluon jet dis-
  crimination. JHEP, 04:131, 2015. doi: 10.1007/JHEP04(2015)131.
Christian Bierlich et al. A comprehensive guide to the physics and usage of PYTHIA 8.3. 3
  2022.
Geoffrey T. Bodwin, Eric Braaten, and G. Peter Lepage. Rigorous QCD analysis of inclusive
  annihilation and production of heavy quarkonium. Phys. Rev. D, 51:1125–1171, 1995. doi:
  10.1103/PhysRevD.55.5853. [Erratum: Phys.Rev.D 55, 5853 (1997)].
Daniel Boer, Stanley J. Brodsky, Piet J. Mulders, and Cristian Pisano. Direct Probes of
  Linearly Polarized Gluons inside Unpolarized Hadrons. Phys. Rev. Lett., 106:132001, 2011.
  doi: 10.1103/PhysRevLett.106.132001.
Daniel Boer, Wilco J. den Dunnen, Cristian Pisano, Marc Schlegel, and Werner Vogelsang.
  Linearly Polarized Gluons and the Higgs Transverse Momentum Distribution. Phys. Rev.
  Lett., 108:032002, 2012. doi: 10.1103/PhysRevLett.108.032002.
Blaž Bortolato, Jernej F. Kamenik, Nejc Košnik, and Aleks Smolkovič. Optimized probes of
  CP -odd effects in the tt̄h process at hadron colliders. Nucl. Phys. B, 964:115328, 2021.
  doi: 10.1016/j.nuclphysb.2021.115328.
James Botts and George F. Sterman. Hard Elastic Scattering in QCD: Leading Behavior.
  Nucl. Phys. B, 325:62–100, 1989. doi: 10.1016/0550-3213(89)90372-6.
Fawzi Boudjema, Rohini M. Godbole, Diego Guadagnoli, and Kirtimaan A. Mohan. Lab-
  frame observables for probing the top-Higgs interaction. Phys. Rev. D, 92(1):015019, 2015.
  doi: 10.1103/PhysRevD.92.015019.
R. Boussarie, B. Pire, L. Szymanowski, and S. Wallon. Exclusive photoproduction of a γ ρ
  pair with a large invariant mass. JHEP, 02:054, 2017. doi: 10.1007/JHEP02(2017)054.
  [Erratum: JHEP 10, 029 (2018)].
Arnd Brandenburg, Z. G. Si, and P. Uwer. QCD corrected spin analyzing power of jets
  in decays of polarized top quarks. Phys. Lett. B, 539:235–241, 2002. doi: 10.1016/
  S0370-2693(02)02098-1.
Samuel Bright-Thonney, Ian Moult, Benjamin Nachman, and Stefan Prestel. Systematic
  Quark/Gluon Identification with Ratios of Likelihoods. 7 2022.
                                            422


Joachim Brod, Ulrich Haisch, and Jure Zupan. Constraints on CP-violating Higgs couplings
  to the third generation. JHEP, 11:180, 2013. doi: 10.1007/JHEP11(2013)180.
Stanley J. Brodsky and G. Peter Lepage. Large Angle Two Photon Exclusive Channels in
  Quantum Chromodynamics. Phys. Rev. D, 24:1808, 1981. doi: 10.1103/PhysRevD.24.
  1808.
Stanley J. Brodsky and G. Peter Lepage. Exclusive Processes in Quantum Chromodynamics.
  Adv. Ser. Direct. High Energy Phys., 5:93–240, 1989. doi: 10.1142/9789814503266 0002.
Stanley J. Brodsky, Thomas A. DeGrand, and R. Schwitters. ARE GLUON JETS OBLATE?
  Phys. Lett. B, 79:255–258, 1978. doi: 10.1016/0370-2693(78)90236-8.
Stanley J. Brodsky, L. Frankfurt, J. F. Gunion, Alfred H. Mueller, and M. Strikman. Diffrac-
  tive leptoproduction of vector mesons in QCD. Phys. Rev. D, 50:3134–3144, 1994. doi:
  10.1103/PhysRevD.50.3134.
Matthew R. Buckley and Dorival Goncalves. Boosting the Direct CP Measurement of the
  Higgs-Top Coupling. Phys. Rev. Lett., 116(9):091801, 2016. doi: 10.1103/PhysRevLett.
  116.091801.
G. Bunce et al. Lambda0 Hyperon Polarization in Inclusive Production by 300-GeV Protons
  on Beryllium. Phys. Rev. Lett., 36:1113–1116, 1976. doi: 10.1103/PhysRevLett.36.1113.
Matthias Burkardt. Impact parameter dependent parton distributions and off forward parton
  distributions for zeta —> 0. Phys. Rev. D, 62:071503, 2000. doi: 10.1103/PhysRevD.62.
  071503. [Erratum: Phys.Rev.D 66, 119903 (2002)].
Matthias Burkardt. Impact parameter space interpretation for generalized parton distribu-
  tions. Int. J. Mod. Phys. A, 18:173–208, 2003. doi: 10.1142/S0217751X03012370.
Matteo Cacciari and Gavin P. Salam. Dispelling the N 3 myth for the kt jet-finder. Phys.
  Lett. B, 641:57–61, 2006. doi: 10.1016/j.physletb.2006.08.037.
Matteo Cacciari, Gavin P. Salam, and Gregory Soyez. The anti-kt jet clustering algorithm.
  JHEP, 04:063, 2008. doi: 10.1088/1126-6708/2008/04/063.
Matteo Cacciari, Gavin P. Salam, and Gregory Soyez. FastJet User Manual. Eur. Phys. J.
  C, 72:1896, 2012. doi: 10.1140/epjc/s10052-012-1896-2.
Qing-Hong Cao, Ke-Pan Xie, Hao Zhang, and Rui Zhang. A New Observable for Measuring
  CP Property of Top-Higgs Interaction. Chin. Phys. C, 45(2):023117, 2021. doi: 10.1088/
  1674-1137/abcfac.
John L. Cardy and G. A. Winbow. The Absence of Final State Interaction Corrections to
  the Drell-Yan Formula for Massive Lepton Pair Production. Phys. Lett. B, 52:95–96, 1974.
  doi: 10.1016/0370-2693(74)90729-1.
                                             423


S. Catani and M. Grazzini. Higgs Boson Production at Hadron Colliders: Hard-Collinear
   Coefficients at the NNLO. Eur. Phys. J. C, 72:2013, 2012. doi: 10.1140/epjc/
   s10052-012-2013-2. [Erratum: Eur.Phys.J.C 72, 2132 (2012)].
Chao-Hsi Chang. Hadronic Production of J/ψ Associated With a Gluon. Nucl. Phys. B,
   172:425–434, 1980. doi: 10.1016/0550-3213(80)90175-3.
P. Chatagnon et al. First Measurement of Timelike Compton Scattering. Phys. Rev. Lett.,
   127(26):262501, 2021. doi: 10.1103/PhysRevLett.127.262501.
Serguei Chatrchyan
                 √ et al. Search for Resonant tt̄ Production in Lepton+Jets Events in pp
   Collisions at s = 7 TeV. JHEP, 12:015, 2012a. doi: 10.1007/JHEP12(2012)015.
Serguei Chatrchyan et al. Observation of a New Boson at a Mass of 125 GeV with the CMS
   Experiment at the LHC. Phys. Lett. B, 716:30–61, 2012b. doi: 10.1016/j.physletb.2012.
   08.021.
Hao Chen, Ian Moult, and Hua Xing Zhu. Quantum Interference in Jet Substructure from
   Spinning Gluons. Phys. Rev. Lett., 126(11):112003, 2021. doi: 10.1103/PhysRevLett.126.
   112003.
Hao Chen, Ian Moult, and Hua Xing Zhu. Spinning gluons from the QCD light-ray OPE.
   JHEP, 08:233, 2022. doi: 10.1007/JHEP08(2022)233.
Jiunn-Wei Chen, Huey-Wen Lin, and Jian-Hui Zhang. Pion generalized parton distribution
   from lattice QCD. Nucl. Phys. B, 952:114940, 2020. doi: 10.1016/j.nuclphysb.2020.114940.
K. G. Chetyrkin, Johann H. Kuhn, and M. Steinhauser. RunDec: A Mathematica package
   for running and decoupling of the strong coupling and quark masses. Comput. Phys.
   Commun., 133:43–65, 2000. doi: 10.1016/S0010-4655(00)00155-7.
Xabier Cid Vidal et al. Report from Working Group 3: Beyond the Standard Model physics
   at the HL-LHC and HE-LHC. CERN Yellow Rep. Monogr., 7:585–865, 2019. doi: 10.
   23731/CYRM-2019-007.585.
S. Coleman and R. E. Norton. Singularities in the physical region. Nuovo Cim., 38:438–442,
   1965. doi: 10.1007/BF02750472.
J. C. Collins and T. C. Rogers. The Gluon Distribution Function and Factorization in
   Feynman Gauge. Phys. Rev. D, 78:054012, 2008. doi: 10.1103/PhysRevD.78.054012.
John Collins. Foundations of perturbative QCD, volume 32. Cambridge University Press, 11
   2013. ISBN 978-1-107-64525-7, 978-1-107-64525-7, 978-0-521-85533-4, 978-1-139-09782-6.
John Collins. A new and complete proof of the Landau condition for pinch singularities of
   Feynman graphs and other integrals. 7 2020.
John C. Collins. Fragmentation of transversely polarized quarks probed in transverse momen-
   tum distributions. Nucl. Phys. B, 396:161–182, 1993. doi: 10.1016/0550-3213(93)90262-N.
                                             424


John C. Collins. Proof of factorization for diffractive hard scattering. Phys. Rev. D, 57:
  3051–3056, 1998. doi: 10.1103/PhysRevD.61.019902. [Erratum: Phys.Rev.D 61, 019902
  (2000)].
John C. Collins and Andreas Freund. Proof of factorization for deeply virtual Compton
  scattering in QCD. Phys. Rev. D, 59:074009, 1999. doi: 10.1103/PhysRevD.59.074009.
John C. Collins and Andreas Metz. Universality of soft and collinear factors in hard-
  scattering factorization. Phys. Rev. Lett., 93:252001, 2004. doi: 10.1103/PhysRevLett.
  93.252001.
John C. Collins and George F. Sterman. Soft Partons in QCD. Nucl. Phys. B, 185:172–188,
  1981. doi: 10.1016/0550-3213(81)90370-9.
John C. Collins, Davison E. Soper, and George F. Sterman. Transverse Momentum Distri-
  bution in Drell-Yan Pair and W and Z Boson Production. Nucl. Phys. B, 250:199–224,
  1985. doi: 10.1016/0550-3213(85)90479-1.
John C. Collins, Davison E. Soper, and George F. Sterman. Factorization of Hard Processes
  in QCD. Adv. Ser. Direct. High Energy Phys., 5:1–91, 1989. doi: 10.1142/9789814503266
  0001.
John C. Collins, Leonid Frankfurt, and Mark Strikman. Diffractive hard scattering with a
  coherent pomeron. Phys. Lett. B, 307:161–168, 1993. doi: 10.1016/0370-2693(93)90206-W.
John C. Collins, Steve F. Heppelmann, and Glenn A. Ladinsky. Measuring transversity
  densities in singly polarized hadron hadron and lepton - hadron collisions. Nucl. Phys. B,
  420:565–582, 1994. doi: 10.1016/0550-3213(94)90078-7.
John C. Collins, Leonid Frankfurt, and Mark Strikman. Factorization for hard exclusive
  electroproduction of mesons in QCD. Phys. Rev. D, 56:2982–3006, 1997. doi: 10.1103/
  PhysRevD.56.2982.
Martha Constantinou et al. Parton distributions and lattice-QCD calculations: Toward 3D
  structure. Prog. Part. Nucl. Phys., 121:103908, 2021. doi: 10.1016/j.ppnp.2021.103908.
S. Dawson. Radiative corrections to Higgs boson production. Nucl. Phys. B, 359:283–300,
  1991. doi: 10.1016/0550-3213(91)90061-2.
Thomas A. DeGrand and B. Petersson. WHERE CAN ONE SEE POLARIZED GLUON
  FRAGMENTATION? Phys. Rev. D, 21:3129, 1980. doi: 10.1103/PhysRevD.21.3129.
Carleton E. DeTar, S. D. Ellis, and P. V. Landshoff. Final State Interactions in Large
  Transverse Momentum Lepton and Hadron Production. Nucl. Phys. B, 87:176–188, 1975.
  doi: 10.1016/0550-3213(75)90260-6.
A. Devoto and W. W. Repko. GLUON POLARIZATION IN QUARK - GLUON ELASTIC
  SCATTERING. Phys. Rev. D, 25:904, 1982. doi: 10.1103/PhysRevD.25.904.
                                            425


A. Devoto, J. Pumplin, W. Repko, and Gordon L. Kane. Polarization of High Transverse
  Momentum Single Photons as a Test of QCD. Phys. Rev. Lett., 43:1062, 1979. doi:
  10.1103/PhysRevLett.43.1062. [Erratum: Phys.Rev.Lett. 43, 1540 (1979)].
A. Devoto, J. Pumplin, W. W. Repko, and Gordon L. Kane. Polarization of Gluon Jets in
  Photon - Photon Scattering. Phys. Lett. B, 90:436–438, 1980. doi: 10.1016/0370-2693(80)
  90968-5.
Welathantri G. D. Dharmaratna and Gary R. Goldstein. Gluon Fusion as a Source for
  Massive Quark Polarization. Phys. Rev. D, 41:1731, 1990. doi: 10.1103/PhysRevD.41.
  1731.
M. Diehl. Generalized parton distributions. Phys. Rept., 388:41–277, 2003. doi: 10.1016/j.
  physrep.2003.08.002.
M. Diehl, Th. Feldmann, R. Jakob, and P. Kroll. Generalized parton distributions from nu-
  cleon form-factor data. Eur. Phys. J. C, 39:1–39, 2005. doi: 10.1140/epjc/s2004-02063-4.
Markus Diehl. Introduction to GPDs and TMDs. Eur. Phys. J. A, 52(6):149, 2016. doi:
  10.1140/epja/i2016-16149-3.
Markus Diehl and Thierry Gousset. Time ordering in off diagonal parton distributions. Phys.
  Lett. B, 428:359–370, 1998. doi: 10.1016/S0370-2693(98)00439-0.
A. Djouadi, M. Spira, and P. M. Zerwas. Production of Higgs bosons in proton colliders:
  QCD corrections. Phys. Lett. B, 264:440–446, 1991. doi: 10.1016/0370-2693(91)90375-Z.
H. S. Do, S. Groote, J. G. Korner, and M. C. Mauser. Electroweak and finite width correc-
  tions to top quark decays into transverse and longitudinal W bosons. Phys. Rev. D, 67:
  091501, 2003. doi: 10.1103/PhysRevD.67.091501.
Yuri L. Dokshitzer. Calculation of the Structure Functions for Deep Inelastic Scattering and
  e+ e- Annihilation by Perturbation Theory in Quantum Chromodynamics. Sov. Phys.
  JETP, 46:641–653, 1977.
Matthew J. Dolan, Philip Harris, Martin Jankowiak, and Michael Spannowsky. Constraining
  CP -violating Higgs Sectors at the LHC using gluon fusion. Phys. Rev. D, 90:073008, 2014.
  doi: 10.1103/PhysRevD.90.073008.
G. Duplančić, K. Passek-Kumerički, B. Pire, L. Szymanowski, and S. Wallon. Probing axial
  quark generalized parton distributions through exclusive photoproduction of a γ π ± pair
  with a large invariant mass. JHEP, 11:179, 2018. doi: 10.1007/JHEP11(2018)179.
Goran Duplančić, Saad Nabeebaccus, Kornelija Passek-Kumerički, Bernard Pire, Lech Szy-
  manowski, and Samuel Wallon. Accessing chiral-even quark generalised parton distribu-
  tions in the exclusive photoproduction of a γπ pair with large invariant mass in both fixed-
  target and collider experiments. JHEP, 03:241, 2023a. doi: 10.1007/JHEP03(2023)241.
                                             426


Goran Duplančić, Saad Nabeebaccus, Kornelija Passek-Kumerički, Bernard Pire, Lech Szy-
  manowski, and Samuel Wallon. Probing chiral-even and chiral-odd leading twist quark
  generalized parton distributions through the exclusive photoproduction of a γρ pair. Phys.
  Rev. D, 107(9):094023, 2023b. doi: 10.1103/PhysRevD.107.094023.
M. B. Einhorn and S. D. Ellis. Hadronic Production of the New Resonances: Probing Gluon
  Distributions. Phys. Rev. D, 12:2007, 1975. doi: 10.1103/PhysRevD.12.2007.
M. El Beiyad, B. Pire, M. Segond, L. Szymanowski, and S. Wallon. Photoproduction of a
  pi rhoT pair with a large invariant mass and transversity generalized parton distribution.
  Phys. Lett. B, 688:154–167, 2010. doi: 10.1016/j.physletb.2010.02.086.
John Ellis, Dae Sung Hwang, Kazuki Sakurai, and Michihisa Takeuchi. Disentangling
  Higgs-Top Couplings in Associated Production. JHEP, 04:004, 2014. doi: 10.1007/
  JHEP04(2014)004.
Stephen D. Ellis, Christopher K. Vermilion, Jonathan R. Walsh, Andrew Hornig, and
  Christopher Lee. Jet Shapes and Jet Algorithms in SCET. JHEP, 11:101, 2010. doi:
  10.1007/JHEP11(2010)101.
Christoph Englert, Dorival Goncalves-Netto, Kentarou Mawatari, and Tilman Plehn. Higgs
  Quantum Numbers in Weak Boson Fusion. JHEP, 01:148, 2013. doi: 10.1007/
  JHEP01(2013)148.
Christoph Englert, Peter Galler, Andrew Pilkington, and Michael Spannowsky. Approaching
  robust EFT limits for CP-violation in the Higgs sector. Phys. Rev. D, 99(9):095007, 2019.
  doi: 10.1103/PhysRevD.99.095007.
Darius A. Faroughy, Jernej F. Kamenik, Nejc Košnik, and Aleks Smolkovič. Probing the
  CP nature of the top quark Yukawa at hadron colliders. JHEP, 02:085, 2020. doi: 10.
  1007/JHEP02(2020)085.
Danilo Ferreira de Lima, Petar Petrov, Davison Soper, and Michael Spannowsky. Quark-
  Gluon tagging with Shower Deconstruction: Unearthing dark matter and Higgs couplings.
  Phys. Rev. D, 95(3):034001, 2017. doi: 10.1103/PhysRevD.95.034001.
R. P. Feynman.         Photon-hadron interactions.        Addison-Wesley, 1972.        URL
  http://www-library.desy.de/cgi-bin/spiface/find/hep/www?key=6634834&
  FORMAT=WWWBRIEFBIBTEX. Reading 1972, 282p.
Richard P. Feynman. Very high-energy collisions of hadrons. Phys. Rev. Lett., 23:1415–1417,
  1969. doi: 10.1103/PhysRevLett.23.1415.
Leonid Frankfurt, Werner Koepf, and Mark Strikman. Hard diffractive electroproduction of
  vector mesons in QCD. Phys. Rev. D, 54:3194–3215, 1996. doi: 10.1103/PhysRevD.54.
  3194.
                                            427


Christopher Frye, Andrew J. Larkoski, Jesse Thaler, and Kevin Zhou. Casimir Meets Poisson:
  Improved Quark/Gluon Discrimination with Counting Observables. JHEP, 09:083, 2017.
  doi: 10.1007/JHEP09(2017)083.
Jason Gallicchio and Matthew D. Schwartz. Quark and Gluon Tagging at the LHC. Phys.
  Rev. Lett., 107:172001, 2011. doi: 10.1103/PhysRevLett.107.172001.
Jason Gallicchio and Matthew D. Schwartz. Quark and Gluon Jet Substructure. JHEP, 04:
  090, 2013. doi: 10.1007/JHEP04(2013)090.
F. X. Girod et al. Measurement of Deeply virtual Compton scattering beam-spin asymme-
  tries. Phys. Rev. Lett., 100:162002, 2008. doi: 10.1103/PhysRevLett.100.162002.
Rohini Godbole, Monoranjan Guchait, Charanjit K. Khosa, Jayita Lahiri, Seema Sharma,
  and Aravind H. Vijay. Boosted Top quark polarization. Phys. Rev. D, 100(5):056010,
  2019. doi: 10.1103/PhysRevD.100.056010.
Dorival Gonçalves, Kyoungchul Kong, and Jeong Han Kim. Probing the top-Higgs Yukawa
  CP structure in dileptonic tth with M2 -assisted reconstruction. JHEP, 06:079, 2018. doi:
  10.1007/JHEP06(2018)079.
Dorival Gonçalves, Jeong Han Kim, Kyoungchul Kong, and Yongcheng Wu. Direct Higgs-top
  CP-phase measurement with tth at the 14 TeV LHC and 100 TeV FCC. JHEP, 01:158,
  2022. doi: 10.1007/JHEP01(2022)158.
Philippe Gras, Stefan Höche, Deepak Kar, Andrew Larkoski, Leif Lönnblad, Simon Plätzer,
  Andrzej Siódmok, Peter Skands, Gregory Soyez, and Jesse Thaler. Systematics of
  quark/gluon tagging. JHEP, 07:091, 2017. doi: 10.1007/JHEP07(2017)091.
V. N. Gribov and L. N. Lipatov. Deep inelastic e p scattering in perturbation theory. Sov.
  J. Nucl. Phys., 15:438–450, 1972.
Andrei V. Gritsan, Raoul Röntsch, Markus Schulze, and Meng Xiao. Constraining anomalous
  Higgs boson couplings to the heavy flavor fermions using matrix element techniques. Phys.
  Rev. D, 94(5):055023, 2016. doi: 10.1103/PhysRevD.94.055023.
Andrei V. Gritsan, Jeffrey Roskes, Ulascan Sarica, Markus Schulze, Meng Xiao, and Yaofu
  Zhou. New features in the JHU generator framework: constraining Higgs boson properties
  from on-shell and off-shell production. Phys. Rev. D, 102(5):056022, 2020. doi: 10.1103/
  PhysRevD.102.056022.
Oskar Grocholski, Bernard Pire, Pawel Sznajder, Lech Szymanowski, and Jakub Wagner.
  Collinear factorization of diphoton photoproduction at next to leading order. Phys. Rev.
  D, 104(11):114006, 2021. doi: 10.1103/PhysRevD.104.114006.
Oskar Grocholski, Bernard Pire, Pawel Sznajder, Lech Szymanowski, and Jakub Wagner.
  Phenomenology of diphoton photoproduction at next-to-leading order. Phys. Rev. D, 105
  (9):094025, 2022. doi: 10.1103/PhysRevD.105.094025.
                                             428


S. Groote, J. G. Korner, and J. A. Leyva. Gluon polarization in e+ e- —> t anti-t G. Phys.
   Rev. D, 56:6031–6034, 1997. doi: 10.1103/PhysRevD.56.6031.
S. Groote, J. G. Korner, and J. A. Leyva. Gluon polarization in e+ e- —> t anti-t G: Polar
   angle dependence and beam polarization effects. Eur. Phys. J. C, 7:49–59, 1999. doi:
   10.1007/s100529800981.
Stefan Groote. Polarization effects in e+ e- annihilation processes. 12 2002.
David J. Gross and Frank Wilczek. Ultraviolet Behavior of Nonabelian Gauge Theories.
   Phys. Rev. Lett., 30:1343–1346, 1973. doi: 10.1103/PhysRevLett.30.1343.
M. Guidal and M. Vanderhaeghen. Double deeply virtual Compton scattering off the nucleon.
   Phys. Rev. Lett., 90:012001, 2003. doi: 10.1103/PhysRevLett.90.012001.
Yuxun Guo, Xiangdong Ji, and Kyle Shiells. Generalized parton distributions through
   universal moment parameterization: zero skewness case. JHEP, 09:215, 2022. doi:
   10.1007/JHEP09(2022)215.
Keith Hamilton, Alexander Karlberg, Gavin P. Salam, Ludovic Scyboz, and Rob Verheyen.
   Soft spin correlations in final-state parton showers. JHEP, 03:193, 2022. doi: 10.1007/
   JHEP03(2022)193.
Tao Han and Yingchuan Li. Genuine CP-odd Observables at the LHC. Phys. Lett. B, 683:
   278–281, 2010. doi: 10.1016/j.physletb.2009.12.047.
Yasuo Hara and Sunao Sakai. Polarization of Gluons. Phys. Lett. B, 221:67–69, 1989. doi:
   10.1016/0370-2693(89)90193-7.
Hadi Hashamipour, Muhammad Goharipour, and Siamak S. Gousheh. Determination of
   generalized parton distributions through a simultaneous analysis of axial form factor and
   wide-angle Compton scattering data. Phys. Rev. D, 102(9):096014, 2020. doi: 10.1103/
   PhysRevD.102.096014.
Hadi Hashamipour, Muhammad Goharipour, K. Azizi, and S. V. Goloskokov. Determination
   of the generalized parton distributions through the analysis of the world electron scattering
   data considering two-photon exchange corrections. Phys. Rev. D, 105(5):054002, 2022. doi:
   10.1103/PhysRevD.105.054002.
Hadi Hashamipour, Muhammad Goharipour, K. Azizi, and S. V. Goloskokov. Generalized
   parton distributions at zero skewness. Phys. Rev. D, 107(9):096005, 2023. doi: 10.1103/
   PhysRevD.107.096005.
Kenneth J. Heller et al. Polarization of Lambdas and anti-Lambdas Produced by 400-GeV
   Protons. Phys. Rev. Lett., 41:607, 1978. doi: 10.1103/PhysRevLett.41.607. [Erratum:
   Phys.Rev.Lett. 45, 1043 (1980)].
F. Henyey and Robert Savit. Final State Interactions in the Parton Model and Massive
   Lepton Pair Production. Phys. Lett. B, 52:71–73, 1974. doi: 10.1016/0370-2693(74)
   90722-9.
                                              429


R. Hofstadter and R. W. McAllister. Electron Scattering From the Proton. Phys. Rev., 98:
  217–218, 1955. doi: 10.1103/PhysRev.98.217.
Robert Hofstadter. Electron scattering and nuclear structure. Rev. Mod. Phys., 28:214–254,
  1956. doi: 10.1103/RevModPhys.28.214.
Tie-Jiun Hou et al. New CTEQ global analysis of quantum chromodynamics with high-
  precision data from the LHC. Phys. Rev. D, 103(1):014013, 2021. doi: 10.1103/PhysRevD.
  103.014013.
Tor Jacobsen and Haakon A. Olsen. OBSERVATION OF GLUON POLARIZATION. Phys.
  Scripta, 42:513–514, 1990. doi: 10.1088/0031-8949/42/5/001.
M. Jezabek. Top quark physics. Nucl. Phys. B Proc. Suppl., 37(2):197, 1994. doi: 10.1016/
  0920-5632(94)90677-7.
Xiang-Dong Ji. Gauge-Invariant Decomposition of Nucleon Spin. Phys. Rev. Lett., 78:
  610–613, 1997a. doi: 10.1103/PhysRevLett.78.610.
Xiang-Dong Ji. Deeply virtual Compton scattering. Phys. Rev. D, 55:7114–7125, 1997b. doi:
  10.1103/PhysRevD.55.7114.
Xiangdong Ji. Parton Physics on a Euclidean Lattice. Phys. Rev. Lett., 110:262002, 2013.
  doi: 10.1103/PhysRevLett.110.262002.
H. S. Jo et al. Cross sections for the exclusive photon electroproduction on the proton and
  Generalized Parton Distributions. Phys. Rev. Lett., 115(21):212003, 2015. doi: 10.1103/
  PhysRevLett.115.212003.
Gordon L. Kane, J. Pumplin, and W. Repko. Transverse Quark Polarization in Large p(T)
  Reactions, e+ e- Jets, and Leptoproduction: A Test of QCD. Phys. Rev. Lett., 41:1689,
  1978. doi: 10.1103/PhysRevLett.41.1689.
Gordon L. Kane, G. A. Ladinsky, and C. P. Yuan. Using the Top Quark for Testing Standard
  Model Polarization and CP Predictions. Phys. Rev. D, 45:124–141, 1992. doi: 10.1103/
  PhysRevD.45.124.
Zhong-Bo Kang, Yan-Qing Ma, Jian-Wei Qiu, and George Sterman. Heavy Quarkonium
  Production at Collider Energies: Factorization and Evolution. Phys. Rev. D, 90(3):034006,
  2014. doi: 10.1103/PhysRevD.90.034006.
Zhong-Bo Kang, Kyle Lee, and Fanyi Zhao. Polarized jet fragmentation functions. Phys.
  Lett. B, 809:135756, 2020. doi: 10.1016/j.physletb.2020.135756.
Alexander Karlberg, Gavin P. Salam, Ludovic Scyboz, and Rob Verheyen. Spin correlations
  in final-state parton showers and jet observables. Eur. Phys. J. C, 81(8):681, 2021. doi:
  10.1140/epjc/s10052-021-09378-0.
                                              430


Gregor Kasieczka, Nicholas Kiefer, Tilman Plehn, and Jennifer M. Thompson. Quark-Gluon
  Tagging: Machine Learning vs Detector. SciPost Phys., 6(6):069, 2019a. doi: 10.21468/
  SciPostPhys.6.6.069.
Gregor Kasieczka, Tilman Plehn, Anja Butter, Kyle Cranmer, Dipsikha Debnath, Barry M.
  Dillon, Malcolm Fairbairn, Darius A. Faroughy, Wojtek Fedorko, Christophe Gay, Loukas
  Gouskos, Jernej F. Kamenik, Patrick T. Komiske, Simon Leiss, Alison Lister, Sebastian
  Macaluso, Eric M. Metodiev, Liam Moore, Ben Nachman, Karl Nordström, Jannicke
  Pearkes, Huilin Qu, Yannik Rath, Marcel Rieger, David Shih, Jennifer M. Thompson,
  and Sreedevi Varma. The Machine Learning landscape of top taggers. SciPost Phys., 7:
  014, 2019b. doi: 10.21468/SciPostPhys.7.1.014. URL https://scipost.org/10.21468/
  SciPostPhys.7.1.014.
Yoshio Kitadono and Hsiang-nan Li. Jet substructures of boosted polarized hadronic top
  quarks. Phys. Rev. D, 93(5):054043, 2016. doi: 10.1103/PhysRevD.93.054043.
K. Koller, K. H. Streng, T. F. Walsh, and P. M. Zerwas. Quarkonium Decays: Testing the
  Three Gluon Vertex. Nucl. Phys. B, 193:61–84, 1981. doi: 10.1016/0550-3213(81)90518-6.
Jurgen G. Korner and D. H. Schiller. HELICITY DESCRIPTION OF e+ e- —> q anti-q
  g AND e+ e- —> Q anti-Q (1–) —> g g g ON AND OFF THE Z0: QUARK, GLUON
  AND BEAM POLARIZATION EFFECTS. 7 1981.
David Krohn, Jessie Shelton, and Lian-Tao Wang. Measuring the Polarization of Boosted
  Hadronic Tops. JHEP, 07:041, 2010. doi: 10.1007/JHEP07(2010)041.
C. S. Lam and Wu-Ki Tung. A Systematic Approach to Inclusive Lepton Pair Production
  in Hadronic Collisions. Phys. Rev. D, 18:2447, 1978. doi: 10.1103/PhysRevD.18.2447.
L. D. Landau. On analytic properties of vertex parts in quantum field theory. Nucl. Phys.,
  13(1):181–192, 1959. doi: 10.1016/B978-0-08-010586-4.50103-6.
P. V. Landshoff. Model for elastic scattering at wide angle. Phys. Rev. D, 10:1024–1030,
  1974. doi: 10.1103/PhysRevD.10.1024.
P. V. Landshoff and J. C. Polkinghorne. Two high energy processes involving detected final
  state particles. Nucl. Phys. B, 33:221–238, 1971. doi: 10.1016/0550-3213(72)90244-1.
  [Erratum: Nucl.Phys.B 36, 642 (1972)].
Andrew J. Larkoski. General analysis for observing quantum interference at colliders. Phys.
  Rev. D, 105(9):096012, 2022. doi: 10.1103/PhysRevD.105.096012.
Andrew J. Larkoski and Eric M. Metodiev. A Theory of Quark vs. Gluon Discrimination.
  JHEP, 10:014, 2019. doi: 10.1007/JHEP10(2019)014.
Andrew J. Larkoski, Jesse Thaler, and Wouter J. Waalewijn. Gaining (Mutual) Information
  about Quark/Gluon Discrimination. JHEP, 11:129, 2014. doi: 10.1007/JHEP11(2014)129.
                                           431


G. Peter Lepage and Stanley J. Brodsky. Exclusive Processes in Perturbative Quantum
  Chromodynamics. Phys. Rev. D, 22:2157, 1980. doi: 10.1103/PhysRevD.22.2157.
Hsiang-nan Li and George F. Sterman. The Perturbative pion form-factor with Sudakov
  suppression. Nucl. Phys. B, 381:129–140, 1992. doi: 10.1016/0550-3213(92)90643-P.
Jinmian Li, Zong-guo Si, Lei Wu, and Jason Yue. Central-edge asymmetry as a probe of
  Higgs-top coupling in tt̄h production at the LHC. Phys. Lett. B, 779:72–76, 2018. doi:
  10.1016/j.physletb.2018.02.009.
Huey-Wen Lin. Nucleon Tomography and Generalized Parton Distribution at Physical
  Pion Mass from Lattice QCD. Phys. Rev. Lett., 127(18):182001, 2021. doi: 10.1103/
  PhysRevLett.127.182001.
Huey-Wen Lin. Nucleon helicity generalized parton distribution at physical pion mass from
  lattice QCD. Phys. Lett. B, 824:136821, 2022. doi: 10.1016/j.physletb.2021.136821.
L. N. Lipatov. The parton model and perturbation theory. Yad. Fiz., 20:181–198, 1974.
Tianbo Liu, W. Melnitchouk, Jian-Wei Qiu, and N. Sato. Factorized approach to radiative
  corrections for inelastic lepton-hadron collisions. Phys. Rev. D, 104(9):094033, 2021a. doi:
  10.1103/PhysRevD.104.094033.
Tianbo Liu, W. Melnitchouk, Jian-Wei Qiu, and N. Sato. A new approach to semi-inclusive
  deep-inelastic scattering with QED and QCD factorization. JHEP, 11:157, 2021b. doi:
  10.1007/JHEP11(2021)157.
Gregory Mahlon and Stephen J. Parke. Spin Correlation Effects in Top Quark Pair Produc-
  tion at the LHC. Phys. Rev. D, 81:074024, 2010. doi: 10.1103/PhysRevD.81.074024.
Michelangelo Mangano and Michelangelo Mangano. Physics at the FCC-hh, a 100 TeV pp
  collider. CERN Yellow Reports: Monographs. CERN, Geneva, Jun 2017. doi: 10.23731/
  CYRM-2017-003. URL http://cds.cern.ch/record/2270978.
L. Mankiewicz, G. Piller, and T. Weigl. Hard leptoproduction of charged vector mesons.
  Phys. Rev. D, 59:017501, 1999. doi: 10.1103/PhysRevD.59.017501.
Till Martini, Ren-Qi Pan, Markus Schulze, and Meng Xiao. Probing the CP structure of
  the top quark Yukawa coupling: Loop sensitivity versus on-shell sensitivity. Phys. Rev.
  D, 104(5):055045, 2021. doi: 10.1103/PhysRevD.104.055045.
Eric M. Metodiev and Jesse Thaler. Jet Topics: Disentangling Quarks and Gluons at Col-
  liders. Phys. Rev. Lett., 120(24):241602, 2018. doi: 10.1103/PhysRevLett.120.241602.
M. Mihovilovič et al. First measurement of proton’s charge form factor at very low Q2 with
  initial state radiation. Phys. Lett. B, 771:194–198, 2017. doi: 10.1016/j.physletb.2017.05.
  031.
                                              432


M. Mihovilovič et al. The proton charge radius extracted from the initial-state radi-
  ation experiment at MAMI. Eur. Phys. J. A, 57(3):107, 2021. doi: 10.1140/epja/
  s10050-021-00414-x.
Nicolas Mileo, Ken Kiers, Alejandro Szynkman, Daniel Crane, and Ethan Gegner. Pseu-
  doscalar top-Higgs coupling: exploration of CP-odd observables to resolve the sign ambi-
  guity. JHEP, 07:056, 2016. doi: 10.1007/JHEP07(2016)056.
Eric Moffat, Adam Freese, Ian Cloët, Thomas Donohoe, Leonard Gamberg, Wally Mel-
  nitchouk, Andreas Metz, Alexei Prokudin, and Nobuo Sato. Shedding light on shadow
  generalized parton distributions. 3 2023.
P. J. Mulders and J. Rodrigues. Transverse momentum dependence in gluon distribution
  and fragmentation functions. Phys. Rev. D, 63:094021, 2001. doi: 10.1103/PhysRevD.63.
  094021.
Pavel M. Nadolsky, C. Balazs, Edmond L. Berger, and C. P. Yuan. Gluon-gluon contributions
  to the production of continuum diphoton pairs at hadron colliders. Phys. Rev. D, 76:
  013008, 2007. doi: 10.1103/PhysRevD.76.013008.
Gouranga C. Nayak, Jian-Wei Qiu, and George F. Sterman. Fragmentation, NRQCD and
  NNLO factorization analysis in heavy quarkonium production. Phys. Rev. D, 72:114012,
  2005. doi: 10.1103/PhysRevD.72.114012.
H. A. Olsen, P. Osland, and I. Overbo. Polarized Gluon Bremsstrahlung as a Test of QCD.
  Phys. Lett. B, 89:221–224, 1980. doi: 10.1016/0370-2693(80)90015-5.
H. A. Olsen, P. Osland, and I. Overbo. Gluon Bremsstrahlung in e+ e− Annihilation. 2.
  Gluon Polarization. Nucl. Phys. B, 192:33–60, 1981. doi: 10.1016/0550-3213(81)90191-7.
Oyvind E. Olsen and Haakon A. Olsen. GLUON BREMSSTRAHLUNG IN CHARGED
  LEPTON - NUCLEON COLLISIONS. 2. GLUON LINEAR POLARIZATION. Phys.
  Scripta, 29:12, 1984. doi: 10.1088/0031-8949/29/1/003.
Riley Patrick, Andre Scaffidi, and Pankaj Sharma. Top polarisation as a probe of CP-
  mixing top-Higgs coupling in tjh signals. Phys. Rev. D, 101(9):093005, 2020. doi: 10.
  1103/PhysRevD.101.093005.
A. Pedrak, B. Pire, L. Szymanowski, and J. Wagner. Hard photoproduction of a diphoton
  with a large invariant mass. Phys. Rev. D, 96(7):074008, 2017. doi: 10.1103/PhysRevD.
  96.074008. [Erratum: Phys.Rev.D 100, 039901 (2019)].
B. Petersson and B. Pire. Photoproduction With Polarized Photons as a Source of Polarized
  Gluon Jets. Phys. Lett. B, 95:119–122, 1980. doi: 10.1016/0370-2693(80)90414-1.
Tilman Plehn, Michael Spannowsky, Michihisa Takeuchi, and Dirk Zerwas. Stop Recon-
  struction with Tagged Tops. JHEP, 10:078, 2010. doi: 10.1007/JHEP10(2010)078.
                                            433


H. David Politzer. Reliable Perturbative Results for Strong Interactions? Phys. Rev. Lett.,
  30:1346–1349, 1973. doi: 10.1103/PhysRevLett.30.1346.
Maxim V. Polyakov and Peter Schweitzer. Forces inside hadrons: pressure, surface tension,
  mechanical radius, and all that. Int. J. Mod. Phys. A, 33(26):1830025, 2018. doi: 10.1142/
  S0217751X18300259.
Jian-Wei Qiu and George F. Sterman. Power corrections in hadronic scattering. 1. Leading
  1/Q**2 corrections to the Drell-Yan cross-section. Nucl. Phys. B, 353:105–136, 1991a. doi:
  10.1016/0550-3213(91)90503-P.
Jian-Wei Qiu and George F. Sterman. Power corrections to hadronic scattering. 2. Factor-
  ization. Nucl. Phys. B, 353:137–164, 1991b. doi: 10.1016/0550-3213(91)90504-Q.
Jian-Wei Qiu and Zhite Yu. Exclusive production of a pair of high transverse momentum
  photons in pion-nucleon collisions for extracting generalized parton distributions. JHEP,
  08:103, 2022. doi: 10.1007/JHEP08(2022)103.
Jian-Wei Qiu and Zhite Yu. Single diffractive hard exclusive processes for the study of
  generalized parton distributions. Phys. Rev. D, 107(1):014007, 2023a. doi: 10.1103/
  PhysRevD.107.014007.
Jian-Wei Qiu and Zhite Yu. Extraction of the x-dependence of generalized parton distribu-
  tions from exclusive photoproduction. 5 2023b.
Jian-Wei Qiu, Marc Schlegel, and Werner Vogelsang. Probing Gluonic Spin-Orbit Corre-
  lations in Photon Pair Production. Phys. Rev. Lett., 107:062001, 2011. doi: 10.1103/
  PhysRevLett.107.062001.
Jian-Wei Qiu, Ted C. Rogers, and Bowen Wang. Intrinsic Transverse Momentum and
  Evolution in Weighted Spin Asymmetries. Phys. Rev. D, 101(11):116017, 2020. doi:
  10.1103/PhysRevD.101.116017.
A. V. Radyushkin. Nonforward parton distributions. Phys. Rev. D, 56:5524–5557, 1997. doi:
  10.1103/PhysRevD.56.5524.
Jie Ren, Lei Wu, and Jin Min Yang. Unveiling CP property of top-Higgs coupling with graph
  neural networks at the LHC. Phys. Lett. B, 802:135198, 2020. doi: 10.1016/j.physletb.
  2020.135198.
R. W. Robinett. Final state gluon polarization in large transverse momentum quarkonium
  production. Z. Phys. C, 51:89–92, 1991. doi: 10.1007/BF01579563.
A. D. Sakharov. Violation of CP Invariance, C asymmetry, and baryon asymme-
  try of the universe. Pisma Zh. Eksp. Teor. Fiz., 5:32–35, 1967. doi: 10.1070/
  PU1991v034n05ABEH002497.
Sebastian Schätzel. Boosted Top Quarks and Jet Structure. Eur. Phys. J. C, 75(9):415,
  2015. doi: 10.1140/epjc/s10052-015-3636-x.
                                             434


Reinhard Schwienhorst, C. P. Yuan, Charles Mueller, and Qing-Hong Cao. Single top quark
  production and decay in the t-channel at next-to-leading order at the LHC. Phys. Rev.
  D, 83:034019, 2011. doi: 10.1103/PhysRevD.83.034019.
Jessie Shelton. Polarized tops from new physics: signals and observables. Phys. Rev. D, 79:
  014032, 2009. doi: 10.1103/PhysRevD.79.014032.
G. G. Simon, C. Schmitt, F. Borkowski, and V. H. Walther. Absolute electron Proton Cross-
  Sections at Low Momentum Transfer Measured with a High Pressure Gas Target System.
  Nucl. Phys. A, 333:381–391, 1980. doi: 10.1016/0375-9474(80)90104-9.
A. M. Sirunyan et al. Identification of heavy-flavour jets with the CMS detector in pp
  collisions at 13 TeV. JINST, 13(05):P05011, 2018. doi: 10.1088/1748-0221/13/05/P05011.
Albert M Sirunyan et al. Measurement of the top quark polarization
                                                             √        and tt̄ spin correlations
  using dilepton final states in proton-proton collisions at s = 13 TeV. Phys. Rev. D, 100
  (7):072002, 2019. doi: 10.1103/PhysRevD.100.072002.
Albert M Sirunyan et al. Measurements of tt̄H Production and the CP Structure of the
  Yukawa Interaction between the Higgs Boson and Top Quark in the Diphoton Decay
  Channel. Phys. Rev. Lett., 125(6):061801, 2020. doi: 10.1103/PhysRevLett.125.061801.
Albert M Sirunyan et al. Constraints on anomalous Higgs boson couplings to vector bosons
  and fermions in its production and decay using the four-lepton final state. Phys. Rev. D,
  104(5):052004, 2021. doi: 10.1103/PhysRevD.104.052004.
Torbjörn Sjöstrand, Stefan Ask, Jesper R. Christiansen, Richard Corke, Nishita Desai, Philip
  Ilten, Stephen Mrenna, Stefan Prestel, Christine O. Rasmussen, and Peter Z. Skands.
  An introduction to PYTHIA 8.2. Comput. Phys. Commun., 191:159–177, 2015. doi:
  10.1016/j.cpc.2015.01.024.
Davison E. Soper. Diffraction in DIS and elsewhere. AIP Conf. Proc., 407(1):147, 1997. doi:
  10.1063/1.53586.
Peng Sun, Bo-Wen Xiao, and Feng Yuan. Gluon Distribution Functions and Higgs Boson
  Production at Moderate Transverse Momentum. Phys. Rev. D, 84:094005, 2011. doi:
  10.1103/PhysRevD.84.094005.
Armen Tumasyan et al. A new calibration     √ method for charm jet identification validated
  with proton-proton collision events at s =13 TeV. JINST, 17(03):P03014, 2022. doi:
  10.1088/1748-0221/17/03/P03014.
Armen Tumasyan et al. Search for CP violation
                                           √        in ttH and tH production in multilepton
  channels in proton-proton collisions at s = 13 TeV. JHEP, 07:092, 2023. doi: 10.1007/
  JHEP07(2023)092.
W. K. Tung. GROUP THEORY IN PHYSICS. 1985.
                                             435


Steven Weinberg. The Quantum theory of fields. Vol. 1: Foundations. Cambridge Uni-
  versity Press, 6 2005. ISBN 978-0-521-67053-1, 978-0-511-25204-4. doi: 10.1017/
  CBO9781139644167.
Xin-Kai Wen, Bin Yan, Zhite Yu, and C. P. Yuan. Single Transverse Spin Asymmetry as a
  New Probe of SMEFT Dipole Operators. 7 2023.
Eugene P. Wigner. On Unitary Representations of the Inhomogeneous Lorentz Group. An-
  nals Math., 40:149–204, 1939. doi: 10.2307/1968551.
R. L. Workman et al. Review of Particle Physics. PTEP, 2022:083C01, 2022. doi: 10.1093/
  ptep/ptac097.
W. Xiong et al. A small proton charge radius from an electron–proton scattering experiment.
  Nature, 575(7781):147–150, 2019. doi: 10.1038/s41586-019-1721-2.
Zhite Yu and C. P. Yuan. Azimuthal Angular Correlation as a Boosted Top Jet Substructure.
  Phys. Rev. Lett., 129(11):112001, 2022a. doi: 10.1103/PhysRevLett.129.112001.
Zhite Yu and C. P. Yuan. Azimuthal Angular Correlation as a Boosted Top Jet Substructure.
  Phys. Rev. Lett., 129(11):112001, 2022b. doi: 10.1103/PhysRevLett.129.112001.
Zhite Yu, Kirtimaan A. Mohan, and C. P. Yuan. Determining the CP Property of htt̄
  Coupling via a Novel Jet Substructure Observable. 11 2022.
X. Zhan et al. High-Precision Measurement of the Proton Elastic Form Factor Ratio
  µp GE /GM at low Q2 . Phys. Lett. B, 705:59–64, 2011. doi: 10.1016/j.physletb.2011.10.002.
                                            436