CP VIOLATION IN SINGLE TOP QUARK PRODUCTION
By
Weigang Geng

A DISSERTATION
Submitted to
Michigan State University
in partial fulﬁllment of the requirements
for the degree of
DOCTOR OF PHILOSOPHY
Physics
2012

ABSTRACT
CP VIOLATION IN SINGLE TOP QUARK PRODUCTION
By
Weigang Geng
We present a search for CP violation in single top quark production with the
DØ experiment at the Tevatron proton-antiproton collider. CP violation in the top
electroweak interaction results in diﬀerent single top quark production cross sections
for top and antitop quarks. We perform the search in the single top quark ﬁnal state
using 5.4 fb−1 of data, in the s-channel, t-channel, and for both combined. At this
time, we do not see an observable CP asymmetry.

To Ni.

iii

TABLE OF CONTENTS

List of Tables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . viii
List of Figures . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

x

Chapter 1 Introduction . . . . . . . . . . . . . . . .
1.1 CP Violation . . . . . . . . . . . . . . . . . . . .
1.2 Motivation for searching for CP violation in single
1.3 Samples selection and single top analysis overview

. . .
. . .
top
. . .

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

1
2
3
4

Chapter 2 Theory . . . . . . . . . . . . . . . . . . . . . . . . . . .
2.1 The Standard Model . . . . . . . . . . . . . . . . . . . . . . .
2.1.1 Fermions . . . . . . . . . . . . . . . . . . . . . . . . . .
2.1.2 Gauge Bosons . . . . . . . . . . . . . . . . . . . . . . .
2.1.3 Higgs Boson . . . . . . . . . . . . . . . . . . . . . . . .
2.1.4 Limits of the Standard Model . . . . . . . . . . . . . .
2.2 The Top Quark . . . . . . . . . . . . . . . . . . . . . . . . . .
2.2.1 Properties . . . . . . . . . . . . . . . . . . . . . . . . .
2.2.2 Production . . . . . . . . . . . . . . . . . . . . . . . .
2.2.3 Decay . . . . . . . . . . . . . . . . . . . . . . . . . . .
2.3 The single top quark . . . . . . . . . . . . . . . . . . . . . . .
2.3.1 Single top production and decay at the Tevatron . . . .
2.3.2 Motivations for searching for CP Violation in single top
2.4 Outline of CP violation measurement in single top . . . . . . .

.
.
.
.
.
.
.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.
.
.
.
.
.
.

6
7
9
11
12
13
14
14
15
15
16
16
26
28

Chapter 3 Experimental Setup . . . . . . . . . . . . . . . . . . . . . . . 30
3.1 The Accelerator Complex and Tevatron . . . . . . . . . . . . . . . . . 31
3.2 Luminosity and Cross Sections . . . . . . . . . . . . . . . . . . . . . . 35
iv

3.3

The DØ detector . . . . . . . . . . . . . . . . . . . .
3.3.1 Particle Signatures . . . . . . . . . . . . . . .
3.3.2 Coordinate System and Pseudorapidity . . . .
3.3.3 The Central Tracking System . . . . . . . . .
3.3.3.1 Silicon Micro-strip Tracker (SMT) .
3.3.3.2 Central Fiber Tracker (CFT) . . . .
3.3.4 The Central and Forward Preshower Detectors
3.3.5 The Calorimeters and ICD . . . . . . . . . . .
3.3.5.1 Calorimeters . . . . . . . . . . . . .
3.3.5.2 Intercryostat Detector . . . . . . . .
3.3.6 The Muon System . . . . . . . . . . . . . . .
3.3.7 The Luminosity Monitor . . . . . . . . . . . .
3.3.8 The DØ Trigger System . . . . . . . . . . . .
3.3.8.1 The Level 1 Trigger . . . . . . . . .
3.3.8.2 The Level 2 Trigger . . . . . . . . .
3.3.8.3 The Level 3 Trigger . . . . . . . . .

Chapter 4 Event Reconstruction
4.1 Tracks . . . . . . . . . . . . .
4.2 Primary Vertices . . . . . . .
4.3 Electrons . . . . . . . . . . . .
4.4 Muons . . . . . . . . . . . . .
4.5 Jets . . . . . . . . . . . . . .
4.6 b Jets . . . . . . . . . . . . . .
4.7 Missing Transverse Energy . .

and
. . .
. . .
. . .
. . .
. . .
. . .
. . .

Chapter 5 Event Selection . . . . . .
5.1 Strategy . . . . . . . . . . . . . .
5.2 Event selection cuts . . . . . . . .
5.2.1 General selection . . . . .
5.2.2 Electron channel selection
5.2.3 Muon channel selection . .
5.2.4 b-tagging selection . . . .

.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.

Object Identiﬁcation
. . . . . . . . . . . . . .
. . . . . . . . . . . . . .
. . . . . . . . . . . . . .
. . . . . . . . . . . . . .
. . . . . . . . . . . . . .
. . . . . . . . . . . . . .
. . . . . . . . . . . . . .

.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.

. 60
. 61
. 63
. 65
. 66
. 67
. 69
. 71

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.

38
40
43
44
46
46
47
48
48
51
51
52
53
54
55
58

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

. 73
. 73
. 74
. 74
. 77
. 77
. 79

Chapter 6 Data and Simulation Samples .
6.1 Data samples . . . . . . . . . . . . . . .
6.1.1 Signal data samples . . . . . . . .
6.1.2 Multijets data samples . . . . . .

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

. 80
. 80
. 80
. 81

v

.
.
.
.
.
.
.

.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.
.

Chapter 7 Event Yields and Data/Simulation Comparison
7.1 Event yields . . . . . . . . . . . . . . . . . . . . . . . . . . .
7.2 Signal acceptance . . . . . . . . . . . . . . . . . . . . . . . .
7.3 Data and Monte Carlo comparison . . . . . . . . . . . . . .

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

. 91
. 91
. 98
. 99

Chapter 8 Multivariate Analysis . . . . . . . . . . . . . . . . . .
8.1 Split samples . . . . . . . . . . . . . . . . . . . . . . . . . . .
8.2 Output transformation . . . . . . . . . . . . . . . . . . . . . .
8.3 Key for plots . . . . . . . . . . . . . . . . . . . . . . . . . . .
8.4 Boosted Decision Trees (BDT) . . . . . . . . . . . . . . . . . .
8.5 Bayesian Neural Networks (BNN) . . . . . . . . . . . . . . . .
8.6 Neuroevolution of augmented topologies (NEAT) . . . . . . .
8.7 A combined BNN using BDT, BNN and NEAT discriminants

.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.

104
105
107
109
110
115
120
125

Chapter 9 Statistical Analysis . . . .
9.1 Poisson distribution likelihood . . .
9.2 Bayesian analysis . . . . . . . . . .
9.3 Posterior probability density for the

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

132
132
134
140

6.2

6.3

6.1.2.1 Multijets in the electron channel . . . .
6.1.2.2 Multijets in the muon channel . . . . . .
Simulation samples . . . . . . . . . . . . . . . . . . . . .
6.2.1 Signal simulation samples . . . . . . . . . . . . .
6.2.2 Background simulation samples . . . . . . . . . .
6.2.3 Simulation samples corrections and normalization
6.2.3.1 Simulation eﬃciencies corrections . . . .
6.2.3.2 W +jets and multijets backgrounds . . .
Lepton charge mis-ID measurement samples . . . . . . .

. . . . . . . . .
. . . . . . . . .
. . . . . . . . .
CP asymmetry

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.
.
.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

82
83
83
83
84
84
86
88
89

Chapter 10 Systematic Uncertainties . . . . . . . . . . . . . . . . . . . 142
10.1 Uncertainties aﬀecting normalization only . . . . . . . . . . . . . . . 143
10.2 Uncertainties aﬀecting normalization and shape . . . . . . . . . . . . 150
Chapter 11 CP Violation Measurement
11.1 Cross section measurement . . . . . .
11.2 Expected results . . . . . . . . . . .
11.3 Observed results . . . . . . . . . . .
vi

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

153
153
157
162

Chapter 12 Results and Conclusions . . . . . . . . . . . . . . . . . . . . 167
Appendix A Plots After Splitting by the Lepton Charge . . . . . . . . 169
Appendix B SAM Deﬁnitions of Dataset Used in This Analysis . . . 183
Appendix C Plots for discriminants after the binning transformation 185
Appendix D Lepton Charge Mis-Identiﬁcation
D.1 Electron Charge Mis-ID Rate . . . . . . . .
D.2 Muon Charge Mis-ID Rate . . . . . . . . . .
D.3 Conclusion . . . . . . . . . . . . . . . . . . .

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

.
.
.
.

190
193
195
198

Appendix E W Asymmetry Check . . . . . . . . . . . . . . . . . . . . . . 199
Appendix F b/¯ Jet Energy Scale (JES) Check . . . . . . . . . . . . . . 202
b
Appendix G Production and decay asymmetries . . . . . . . . . . . . . 212
Bibliography . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 216

vii

LIST OF TABLES

Table 2.1

Standard Model gauge bosons . . . . . . . . . . . . . . . . .

12

Table 2.2

Single top quark production cross sections [1]. . . . . . . . .

17

Table 6.1

Integrated luminosity for the datasets used in this analysis. .

81

Table 6.2

The Monte Carlo samples event samples . . . . . . . . . . . .

85

Table 7.1

One b tagged event yields . . . . . . . . . . . . . . . . . . . .

94

Table 7.2

Two b tagged event yields . . . . . . . . . . . . . . . . . . . .

95

Table 7.3

Expected and observed yields in “positive” samples . . . . .

96

Table 7.4

Expected and observed yields in “negative” samples . . . . .

97

Table 7.5

Combined (Electron+Muon, Run IIa+Run IIb) signal acceptances. 99

Table 8.1

BDT discriminant used in the current analysis. . . . . . . . . 112

Table 11.1

Expected results for CP asymmetry in s, t, s+t channels. . . 161

Table 11.2

Observed results for CP asymmetry in s, t, s+t channels. . . 166

Table D.1

Overall Charge mis-ID Rates. . . . . . . . . . . . . . . . . . . 198
viii

Table F.1

Event yields for signals in “positive” samples . . . . . . . . . 203

Table F.2

Event yields for signals in “positive” samples b/bbar JES . . 203

Table F.3

Event yields for signals in “negative” samples . . . . . . . . . 204

Table F.4

Event yields for signals in “negative” samples b/bbar JES . . 204

Table F.5

Ratio of integrals for variables before and after b/¯ JES . . . 205
b

ix

LIST OF FIGURES

Figure 1.1

Flow chart of the stages of the analysis overview . . . . . . .

5

Figure 2.1

The Standard Model of elementary particles . . . . . . . . .

8

Figure 2.2

Feynman diagram for the quark-antiquark annihilation

. . .

16

Figure 2.3

Feynman diagram for single top s-channel production. . . . .

18

Figure 2.4

Feynman diagram for single top t-channel production. . . . .

19

Figure 2.5

Decay branching ratio of a top quark . . . . . . . . . . . . .

20

Figure 2.6

Feynman diagram for single top s-channel . . . . . . . . . . .

20

Figure 2.7

Feynman diagram for single top t-channel . . . . . . . . . . .

21

Figure 2.8

The transverse momentum for s-channel . . . . . . . . . . . .

22

Figure 2.9

The transverse momentum for t-channel . . . . . . . . . . . .

23

Figure 2.10

The pseudorapidity for s-channel . . . . . . . . . . . . . . . .

24

Figure 2.11

The pseudorapidity for t-channel . . . . . . . . . . . . . . . .

25

Figure 2.12

Feynman diagram for W ′ boson decaying into single top. . .

27

x

Figure 2.13

FCNC via exchange of a gluon in single top production. . . .

28

Figure 3.1

Fermilab’s accelerater chain. . . . . . . . . . . . . . . . . . .

32

Figure 3.2

Fermilab’s Tevatron Tunnel . . . . . . . . . . . . . . . . . . .

36

Figure 3.3

Side view of the DØ detector . . . . . . . . . . . . . . . . . .

39

Figure 3.4

A schematic of a hermetic detector . . . . . . . . . . . . . . .

41

Figure 3.5

High energy particle signatures left in the detector components 42

Figure 3.6

The Tracking System at DØ. . . . . . . . . . . . . . . . . . .

45

Figure 3.7

The disk/barrel design of the silicon microstrip tracker. . . .

47

Figure 3.8

Illustration of the calorimeter systems. . . . . . . . . . . . . .

49

Figure 3.9

Intersecting view of a quarter of the DØ calorimeter . . . . .

50

Figure 3.10

Trigger-related data ﬂow in L1 and L2 trigger systems.

. . .

58

Figure 4.1

Illustration of a primary vertex. . . . . . . . . . . . . . . . .

64

Figure 4.2

Illustration of jet formation. . . . . . . . . . . . . . . . . . .

68

Figure 4.3

Illustration of a secondary vertex . . . . . . . . . . . . . . . .

70

Figure 7.1

A pie chart illustration of backgrounds and signals . . . . . .

92

Figure 7.2

Lepton transverse momentum . . . . . . . . . . . . . . . . . 100

Figure 7.3

HT for positive and negative samples . . . . . . . . . . . . . 101

Figure 7.4

The Lepton Eta . . . . . . . . . . . . . . . . . . . . . . . . . 101
xi

Figure 7.5

Single top mass distribution . . . . . . . . . . . . . . . . . . 102

Figure 7.6

Top Mass distribution for positive and negative samples . . . 102

Figure 7.7

W Transverse mass distribution . . . . . . . . . . . . . . . . 103

Figure 8.1

Flowcharts of the MVA analyses. . . . . . . . . . . . . . . . . 106

Figure 8.2

Color scheme . . . . . . . . . . . . . . . . . . . . . . . . . . . 109

Figure 8.3

Decision tree illustration . . . . . . . . . . . . . . . . . . . . 111

Figure 8.4

BDT output for s+t channel . . . . . . . . . . . . . . . . . . 113

Figure 8.5

BDT output for s-channel

Figure 8.6

BDT output for t-channel . . . . . . . . . . . . . . . . . . . . 115

Figure 8.7

An illustration of a simple Neural Network. . . . . . . . . . . 116

Figure 8.8

BNN output for s+t channel . . . . . . . . . . . . . . . . . . 118

Figure 8.9

BNN output for s channel

Figure 8.10

BNN output for t channel . . . . . . . . . . . . . . . . . . . . 120

Figure 8.11

NEAT output for s+t channel . . . . . . . . . . . . . . . . . 122

Figure 8.12

NEAT output for s channel . . . . . . . . . . . . . . . . . . . 123

Figure 8.13

NEAT output for t channel . . . . . . . . . . . . . . . . . . . 124

Figure 8.14

Combined BNN output for s+t channel . . . . . . . . . . . . 126

Figure 8.15

Combined BNN output for s channel . . . . . . . . . . . . . 127
xii

. . . . . . . . . . . . . . . . . . . 114

. . . . . . . . . . . . . . . . . . . 119

Figure 8.16

Combined BNN output for t channel . . . . . . . . . . . . . . 128

Figure 8.17

s+t channel discriminant plots . . . . . . . . . . . . . . . . . 129

Figure 8.18

Zoomed s+t channel discriminant plots . . . . . . . . . . . . 129

Figure 8.19

t channel discriminant plots . . . . . . . . . . . . . . . . . . . 130

Figure 8.20

Zoomed t channel discriminant plots . . . . . . . . . . . . . . 130

Figure 8.21

s channel discriminant plots . . . . . . . . . . . . . . . . . . 131

Figure 8.22

Zoomed s channel discriminant plots . . . . . . . . . . . . . . 131

Figure 9.1

Illustration of ﬂat probability density . . . . . . . . . . . . . 135

Figure 9.2

Illustration of cross section measurement . . . . . . . . . . . 136

Figure 9.3

Illustration of two dimensional posterior probability density . 139

Figure 11.1

Illustration of a 2d posterior . . . . . . . . . . . . . . . . . . 155

Figure 11.2

Illustration of measuring CP violation . . . . . . . . . . . . . 156

Figure 11.3

Expected CP Violation results in s + t-channel . . . . . . . . 158

Figure 11.4

Expected CP Violation results in t-channel . . . . . . . . . . 159

Figure 11.5

Expected CP Violation results in s-channel . . . . . . . . . . 160

Figure 11.6

Observed CP Violation results in s + t-channel . . . . . . . . 163

Figure 11.7

Observed CP Violation results in t-channel . . . . . . . . . . 164

Figure 11.8

Observed CP Violation results in s-channel . . . . . . . . . . 165
xiii

Figure A.1

Color scheme . . . . . . . . . . . . . . . . . . . . . . . . . . . 170

Figure A.2

Transverse momentum of lepton for both samples . . . . . . 171

Figure A.3

Centrality for All Jets for both samples . . . . . . . . . . . . 171

Figure A.4

Delta Phi between Lepton MET for both samples . . . . . . 172

Figure A.5

HT for both samples . . . . . . . . . . . . . . . . . . . . . . 172

Figure A.6

HT-AllJets-MinusBTaggedJet for both samples . . . . . . . . 173

Figure A.7

Invariant Mass of two leading jets for both samples . . . . . . 173

Figure A.8

InvariantMass-LightQuarkJets1-2 for both samples . . . . . . 174

Figure A.9

Leading b Tagged Jet for both samples . . . . . . . . . . . . 174

Figure A.10 LeadingBTaggedJetEta for both samples . . . . . . . . . . . 175
Figure A.11 LeadingBTaggedJetLeptonDeltaPhi for both samples . . . . . 175
Figure A.12 LeadingBTaggedJetPt for both samples . . . . . . . . . . . . 176
Figure A.13 LeadingLightQuarkJetBTagNN for both samples . . . . . . . 176
Figure A.14 LeadingLightQuarkJetEta for both samples . . . . . . . . . . 177
Figure A.15 LeadingLightQuarkJetLeptonDeltaPhi for both samples . . . 177
Figure A.16 LeadingLightQuarkJetPt for both samples . . . . . . . . . . 178
Figure A.17 LeptonEta for both samples . . . . . . . . . . . . . . . . . . 178
Figure A.18 METPt for both samples . . . . . . . . . . . . . . . . . . . . 179
xiv

Figure A.19 QTimesEta for positive and negative samples . . . . . . . . . 179
Figure A.20 SecondLightQuarkJetPt for positive and negative samples . . 180
Figure A.21 Single top mass for positive and negative samples

. . . . . . 180

Figure A.22 SpinCorr for positive and negative samples . . . . . . . . . . 181
Figure A.23 TopMassMinChiSqr for positive and negative samples . . . . 181
Figure A.24 WT for positive and negative samples . . . . . . . . . . . . . 182
Figure C.1

s+t channel discriminant plots . . . . . . . . . . . . . . . . . 187

Figure C.2

Zoomed s+t channel discriminant plots . . . . . . . . . . . . 187

Figure C.3

t channel discriminant plots . . . . . . . . . . . . . . . . . . . 188

Figure C.4

Zoomed t channel discriminant plots . . . . . . . . . . . . . . 188

Figure C.5

s channel discriminant plots for . . . . . . . . . . . . . . . . 189

Figure C.6

Zoomed s channel discriminant plots . . . . . . . . . . . . . . 189

Figure D.1

Muon charge mis-ID rate as a function of pT . . . . . . . . . 197

Figure E.1

Expected posterior density for W + jets asymmetry . . . . . 200

Figure E.2

Observed posterior density for W + jets asymmetry . . . . . 201

Figure F.1

HT for all jets before and after b/¯ JES
b

Figure F.2

Leading jet transverse momentum before and after b/¯ JES
b

Figure F.3

Single top mass before and after b/¯ JES . . . . . . . . . . . 208
b
xv

. . . . . . . . . . . 206
207

Figure F.4

HT for all jets before and after b/¯ JES
b

Figure F.5

Leading jet transverse momentum before and after b/¯ JES
b

Figure F.6

Single top mass before and after b/¯ JES . . . . . . . . . . . 211
b

xvi

. . . . . . . . . . . 209
210

Chapter 1
Introduction
This thesis gives an overview of the ﬁrst search for CP violation in single top quark
production at the DØ experiment at the Fermilab Tevatron Collider. Chapter 1 gives
an introduction to CP violation and the motivation for this search in the single top
process; Chapter 2 sketches the theory background; Chapter 3 gives an introduction
of the experimental facilities used in this analysis; Chapter 4 describes how to identify
basic physics objects; Chapter 5 presents the data and simulation samples used
in this analysis; Chapter 6 shows the event selection cuts and describes how the
background samples are normalized; Chapter 7 presents the multivariate methods
used; Chapter 8 describes how the statistical methods are implemented; Chapter 9
discusses systematic uncertainties encountered in this analysis; Chapter 10 shows the
speciﬁc methods used to measure the CP asymmetry and the results; Chapter 11
1

concludes this thesis by summarizing the results of the analysis and giving an outlook
for future analyses.

1.1

CP Violation

Charge conjugation and Parity (CP) symmetry states that the laws of physics should
be the same if a particle were interchanged with its antiparticle (C symmetry), and
left and right were swapped (P symmetry). CP-symmetry was proposed in 1957 by
Lev Landau as the true symmetry between matter and antimatter [2]. CP violation
occurs when there is a diﬀerence in the way that particles and anti-particles interact.
CP violation was ﬁrst discovered in 1964, in the decays of neutral kaons by J. Cronin
and V. Fitch [3]. Their discovery showed that weak interactions violate not only the
charge-conjugation symmetry C between particles and antiparticles and the parity
P, but also their combination. The violation of CP-symmetry plays an important
role both in the attempts to explain the dominance of matter over antimatter in the
universe, and in the study of weak interactions in particle physics.
The Standard Model Lagrangian contains two ways to break CP-symmetry. The
ﬁrst of these is in the QCD strong sector, which has yet not been found experimentally. The second way, involving the weak force, has been experimentally veriﬁed,
but only accounts for a small portion of the CP violation required to explain baryogenesis. Here we present a search for CP violation involving the top quark.
2

1.2

Motivation for searching for CP violation in
single top

The top quark was discovered in 1995 at the Fermilab Tevatron by the CDF and D0
collaborations[4]. It is the heaviest elementary particle found to date. Its large mass
and corresponding coupling strength to the Higgs boson suggests that CP violation
in the top sector might be accessible [5].
CP violation processes can exist within the Standard Model (SM) through a
CP-violating phase in the Cabibbo-Kobayashi-Maskawa (CKM) matrix. However,
it is estimated that the amount of CP-violation in the SM is not enough to explain
baryogenesis, suggesting the possibility that new physics is needed to account for all
CP-violating processes [6].
CP violation is a very rare phenomenon that was conﬁrmed in the decay of KL mesons [3]. However, the amount of CP violation observed in these systems can
be accommodated within the SM [7]. More interesting are systems involving Bmesons where the expected CP-violating eﬀects within the SM are much larger [8].
On the other hand, the SM predicts very low CP violation for top physics because
of the Glashow-Iliopoulos-Maiani (GIM) mechanism which suppresses changes in
strangeness by 2 in weak interactions, making any large CP-violation eﬀect direct
evidence of physics beyond the SM [6, 8].
3

The single top process provides a direct measurement of the CKM Matrix element
|Vtb |. One of the assumptions in measuring |Vtb | from single top is that the W tb
vertex is CP-conserving, and we test that assumption here.

1.3

Samples selection and single top analysis overview

This analysis is based on the single top cross section measurement in s+t, s and t
channels. (See Section 2.3 for deﬁnitions of these channels.) The data and simulation
samples went through various selection cuts and processing stages before multivariate
and statistical analysis. Sketch of the analysis is shown in Figure 1.1. These items
will be discussed in detail in Chapters 4 through 11.

4

$

!
"

#

%
"

#

Figure 1.1: Flow chart of the stages of CP violation analysis in single top. For
interpretation of the references to color in this and all other ﬁgures, the reader is
referred to the electronic version of this dissertation.

5

Chapter 2
Theory
Elementary particles are the building blocks of the world around us and their interactions have been successfully described by the Standard Model of Elementary
Particles [9, 10, 11, 12]. High energy experimentalists conduct experiments either to
test the SM’s predictions in great detail, or to search for new interactions that are not
predicted by the SM (Beyound SM). At the energy frontier, high energy accelerators
can produce heavy particles such as top quarks to probe new physics.
The top quark is the most massive observed elementary particle. It was postulated
in 1973 by Makoto Kobayashi and Toshihide Maskawa to explain CP violation in
kaon decays [7]. The top quark was discovered at the Tevatron collider at Fermilab
in 1995, with the production of a top-antitop pair via strong interactions [4].
C.-P. Yuan et al [13] and S. Willenbrock [14] predicted a top quark production
6

mode by means of the electroweak interaction, in which the top quark is produced
singly, thus also called “Single Top” quark production. This production mode has
been observed at the Tevatron [15, 16] in 2009 by the DØ and CDF collaborations.
Single top is sensitive to many new physics models, and CP violation would be
a signiﬁcant manifestation of them. Moreover, this measurement is unique in the
Tevatron (p¯ collider), and is not easily measured at the Large Hadron Collider
p
(LHC). The last section of this chapter explains how CP Violation in single top
manifest itself and how we intend to measure it.

2.1

The Standard Model

The Standard Model includes two major theories – quantum electroweak theory and
quantum chromodynamics. It describes the interactions between all known particles
(electromagnetic, weak, and strong interactions) in terms of one internally consistent
framework, the quantum ﬁeld theory (QFT). The particles in the Standard Model
are divided into two groups – fermions and bosons.
The Standard Model has had extensive success in explaining a wide variety of
experimental results. Among other examples, the Standard Model predicted the
existence of the W and Z bosons, gluon, and the top and charm quarks before
these particles were observed. Their predicted properties were also experimentally
conﬁrmed with good precision.
7

Three Generations
of Matter (Fermions)

mass→ 2,4 MeV
charge→ ⅔

u

spin→ ½

4,8 MeV

-⅓
½

d

0

½

171,2 GeV

c

⅔
½

t

⅔
½

charm

-⅓
½

4,2 GeV

s

b

-⅓
½

<0,17 MeV

<15,5 MeV

1

photon
0

νe νμ ντ

1

gluon
91,2 GeV

0

½

1

tau
neutrino

0,511 MeV

105,7 MeV

1,777 GeV

½

electron

Z

0

½

e

g

0

0

muon
neutrino

-1

γ

0

bottom

strange

electron
neutrino

Leptons

0

top

104 MeV

down
<2,2 eV

III

1,27 GeV

up

name→

Quarks

II

-1

½

μ

-1

½

τ
tau

muon

0

Z boson
80,4 GeV

W

±1

1

±

W boson

Figure 2.1: The Standard Model of elementary particles. [17]

8

Gauge Bosons

I

The Standard Model depends on 19 parameters, whose numerical values are established by experiment.

2.1.1

Fermions

The Standard Model includes 12 elementary particles of spin 1 known as fermions,
2
which observe the Pauli exclusion principle [18]. There are the six quarks (up,
down, charm, strange, top, bottom), and six leptons (electron, electron neutrino,
muon, muon neutrino, tau, tau neutrino). Each quark/lepton has a corresponding
antiparticle with the same mass but opposite electric charge.
The fermions are the building blocks of matter. They are further grouped into
three generations, where the lightest are in the ﬁrst generation and the heaviest in the
third. Each generation contains a charged lepton with charge −1, a charge-neutral
neutrino, and up-type and down-type quarks with charges + 2 and − 1 respectively.
3
3
All ordinary (baryonic) matter is made of ﬁrst generation charged particles (electrons,
and up and down quarks), since these do not decay. On the other hand, the two
higher generation charged particles decay within very short half lives.
Some properties of quarks and leptons can be seen in Figure 2.1. Note by massenergy equivalence, we usually express mass in units of eV/c2 , where c is the speed
of light in a vacuum, and we set c2 = 1 in this thesis, and then leave out c from
equations. The masses of hadrons are mostly of the order of 1 GeV/c2 , which makes
9

the GeV (gigaelectronvolt, or a billion electronvolt) a very convenient unit of mass.
For example the proton has a mass of 0.938 GeV.

The quarks carry color charge (red, green, blue), and hence interact through the
strong interaction. Quarks also carry electric charge and weak isospin (a component
of the weak hypercharge, which uniﬁes weak with electromagnetic interactions.), so
they can interact with each other both elctromagnatically and via the weak interaction. At lower energy, or larger distance, the strong interaction becomes very
strong, resulting in a phenomena called color conﬁnement, which results in quarks
being bound to one another, forming color-neutral composite particles called hadrons.
Hadrons are strongly interacting particles built from two types of quark combination:
¯
Baryon = QQQ (3 quarks), Meson = QQ (quark-antiquark pair). The familiar proton (uud) and the neutron (udd) are the two baryons composed of quarks with the
smallest masses.

The electrons, muons, and taus do not carry color charge, but carry an electric
charge, and so they all interact electromagnetically. The neutrinos do not carry color
charge or electric charge, so they only interact with the other particles through the
weak force, which makes them very diﬃcult to detect.
10

2.1.2

Gauge Bosons

In the Standard Model, gauge bosons are force carriers that mediate the strong, weak,
and electromagnetic fundamental interactions. The gauge bosons of the Standard
Model all have spin 1 (thus the name bosons).
Photons mediate the electromagnetic force between particles that carry electric charges. The photon is massless and is described by quantum electrodynamics
(QED) [11].
The W + , W − , and Z gauge bosons mediate the weak interactions between
particles of diﬀerent ﬂavors (all quarks and leptons). They are massive, with the Z
boson the heaviest. These three gauge bosons along with the photons are grouped
together, as collectively mediating the electroweak interaction.
The eight gluons mediate the strong interactions between color-charged particles
(the quarks). Gluons are massless. The eightfold multiplicity of gluons is labeled
by a combination of color and anticolor charge (e.g. red-antigreen). Because the
gluon has an eﬀective color charge, they can also interact among with each other,
which is diﬀerent from photons, which do not carry electric charge and therefore do
not interact with each other. The gluons and their interactions are described by the
theory of quantum chromodynamics (QCD) [11].
Table 2.1 gives a summary of the properties of the gauge bosons in the SM.
11

Table 2.1: Standard Model gauge bosons
Gauge Bosons spin = 1
Force

Mass [GeV ]

Electric Charge

Strong

Gluon (g)

0

0

Weak

W±

80.398 ± 0.023

±1

Weak

Z

91.1876 ± 0.0021

0

EM

2.1.3

Gauge boson

Photon (γ)

0

0

Higgs Boson

The Standard Model also predicts the existence of another boson, the Higgs boson,
which gives mass to all elementary particles, including the massive W/Z bosons and
the lighter fermions, i.e. quarks and leptons, and even the Higgs boson itself. In this
way the Higgs boson is a key building block in the Standard Model.
The Higgs particle was theorized by R. Brout, F. Englert, P. Higgs, G. Guralnik,
C. R. Hagen, and T. Kibble in 1964 [19, 20, 21]. It has no intrinsic spin (spin 0), and
for that reason is classiﬁed as a boson (like the gauge bosons integer spin 1). Because an exceptionally large amount of energy and beam luminosity are theoretically
required to observe a Higgs boson in high energy colliders, it is the only fundamental
particle predicted by the Standard Model that has yet to be observed. It is hoped
that the LHC at CERN will conﬁrm the existence of the Higgs boson.
12

2.1.4

Limits of the Standard Model

In spite of extensive experimental successes, the Standard Model is not a complete
theory of fundamental interactions. It has its own intrinsic defects, and there are
numerous phenomena that the SM can not explain. This leads to various Beyond
the Standard Model (BSM) searches. Listed below are some of the limitations of the
SM:
• It does not incorporate the physics of dark energy, which is the reason that
the universe is expanding at an accelerating rate, and accounts for 73% of the
total mass-energy of the universe [22].
• It does not explain gravitation, and it can’t describe general relativity consistently in terms of quantum ﬁeld theory.
• It does not correctly account for neutrino oscillations and their non-zero masses
observed by the oscillation experiments [23]. Although mass terms for the
neutrinos can be added to the standard model by hand, the speciﬁcs of neutrino
mass are still unclear.
• It depends on 19 numerical constants, whose values are obtained from experiment, but the origin of the values is unknown.
• It has several apparently unnatural properties giving rise to puzzles like the
strong CP problem, where QCD strong interaction is CP-symmetric, and the
13

hierarchy problem, such as why the weak force is 1032 times stronger than
gravity [24].
• It does not contain any dark (i.e. nonluminous) matter particle that possesses
all of the required properties deduced from cosmology observations [25].
• It is also diﬃcult to accommodate the predominance of matter over antimatter
(matter/antimatter asymmetry).

2.2

The Top Quark

The existence of the top quark was postulated (together with bottom quark) in 1973
by Makoto Kobayashi and Toshihide Maskawa to explain the observed CP violations
in kaon decay [7]. In hadron colliders, it can be produced via both the strong and
electroweak interactions. The top quark was discovered in 1995 by the DØ and
CDF Collabrations at Fermilab via strong interactions [4]. The production of single
tops via weak interaction was discovered also at Fermilab by the DØ and CDF
experiments in 2009 [15, 16] .

2.2.1

Properties

The top quark is an up-type quark with spin 1/2 and charge +2/3e. It has the largest
mass of any known elementary particles. Its mass is measured 172.9 ± 1.5 GeV [26].
14

Due to its large mass, it needs high energy colliders to be created, such as Tevatron
or LHC. It participates in strong, weak and electromagnetic processes.

2.2.2

Production

In hadron collisions, top quarks are produced dominantly in pairs through the QCD
¯
¯
processes q q → tt and gg → tt. Approximately 85% of the production cross sec¯
tion at the Tevatron is from q q annihilation, with the remainder from gluon-gluon
¯
fusion. Smaller cross sections are expected from electroweak single-top productions
mechanisms, mediated by virtual s-channel and t-channel W bosons, respectively. A
virtual particle is one having mass not equal to that of the free particles, also called
“oﬀ mass shell”.
A Feynman diagram for top pair production is shown in Fig. 2.2

2.2.3

Decay

Top quarks decay almost exclusively into a W boson and a b quark, with a branching
ratio B(t → W + b) ≈ 99.8%. Thus the ﬁnal decay products are determined by the
W boson decay.
The fraction of top events containing b quarks is expected to be close to 100%,
and the quarks to be rather energetic, since they come from the decay of a massive
object.
15

Figure 2.2: Leading order Feynman diagram for the production of top quark pairs in
strong forces, where a quark-antiquark pair form a very energetic gluon, which then
decays into a top-antitop pair. From here and below, for the Feynman diagrams, the
time axis is always from left to right, and the space axis is from bottom up, unless
otherwise noted.
Because of its enormous mass, the top quark has an extremely short lifetime of
only 5 × 10−25 s, which is about 20 times smaller than the time scale of the strong
interaction (10−23 s). As a result, unlike other quarks, the top quark is expected
to decay before top-ﬂavored hadrons or tt-quarkonium bound states can form. This
provides a unique opportunity to study directly the decay of quarks.

2.3
2.3.1

The single top quark
Single top production and decay at the Tevatron

Electroweak top quark production is usually referred to as single top quark production because only one top quark is produced per event. At hadron colliders,
16

there are three Standard Model single top quark production modes, the s-channel
p
b
(p¯ → t¯ + X) 2.3 and t-channel (p¯ → tq¯ + X) 2.4 exchanges of a virtual W , and
p
b
the tW production (p¯ → tW + X). The next to leading order (NLO) cross sections
p
with NNLL corrections for these processes at the Tevatron at 1.96 TeV are listed in
Table 2.2. The central values of the cross sections for single top are for estimated
172.5 GeV top quark mass. The uncertainties include components from the choice
of scale and parton distribution functions, but not for the top quark mass.
Table 2.2: Single top quark production cross sections [1].
Process

Cross Section (pb)

t-channel

2.26 ± 0.12

s-channel

1.04 ± 0.04

At the Tevatron, the two dominant single top quark production channels are the
s-channel and t-channel. In the s-channel, an intermediate virtual time-like W -boson
decays into a top and antibottom quark, as shown in Figure 2.3. The ﬁnal state is
therefore a W boson and two bottom jets. In the t-channel, a bottom quark (originating from gluon radiation) transforms to top quark by exchanging a virtual space-like
W boson with an up or down quark, as shown in Figure 2.4. The accompanying
bottom quark is typically at large pseudorapidity (See deﬁnition in Equation 3.6)
and low transverse momentum, deﬁned as pT =
17

p2 + p2 . Thus they usually esx
y

cape detection, and are often not reconstructed in the analysis. The t-channel ﬁnal
state results also dominantly in W +2 jets, with just one of the jets coming from a
bottom quark. Occasionally, the initial or ﬁnal-state quark can radiate a hard gluon
or quark at a large angle. That can be detected as additional jets. Together with
the light quark, this will lead to W +3 and W +4 jets events. The number of jets
reconstructed in the detectors depends on the decay kinematics. Although as shown
in Figure 2.4 is a 2 → 3 process, it contributes similar large cross section in the NLO
calculation to s-chanel [27]. The tW production mode, where a single top quark is
created in association with a real (on-shell) W boson, has too small of a cross section
(0.14±0.03 pb [1]) at the Tevatron energy, so it’s not included in our search channels.

Figure 2.3: Feynman diagram for single top s-channel production.

As mentioned in the top quark section 2.2, the top quark decays to a W boson
18

Figure 2.4: Feynman diagram for single top t-channel production.

and a b quark. The W boson further decays to two jets or to a lepton plus neutrino
(Figure 2.5). The all-jets channel is immersed in a large QCD background. In
our analysis, the electron and muon leptonic ﬁnal states are the most useful. The
Feynman diagrams for the single top production and decay modes for s- and tchannel are shown in Figure 2.6 and Figure 2.7 separately.
Due to its large mass, the top quark is produced almost at rest, and the b quark
from the top quark decay tends to be central and have large transverse momentum,
as shown in Figure ??. For the decay products of W boson, the lepton has a softer
pT spectrum than that of the neutrino. This is due to the W boson polarization,
so the lepton from the W boson decay preferentially points in the same direction
as the incident proton or antiproton. The lepton appears as an isolated track of
19

Figure 2.5: Decay branching ratio of a top quark. Source: CP Yuan class notes.

Figure 2.6: Feynman diagram for single top s-channel production and decay.

20

Figure 2.7: Feynman diagram for single top t-channel production and decay.
high momentum in the cetnral tracking detector, matched with a shower in the
electromagnetic segments of the calorimeter or the muon chamber. The transverse
momenta of neutrinos from the W boson decay are reconstructed from the imbalance
in transverse energy measured in each event (missing ET):

ET =−
/

i
ET .

(2.1)

(Other quarks involved in the collision generate secondary hadrons, but they are
mostly low pT .)
Several single top parton level kinematics distributions for both channels are
shown in Figure 2.8 to Figure 2.11 .
The light quark produced in the t-channel has reasonably large pT , but its most
distinguishing feature is the asymmetric Q(l) × η distribution, where Q(l) is the
21

dσ/dpT [fb/5 GeV]

2 (a)
1.8
1.6
1.4
1.2
1
0.8
0.6
0.4
0.2
0
0

b from top
b with top
lepton
neutrino

50

100

150
pT [GeV]

Figure 2.8: The transverse momentum for s-channel

22

dσ/dpT [fb/5 GeV]

b from top (b)
light quark
b from gluon
lepton
neutrino

5
4
3
2
1
0
0

50

100

150
pT [GeV]

Figure 2.9: The transverse momentum for t-channel

23

dσ/dη [fb/0.2]

b from top
b with top
lepton

16 (c)
14
12
10
8
6
4
2
0
-4

-2

0

2

Figure 2.10: The pseudorapidity for s-channel

24

4
η

dσ/dη [fb/0.2]

b from top
light quark
b from gluon
lepton

35
30
25

(d)

20
15
10
5
0
-4

-2

0

2

Figure 2.11: The pseudorapidity for t-channel

25

4
η

charge of the lepton in the event, which is used to keep the multiplication positive.
This asymmetry arises because the ﬁnal state light quark produced during the single
top production most often is a d quark that moves in the same direction as the proton
(antiproton). The light quark η will hence tend to have the same sign as the charge
of the lepton from the top decay.
Event selection therefore requires a high pT electron or muon, two to four jets,
one of which must be identiﬁed as originating from a bottom quark, and missing ET
(Details of event selection see Chapter 5).

2.3.2

Motivations for searching for CP Violation in single
top

The main signiﬁcance of measuring the cross sections for single-top production is that
it is directly proportional to the |Vtb |2 component of the CKM matrix. Moreover,
no assumption is needed on the number of families or on the unitarity of the CKM
matrix to measure |Vtb |. If there is CP violation, it would show up for amplitude in
|Vtb |.
Single top quark production oﬀers excellent opportunities to probe physics beyond
the standard model. There are several ways new physics could produce a single
top quark diﬀerently for top and anti-top through processes such as new exchange
particles, ﬂavor changing neutral currents (FCNC), anomalous W tb couplings or
26

other processes.
Single top can be produced by the decay of a resonance, for example, a W ′
boson, which have the same ﬁnal states as the SM single top production, as shown
in Figure 2.12. If this resonance couples diﬀerently to top and anti-top, this could
produce diﬀerent cross sections.

Figure 2.12: Feynman diagram for W ′ boson decaying into single top.

Non-SM interactions, such as ﬂavor changing neutral currents (FCNC), can also
enhance the total single top cross section for t-channel, and possibly introduce a
diﬀerence in top and anti-top production rates. The Feynman diagrams for FCNC
via exchange of a gluon are shown in Figure 2.13.
In summary, the s-channel (tb) single top production is most sensitive to new,
heavy charged bosons. The t-channel (tqb) single top quark production would similarly be enhanced by ﬂavor-changing neutral currents (FCNC). Either could produce
27

Figure 2.13: FCNC via exchange of a gluon in single top production.
a CP asymmetry eﬀect, providing sensitivity to new physics.

2.4

Outline of CP violation measurement in single
top

Combining the CP symmetry with simultaneous time reversal (T) produces a combined symmetry called CPT symmetry. As of now, no violations of CPT symmetry
have been observed. CPT symmetry must be preserved in any Lorentz invariant
local quantum ﬁeld theory with a Hermitian Hamiltonian.
We exploit that Tevatron is a p¯ collider, meaning that the initial state is a CP
p
eigenstate and therefore, any diﬀerence between the cross sections for p¯ → tX and
p
¯
p¯ → tX would be a clear indication of CP violation [8, 28].
p
28

The purpose of this measurement is to measure the top and anti-top quark production separately. The theoretical cross section for each of them is half that of the
total single top cross section.
The asymmetry AP in the single top production can be expressed as follows:
AP =

¯
σ(p¯ → tX) − σ(p¯ → tX)
p
p
.
¯
σ(p¯ → tX) + σ(p¯ → tX)
p
p

(2.2)

where σ denotes the production rate.
¯
The strategy of this analysis is to identify the production of single t or t quarks
using the charge of the lepton when the W boson decays leptonically and then
¯
measure the individual single t and t production cross sections. We assume that no
signiﬁcant CP violation is present in background samples.
This analysis uses the same event selection and signal and background modeling
as the measurement of the single top quark production cross section using 5.4 fb−1
of DØ Run IIa and Run IIb data [29]. We split by lepton charge the discriminant
that was used to measure the single top quark production cross section [30], and
use it to measure top and antitop production cross sections separately. Therefore all
the systematic uncertainties in the single top quark cross section measurement are
applied to this analysis [29, 30]. A new systematic uncertainty is added to account
for the misidentiﬁcation of the lepton charge, discussed in detail in Appendix D.

29

Chapter 3

Experimental Setup

The Fermi National Accelerator Laboratory (Fermilab) [31, 32] is located in the west
surburbs of Chicago, Illinois in the USA. Fermilab is the only facility in the world
where high energy protons and anti-protons collide at a center-of-mass energy of
√

s = 1.96 TeV [32, 33]. The Tevatron completed its physics program in September

2011 and stopped running. All the other elements in the accelerator chain except the
Tevatron and recycler are still being used for neutrino physics and other experiments.
A complex detection system is required to “see” the details of these energetic
collisions, and study interesting physics processes such as single top quark production. The two main detectors at Fermilab are known as DØ and CDF. This chapter
presents a description of accelerator complex and the DØ detector at Fermilab.
30

3.1

The Accelerator Complex and Tevatron

Fermilab uses a series of accelerators to create the high energy proton and antiproton
beams that collide in the Tevatron collider. The proton beams are created starting
with hydrogen gas from a linac at the beginning of the accelerator cycle. A portion
of the proton beams are taken from the Main Injector to create antiprotons. Once
enough antiprotons are accumulated, they are loaded into the Tevatron, where they
are accelerated to the ﬁnal highest energy stage. The accelerated protons and antiprotons are grouped into a number of bunches, and they collide at two points where
the CDF (B0) and DØ (D0) detectors are located. Figure 3.1 is a picture illustrating
Fermilab’s accelerator chain. Below we talk about each part separately.
• Cockcroft-Walton Pre-accelerator
Producing negatively charged hydrogen ions (H − ) is the ﬁrst step in creating proton and antiproton beams. This takes place in the Cockcroft-Walton
generator. Hydrogen gas is converted into H − ions by introducing it into a
container lined with molybdenum electrodes. The ions are accelerated out of
the container by a 750 keV electrostatic ﬁeld applied by the Cockcroft-Walton
generator.
• Linear Accelerator
The Fermilab Linac is a 500-foot long linear particle accelerator that accelerates
31

Figure 3.1: Fermilab’s accelerater chain.

32

the negative hydrogen ions H − from the Cockcroft-Walton pre-accelerator to
400 MeV before injecting them into the Booster accelerator. The H − ions
pass through a carbon foil, which removes electrons from the hydrogen ions,
creating positively charged protons or H + .
• Booster
Located nearly 20 feet below ground, the Booster is a circular accelerator with a
diameter of 500 feet that uses magnets to bend electrically charged particles in
a circular path. Boosting means the protons coming from the linac experience
a repeated acceleration from electric ﬁelds during each revolution, increasing
their energy by a small amount each time. The protons travel around the
Booster about 20,000 times, and their energy is boosted to 8 GeV by the end
of the acceleration cycle. Then the Booster sends the protons to the Main
Injector.
In the synchrotron, the ﬁnal beam energy is determined by the ring radius and
the maximum value of ﬁeld:
p = 0.3Bρ,

(3.1)

where p is the proton momentum, B is the magnetic ﬁeld in Tesla, and ρ is the
ring radius in meters.
• Main Injector The Main Injector (MI) is the next link in the accelerator chain.
33

The MI performs three functions: it accelerates protons and antiprotons for
injection into the Tevatron, it delivers protons for antiproton production, and it
transfer antiprotons between antiproton storage rings and from the antiproton
storage rings to the Tevatron. Protons from the Main Injector are also delivered
to several neutrino targets.

• Antiproton Source
Antiprotons do not exist freely in nature like normal matter. To produce antiprotons, a 120 GeV beam of protons taken from the main injector is steered
onto the side of a drum shaped nickel target. The drum is rotated quickly to
prevent the beam from hitting the same spot and thus preventing the destruction of the target. The collisions produce many secondary particles including
antiprotons. The aniprotons then enter a beamline in which they are captured
and focused, before being injected into a storage ring. After accumulating
a suﬃcient number of antiprotons in the storage ring, they are sent to the
Recycler before they are injected into the Tevatron.

• The Recycler
The Fermilab Recycler is a 8 GeV permanent magnet storage ring for the
accumulation of antiprotons from the antiproton source, and the recovery of
the antiprotons remaining at the end of a Tevatron store. It is located in the
34

Main Injector tunnel directly above the Main Injector beamline.

• The Tevatron
The Tevatron is the most powerful proton-antiproton accelerator in the world.
The Tevatron ceased operations on September 30th, 2011. It accelerates beams
of protons and antiprotons to nearly the speed of light at an energy of 0.98
TeV around a four-mile circumference vacuum pipe mostly surrounded by superconducting electromagnets. 774 niobium-titanium superconducting dipole
magnets cooled in liquid helium are used to produce a magnetic ﬁled of 4.2 Teslas. Another 240 NbTi quadrupole magnets are used to focus the beam [31].
The collisions reproduce conditions in the early universe (≈ 10 ps after the Big
Bang, in comparison to 1 ps for LHC) and probe the structure of matter at a
very small distance. Picture 3.2 shows the Tevatron tunnel.

3.2

Luminosity and Cross Sections

In particle physics, collision rates are measured in terms of instantaneous luminosity,
L. The luminosity is an important value to characterize the performance of an
accelerator. Luminosity is deﬁned as the number of particles per unit area per unit
time, usually expressed in cgs units, cm−2 s−1 .
For an intersecting storage ring collider, the instantaneous luminosity can be
35

Figure 3.2: Fermilab’s Tevatron Tunnel. The upper magnets are not in use. The
lower magnets are part of the Tevatron Collider. Protons and antiprotons travel
through a vacuum pipe located inside the string of magnets. On the roof are the
cables and cooling pipes.

36

characterized by the parameters:

N N
L = fn 1 2
A

(3.2)

where
f is the revolution frequency,
n is the number of bunches in one beam in the storage ring,
Ni is the number of particles in each bunch,
and A is the cross area of the beam.
The higher the luminosity, the greater the chance that a proton will collide with
an antiproton. The Tevatron had been able to deliver luminosities up to 4 × 1032
cm−2 s−1 [34]. At the interaction point, the beams are squeezed very narrowly to
increase the chance of hard collisions.
The rate of a physics process is given by

dN
= Lσ
dt

(3.3)

where σ is the total cross section of the process, N is the count of physics events,
and L is the (instantaneous) luminosity.
The total number of collisions which result in a given physics process in a period
37

of time can thus be written as

N =σ

where L ≡

Ldt = σL

(3.4)

Ldt, the integrated luminosity, is deﬁned as the integral of the instan-

taneous luminosity with respect to time.
Therefore, the collision count can be calculated by multiplying the integrated luminosity by the cross-section for that collision process. The total number of collisions
is directly proportional to the integrated luminosity over this time.
The integrated luminosity has the unit of inverse area, usually expressed as inverse
picobarns pb−1 or inverse femtobarns f b−1 (1 barn = 10−24 cm2 ; 1 fb = 10−15
barns = 10−39 cm2 ).
This analysis uses 5.4 f b−1 of data collected at DØ.

3.3

The DØ detector

Detectors are our “eyes” to record and identify the useful events to reveal the nature
of fundamental interactions. DØ is one of two detectors used to study collisions produced in the Tevatron. Proton-antiproton collisions create showers of new particles
at the center of both CDF and DØ detectors at a rate of more than 2 million Hz.
We cannot record all those events. The trigger system decides whether an event
38

is interesting enough to be worth keeping. About 20 events per second are stored
on computer tape. For these interesting events, the detectors record each particle’s
ﬂight path, energy, momentum and electric charge.
DØ is a typical high energy particle detector. It measures thirty feet tall and
ﬁfty feet long and consists of four major parts: a tracking system, calorimeter, muon
system, trigger system. Figure 3.3 gives a sketch of the DØ detector components.

Figure 3.3: Side view of the DØ detector. The proton/anti-proton beam travels from
left/right to right/left.

39

3.3.1

Particle Signatures

Stable particles such as p, p, e± , γ will show up in the detector as energy deposits in
¯
hadronic and electromagnetic calorimeters or charged tracks in the tracking system.
Muons have a long lifetime and interact less with matters in the detector, which
means they pass through the entire detector before they decay, and leave tracks
in the muon detectors. Most of the particles produced in high-energy collisions
are short-lived. In order to observe all possible decay products of an collision, one
needs a hermetic detector and electronic system to record the events for further
oﬄine analysis. Besides, since the presence of a neutrino is inferred from the missing
energy, it’s important that the calorimeters have 4π solid angle coverage, so that
missing energy is not due to any failure to detect all produced particles. Figure 3.4
shows a typical cylindrical modern detector:
Figure 3.5 shows what signatures particles will leave in the components of the
detector.

• Most charged particles, like electrons and protons, are detected both in the
tracking chamber and the electromagnetic calorimeter.

• Muons have a long life time, and have small interactions with matters in
calorimeter, so they are detected in all parts of the detector. They pass through
the entire detector, and leave tracks in the muon chamber.
40

Figure 3.4: A schematic of the basic components of a hermetic detector. I.P. refers
to the region containing the interaction point. This is a cross section of the typical
cylindrical design. [35].

41

Figure 3.5: High energy particle signatures left in the detector components [36].
• Neutral particles, like neutrons and photons, are not detectable in the tracking
chamber.

• Photons are detected by the electromagnetic calorimeter, while neutrons are
evidenced by the energy they deposit in the hadron calorimeter.

• Neutrinos rarely interact with matter, and will escape from detection by the
detector. Their presence can only be inferred using missing energy by looking
at the momentum balance (Eq. 2.1).

An event consists of all data pertaining to a single p¯ interaction as measured in
p
the detector.
42

3.3.2

Coordinate System and Pseudorapidity

For the convenience of the DØ detector description and data analysis, we use a righthanded coordinate system, in which the z-axis is deﬁned along the beam pipe in the
direction of the proton beam, and the y-axis is upward, and thus x-axis directed into
the paper by right-handed rule (Figure3.3). The (x, y)-plane is usually referred to
as the transverse plane, which is vertical to the beam direction (z-axis). The polar
angle θ is the angle relative to the positive z-axis, and the azimuthal angle φ is the
angle from the positive x-axis on the transverse plane.
The rapidity of a particle of momentum pµ is deﬁned as

y=

1
ln
2

|p| + pz
|p| − pz

,

(3.5)

where pz is the component of the momentum along the beam axis.
In the Tevatron, the particles are traveling close to the speed of light, thus the
mass of the particle is neglectable, and E ≈ |p| establishes. In this limit, the rapidity
becomes equal to the pseudorapidity:

y=

1
ln
2

E + pz
E − pz

θ
where η ≡ − ln tan 2

1
1 − cos θ
= − ln tan
2
1 + cos θ

= − ln tan

θ
2

≡ η,

(3.6)

is called the pseudorapidity, and θ is the angle between

the particle momentum p and the beam axis.
43

We use the term “forward” to describe the regions that are close to the beam
axis at large |η| (e.g. 2.5), and “central” means the regions with small |η| (e.g. 1.5).
Pseudorapidity depends only on the polar angle of the trajectory of a particle, and
not on its energy as seen from Eq. 3.6.
The detector pseudorapidity η det is deﬁned as the pseudorapidity of the calorimeter cluster (electron, jet, or central track muon) with respect to the geometric center
θ
of the DØ detector, η det ≡ − ln tan det
2

, where θdet is relative to the center

of the detector.

3.3.3

The Central Tracking System

The point where the beams collide is surrounded by tracking detectors to record the
tracks (trajectories) of high energy particles produced in the collision. The Tracking
System is closest to the interaction point. It includes a silicon micro-strip tracker
(SMT) and a central ﬁber tracker (CFT) right outside the silicon detector. The
whole tracker is immersed in a 2 T superconducting solenoidal magnet, thus particle
tracks are curved; from the curvature we can deduce their momentum. The SMT
and CFT are optimized for tracking and vertexing at pseudorapidities |η| < 3 and
|η| < 2.5, respectively. A schematic view of the central tracking system is shown in
Figure 3.6.
The Tracking System is used to identify and measure the momentum of electrons
44

Figure 3.6: The Tracking System at DØ.

45

and muons, and to determine the position of the primary interaction vertex. The two
tracking detectors are able to locate the primary interaction vertex with a resolution
of about 35 µm along the beamline. They are also used to identify jets originating
from b quarks, which is an important decay product of single top.

3.3.3.1

Silicon Micro-strip Tracker (SMT)

The silicon micro-strip tracker (SMT) is located closest around the interaction region.
It gives very precise information, but silicon is expensive, so it is installed closest to
the beam where they cover a smaller area. The SMT is used for both tracking and
vertexing.
The SMT is designed of barrel modules interspersed with disks in the center, and
disks in the forward region (Figure 3.7). The barrel detectors primarily measure the
r − φ coordinate and the disk detectors measure r − z and r − φ. Vertices of particles
in the forward region are reconstructed in three dimensions by the disks, and vertices
of particles in the central region are measured in the barrels, and the central ﬁber
tracker. The detector has in total six barrels and sixteen disks.

3.3.3.2

Central Fiber Tracker (CFT)

Outside the silicon, DØ has an outer tracker made of scintillating ﬁbers, which
produce photons of light when a particle passes through. The CFT consists of scintillating ﬁbers mounted on eight concentric support cylinders and occupies r from
46

Figure 3.7: The disk/barrel design of the silicon microstrip tracker.
20 to 52 cm from the center of the beampipe. The ﬁbers are double clad and are
835 microns in diameter. The two innermost cylinders are 1.66 m long; the outer six
cylinders are 2.52 m long. The outer cylinder provides coverage for |η| < 1.7 (See
Fig. 3.6).
Each cylinder supports two layers of ﬁber doublets. One doublet layer of ﬁbers
is oriented along the beam direction (z) and a second doublet layer is oriented at a
stereo angle in +3◦ or −3◦ .
The scintillating ﬁbers are coupled to clear ﬁber waveguides which carry the
scintillation light to visible light photon counters for read out. The entire detector
has 71,680 channels.

3.3.4

The Central and Forward Preshower Detectors

The central and forward preshower detectors are located between the solenoid and
the calorimeters. The preshower detectors aid in electron identiﬁcation and back47

ground rejection during both triggering and oﬄine reconstruction. They function as
calorimeters as well as tracking detectors, enhancing the spatial matching between
tracks and calorimeter showers.
The central preshower detector (CPS) covers the region |η| < 1.3. The CPS
consists of three concentric cylindrical layers, each having 1280 triangular scintillator
strips. The resolution of the CPS is about 6 mm. The two forward preshower
detectors (FPS) cover 1.5 < |η| < 2.5.

3.3.5

The Calorimeters and ICD

The DØ calorimeter system consists of three uranium/liquid-argon calorimeters and
an intercryostat detector (ICD).

3.3.5.1

Calorimeters

The calorimeters are located outside the tracking system. They use uranium metal
bathed in liqueﬁed argon to capture particles and measure their energies. The
calorimeters provide energy measurements for electrons, photons, and jets, as well
as assist in identiﬁcation of electrons, photons, jets, and muons and measure the
transverse energy imbalance in events (missing transverse energy E T ). They play
/
crucial role in this analysis. The calorimeter systems are illustrated in Figure 3.8
below.
48

Figure 3.8: Illustration of the calorimeter systems.

49

As can be seen in Figure 3.8, each calorimeter contains an electromagnetic section
closest to the interaction region followed by ﬁne and coarse hadronic sections.
As shown in Figure 3.9, the calorimeter has a central section (CC) covering
pseudorapidities up to |η| ≈ 1.1, and two end calorimeters (EC) that extend coverage
to very forward region |η| ≈ 4.2. All three calorimeters are housed in separate
cryostats that maintain a temperature of approximately 90 K.

Figure 3.9: Intersecting view of a quarter of the DØ calorimeter, showing the trigger
towers and pseudorapidity coverage. The shading pattern indicates groups of cells
ganged together for signal readout. The rays indicate pseudorapidity intervals from
the center of the detector. The ICD is also denoted in the ﬁgure.

The signals generated by particle interactions in the calorimeters are grouped
into showers. Calorimeter readout cells form pseudo-projective towers as shown
50

in Figure 3.9, which are called Trigger Towers, which are essential for L1 and L2
triggers. There are 55,296 calorimeter electronic channels to be read out in total.

3.3.5.2

Intercryostat Detector

There are two issues with the calorimeters:
• The three calorimeters provide incomplete coverage in pseudorapidity.
• There is substantial material in the region we do not analyze, thus degrading
the energy resolution.
To address these problems, additional single-cell structure layers of sampling have
been added between the central and end cryostats. This Intercryostat detector (ICD)
consists of scintillator sampling that is attached to the exterior surfaces of the end
cryostats. It covers the region 1.1 < |η| < 1.4. Its rough location is shown in
Figure 3.9.

3.3.6

The Muon System

Surrounding all of the rest of the detector, the muon system resides at the most
outside layer beyond the calorimeter, and as the name implies, detects muons. Unlike
most common particles, muons don’t get absorbed in the calorimeter; they interact
small with the calorimeter and live long enough to leave the detector. By putting
51

detectors outside, we can identify muons relatively easily. High energy muons are
quite rare and a sign of interesting collisions.
The central muon system uses proportional drift tubes (PDTs), central scintillation counters. The PDTs and scintillators are emerged in toroidal magnets in order
to bend charged particles. The scintillators are also used for triggering. The central
muon system provides coverage for |η| < 1.0.
The forward muon system extends muon detection to |η| ≈ 2.0, and uses mini
drift tubes (MDTs) rather than PDTs, and includes trigger scintillation counters and
beam pipe shielding.

3.3.7

The Luminosity Monitor

Luminosity at the DØ interaction region is measured by plastic scintillator arrays
placed in front of the EC cryostats. A dedicated Luminosity Monitor (LM) is used
to detect inelastic pp collisions. The LM also serves to measure beam halo rates and
as a fast measurement of the z coordinate of the interaction vertex.
The LM detector consists of two arrays of twenty-four plastic scintillation counters
with Photomultiplier tubes (PMTs) readout. The arrays are located in front of the
end calorimeters and occupy the radial region between the beam pipe and the forward
preshower detector. They cover the pseudorapidity range 2.7 < |η| < 4.4.
52

3.3.8

The DØ Trigger System

Proton-antiproton collisions delivered by the Tevatron happen inside the detector
2.5 million times every second. We cannot record all those events on computer tape.
Instead, trigger and data acquisition systems are designed to accommodate the high
interaction rates. The trigger is a system of fast electronics and computers that
decides whether an event is interesting enough to be kept using information from a
small subset of the DØ detector signals. The task of the trigger system is to suppress
background events as eﬃciently as possible while not losing precious physics events.
Three distinct levels form this trigger system with each succeeding level examining
fewer events but in greater detail.

• The ﬁrst stage (Level 1 or L1) comprises a collection of hardware trigger elements that provide a trigger accept rate of about 2 kHz.
• In the second stage (Level 2 or L2), hardware engines and embedded microprocessors associated with speciﬁc subdetectors provide information to a global
processor (L2GBL) to construct a trigger decision based on individual objects
as well as object correlations. Once there is a L2 accept, the complete detector
information is read out. The L2 system has an accept rate of approximately 1
kHz.
• Events passed by L1 and L2 are sent to a farm of Level 3 (L3) microprocessors;
53

sophisticated algorithms reduce the rate to about 50 Hz to be recorded for
oﬄine reconstruction.

The overall coordination and control of DØ triggering is handled by the software
package COOR running on the online host. COOR interacts directly with the trigger
framework (for L1 and L2 triggers) and with the data acquisition supervising systems
(for the L3 triggers).

3.3.8.1

The Level 1 Trigger

The Level 1 Trigger is implemented in hardware and programmable ﬁrmware. The
calorimeter trigger (L1Cal) inputs consist of electromagnetic and hadronic trigger tower energies made up of sums over depth and transverse coordinates (in a
∆η × ∆φ = 0.2 × 0.2 region). The central track trigger (L1CTT) does a fast track
identiﬁcation and momentum estimate using FPGAs and ﬁrmware. The muon system trigger (L1Muon) looks for patterns consistent with muons using hits from muon
wire chambers, muon scintillation counters, and tracks from the L1CTT. The L1 forward proton detector trigger (L1FPD) selects diﬀractively-produced events in which
the outgoing protons or antiprotons are scattered at very small angles. The trigger
framework (TFW) gathers digital information from each of these L1 trigger devices
and determines whether a particular event is to be accepted for further examination
by L2, or rejected. The L1 trigger makes a decision in 3.5 µs.
54

3.3.8.2

The Level 2 Trigger

The L2 trigger provides detector-speciﬁc preprocessors (L2Cal, L2Muon, L2PS, L2CTT,
L2STT) and a global stage (L2Global) to reﬁne objects coming from L1 and test for
correlations in physics signatures across detector subsystems. L2 includes preprocessors for each detector subsystem and a global processor for integration of the
data. Preprocessor subsystems include tracking, calorimeter, preshower, and muon
systems. The subsystems work in parallel and trigger decisions are made in the
L2Global stage based on physics objects reconstructed in the preprocessors.
L2 preprocessors collect data from the front-ends and L1 trigger system and
analyze these data to form physics objects. L2 can also combine data across detectors
to form higher quality physics objects and examine event-wide correlations in all L2
physics objects. The full detector is read out for events passing L2 and further
analysis in the L3 trigger.
Data arrive at the L2 system via three transmission protocols. Calorimeter and
tracker data and signals from the TFW are transmitted by 1.3 Gbit/s serial GLinks on optical ﬁbers [37]. The muon system uses 160 Mbit/s Cypress Hotlink
transmitters on coaxial cables or standard CAT/6 cables, unshielded twisted pair
(UTP) Hotlinks [38].
• L2Cal The calorimeter preprocessor system identiﬁes jets and electrons/ photons and calculates event E T for the global processor. Each processor uses the
/
55

ET data from the 2560 calorimeter trigger towers, including both electromagnetic towers and the electromagnetic and hadronic tower sums.
• L2Muon L2Muon receives the L1Muon output and data from approximately
150 front-end modules (from the PDTs, MDTs, and the scintillation counters).
The muon candidates contain the track pT , η and φ coordinates, and quality
and timing information.
• L2PS L2PS is used for electron reﬁnement at the trigger level by providing
evidence for early shower development and by giving a good spatial point for
comparison with calorimeter clusters or tracks. At L2, the CPS and FPS are
treated as separate detectors and their data are processed independently.
• L2STT The L2STT performs online pattern recognition in the data from the
SMT. It reconstructs charged particle tracks found in the CFT at L1 with
increased precision by utilizing the much ﬁner spatial resolution of the SMT.
The L2STT improves the momentum measurement of charged particle tracks at
the trigger level. Requiring hits in the SMT helps reject spurious L1 triggers
from accidental track patterns in the CFT. L2STT is able to measure the
impact parameter of tracks precisely enough to tag the decays of long-lived
particles, speciﬁcally B hadrons.
• L2CTT The L2CTT preprocessor takes inputs from the L1CTT and the
56

L2STT. It operates in two diﬀerent modes: i) with input tracks straight from
L1CTT and ii) with input tracks from L2STT which receives input from the
L1CTT and SMT barrels. The pT or impact parameter sorted list of L2 tracks
are passed on to L2Global.
• L2Global L2Global is the ﬁrst level of the trigger to examine correlations
across all detector systems. L2Global is responsible for making trigger decisions
based on the objects identiﬁed by the L2 preprocessors. Trigger decisions are
made by creating global physics objects. L2Global imposes requirements on
the global physics objects according to conﬁguration information it receives
from the Trigger Control Computer (TCC) based on the downloaded trigger
menu from COOR. Level 2 Trigger Control Computer (L2TCC or TCC2) is a
computer that runs L2 Relay Software (or L2RS). It provides an interface to
COOR and monitor servers, and conﬁgures and controls all L2 crates.
After an L1Accept is issued, the TFW sends a trigger decision mask to L2Global,
and the SCL sends notiﬁcation of an L1 accepted event to every geographic
sector. For each event, L2Global uses the 128 L1 trigger decision bits (each
trigger decision bit represents a pass or fail of a certain trigger condition) and
preprocessor inputs to make a trigger decision, and returns this decision to the
TFW. The trigger list speciﬁes which trigger conditions L2 will impose for each
run. The trigger list is downloaded to L2Global by the TCC, which receives
57

its instructions from COOR.
A block diagram of the L1 and L2 trigger systems is shown in Figure 3.10.

Detector Level1
CAL

L1CAL

CPS
FPS

Level2
L2CAL
L2PS

L1CTT
L2CTT

CFT

L2STT

SMT
MUO

L1MUO

FPD

L1FPD

Lumi

Trigger
Framework

L2MUO

Global
L2

Figure 3.10: Trigger-related data ﬂow in L1 and L2 trigger systems.

3.3.8.3

The Level 3 Trigger

The L3 trigger provides additional rejection at a higher level through a fully programmable software trigger. L3 has the full readout information available from all
detector elements. L3 performs a limited reconstruction of events, reducing a nomi58

nal 1 kHz input rate to 50 Hz for oﬄine analysis. Its decisions are based on complete
physics objects as well as on the relationships between such objects. The trigger list
includes blocks of ﬁlter scripts that specify one or more ﬁlters. Only when all ﬁlters
in a script are satisﬁed, is the trigger satisﬁed and the event sent out to be recorded.

59

Chapter 4
Event Reconstruction and Object
Identiﬁcation
This chapter describes how the detector subsystems are used in order to identify
physics objects, such as jets and electrons, which are created from a p¯ collision.
p
Particle identiﬁcation is essential for many physics analyses at the DØ experiment.
The hits with a corresponding position and measured energy are clustered to form
either a track or a calorimeter energy cluster, depending on their locations in the
detector. From these physics objects are created: vertices, electrons, muons, jets and
E T . The DØ oﬄine reconstruction software [39] is responsible for reconstructing
/
those objects that are used to perform all DØ physics analyses.
The reconstructed objects are related to each other. By extrapolating all tracks
60

to the beam line, the crossing point with the most tracks is used as the vertex.
Diﬀerent detector systems help in separating physics objects from each other. For
example, muon will have signal at muon detector, while electron does not, which
will discriminate from electron. Jets have a bigger and wider shower shape, while
electron are isolated signals in calorimeter, and usually have tracks in the tracking
system. MET is the sum of all energies measured in calorimeter. Since there will
be some energy missing to get the balance in each direction, we assume there is a
neutrino, which can not be detected. Even though physics objects are reconstructed
separately, objects are not allowed to overlap and share hits or energy or tracks.

4.1

Tracks

Charged particles are identiﬁed in the CTT, which is discussed at Section 3.3.3. It
can be used to measure the particle momentum. Matching with calorimeter energy
deposits or muon hits, we can determine the particle trajectories more precisely.
In the plane perpendicular to the direction of the magnetic ﬁeld, the trajectory
of a charged particle is a circle. Trajectories originating from the interaction point
can be characterized by two parameters (ρ, φ), where φ is the direction of the track
at the point of closest approach to (0, 0), and the curvature:

ρ=

qB
pt

61

(4.1)

where q is charge, B is magnetic ﬁled, and pt is transverse momentum.
Due to the large number of hits in the event, pattern recognition in the tracking
system can not be achieved by conventional combinational approach. One must
reduce the combinations of track candidates before feeding them to the ﬁnal Kalman
ﬁlter based local road ﬁnders, which is highly CPU consuming. In this analysis two
preselection pattern recognition algorithms are used:

• Histogram Track Finding (HTF): divides the detector into slices in (φ, ρ)
and uses the Hough transform [40] to preselect a initial number of combinations
of groups of hits of relatively small size [41].
• Alternative Algorithm (AA): starts from combinations of hits in at least
three SMT super-layers. Track candidates are extended to CFT detectors [42].

Of these two methods, AA is better at low pT and high impact parameter and
has a lower fake rate. HTF has higher eﬃciency at high pT, and has higher eﬃciency
in diﬃcult η regions like the overlap region (region with partial CFT coverage), and
is less aﬀected by high luminosity.
Having done these preselections, a Kalman ﬁlter based global track-ﬁnding reconstructor is applied to these track candidates to ﬁnd the ﬁnal global tracks in the
event, and write a list of found tracks back into the event. A Kalman ﬁlter gives
a weighted average of the predicted and measured track values, with more weight
62

to the ones with smaller uncertainty. The estimates produced by the Kalman ﬁlter
method tend to be closer to the true values than the original measurements because
the weighted average has a better estimated uncertainty. As implied by their name,
the global tracks are not speciﬁc to any one detector, instead to all the tracking
detector subsystems (CFT, SMT, Muon).

4.2

Primary Vertices

The real or physics location of the hard scatter interaction point is named primary
vertex (PV). See ﬁgure 4.1 for an example. In DØ primary vertices are reconstructed
by means of an adaptive primary vertex algorithm [43]. This algorithm ﬁrst selects
tracks coming from diﬀerent interactions by clustering them according to their zposition; in the second step, the location and width of the beam is determined and
then used to re-ﬁt tracks and ﬁnally each cluster of tracks is associated with a vertex
using the “adaptive” technique of giving all tracks a weight and reiterating the ﬁt;
the third and last step consists of choosing the vertex that has the lowest probability
of coming from a minimum bias interaction (event with no trigger requirements).
63

Figure 4.1: Illustration of a primary vertex.

64

4.3

Electrons

Electrons are deﬁned as clusters of energy depositions in the electromagnetic section
of the calorimeter, matched with a track from the Central Tracking System. We
require electrons to be within the central calorimeter with |η det | < 1.1 (CC) 3.3.2.
This analysis uses three types of electrons:

• Loose isolated electron
The fraction of cluster energy deposited in the electromagnetic section of the
calorimeter, over the total energy deposited in both the electromagnetic and
hadronic sections of the calorimeter, is deﬁned as the EM fraction. For electrons, we require its EM fraction > 0.9. The electron must also meet shape
requirements. The energy deposition in the calorimeter must be matched to a
charged particle track from the tracking detectors with pT > 5 GeV and originating from the primary vertex. The electron is also required to be isolated
(not within a jet), and the transverse momentum pT (e) > 15 GeV.
• Tight isolated electron
The deﬁnition of a loose isolated electron is used as the basis for the deﬁnition
of a tight isolated electron. A tight isolated electron must pass all the loose
isolated electron requirements and have a value of EM-likelihood L > 0.85.
Electron likelihood is the probability to discriminate a real electron from faked
65

ones, using a discriminant based on several electron identiﬁcation variables.
• Multijets modeling electron
A multijet electron is deﬁned as a Loose no-track electron. It must have EM
fraction > 0.9, satisfy shape requirement, isolation < 0.2, and pT (e) > 15 GeV.
There are no requirements for a matching track. We use thus deﬁned electrons
to model the multijet background sample as shown in Section 6.1.2.1.

4.4

Muons

Muons are identiﬁed by combining tracks in the muon spectrometer ( η det < 2.0)
with central detectors tracks. We use the following deﬁnitions in this analysis:
• Loose isolated muon
Loose muons must hit 3 layers in the muon section and pass the cosmic ray rejection requirements. The track reconstructed in the muon system must match
a track reconstructed in the central tracker with medium track quality and
match requirements. The track must be close to the primary vertex: z(track,
PV) < 1 cm. The muon’s transverse momentum must be pT (µ) > 15 GeV,
and it must not be in a jet, determined by requiring ∆R(µ, jet) > 0.5.
• Tight isolated muon
The deﬁnition of a loose isolated muon is used as the basis of the tight isolated
66

muon. Tight isolated muons are loose muons with the additional isolation
criterion that: (a) the momenta of all tracks in a cone of radius R < 0.5
around the muon direction, except the track matched to the muon, add up to
less than 20% of the muon pT ; and (b) the energy deposited in an annular
cone of radius 0.1 < R < 0.4 around the muon direction is less that 20% of the
muon pT .
• Multijets modeling muon
The multijet muon is deﬁned as one which passes all selection cuts (before
b tagging), but no isolation or ∆R(muon, jet) requirements are made on the
muon. Any jets close to the muon are removed from the event, and E T is
/
recalculated to regain momentum balance. Thus deﬁned muons are used to
model the multijet background samples.

4.5

Jets

A jet is a narrow cone of hadrons and other particles produced by the hadronization of
a quark or gluon. Because of QCD conﬁnement, when two quarks reach a separation
distance of around 10−15 m (diameter of a hadron), they fragment into jets of mesons
and baryons, that are actually recorded by the detector. The jets are measured
and studied through jet reconstruction, correction and ﬂavor tagging, in order to
67

determine the properties of the original quark. Figure 4.2 shows how the quark
evolves to a jet in the detector.

Figure 4.2: Illustration of jet formation.
In DØ, jets are reconstructed using the RunII cone algorithm, with a cone size of
0.5. Jet energy scale corrections [44] are applied to jets in the data and MC to convert
jet energies from reconstruction to particle-level. Jets containing a muon within
∆R(µ, jet-axis) < 0.5 are considered to originate from a semileptonic b-quark decay
68

and are corrected for the momentum carried away by the muon and the neutrino.
For this correction, it is assumed that the neutrino carries the same momentum as
the muon.
Monte Carlo jets need to be corrected for diﬀerences in the reconstruction and
identiﬁcation eﬃciency between data and simulation, and for diﬀerent calorimeter
responses. The “jet shifting, smearing, and removal algorithm” (JSSR) [45] is applied.
Jets are accepted in the analysis if they satisfy the Jet-ID selection criteria [46],
and have pT > 15 GeV and |η| < 3.4. For RunIIb only, we require that tracks
associated with jets originate from the primary vertex.

4.6

b Jets

b jets are jets originating from bottom quarks. They are identiﬁed (“tagged”) by the
b-tagging algorithm. Because top quarks nearly always decay to a bottom quark,
identifying bottom quarks helps to identify the decays of top quarks. The most important feature of b jets is that they have suﬃcient lifetime and travel some distance
before decaying, resulting in a secondary vertex, as show in ﬁgure 4.3.
Given that single top quark events have at least one b jet in the ﬁnal state, we
use a neural network (NN) based b jet tagger [47] to identify jets originating from b
quarks. Jets are ﬁrst required to be “taggable” and then are “tagged”. “taggable”
69

Figure 4.3:
Illustration of the secondary vertex of a b jet.
http://www-d0.fnal.gov/Run2Physics/top/singletop_observation/.
d0
is distance between the secondary track and primary vertex. A displaced track is
one with large d0 . The 2D distance between the primary vertex and the secondary
vertex is the transverse decay length, which we denote Lxy .

70

means a jet has enough tracking information that the b-tagging will be reliable. If a
jet has no tracks, or is too forward where there is no tracker, then b-tagging makes
no sense so we can’t use it. We only try to b-tag jets which are taggable. This is
done by looking at the secondary vertex information and seeing if it’s consistent with
a b hadron decay.
Both taggability and tagging are applied directly on the Monte Carlo samples,
and the eﬃciencies are corrected by a scale factor derived from comparing MC to
data. We select events that either have one b jet that has b-tag NN output > 0.775,
or two b jets with b-tag NN output > 0.5.

4.7

Missing Transverse Energy

In the Tevatron, the initial momentum of the colliding partons along the beam axis
is unknown (parton distribution). However, the initial momentum transverse to the
beam axis is zero. Even though neutrinos escape detection, the component of their
momentum in the transverse plane (“missing transverse energy”) can be inferred from
conservation of momentum in the collision. The sum of the transverse momenta
of undetected neutrinos is therefore equal to the negative sum of the transverse
momenta of all particles observed in the detector. In practice, we compute the
missing transverse energy by adding up vectorially the transverse energies in all cells
of the electromagnetic and ﬁne hadronic calorimeters. Cells in the coarse hadronic
71

calorimeter are only added if they are part of a good jet. This raw quantity is
then corrected for the energy corrections applied to the reconstructed objects, and
the momenta of all muons in the event are corrected for their energy loss in the
calorimeter.

72

Chapter 5
Event Selection
At the Tevatron, single top quark production is a very rare process relative to its
major backgrounds (cross section ≈ 3 pb−1 ). A general event selection strategy is
implemented to maximize signal acceptance, and thereafter a multivariate technique
is used to separate signal from background, which will be discussed in the next
chapter.

5.1

Strategy

The experimental signal for single top events is one isolated high transverse momentum central lepton, and missing transverse energy from the decay of W boson from
the top quark decay, accompanied by a b jet from the top decay, and a second jet
which is sometimes also from the decay of a b hadron. Since there may be signiﬁcant
73

initial- or ﬁnal- state radiation, we include our search events with between two to
four jets.
The composition of the background components is diﬀerent for events with different jet multiplicities (2, 3, 4) and lepton ﬂavors (electron, muon). The signal to
background ratio and signal acceptance in each subsample set also diﬀer signiﬁcantly.
Thus the data is divided into many orthogonal sub-samples (i.e. channels) according to jet and b-tag multiplicities. The event selection criteria in this analysis are
applied separately for the diﬀerent channels, and then several channels are combined
to increase statistics.

5.2

Event selection cuts

We apply a loose event selection to ﬁnd events containing an isolated lepton, missing
transverse energy, and two to four jets with high transverse momentum. Then we
make a series of topological cuts to reduce the multijet background in each channel
to an acceptable low level.

5.2.1

General selection

These selection cuts are applied for both electron and muon channels.

• Good quality (for data).
74

• Instantaneous luminosity > 0.
• Pass Mega-OR trigger requirement, in which at least one of the selected “reasonable” triggers has to ﬁre. For details of which triggers are included in the
Mega-OR trigger and their eﬃciencies, see Appendix ??.
• Good primary vertex: |zPV | < 60 cm with at least three tracks attached.
• Two, three, or four good jets with pT > 15 GeV and |η det | < 3.4. For Run
IIb jets are required to be vertex-conﬁrmed.
• The leading jet pT > 25 GeV.
• For Run IIb only, the second leading jet pT cut is raised to 20 GeV in events
with exactly three good jets.
• For Run IIb only, the second leading jet pT cut is raised to 25 GeV in events
with exactly four good jets.
• Jet triangle cut of |∆φ(leading jet, E T )| vs. E T :
/
/
• |∆φ| from 1.5 to π rad when E T = 0 GeV, and E T from 0 to 35 GeV
/
/
when |∆φ = π rad.
• Missing transverse energy
• 20 < E T < 200 GeV in events with exactly two good jets.
/
75

• 25 < E T < 200 GeV in events with three or more good jets.
/
• Scalar sum of the transverse energies of all good jets
• HT (alljets) > 50 (55) GeV in events with exactly two good jets for Run IIa
(Run IIb) period.
• HT (alljets) > 75 (80) GeV in events with exactly three good jets for
Run IIa (Run IIb) period.
• HT (alljets) > 100 (110) GeV in events with exactly four good jets for
Run IIa (Run IIb) period .
• Jet triangle cut of second leading jet pT vs. HT (alljets) for Run IIb in events
with exactly two good jets.
• second leading jet pT from 0 to 27.5 GeV when HT (alljets) = 0 GeV, and
HT (alljets) from 0 to 165 GeV when second leading jetpT = 0 GeV.
• Scalar sum of the transverse energies of all good jets, the lepton transverse
momentum, and the missing transverse energy
• HT (alljets, lepton, E T ) > 120 GeV in events with exactly two good jets.
/
• HT (alljets, lepton, E T ) > 140 GeV in events with exactly three good jets.
/
• HT (alljets, lepton, E T ) > 160 GeV in events with exactly four good jets.
/
76

5.2.2

Electron channel selection

These selection cuts are applied for electron channels only.
• Only one tight electron with |η det | < 1.1 and pT > 15 (20) GeV in events with
2 (3 or more) good jets.
• No additional loose electron with pT > 15 GeV.
• No tight isolated muon with pT > 15 GeV and within |η det | < 2.0.
• Electron coming from the primary vertex: |∆z(e, PV)| < 1 cm.
• Electron triangle cuts of |∆φ(e, E T )| vs. E T
/
/
• |∆φ| from 2 to 0 rad when E T = 0 GeV, and E T from 0 to 40 GeV when
/
/
|∆φ| = 0 rad.
• |∆φ| from 1.5 to 0 rad when E T = 0 GeV, and E T from 0 to 50 GeV
/
/
when |∆φ| = 0 rad.
• |∆φ| from 2 to π rad when E T = 0 GeV, and E T from 0 to 24 GeV
/
/
when |∆φ| = π rad.

5.2.3

Muon channel selection

These selection cuts are applied for muon channels only.
• Only one tight muon with pT > 15 GeV and |η det | < 2.0.
77

• No additional loose muons with pT > 4 GeV.
• No loose electron with pT > 15 GeV and within |η det | < 2.5.
• Muon coming from the primary vertex: |∆z(µ, PV)| < 1 cm.
• Muon triangle cuts of |∆φ(µ, E T )| vs. E T
/
/
• |∆φ| from 1.2 to 0 rad when E T = 0 GeV, and E T from 0 to 85 GeV
/
/
when |∆φ| = 0 rad.
• |∆φ| from 2.5 to π rad when E T = 0 GeV, and E T from 0 to 30 GeV
/
/
when |∆φ| = π rad.
• Muon track curvature signiﬁcance cuts of |TrackCurvSig| vs. |∆φ(µ, E T )|, where
/
|TrackCurvSig| = |

q/pT
|, and q and pT are the charge and transverse moσ(1/pT )

mentum of the charged track associated with the muon.
• |∆φ| from 0.875π to π rad when |TrackCurvSig| = 0, and |TrackCurvSig|
from 0 to 4 (6) when |∆φ| = π rad for Run IIa (Run IIb) period.
• |∆φ| from 2 to π rad when |TrackCurvSig| = 0, and |TrackCurvSig| from
0 to 2 (3) when |∆φ| = π rad for Run IIa (Run IIb) period.
These cuts are needed to reject events with poorly measured muons that cause
an excess in data over background model in the ∆φ distributions.
78

• Transverse momentum of the leading jet within the ICD region of the detector:
leading jet pT > 30 GeV when 1.0 < |leading jet η det | < 1.5.
• Transverse momentum of the second leading jet within the ICD region of the
detector: leading jet pT > 25 GeV when 1.0 < |leading jet η det | < 1.5 only
for Run IIb period with exactly three good jets.

5.2.4

b-tagging selection

To be able to extract single top events we also look for the presence of one or two btagged jets. We select events that have exactly one jet with b-tag NN output > 0.775
and no other jet with b-tag NN output > 0.5, or two jets with b-tag NN output >
0.5. This ensures that there is no overlap between the one tag and two tags samples.

79

Chapter 6
Data and Simulation Samples
In this chapter, the data and simulation samples used for this analysis are presented.
The corrections to multijet and MC simulation samples are discussed.

6.1

Data samples

The data sample was collected from the p¯ collisions in the Tevatron at a center of
p
mass energy of 1.96 TeV, using the DØ detector.

6.1.1

Signal data samples

The data sample was collected between August 2002 and February 2006 (Run IIa)
and between June 2006 and June 2009 (RunIIb), corresponding to 1.08 + 4.28 =
80

5.35 fb−1 of good quality data in each of the e and µ channels, as shown in Table 6.1.
The datasets used in this analysis are listed in Appendix B.
Table 6.1: Integrated luminosity for the datasets used in this
analysis.
Trigger Version

Integrated Luminosity [pb−1 ]

Run IIa

v8 – v14

1078.81

Run IIb

v15a

534.44

v15b

688.02

v15c

397.31

v16

2661.90

v8-16

5360.48

Data sample

Total

We require events to pass the Mega-OR trigger requirement, in which at least
one of the selected “reasonable” triggers has to ﬁre. We measure that the Mega-OR
trigger eﬃciency for the events that pass our selection is 100% within the assigned
systematic uncertainties and thus no additional trigger eﬃciency correction factors
are needed in our MC samples. Details of trigger eﬃciencies measurement are described in Appendix ??.

6.1.2

Multijets data samples

Multijet events form the second largest background after W +jets. We use data
events to model it. We select those events that pass all our event selection criteria
81

except the lepton requirements.

6.1.2.1

Multijets in the electron channel

The multijets background in the electron channel is modeled with data events that
pass all selection cuts (before b tagging) except that the EM object is not required
to have a matching track. In addition, the EM object is required to fail the electron
likelihood cut.
This sample suﬀers from a kinematic bias that can be corrected by reweighting
the events with a kinematic correction factor [48]. The kinematic correction factor
is deﬁned as
QCDWeight = εfake−e /(1 − εfake−e ),

(6.1)

which depends on the parameter εfake−e , which is the eﬃciency for a multijet faked
electron (See deﬁnition in Section 4.3.) to pass the tight electron criteria.
εfake−e is measured as the ratio of the number of events with a tight electron over
the number of events with an multijet faked electron in the E T < 10 GeV region.
/
Even though the low- E T region is dominated by multijet events, there is a non/
negligible contamination from real electrons, in particular for the tight sample. This
signal contamination originates mostly from Z+jets events and to some extent also
from W +jets events. The expected contributions to the multijet and tight electron
samples originating from W +jets and Z+jets events are estimated from Monte Carlo
82

and subtracted from the number of events in data before computing εfake−e .

6.1.2.2

Multijets in the muon channel

The multijets background in the muon channel is modeled with data events that pass
all selection cuts (before b tagging), but no isolation or ∆R(muon, jet) requirements
are made on the muon. In order to make the jets in these events match those in
the signal data, any jets close to the muon are removed from the event, and E T is
/
recalculated to regain momentum balance.

6.2
6.2.1

Simulation samples
Signal simulation samples

Simulated single top event samples are generated with the comphep-singletop [49]
Monte Carlo event generator. singletop produces events whose kinematic distributions match those from NLO calculations. The top quark mass is 172.5 GeV, and
this applies to all top quark samples. The PDF set is CTEQ6M [50] and the scales
are m2 for the s-channel and (mt /2)2 for the t-channel. The top quarks and the
t
W bosons from the top quark decays are decayed in comphep-singletop to ensure
the spins are properly transferred. pythia [51] is used to add the underlying event
and initial- and ﬁnal-state radiation. evtgen [52] is used to decay b hadrons.
83

6.2.2

Background simulation samples

¯
The W +jets, Z+jets, and tt background samples for this analysis are generated using
alpgen [53] (version 2.11). Zero-bias overlay events are used to better model the
high instantaneous luminosity. All MC simulated events have an additional number
of zero bias p¯ events superimposed, dependent on the luminosity proﬁle.
p
Table 6.2 shows the cross sections, branching fractions, and initial numbers of
events for the Monte Carlo samples. The W +jets and Z+jets cross sections are from
alpgen.

6.2.3

Simulation samples corrections and normalization

The expected number of Monte Carlo events coming from the simulated backgrounds
samples are obtained in three steps:
• First, apply scale factors such that the MC reconstruction and selection eﬃciencies match those found in data, as explained in Section 6.2.3.1 below.
¯
• Second, the tt, Z+jets, dibosons, and single top samples are normalized to
the integrated luminosity of our dataset using the cross sections and branching
fractions listed in Table 6.2. The normalization weight is calculated as:

w=

σT heory × Integrated Luminosity
.
Number of events produced by MC
84

(6.2)

Table 6.2: The cross sections, branching fractions, and initial numbers of events in
the Monte Carlo event samples. The symbol ℓ stands for electron plus muon plus
tau decays. We only include tau decays to leptons.
The Monte Carlo Event Sets
Cross Section

Branching

No. of p17

No. of p20

[pb]

Fraction

Events

Events

1.04+0.04
−0.04

0.324 ± 0.003

0.6M

0.5M

0.6M

0.5M

+0.16
3.30−0.16

0.324 ± 0.003
0.324 ± 0.003

1.2M

1.0M

7.46+0.48
−0.67

0.438 ± 0.004

1.5M

1.3M

1.5M

1.3M

+0.48
7.46−0.67

0.105 ± 0.001
0.543 ± 0.005

3.0M

1.6M

90.5

0.324 ± 0.003

2.7M

3.0M

W c¯ → ℓνcc
c

260

0.324 ± 0.003

2.7M

3.0M

W jj → ℓνjj

23, 831

0.324 ± 0.003

55M

97M

W+jets total

24, 182

0.324 ± 0.003

60.4M

103M

Zb¯ → ℓℓbb
b

38.7

0.0337 ± 0.0000

0.7M

0.7M

Zc¯ → ℓℓcc
c

106

0.0337 ± 0.0000

0.7M

0.7M

Zjj → ℓℓjj

7, 032

0.0337 ± 0.0000

14M

4.0M

Z+jets total

7, 177

0.0337 ± 0.0000

15.4M

5.4M

W W → anything

11.6 ± 0.4

1.0 ± 0.0

2.0M

0.7M

W Z → anything

3.25 ± 0.11

1.0 ± 0.0

1.0M

0.6M

ZZ → anything

1.33 ± 0.04

85 1.0 ± 0.0

1.0M

0.5M

Diboson total

16.2 ± 0.6

1.0 ± 0.0

4.0M

1.8M

Event Type
Signals
tb → ℓ+jets
tqb → ℓ+jets
Signal total
Backgrounds
¯
tt → ℓ+jets
¯
tt → ℓℓ
Top pairs total
W b¯ → ℓνbb
b

2.26+0.12
−0.12

7.46+0.48
−0.67

• Finally, the W +jets background is corrected for the reconstruction eﬃciencies,
but normalized to the data before tagging, as explained in Section 6.2.3.2 below.

6.2.3.1

Simulation eﬃciencies corrections

The eﬃciencies to reconstruct, identify and select objects in the MC are higher than
in data, so the following scale factors are used to correct for those diﬀerences:

• Primary vertex position reweighting
We reweight the position of the primary vertices along the beamline in z to
make the distribution in MC events match data.
• Instantaneous luminosity reweighting
We reweight the instantaneous luminosity distributions of the MC events to
match those in Run IIa and Run IIb data.
• Z pT distribution reweighting
We reweight the inclusive Z boson pT spectrum to match the theory model.
• EM ID eﬃciency correction factor
We correct each event in the electron channel with a scale factor that accounts
for the diﬀerences in electron cluster ﬁnding and identiﬁcation eﬃciency [54]
[55]. For multijet background model electrons, the MC to data scale factor is
consistent with one.
86

• Muon ID and isolation eﬃciency correction factor
We correct each event in the muon channel for the muon ID eﬃciency, the
track match, and the isolation. The ID scale factor is parametrized in (ηdet ,φ);
track match is parametrized in track-z and ηCFT ; and the isolation in η. The
parametrizations derived in Z → µµ data and MC, as described in Ref. [56]:
There is an additional mismodeling correction applied to the muon-ID as a
function of ηdet in W +jets events.

• Jet shifting, smearing, and removal (JSSR)
We use the JSSR processor [45] for the W +jets and Z+jets samples (including
W bb, W cc, Zbb and Zcc), and dibosons.

• Taggability and b-tagging eﬃciency correction factor
The taggability and tagging algorithms are directly applied to both data samplesand the simulated samples (direct tagging), and the eﬃciencies corrected
by a scale factor derived from comparing MC to data. Random tagging is used
for W +light and W + c samples to ensure proper MC statistics after tagging.
The random tagging algorithm replaces the standard b-tag NN output with
one that is chosen randomly between a probability p = 1/2 and p = 1/24. This
causes many more of the W +light jet events to fall into the one and two tagged
samples.
87

6.2.3.2

W +jets and multijets backgrounds

The W +jets background is taken from Monte Carlo samples and corrected for reconstruction eﬃciencies as described in Section 6.2.3.1 above. The samples are weighted
by the leading-log (LL) cross sections (including branching fractions) from alpgen
and the integrated luminosity. Comparisons of LL and NLO calculations to data [57]
show the need for K ′ -factors to scale alpgen cross sections for W jj, W cj (j rep′
resents light partons, including u, d, s), W b¯ and W c¯, and also KHF -factors for
b,
c
W b¯ and W c¯ (HF represents Heavy Flavor partons, which include b and c). Similar
b
c
cross section correction is done to the Z+Jets samples. The ﬂavor composition of
W +jets and Z+jets samples are normalized to NLO.
The multijets background is modeled with the orthogonal data samples described
in Section 6.1.2. The multijets and the W +jets backgrounds are normalized using
an iterative Kolmogorov-Smirnov-test normalization method (IKS).
In the IKS method, two anti-correlated scale factors are applied to the W +jets
and multijets samples to get the proper normalization to data. The scale factors
are determined by measuring the agreement between data and background model
on three distributions sensitive to the W +jets and multijets normalizations: lepton
pT , E T , and W transverse mass. The ﬁnal W +jets and multijets scale factors are
/
the ones that maximize the product of the Kolmogorov-Smirnov (KS) test values of
each distribution. The KS-test tries to determine if two datasets diﬀer signiﬁcantly.
88

It has the advantage of making no assumption about the distribution of data.
In order to deal with the mismodeling observed for some angular variables, the
W +jets and Z+jets samples are reweighted to reproduce the jet η distributions in
data. The reweighting is done by comparing the W +jets and Z+jets samples to data
after subtraction of all other backgrounds. It is done successively on the leading and
second leading jet η, the ∆R between them, and on the third and fourth jet η when
applicable. The reweighting is done with a ﬁtted function that excludes the ICR
region and preserves the overall normalization.
It is found that the normalization is preserved after applying b-tagging, and no
further IKS normalization is needed.

6.3

Lepton charge mis-ID measurement samples

Charge measurement is important for this analysis since the CP violation can be
diluted by the mis-identiﬁcation (mis-ID) of the lepton charge. We rely on the track
charge to determine whether the selected lepton carries positive or negative charge.
We select Z → ee or Z → µµ events to measure the charge mis-ID rate. For
this purpose, we use samples of events which have two electrons or muons with
high transverse momentum in them. The charge mis-ID rate is deﬁned as the ratio
between the number of same sign events and the total number of di-lepton events.
We make the same lepton and jet requirements to select this sample as in the single
89

top analysis. We make no MET cut.
Details of the sample deﬁnitions can be found in Appendix D.

90

Chapter 7
Event Yields and Data/Simulation
Comparison
In this chapter we present data and simulation sample event yields after the event
selection described in Chapter 5.

7.1

Event yields

The samples after selection, which we call “pretagged”, “onetagged” and “twotagged”
¯
samples, are dominated by W +jets events, with some tt contributions that become
more signiﬁcant for higher jet multiplicities, and smaller contributions from multijets,
Z+jets and diboson events. Figure 7.1 gives a pie chart illustration of the composition
91

2 jets

3 jets

4 jets

pretag

1 tag

2 tag
Figure 7.1: A pie chart illustration of the components of each background and signal
sample in each jet channel in pretag, one tag, and two tag bins.

of each background and signal sample in each jet channel after this selection.
We use the term “yield” to mean the number of events of the signal or background
in question predicted to be in the nearly 5.4 fb−1 of data analyzed here. Thus, the
event yield for a speciﬁc sample is deﬁned as:

Y = σ × L = σ × 5.4 fb−1

(7.1)

where, σ is the cross section for that sample, and L = 5.4 fb−1 is the integrated
luminosity used in this analysis. On the other hand, the event yield for a MC sample
92

can also be expressed as:

Y=

Nselected

ε(i)corrections

(7.2)

i

where Nselected is the number of events remaining after selection in that MC sample,
and ε(i)corrections is the overall eﬃciency correction factor for the ith event after
selection to account for diﬀerences between data and MC from particle ID, vertex
conﬁrmation, taggability and b-tagging, that is:

ε(i)corrections = ε(i)particle ID × ε(i)vertex conﬁrmation ×
ε(i)taggability × ε(i)b−tagging × N σ
MC

(7.3)

, where σ is cross section and NMC total number of events in MC sample. For
data, the event yield is equal to the number of events.
Tables 7.1 and 7.2 show the yields for all signals and backgrounds for events with
one and two b-tagged jets respectively.
For the CP violation measurement, an extra cut applied is to split the samples by
lepton charge. Tables 7.3 and 7.4 show these yields for all signals and backgrounds
after splitting samples by lepton charge (“positive” and “negative” samples). The
sum of the positive and negative yields is identical to the total. We do not redo the
normalization of W +jets and QCD separately for positive and negative samples.
93

From these tables we can see that in all of the sum of yields of all MC samples
equal to the number of events in data. b tagging improves signal to background ratio
radically, however, we also lose event statistics in the 1-tag and 2-tag samples. Note
that the yield values shown in these and subsequent tables have been rounded for
clarity, so that the sums of the components do not always equal exactly the values
given for these sums, however all calculations have been done with full-precision
values.
Table 7.1: One b tagged event yields with statistical uncertainty for each jet multiplicity and for all analysis channels combined (Electron+Muon, Run IIa+Run IIb).
2 jets

3 jets

4 jets

All Channels

198 ± 0.91

85 ± 0.63

26 ± 0.36

309 ± 1.2

¯
tt

313 ± 1.8

545 ± 2.8

499 ± 2.8

1,356 ± 4.4

W +jets

3,222 ± 20

961 ± 9.9

239 ± 4.0

4,421 ± 23

Z+jets & dibosons

357 ± 6.8

123 ± 4.3

29 ± 1.7

510 ± 8.2

Multijets

258 ± 2.2

117 ± 1.9

37 ± 0.83

412 ± 3.1

Background Sum

4,150 ± 22 1,745 ± 11 804 ± 5.2

6,699 ± 25

Backgrounds+Signals

4,348 ± 22 1,830 ± 11 830 ± 5.3

7,008 ± 25

Data

4,284 ± 65 1,772 ± 42

851 ± 29

6,907 ± 83

1:31

1:22

Source
Signals
tb+tqb
Backgrounds

S:B

1:21

1:21

94

Table 7.2: Two b tagged event yields with statistical uncertainty for each jet multiplicity and for all analysis channels combined (Electron+Muon, Run IIa+Run IIb).
Source

2 jets

3 jets

4 jets

All Channels

46 ± 0.37

31 ± 0.34

13 ± 0.22

90 ± 0.55

¯
tt

121 ± 0.97 285 ± 1.8

361 ± 2.2

767 ± 3.0

W +jets

338 ± 5.1

139 ± 3.2

45 ± 1.7

522 ± 6.3

Z+jets & dibosons

42 ± 1.7

18 ± 1.1

5.9 ± 0.53

66 ± 2.1

Multijets

19 ± 0.62

13 ± 0.64 6.3 ± 0.33

39 ± 0.95

Background Sum

520 ± 5.5

456 ± 3.9

418 ± 2.8

1,394 ± 7.3

Backgrounds+Signals

566 ± 5.5

487 ± 3.9

431 ± 2.8

1,484 ± 7.3

Data

597 ± 24

535 ± 23

432 ± 21

1,564 ± 40

S:B

1:11

1:15

1:33

1:16

Signals
tb+tqb
Backgrounds

95

Table 7.3: Number of expected yields and observed data counts in ”positive” samples
after b-tagging.
2 jets

3 jets

4 jets

All Channels

tb

53 ± 7.6

22 ± 3.7

6.4 ± 1.7

81 ± 13

tqb

70 ± 5.6

37 ± 4.4

13 ± 2.9

119 ± 13

tb+tqb

122 ± 13

59 ± 8.1

19 ± 4.6

200 ± 26

¯
tt

218 ± 41

418 ± 66

429 ± 77

1,066 ± 183

1,773 ± 199

549 ± 67

149 ± 42

2,471 ± 307

Z+jets & dibosons

197 ± 28

73 ± 21

16 ± 9.0

286 ± 57

Multijets

133 ± 12

63 ± 5.8

21 ± 2.0

217 ± 20

Source

W +jets

Total prediction

2,443 ± 292 1,162 ± 166 635 ± 134

4,239 ± 592

Data

2,386 ± 49

1,161 ± 34

645 ± 25

4,192 ± 65

S:B

1:19

1:19

1:32

1:20

96

Table 7.4: Number of expected yields and observed data counts in “negative” samples
after b-tagging.
2 jets

3 jets

4 jets

All Channels

tb

52 ± 7.6

22 ± 3.7

6.5 ± 1.6

80 ± 13

tqb

71 ± 5.5

36 ± 4.2

13 ± 2.9

120 ± 13

tb+tqb

123 ± 13

58 ± 7.9

20 ± 4.5

200 ± 25

¯
tt

217 ± 42

415 ± 65

433 ± 78

1,065 ± 185

1,789 ± 191

551 ± 86

135 ± 40

2,474 ± 316

Z+jets & dibosons

203 ± 29

68 ± 19

18 ± 9.6

289 ± 56

Multijets

144 ± 13

67 ± 6.2

22 ± 2.1

233 ± 21

Source

W +jets

Total prediction

2,475 ± 287 1,157 ± 183 628 ± 133

4,261 ± 603

Data

2,495 ± 50

1,146 ± 34

638 ± 25

4,279 ± 65

S:B

1:19

1:19

1:31

1:20

97

7.2

Signal acceptance

The signal acceptance is deﬁned as:

A=

B

Ninitial

εcorrections
Nselected

where B is the branching fraction for the MC sample, Ninitial is the initial number of events in each MC sample, Nselected is the number of MC events remaining
after selection, and εcorrections are eﬃciency correction factors to account for diﬀerences between data and MC from particle ID, vertex conﬁrmation, taggability and
b-tagging.

Table 7.5 shows the percentage of each single top quark signal that remain after
selection in each channel before and after b-tagged jets have been required. The
entries in this table are calculated using the yields shown in Section 7.1 and the
following equation:
A=

Yield
σ×L

in which σ is the predicted cross section (for tb or tqb) and L is the integrated
luminosity for each sample.
98

Table 7.5: Combined (Electron+Muon, Run IIa+Run IIb) signal acceptances.
2 jets

3 jets

4 jets

All Channels

tb

3.0%

1.3%

0.39%

4.7%

tqb

2.6%

1.2%

0.41%

4.3%

tb+tqb

2.8%

1.2%

0.40%

4.4%

tb

1.9%

0.78%

0.23%

2.9%

tqb

1.2%

0.60%

0.21%

2.0%

tb+tqb

1.4%

0.65%

0.22%

2.3%

PreTag

1,2 Tags

The ﬁnal combined signal acceptances are:

s − channel tb = (2.9 ± 0.4)%
t − channel tqb = (2.0 ± 0.3)%
s + t channels tb + tqb = (2.3 ± 0.3)%

7.3

Data and Monte Carlo comparison

We examine various important kinematic variables to make sure the Monte Carlo
simulated samples model the data. To judge whether a variable is well modeled or
not, the variable distribution for data is compared with the sum of the signal and
99

backgrounds. We normalize each sample in the model to the integrated luminosity
of data (5.4 ifb), and overlay their sum to the data.
The data-MC comparison plots after the selection cuts in Section 5.2 for both
pretagged and tagged samples are shown in Appendix ??.
The data-MC comparison plots after splitting by the lepton charge are shown in
Appendix A. The data-MC comparison plots for the combined BNN discriminants
after splitting are shown in Appendix C.
Good agreement is seen between the prediction and data in all channels. Selected

DØ 5.4 fb -1
Data
s+t-channel
W+bb
W+cc
W+lp
Z+jets
Diboson
tt
Multijets

1000

500

0

Yield [Events/10GeV]

Yield [Events/10GeV]

plots after splitting by the lepton charge are shown below.

50

100

DØ 5.4 fb -1
Data
s+t-channel
W+bb
W+cc
W+lp
Z+jets
Diboson
tt
Multijets

1000

150
200
Lepton pT [GeV]

500

0

50

100

150
200
Lepton pT [GeV]

Figure 7.2: The transverse momentum of the lepton in the electron+muon channel
for positive (left) and negative (right) samples.

100

Yield [Events/20GeV]

Yield [Events/20GeV]

DØ 5.4 fb -1
600

400

200

0
100

200

300

DØ 5.4 fb -1
600

400

200

0
100

400
HT [GeV]

200

300

400
HT [GeV]

Yield [Events/0.25]

Yield [Events/0.25]

Figure 7.3: The HT in the electron+muon channel for positive (left) and negative
(right) samples.

DØ 5.4 fb -1
400

200

0

DØ 5.4 fb -1

400

200

-2

-1

0

1

0

2
Lepton η

-2

-1

0

1

2
Lepton η

Figure 7.4: The LeptonEta in the electron+muon channel for positive (left) and
negative (right) samples.

101

Yield [Events/5GeV]

Yield [Events/5GeV]

DØ 5.4 fb -1

(d)
600

400

200

DØ 5.4 fb -1

(d)
600

400

200

0

140

160

0

180
200
220
Top Quark Mass [GeV]

140

160

180
200
220
Top Quark Mass [GeV]

Yield [Events/20]

Yield [Events/20]

Figure 7.5: Reconstructed top mass distribution in the electron+muon channel for
positive (left) and negative (right) samples.

DØ 5.4 fb -1

1000

1000

500

0

DØ 5.4 fb -1

500

0

100

200

χ2

300
400
of top mass

0

0

100

200

300
400
χ2 of top mass

Figure 7.6: Top mass minimum Chi squared distribution in the electron+muon channel for positive (left) and negative (right) samples. TopMassMinChiSqr gives the
minimum value of the diﬀerences between constructed top mass and theory top mass
for all possible reconstructed top quarks.

102

Yield [Events/10GeV]

Yield [Events/10GeV]

DØ 5.4 fb -1

1000

500

0

0

DØ 5.4 fb -1

1000

500

0

50
100
150
W Boson Transverse Mass [GeV]

0

50
100
150
W Boson Transverse Mass [GeV]

Figure 7.7: The W Transverse mass distribution in the electron+muon channel for
positive (left) and negative (right) samples.

103

Chapter 8
Multivariate Analysis
Given the increasing complexity of modern HEP experiments, multivariate analysis
(MVA) techniques have been applied to extract the maximum information content
out of the data. Multivariate classiﬁcation is one of the techniques used for “data
mining”, which is automatic or semi-automatic analysis of large amount of data in
order to extract previously unknown interesting patterns. MVA involves analysis of
multiple statistical variables at a time.
Three diﬀerent MVA techniques are used in the single top analysis: (i) Bayesian
neural networks (BNN), (ii) boosted decision trees (BDT), and (iii) neuroevolution
of augmented topologies (NEAT). A combined new BNN discriminant is formed by
further combining those three methods.
A big advantage of multivariate techniques compared with a conventional cut
104

based analysis is that events which fail an individual selection criterion will continue
to be considered by the algorithm. In a multivariate technique, the separation power
of the many discriminating variables x is combined into a single discriminant D(x).
This way we do not cut out signal events, while obtaining an improved separation
power beyond that of any of the single variables alone. This combined single variable
corresponds to a one-to-one function of the probability P r(S|x) that an event characterized by the variables x belongs to the signal class S. For each MVA, a background
event is more likely to give an output in the low-discriminant region (D(x) ≈ 0 ),
while a signal event will tend to output to the high-discriminant region (D(x) ≈ 1),
thus separating signal from the backgrounds. The cross section and other info can
be derived from the D(x) distributions observed in data using Bayesian calculations
which will be discussed in Chapter 9.
Figure 8.1 shows how each of the MVA’s is applied in the ﬂow chart of the single
top analysis.

8.1

Split samples

The signal and background samples are divided into three independent subsets, which
are used by the three MVA’s.
• the ﬁrst subset (“training”) contains one fourth of the samples and is used to
created the multivariate ﬁlters for (i.e. train) BDT, BNN and NEAT;
105

Figure 8.1: Flowcharts of the MVA analyses.

106

• the second subset (“testing”) also contains one fourth of the samples and is
used to created the multivariate ﬁlter (i.e. train) the combination BNN of the
three diﬀerent multivariate analyses (BDT, BNN and NEAT) used in the single
top analysis;
• the last subset (“yield”) contains the remaining half of the samples and is used
to verify the BNNs convergence, do the ﬁnal measurements with all the MVAs
(BDT, BNN, NEAT, and combination BNN) and make all the plots.
The original MC samples are split by putting the ﬁrst event in the testing, the
second event in the training and the 3rd and 4th event in the yield sample, and
repeating the process until the end of the sample was reached. All three resulting
samples are normalized back to the full dataset: testing/training weights multiplied
by 4.0 and yield weights multiplied by 2.0, where the actual factors are calculated
as yield in the original over yield in the subsample.

8.2

Output transformation

The MVA output is a number between 0 and 1, with background-like (signal-like)
events populating mostly in the low (high) discriminant region. In the high discriminant region there may be some bins in which there are some signal but no background
events. To avoid this, we apply a monotonic transformation to the discriminant that
107

ensures a minimum number of background events in each bin. For details see Appendix D of DØ Note 5811 [58]. All the discriminant plots shown in this chapter are
after applying this binning transformation, unless otherwise noted.

108

8.3

Key for plots

For each MVA output plot, the signal and background distributions use the color
scheme illustrated in Figure 8.2.

Figure 8.2: Illustration of the color scheme used in plots of signals and backgrounds
in the single top analyses.

109

8.4

Boosted Decision Trees (BDT)

Decision trees (DT) originated in the ﬁelds of data mining and pattern recognition.
Several methods were developed to improve the DT classiﬁcation power by creating
an ensemble (forest) of decision trees. One of these extensions is boosting decision
trees (BDT).
In a decision tree, the internal nodes contains various tests, and terminal nodes
are called leaves. Each leaf has an assigned decision tree output value. An event will
start from the root node, follow a unique path through the decision tree, and end up
at a leaf and the classiﬁcation of the event is the decision tree output value of this
leaf. All nodes continue to be split until they become leaves.
Figure 8.3 shows an example of a decision tree.
A powerful technique to improve the performance of any weak classiﬁer (deﬁned
as any classiﬁer that does a little better than random guessing) is boosting [59]. The
basic principle of boosting is to create a tree, calculate an associated error function
then regenerate a tree with a smaller error function by re-weighting the misclassiﬁed
events. The boosting algorithm used in DØ’s single top quark search is adaptive
boosting (AdaBoost) [59].
In this analysis we ﬁnd that boosting typically improves the performance by 20%.
The increase in performance saturates in the region of 25 boosting cycles, therefore
30 boosts were used in this analysis. The full set of parameters used in this analysis
110

Figure 8.3: An example decision tree. Nodes with associated splitting test are shown
as blue circles and terminal nodes with purity output are shown as green leaves. An
event (observation) deﬁned by variables xi of which HT < 242, and mtop > 162 will
return D(xi ) = 0.82. An event with variables xj of which HT ≥ 242 and pT ≥ 27.6
will result a decision tree output value 0.12.

111

Table 8.1: BDT discriminant used in the current analysis.
parameter

value

Number of Boosting Cycles:

30

Ada Boost Parameter:

0.20

Minimal Leaf Size:

100

Impurity Measure:

Gini

Boosting Purity Limit:
Number of Bags:
Bagging Fraction:
Random Forest Fraction:

0.5
1
0.75
1.0

is as shown in Table 8.1.
Figs. 8.4, 8.5, and 8.6 show the distributions for BDT, for the 6 analysis channels
and the combination, for s+t-, s- and t- channels, respectively.

112

Event Yield

BDT OUTPUT FOR s+t CHANNEL

e+µ-channel
1-2 tags
2-4 jets

1000

500

00

0.2

0.4

0.6

0.8
1
tbtqb DT output

Figure 8.4: BDT output for s+t channel for the six analysis channels combined.

113

Event Yield

BDT OUTPUT FOR s CHANNEL

e+µ-channel
1-2 tags
2-4 jets

1000

500

00

0.2

0.4

0.6

0.8
1
tb DT output

Figure 8.5: BDT output for s-channel for the six analysis channels combined.

114

Event Yield

BDT OUTPUT FOR t CHANNEL

e+µ-channel
1-2 tags
2-4 jets

1000

500

00

0.2

0.4

0.6

0.8
1
tqb DT output

Figure 8.6: BDT output for t-channel for the six analysis channels combined.

8.5

Bayesian Neural Networks (BNN)

Artiﬁcial “neural networks” (NN) are widely used as ﬂexible models for classiﬁcation.
Bayesian methods can improve the power of these models, so that they can be safely
115

exploited when training data is limited. Figure 8.7 shows an example of a simple
Neural Network.

Figure 8.7: An illustration of a simple Neural Network.

The BNN analysis proceeds as follows.

1. The BNN analysis starts from the lepton and jet 4-vectors variables with good
discrimination between signal and background for 6 channels.
2. The training of the networks is performed and the discriminant D(x) is built.
We then check that the performance of the BNN is adequate.
3. Compute the expected and observed posterior densities of the single top cross
section for each of the 6 analysis channels and its combinations using binned
116

likelihoods formed from the BNN output distributions of the 6 analysis channels, and determine the cross sections.
For each of the 6 analyses channels we built a training set T by combining 10,000
signal and 10,000 background events, which were sampled proportionally to their
weights from the signal and background in the MC training samples. From the
training set T we constructed a posterior density p(w|T ) over the network parameter
space. A sample of K = 100 networks pertaining to the last 100 of 300 epochs
was drawn from the posterior density using a Markov Chain Monte Carlo (MCMC)
technique [60, 61] and used to approximate the discriminant, D.
Figures 8.8, 8.9, and 8.10 show the BNN output distributions for the 6 analysis
channels and the combination, for s+t-, s- and t- channels, respectively.

117

Event Yield

BNN OUTPUT FOR s+t CHANNEL

-1
DØ Run II, 5.4fb

e+µ channel
1-2 b-tags
2-4 jets

1000

500

00

0.2

0.4

0.6

0.8
1
tbtqb BNN Output

Figure 8.8: BNN output for s+t channel for the six analysis channels combined.

118

Event Yield

BNN OUTPUT FOR s CHANNEL

-1
DØ Run II, 5.4fb

e+µ channel
1-2 b-tags
2-4 jets

1000

500

00

0.2

0.4

0.6

0.8
1
tb BNN Output

Figure 8.9: BNN output for s-channel for the six analysis channels combined.

119

Event Yield

BNN OUTPUT FOR t CHANNEL

-1
DØ Run II, 5.4fb

e+µ channel
1-2 b-tags
2-4 jets

1000

500

00

0.2

0.4

0.6

0.8
1
tqb BNN Output

Figure 8.10: BNN output for t-channel for the six analysis channels combined.

8.6

Neuroevolution of augmented topologies (NEAT)

NeuroEvolution of Augmenting Topologies (NEAT) is a method for evolving artiﬁcial
neural networks with a genetic algorithm. It is most eﬀective to start evolution
120

with small, simple networks and allow them to become increasingly complex over
generations, just as organisms in nature increased in complexity since the ﬁrst cell.
The training for NEAT is based on a genetic algorithm which evolves a population of possible discriminators. The decriminators of NEAT are deﬁned by neural
networks (NN) that are similar to those used in BNN. An initial population of NNs
is created with zero hidden neurons and randomly assigned weight connections. This
population is then evolved by a series of operations that can be classify as mutations,
crossovers and selection. Mutations change the weight of existing connections or add
new hidden nodes to the network. Crossovers create a new NN by combining two
diﬀerent parent NNs, allowing the combination of those features that potentially can
create better discriminators. After these operations, NNs are selected to be part of
the next generation depending on the “ﬁtness” of each discriminator.
Figs. 8.11, 8.12, and 8.13 show the NEAT output distributions for the 6 analysis
channels and the combination, for s+t-, s- and t- channels, respectively [62].

121

Event Yield

NEAT OUTPUT FOR s+t CHANNEL

D0 Run II, 5.4 fb-1
e+µ-channel
1-2 b-tags
2-4 jets

1000

800

600

400

200

0
0

0.1

0.2

0.3

0.4

0.5

0.6

0.7 0.8 0.9
1
tbtqb NEAT output

Figure 8.11: NEAT output for s+t channel for the six analysis channels combined.

122

Event Yield

NEAT OUTPUT FOR s CHANNEL

D0 Run II, 5.4 fb-1
e+µ-channel
1-2 b-tags
2-4 jets

1200

1000

800

600

400

200

0
0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8 0.9
1
tb NEAT output

Figure 8.12: NEAT output for s-channel for the six analysis channels combined.

123

NEAT OUTPUT FOR t CHANNEL

Event Yield

1200

D0 Run II, 5.4 fb-1
e+µ-channel
1-2 b-tags
2-4 jets

1000

800

600

400

200

0
0

0.1

0.2

0.3

0.4

0.5

0.6

0.7 0.8 0.9
1
tqb NEAT output

Figure 8.13: NEAT output for t-channel for the six analysis channels combined.

124

8.7

A combined BNN using BDT, BNN and NEAT
discriminants

Ensemble tests with datasets containing contributions only from backgrounds and
SM signals, shows a 70% correlation among the outputs of the individual MVA
methods above. Thus, to achieve the maximum sensitivity, a second level of BNN is
used to construct a combined discriminant for each channel, for tb, tqb, and tb+tqb
events, using BDT, BNN and NEAT discriminants as the inputs.
In this combination BNN, we have two sets of trainings. The combined tb and
tqb discriminants take as inputs the three discriminant outputs of BDT, BNN, and
NEAT, and they are trained by assuming tb or tqb as signals, respectively. The
combined tb+tqb discriminant takes inputs from the six discriminant outputs of BDT,
BNN, and NEAT that are trained separately for the tb and the tqb signal. The
training for the combined tb+tqb BNN treats the combined tb+tqb contribution as
signal with production rates ratio predicted by SM. In both trainings, we followed the
same procedure as in the non-combination BNN training, described in Section 8.5.
The CP violation analysis uses this combined BNN output as the basis.
Figs. 8.14, 8.15, and 8.16 show the NEAT output distributions for the 6 analysis
channels and the combination, for s+t-, s- and t- channels, respectively.

125

Event Yield

BNN Combination OUTPUT FOR s+t CHANNEL

e/µ+jets
1-2 b-tags
2-4 jets

1000

500

0
0

0.2
0.4
0.6
0.8
1
tbtqb BNNComb Output

Figure 8.14: Combined BNN output for s+t channel for the six analysis channels
combined.

126

Event Yield

BNN Combination OUTPUT FOR s CHANNEL

e/µ+jets
1-2 b-tags
2-4 jets

1000

500

0
0

0.2

0.4
0.6
0.8
1
tb BNNComb Output

Figure 8.15: Combined BNN output for s channel for the six analysis channels
combined.

127

Event Yield

BNN Combination OUTPUT FOR t CHANNEL

e/µ+jets
1-2 b-tags
2-4 jets

1000

500

0
0

0.2

0.4
0.6
0.8
1
tqb BNNComb Output

Figure 8.16: Combined BNN output for s+t channel for the six analysis channels
combined.

Figures C.1 to C.6 show the combined discriminants distributions for positive
and negative samples separately after splitting by lepton charge.

128

400

DØ 5.4 fb -1
Data
tb+tqb
W+jets
tt
Multijets

200

Yield [Events/0.04]

Yield [Events/0.04]

(e)

(e)

400
300

DØ 5.4 fb -1
Data
tb+tqb
W+jets
tt
Multijets

200
100

0

0.2
0.4
0.6
0.8
1
Ranked BNNcomb6 discriminant

0

0.2
0.4
0.6
0.8
1
Ranked BNNcomb6 discriminant

(f)

DØ 5.4 fb -1

10

5

0
0.8

0.85
0.9
0.95
1
Ranked BNNcomb6 discriminant

Yield [Events/0.04]

Yield [Events/0.04]

Figure 8.17: The s+t discriminant plots in the electron+muon channel for positive
(left) and negative (right) events.

(f)

DØ 5.4 fb -1

10

5

0
0.8

0.85
0.9
0.95
1
Ranked BNNcomb6 discriminant

Figure 8.18: The BNNcomb6SortZoom plots in the electron+muon channel for positive (left) and negative (right) events.

129

400
300
200

DØ 5.4 fb -1
Data
tqb
tb
W+jets
tt
Multijets

Yield [Events/0.04]

Yield [Events/0.04]

(e)

DØ 5.4 fb -1
Data
tqb
tb
W+jets
tt
Multijets

400

200

100

0

0.2
0.4
0.6
0.8
1
Ranked BNNcomb6 discriminant

0

0.2
0.4
0.6
0.8
1
Ranked BNNcomb6 discriminant

(f)

DØ 5.4 fb -1

6

4

2

0
0.8

Yield [Events/0.04]

Yield [Events/0.04]

Figure 8.19: The t discriminant plots in the electron+muon channel for positive (left)
and negative (right) events.

8

DØ 5.4 fb -1

6

4

2

0.85
0.9
0.95
1
Ranked BNNcomb6 discriminant

0
0.8

0.85
0.9
0.95
1
Ranked BNNcomb6 discriminant

Figure 8.20: The BNNcombTSortZoom plots in the electron+muon channel for positive (left) and negative (right) events.

130

(e)

400

DØ 5.4 fb -1
Data
tb
tqb
W+jets
tt
Multijets

Yield [Events/0.04]

Yield [Events/0.04]

600

200

(e)
600

400

DØ 5.4 fb -1
Data
tb
tqb
W+jets
tt
Multijets

200

0

0.2
0.4
0.6
0.8
1
Ranked BNNcomb6 discriminant

0

0.2
0.4
0.6
0.8
1
Ranked BNNcomb6 discriminant

(f)

DØ 5.4 fb -1

6

4

15

(f)

DØ 5.4 fb -1

10

5

2

0
0.8

Yield [Events/0.04]

Yield [Events/0.04]

Figure 8.21: The s discriminant plots in the electron+muon channel for positive
(left) and negative (right) events.

0.85
0.9
0.95
1
Ranked BNNcomb6 discriminant

0
0.8

0.85
0.9
0.95
1
Ranked BNNcomb6 discriminant

Figure 8.22: The BNNcombSSortZoom plots in the electron+muon channel for positive (left) and negative (right) events.

131

Chapter 9
Statistical Analysis
In order to obtain a measurement of the CP asymmetry, the top and antitop cross
sections need to be measured simultaneously. The cross sections are determined using
the signal and background binned distributions of the combination BNN outputs.
The cross section calculations are performed employing Bayesian statistics with the
top statistics [63] package.

9.1

Poisson distribution likelihood

The basic assumption of particle physics is that event counts follow a Poisson distribution, that is, if the expected number of events is d, the probability to observe a
132

count of D, is given by:
L(D|d) =

e−d dD
,
D!

(9.1)

The expected number of events d come from signal and background sources:
N

N

d = αLσ +
i=1

bi ≡ aσ +

bi ,

(9.2)

i=1

where α is the signal acceptance, L is the integrated luminosity, σ is the signal cross
section (the quantity to be measured), bi the expected event count for background
source i, and a ≡ αL is deﬁned as the eﬀective luminosity for the signal. N is the
total number of background sources.

We measure the likelihood for each individual histogram bin and regard them as
independent measurement channels. The total likelihood for each analysis channel
is thus a product of the likelihood of each histogram bin:
M
L(D|d) ≡ L(D|σ, a, b) =

i=1

L(Di |di ),

(9.3)

where D and d represent vectors of observed data counts and expected signal plus
background event counts, and a and b are vectors of eﬀective luminosity and backgrounds. M is the total number of bins in the channel.
133

9.2

Bayesian analysis

Bayes’ theorem links the uncertainty of a probability model before and after observing
the modelled system. From Bayes’ theorem, we obtain:

p(σ, a, b|D) =

L(D|σ, a, b)π(σ, a, b)
,
p(D)

(9.4)

We regard the probability to observe a given number of counts for data p(D) is
a constant. Further integrating over the parameters a and b gives the probability
density for the signal cross section given observed event counts D:

p(σ|D) =

1
N

L(D|σ, a, b)π(σ, a, b)da, db.

(9.5)

σ
where N is an overall normalization factor from the requirement 0 max p(σ|D)dσ =
1, where the integral upper bound σmax is chosen as when the posterior probability density p(σ|D) is suﬃciently close to zero. The function p(σ|D) is called the
posterior probability density, which is the probability density of observing a certain
cross section assuming event counts D is observed. The function π(σ, a, b) is called
the prior probability density. L(D|σ, a, b) is the Poisson likelihood detailed in last
section.
In general, our prior knowledge of a and b is independent of that of cross section
134

σ. Thus,
π(σ, a, b) = π(a, b|σ)π(σ) = π(a, b)π(σ)

(9.6)

We assume that the prior probability density of signal cross section is evenly
distributed, which is equivalent to a ﬂat prior for the signal cross section: π(σ) =
1/σmax , as shown in the Figure 9.1 below.

Figure 9.1: An illustrative plot of a ﬂat prior probability density for the signal cross
section.
Thus, the posterior probability density for the signal cross section Eq. 9.5 can be
written as:
p(σ|D) =

1
N σmax

L(D|σ, a, b)π(a, b)da, db.
135

(9.7)

We choose the peak of the posterior density distribution as the measured cross
seciton, and the 68% conﬁdence interval about the peak as an estimate of its uncertainty, as shown in Figure 9.2.

Figure 9.2: An illustrative plot demonstrating cross section measurement from a
posterior probability density.

To measure top and antitop quark cross sections, we generalize the method above
to measure simultaneously the cross sections of two signals, σ+ and σ− , by building
a two-dimensional posterior probability density. The expected event count in this
case will be:
N
d = a+ σ+ + a− σ− +
136

bi ,
i=1

(9.8)

where a+ and a− are the eﬀective luminosities of the two signals respectively.
As before, we assume a ﬂat prior for each of the two signals:

π(σ+ ) = 1/σ1,max

(9.9)

π(σ− ) = 1/σ2,max

(9.10)

where σ1,max and σ2,max is a suﬃciently high upper bound for the ﬁrst and second
signal cross section to be close to zero.

Finally we get a two dimensional posterior probability density for diﬀerent values
of σ+ and σ− :
1
p(σ+ , σ− |D) =
N σ1,max σ2,max

L(D|σ+ , σ− , a, b)π(a, b)da, db.

(9.11)

Replacing the likelihood with the Poisson distribution, we obtained

L(D|σ+ , σ− , a, b) =

Dp
e−dp dp
p=1

Γ(Dp + 1)

n=1

e−dn dDn
n
Γ(Dn + 1)

(9.12)

where the dp and dn is the expected number of events with positive and negative
137

leptons respectively in each bin:

dp = σ+ ap + bp
dn = σ− an + bn .

(9.13)

Here ap and an represent the eﬀective luminosities for single top and antitop, and bp
and bn are the background yields for events with leptons with positive and negative
charges, respectively.
π(a, b) encodes our knowledge of the eﬀective signal luminosities and background
yields. The posterior density distribution is obtained by creating a large number
of samples of systematic shifts in a and b. A separate likelihood distribution is
obtained for each sample, and the ﬁnal posterior is the sum over all of the individual
likelihoods.
From this likelihood the 2d posterior probability density Eq. 9.11 is rewritten as:
Dp
e−dp dp

1

p(σ+ , σ− ) =
N σ+,max σ−,max

p=1

Γ(Dp + 1)

n=1

e−dn dDn
n π(a, b)dadb.
(9.14)
Γ(Dn + 1)

We can then deﬁne a contour plot of equal probability that encloses a volume CL
around the peak of the posterior density, as illustrated in Figure 9.3.
The one dimensional posterior probability density for each positive and negative
signal can be derived by integrating the two-dimensional posterior density Eq. 9.11
138

Figure 9.3: An illustrative plot demonstrating a contour plot derived from a two
dimensional posterior probability density.

139

over the other signal:

p(σ+ |D) =

σ−,max

0

p(σ+ , σ− |D)dσ− ,

(9.15)

p(σ+ , σ− |D)dσ+ .

(9.16)

and
p(σ− |D) =

σ+,max
0

Then we can determine the cross section measurements for positive and negative
signals each in the same way as in the one dimensional case, as shown in Figure 9.2.

9.3

Posterior probability density for the CP asymmetry

To obtain the posterior probability density for the asymmetry, we utilize the relationships (Eq. 2.2):
σ − σ−
; σ = σ+ + σ−
A= +
σ+ + σ−

(9.17)

where A is the deﬁnition of the top production asymmetry, and σ the total production
cross section.
140

Combining with Eq. 9.14, we obtain

p(σ, A) =

σ
2N σmax

L(D|σ(1 + A)/2, σ(1 − A)/2, a, b)π(a, b)dadb.

(9.18)

The posterior probability density for the asymmetry p(A) thus can be computed
by integrating over the total cross section σ:

p(A) =

p(σ, A)dσ

141

(9.19)

Chapter 10

Systematic Uncertainties

Systematic uncertainties expresses our inability to model the detector accurately.
Systematic errors are often diﬃcult to assess, because they do not show up as ﬂuctuations in the results of repeated measurements, as statistical uncertainties do. It is
important to think about possible sources of systematic errors and to try to correct
them or rule them out.

Systematic uncertainties enter the single top measurements in two ways: as uncertainty on the normalization of the signal and background samples, and as eﬀects
that change the shapes of the distributions of those samples and the shapes of the
expected signal distributions, i.e. the combination BNN outputs.
142

10.1

Uncertainties aﬀecting normalization only

These systematic uncertainties aﬀects only the normalization of the signal and background samples.
• Integrated luminosity
¯
The 6.1% uncertainty on the luminosity estimate aﬀects the signal, tt, Z+jets,
and diboson yields.
• Theory cross sections
¯
The uncertainties on the single top and tt cross sections come from Refs. [1]
and [64], and are calculated for 3 top quark mass points: 170 GeV, 172.5 GeV,
and 175 GeV. The resulting values for the uncertainties for 172.5 GeV are
¯
±3.8% for s-channel tb, ±5.3% for t-channel tqb, and +6.4%, −9.0% for tt. For
tb+tqb combined, the uncertainty is ±4.8%. (NB, the uncertainty on the signal
cross section does not of course enter the experimental measurement of it.) The
uncertainties on the Z+jets cross section is set to 3.3% as per reference [65].
The uncertainties on the diboson cross sections is set to 7% in accordance with
the Higgs group current practice [66].
• Branching fractions
From the Particle Data Book, the branching fractions for a W boson to decay
to an electron, muon, or tau lepton, have an average uncertainty of 1.5% and
143

we include this in the MC normalization uncertainties.
• Parton distribution functions
We have estimated the eﬀect of changing the parton distribution functions on
the signals and assign a systematic uncertainty of 2% to the signal acceptances
from this source. PDF uncertainties on the acceptance can be calculated by
reweighting the MC events before selection using 40 CTEQ PDF sets and then
calculating the acceptance for each of the eigen sets [50]. The square root of
the sum of the squares of the diﬀerence between the PDF eigen sets with the
nominal PDF set is quoted as uncertainty.
• Trigger eﬃciency
We use an OR of many trigger conditions which give us a trigger eﬃciency of
close to 100%. We assign an uncertainty of 5% to the trigger eﬃciency in all
channels and treat it as uncorrelated between p17 and p20.
• Instantaneous luminosity reweighting
We reweight the instantaneous luminosity distributions of all MC samples to
make them match Run IIa or Run IIb data distributions as appropriate. The
uncertainty on this reweighting is 1.0%.
• Primary vertex modeling and selection
We reweight the distribution of the primary vertices along the beamline in MC
144

to match that in data [67], [68]. The uncertainty on this reweighting is 0.05%
(negligible). The uncertainty on the diﬀerence in primary vertex selection
eﬃciency between data and MC is 1.4%.

• Color Reconnection
We estimate the systematics due to color reconnections by taking the diﬀerence in signal eﬃciency between a sample generated with PYTHIA TuneAPro
and PYTHIA TuneACPro [69]. The relative diﬀerence on preselection and btagging eﬃciency between both simulations is found to be 1.0% and is assigned
¯
as an additional uncertainty to the central eﬃciencies for tt background and
signal MC.

• Relative b/light jet response
This uncertainty takes into account the diﬀerence between the nominal inclusive jet response and the response for B- hadrons. To estimate this diﬀerence,
¯
particle jets in a tt lepton+jets sample were constructed and classiﬁed as b- or
light jets. Single particle response curves for both data and MC were then
applied to the particle jets to predict the energy of a reconstructed jet in the
calorimeter. From these reconstructed energies, the ratio pdata /pMC was
T
T
then calculated separately for both b-jets and light jets and the double ratio
145

evaluated:

pdata /pMC
T
T

b-jet

pdata /pMC
T
T
light jet
resulting in a diﬀernce of 1.8% [70]. The systematics uncertainty is then obtained by shifting down by 1.8% the nominal response of b-jets (identiﬁed based
on the MC truth information).
• Electron reconstruction and identiﬁcation eﬃciency
The electron scale factor uncertainty is taken from Ref. [55], but adapted to
the electron acceptance cuts used in this analysis in accordance with [71]. The
uncertainty includes contributions from the ﬁt used to evaluate the background
in the Z boson sample, as well as dependencies on the luminosity, distance to
closest jet, jet multiplicity, and electron pT . The resulting total uncertainty is
2.8% for p17, and 3.8% for p20 data.
• Muon reconstruction and identiﬁcation eﬃciency
The muon scale factor uncertainties for muon reconstruction (1.4%), muon
identiﬁcation (1.2%) and muon isolation (0.4%) are taken from Table 6, 4 and
8 of Ref. [72]. They were estimated by the muon ID group, for the tag/probe
method, background subtraction, and limited statistics in the parameterization.
An additional uncertainty of 1% is added in quadrature to take into account the
latest muon ID as explained in Appendix 5.1. The assigned total uncertainty
146

is 2.1%.
• Jet Fragmentation and higher-order eﬀects
We have measured the systematic uncertainty due to jet fragmentation by com¯
paring the acceptance of the nominal tt events generated with alpgen+pythia
(as used in the analysis) to ones generated with alpgen+herwig. The resulting uncertainty of 0.7%, 3.7% and 4.7% for events with 4, 5 and 6 jets, is
applied to the signal samples in the 2, 3 and 4 jet bin, respectively.
We have also measured the systematic uncertainty due to a combination of jet
fragmentation and higher order eﬀects (We take the variations observed in the
top pair cross section analysis described in DØ Note 6025, averaged over run
¯
period and lepton type.) comparing the acceptance of the nominal tt sample to
ones generated with MC@NLO+HERWIG. The resulting uncertainty is 1.6%
for events with 2 jets, 1.7% for events with 3 jets, and 7.0% for events with
4 jets. The variation is anti-correlated between events with 2 or 3 jets and
¯
events with 4 jets. This uncertainty is applied exclusively to the tt sample and
accounts for both jet fragmentation and higher order eﬀects.
• Initial-state and ﬁnal-state radiation
¯
We use tt samples that were generated varying the ISR/FSR settings and we
have evaluated this uncertainty in those samples as follows: for signal MC,
¯
which have 2,3, or 4 jets, we take the uncertainties from the tt events with 4,
147

¯
¯
5, 6 jets. For the tt background, we take numbers from the tt samples with
2, 3, and 4 jets directly. All uncertainties are averaged over the electron and
muon channels, and are half the diﬀerence between the ISR and FSR samples.
We ﬁnd an uncertainty ranging from 0.8% to 10.9% and apply it to signal and
¯
tt samples.

• b-jet fragmentation
¯
The size of the uncertainty from the b-jet modeling was evaluated in the tt
pairs cross section analysis following the method described in Ref. [70]. The
uncertainty arises from the diﬀerence between the fragmentation parameterizations preferred by SLD vs. LEP data. A 2.0% value is measured and applied
¯
to signal, tt and Zbb samples.

• Taggability
The uncertainty associated with b-tagging in MC events includes components
for the taggability and the tagging eﬃciency. The taggability uncertainties are
evaluated by raising and lowering the taggability scale factor values by one
standard deviation on each MC sample and repeating the entire analysis. The
taggability uncertainties aﬀect the normalization of the MC samples only and
the values are between 3.1% and 21.5%.

• W+jets heavy-ﬂavor scale factor correction
148

The heavy-ﬂavor scale factor correction, λHF , for W b¯ and W c¯ is measured
b
c
in a sample that is orthogonal to the one in the analysis and found to be
consistent with 1, with a 12% uncertainty. Two other methods based on the
candidate sample also conﬁrm that λHF is consistent with one.
• Z+jets heavy-ﬂavor scale factor correction
′
The heavy-ﬂavor scale factor KHF for Zb¯ and Zc¯ is determined from NLO
b
c
calculations, and we set the correction to it, λHF to be 1.0 (since we cannot
normalize to data to check it) with an uncertainty of 12%, taken from the λHF
for W +jets events.

• W+jets and multijets normalization
We use a ﬁt to the pretagged data to determine the W +jets and multijets
background normalizations. The uncertainties are estimated to be 30% for the
multijets backgrounds (40% for the Run IIb MU channel), and 1.8% for the
W +jets backgrounds.

• Lepton Charge Mis-Identiﬁcation
The CP violation measurement needs to separate the samples into positive and
negative subsets, according to the lepton charge in the ﬁnal state. We select
the Z → ee or Z → µµ events to measure the charge mis-ID rate. We require
each event to have exactly two electrons or two muons, and the invariant mass
149

of the leptons between 80 and 100 GeV (Z peak). We measured the fake rate
below 1%. And since we are not sensitive to eﬀects of the order of 1% compared
to the existing much larger systematic uncertainties, we decide to simply add a
systematic uncertainty of 1% anti-correlated between the positive and negative
charge samples. Details see Appendix D.
• Sample statistics
The MC and data samples we use to estimate the signal and background shapes
are limited in size. As In particular the multijets background samples have low
statistics after b tagging. We take the background sample statistics into account
for each sample in each bin of the ﬁnal discriminant distribution.

10.2

Uncertainties aﬀecting normalization and

shape
The following systematics aﬀects the shapes of the distributions of the signal
and backgrounds, and the shapes of the combination BNN outputs.
• Jet reconstruction and identiﬁcation
The eﬃciency to reconstruct and identify jets has an uncertainty varying from
0.04% to 3.7% for all MC samples.
150

• Jet energy resolution
We assign a normalization and shape-changing uncertainty ranging from 0.2%
to 11.6% as the jet energy resolution uncertainty for all signals and backgrounds.

• Jet energy scale
The JES correction is raised and lowered by one standard deviation on each
MC sample and the whole analysis repeated, which produces a shape-changing
uncertainty, and an overall normalization uncertainty. The normalization part
ranges from 0.3% to 14.6% for all MC samples.

• Vertex conﬁrmation
The eﬃciency to identify vertex-conﬁrmed jets has an uncertainty varying from
0.1% to 9.6% for all MC samples.

• b-tagging
The uncertainty associated with b-tagging in MC events includes scale factor
uncertainties (SF) in the case of applying direct tagging and tag rate (TRF)
uncertainties in the case of using random tagging. This is done simultaneously
for heavy ﬂavor and light jets, as it was done in the single top observation analysis. The SF and TRF uncertainties originate from several sources: statistical
errors of MC event sets; the assumed fraction of heavy ﬂavor in the multijets
151

MC events for the mistag rate determination; and the parameterizations. The
SF and TRF uncertainties aﬀect both shape and normalization of the MC samples. The values range from 4.3% (5.8%) to 14.0% (11.2%) for single-tagged
(double-tagged) samples.
• V +jets angular corrections
We reweight the alpgen W and Z+jets background based on several pretagged
data distributions. The uncertainty from these reweightings aﬀects the shapes
of the W and Z+jets background components. The reweighting procedure does
not change the normalization of the W and Z+jets samples before b-tagging.
After b-tagging, small normalization variations (≈ 0.3%) are introduced by
taggability and b-tagging scale factors.
• Response to b and ¯ quarks
b
The measurement of the top-antitop cross section diﬀerence could be aﬀected
by a diﬀerent response of the calorimeter to quark and antiquark jets. Such a
bias could come from a diﬀerent calorimeter response to the b and the ¯ decays,
b
respectively. To estimate this uncertainty, we rescale the pT (the 4-vectors
accordingly) of the b and ¯ by a factor of 0.9971 and 1.0021 respectively [73],
b
depending on b or ¯ and then re-apply the MVA ﬁlters. The measured eﬀect
b,
is small, and we add another 1% ﬂat systematic uncertainty to account for it.
See Appendix F for details.
152

Chapter 11
CP Violation Measurement
This chapter discusses the ﬁnal cross section measurements for both top and antitop
quarks, and the expected and observed CP asymmetry measurement results for all
single top production channels, i.e. s-channel, t-channel and s + t-channel.

11.1

Cross section measurement

As can be seen from Section 2.4, to measure the CP Violation in single top, we must
measure the cross sections separately both for top and antitop quarks. The cross
section is determined in a Bayesian approach using the top statistics software
package [63]. The inputs are the BNN combined output distributions shown in
Section 8.7.
These split MVA outputs are further fed into the top statistics package to
153

obtain the desired posteriors.

The cross section is measured by forming a two-dimensional (2d) posterior as a
function of both the positive and the negative top cross sections. In this approach,
the 2d posterior is constructed with no theoretical assumption on either the positive
or negative cross sections. From this 2d posterior, we extract the individual positive
(negative) cross section by integrating over the negative (positive) axis. Figure 11.1
shows an example of a 2d posterior as a function of both the positive and the negative
top cross sections.

The procedure is done separately for s+t (considering s+t as signal and all other
contributions as background), s (considering s as signal and adding the t contribution
to the backgrounds), and for t (considering t as signal and adding the s contribution
to the backgrounds), in order to obtain a posterior density curve as function of the
cross section asymmetry (deﬁned in Section 2.4) for each individual channel. The
central value of the asymmetry is taken from the peak (mode) in the posterior density
function, and the 68% interval about the peak gives the uncertainty. One can see an
example of this measurement from the illustrative Figure 11.2.

The next two chapters show the expected and observed results of the CP asymmetry measurement. Appendix ?? gives more supplement plots and results.
154

Figure 11.1: An illustrative example demonstrating a 2d posterior as a function of
both the positive and the negative top cross sections. This plot is obtained using
the top statistics package. The red small square denotes the Standard Model
prediction (no CP violation). The black dot is the peak of the posterior which
is close to our ﬁnal results of top-antitop quark cross sections. The 68%, 90%, and
95% Conﬁdence Limit (C.L.) are shown in contours of various colors. We also draw a
diagonal line to visually show, how far away the measured peak is from CP symmetry,
and whether it is shifted to the positive or negative side.

155

Figure 11.2: An illustrative example demonstrating how to measure the CP violation from an asymmetry posterior. This plot is obtained using the top statistics
package. We take the peak of the plot as the central value of the CP violation. The
yellow area denotes the 68% CL interval, which is taken as the uncertainty of the
CP violation measurement. We also draw a 95% CL area with the light blue color.

156

11.2

Expected results

The expected cross section is measured by setting the number of data events in
each analysis channel to the expected number of background events plus the number
of signal events, predicted when using the SM cross section value of 0.52 pb for
s-channel, 1.13 pb for t-channel, and 1.65 pb for s + t-channel (for a top mass of
172.5 GeV). The expected CP asymmetries are further calculated from the cross
section measurements.
The posterior density functions for the 2d, single top production asymmetry, and
cross sections, separately for top and antitop quarks, are shown in Figs. 11.3 to 11.5
taking into account all systematics.
From all the plots, we can see in each analysis channel, the expected cross section
is very close to the standard model prediction, and thus the expected CP asymmetry
is close to zero. This means the systematics do not bias our measurement.
Expected results for CP asymmetry in s, t, s+t channels for diﬀerent systematics
are shown in Table 11.1.
The expected values are zero within large errors. The error is smallest in the s+t
channel, and largest in the s channel.

157

Figure 11.3: Expected posterior density distributions and measurements of top and
anti-top cross sections in the s + t-channel. The top left plot shows the 2d posterior
of top and antitop production cross sections; top right plot shows the posterior of the
top-antitop production cross section assymmetry Atop ; bottom left plot shows the
posterior of the top production cross section; bottom right plot shows the posterior
of the antitop production cross section. The top (antitop) cross section are extracted
by integrating over the antitop (top) axis respectively from the 2d posterior.
158

Posterior density

σt [pb]

h2

DØ 5.4 fb -1
3

Expected Peak
68% C.L.
90% C.L.
95% C.L.
SM

2

3

DØ Run II, 5.4 fb

-1

2.5

Expected
Asymmetry

2

+0.19

= 0.01-0.19
1.5
1

1

0.5

1

2

1.2

DØ Run II, 5.4 fb

-1

1

Expected
Cross Section

0.8

0
-1 -0.8 -0.6 -0.4 -0.2 0

3
σt [pb]

+0.41

= 1.25 -0.37 pb

0.2 0.4 0.6 0.8

Cross section asymmetry A

Posterior density

Posterior density

0
0

DØ Run II, 5.4 fb

1.2
1

1
top

-1

Expected
Cross Section

0.8

+0.40

= 1.16 -0.36 pb

0.6

0.6

0.4

0.4

0.2

0.2

0
0

0.5

1

1.5

2

2.5

3

3.5

4

t cross section [pb]

0
0

0.5

1

1.5

2

2.5

3

3.5

4

t cross section [pb]

Figure 11.4: Expected posterior density distributions and measurements of top and
anti-top cross sections in the t-channel. The top left plot shows the 2d posterior of
top and antitop production cross sections; top right plot shows the posterior of the
top-antitop production cross section assymmetry Atop ; bottom left plot shows the
posterior of the top production cross section; bottom right plot shows the posterior
of the antitop production cross section. The top (antitop) cross section are extracted
by integrating over the antitop (top) axis respectively from the 2d posterior.
159

Posterior density

σt [pb]

h2

DØ 5.4 fb -1

2

Expected Peak
68% C.L.
90% C.L.
95% C.L.
SM

1.5

1

3

DØ Run II, 5.4 fb

-1

2.5

Expected
Asymmetry

2

+0.27

= -0.05 -0.27
1.5
1

0.5
0.5

1.4

0.5

1

1.5

DØ Run II, 5.4 fb

1.2

-1

Expected
Cross Section

1

+0.34

= 0.55 -0.29 pb

0.8

0
-1 -0.8 -0.6 -0.4 -0.2 0

2
σt [pb]

0.2 0.4 0.6 0.8

Cross section asymmetry A

Posterior density

Posterior density

0
0

1.4

DØ Run II, 5.4 fb

1.2

1
top

-1

1

Expected
Cross Section

0.8

= 0.58 -0.29 pb

+0.35

0.6

0.6

0.4

0.4

0.2

0.2

0
0

0.2 0.4 0.6 0.8

1

1.2 1.4 1.6 1.8

2

t cross section [pb]

0
0

0.2 0.4 0.6 0.8

1

1.2 1.4 1.6 1.8

2

t cross section [pb]

Figure 11.5: Expected posterior density distributions and measurements of top and
anti-top cross sections in the s-channel. The top left plot shows the 2d posterior of
top and antitop production cross sections; top right plot shows the posterior of the
top-antitop production cross section assymmetry Atop ; bottom left plot shows the
posterior of the top production cross section; bottom right plot shows the posterior
of the antitop production cross section. The top (antitop) cross section are extracted
by integrating over the antitop (top) axis respectively from the 2d posterior.
160

Table 11.1: Expected results for CP asymmetry in s, t, s+t channels.
Discriminant

CP Asymmetry

tb

−0.05+0.28
−0.27

tqb
tb+tqb

0.02+0.19
−0.19

−0.02+0.14
−0.13

161

11.3

Observed results

The approach described in Sec. 11.2 is repeated using the data events to obtain the
observed cross sections. The observed CP asymmetries are further calculated from
the cross section measurements.
The posterior density functions for the 2d, single top production asymmetry,
and cross sections separately for top and antitop quarks, are shown in Fig. 11.6 to
Fig. 11.8 taking into account of all systematics. The colors show the 68%, 90%, and
95% CL regions.
Unlike the expected results, we can see the measured peak is shifted farther away
from the Standard Model prediction, mostly around one standard deviation away.
The s-channel deviates more than the t-channel, which is again larger than s + t
combined. The measured asymmetries all deviate in the same direction, i.e. to the
negative part. This might be due to data ﬂuctuation. We expect this bias will disappear as we have more statistics of events. Our analysis currently is still background
dominated (S:B approximately 1:20). The 2d plots show that the measured values
are about one standard deviation away from the SM, while the asymmetry plot shows
they are more than one standard deviation away. This is due to the integration in
top statistics, when converting the 2d plot into 1d asymmetry plot. We need at
least three standard deviation to claim evidence, so we don’t observe CP violation
yet at this point.
162

Posterior density

σt [pb]

h2

DØ 5.4 fb -1
Measured Peak
68% C.L.
90% C.L.
95% C.L.
SM

4

3

3

DØ Run II, 5.4 fb

-1

2.5

Observed
Asymmetry

2

+0.16

= -0.24 -0.16
1.5

2
1

1

0.5

1

2

3

DØ Run II, 5.4 fb

1.2
1

-1

Observed
Cross Section

0.8

= 1.11 +0.38 pb
-0.36

0.6

0.2 0.4 0.6 0.8

Cross section asymmetry A

DØ Run II, 5.4 fb

1
top

-1

1

Observed
Cross Section

0.8

+0.41

= 1.75 -0.40 pb
0.6

0.4

0.4

0.2

0.2
0
0

0
-1 -0.8 -0.6 -0.4 -0.2 0

4
σt [pb]

Posterior density

Posterior density

0
0

0.5

1

1.5

2

2.5

3

3.5

4

t cross section [pb]

0
0

0.5

1

1.5

2

2.5

3

3.5

4

t cross section [pb]

Figure 11.6: Observed posterior density distributions and measurements of top and
anti-top cross sections in the s + t-channel. The top left plot shows the 2d posterior
of top and antitop production cross sections; top right plot shows the posterior of the
top-antitop production cross section assymmetry Atop ; bottom left plot shows the
posterior of the top production cross section; bottom right plot shows the posterior
of the antitop production cross section. The top (antitop) cross section are extracted
by integrating over the antitop (top) axis respectively from the 2d posterior.
163

Posterior density

σt [pb]

h2

DØ 5.4 fb -1
3

Measured Peak
68% C.L.
90% C.L.
95% C.L.
SM

2

3

DØ Run II, 5.4 fb

-1

2.5

Observed
Asymmetry

2

+0.18

= -0.25 -0.18
1.5
1

1

0.5

1

2

DØ Run II, 5.4 fb

-1

1.4
1.2

Observed
Cross Section

1

0
-1 -0.8 -0.6 -0.4 -0.2 0

3
σt [pb]

+0.32

= 0.86 -0.29 pb
0.8

0.2 0.4 0.6 0.8

Cross section asymmetry A

Posterior density

Posterior density

0
0

1.4

DØ Run II, 5.4 fb

1
top

-1

1.2
1

Observed
Cross Section

0.8

= 1.39 -0.31 pb

+0.35

0.6

0.6
0.4

0.4

0.2

0.2

0
0

0.5

1

1.5

2

2.5

3

3.5

4

t cross section [pb]

0
0

0.5

1

1.5

2

2.5

3

3.5

4

t cross section [pb]

Figure 11.7: Observed posterior density distributions and measurements of top and
anti-top cross sections in the t-channel. The top left plot shows the 2d posterior of
top and antitop production cross sections; top right plot shows the posterior of the
top-antitop production cross section assymmetry Atop ; bottom left plot shows the
posterior of the top production cross section; bottom right plot shows the posterior
of the antitop production cross section. The top (antitop) cross section are extracted
by integrating over the antitop (top) axis respectively from the 2d posterior.
164

Posterior density

σt [pb]

h2

DØ 5.4 fb -1

2

Measured Peak
68% C.L.
90% C.L.
95% C.L.
SM

1.5

1

3

DØ Run II, 5.4 fb

-1

2.5

Observed
Asymmetry

2

+0.29

= -0.43 -0.35
1.5
1

0.5
0.5

0.5

1

1.5

DØ Run II, 5.4 fb

-1

2.5

Observed
Cross Section

2

+0.19

= 0.12 -0.12 pb

1.5

0
-1 -0.8 -0.6 -0.4 -0.2 0

2
σt [pb]

0.2 0.4 0.6 0.8

Cross section asymmetry A

Posterior density

Posterior density

0
0

1.8

DØ Run II, 5.4 fb

1
top

-1

1.6

Observed
Cross Section

1.4

+0.26

1.2

= 0.51-0.23 pb

1
0.8

1

0.6
0.4

0.5

0.2
0
0

0.2 0.4 0.6 0.8

1

1.2 1.4 1.6 1.8

2

t cross section [pb]

0
0

0.2 0.4 0.6 0.8

1

1.2 1.4 1.6 1.8

2

t cross section [pb]

Figure 11.8: Observed posterior density distributions and measurements of top and
anti-top cross sections in the s-channel. The top left plot shows the 2d posterior of
top and antitop production cross sections; top right plot shows the posterior of the
top-antitop production cross section assymmetry Atop ; bottom left plot shows the
posterior of the top production cross section; bottom right plot shows the posterior
of the antitop production cross section. The top (antitop) cross section are extracted
by integrating over the antitop (top) axis respectively from the 2d posterior.
165

The observed results for CP asymmetry in s, t, s + t channels for diﬀerent systematics are summarized in Table 11.2.

Table 11.2: Observed results for CP asymmetry in s, t, s+t channels.
Discriminant

CP Asymmetry

tb

−0.43+0.29
−0.35

tqb
tb+tqb

−0.25+0.18
−0.18

−0.24+0.18
−0.16

166

Chapter 12
Results and Conclusions
In this thesis we present a search for CP violation using the single top quark ﬁnal state
with the DØ detector at the Fermilab Tevatron. We perform the search separately
in the s-, t- and s + t-channel using 5.4 fb−1 of data. This analysis is an extension
of the single top cross section measurement, and is the ﬁrst analysis searching for
CP violation in single top quark production.
The measured asymmetry A for s-, t- and s+t-channel are −0.43+0.29 , −0.25+0.18
−0.35
−0.18

and −0.24+0.16 respectively.
−0.16

167

APPENDICIES

168

Appendix A
Plots After Splitting by the
Lepton Charge
Figures A.2 to A.24 show various kinematic distributions in the tagged ﬁnal samples
after being splitted by the lepton charge. The plots are shown on the left for the positive samples and on the right for the negative samples. The legends are only shown
in the ﬁrst pair of plots (Fig. A.2). All other plots assume the same legend. The
signal and background distributions use the color scheme illustrated in Figure A.1.

169

Figure A.1: Illustration of the color scheme used in plots of signals and backgrounds
in the single top analyses.

170

Yield [Events/10GeV]

Yield [Events/10GeV]

DØ 5.4 fb -1
Data
s+t-channel
W+bb
W+cc
W+lp
Z+jets
Diboson
tt
Multijets

1000

500

0

50

100

DØ 5.4 fb -1
Data
s+t-channel
W+bb
W+cc
W+lp
Z+jets
Diboson
tt
Multijets

1000

500

0

150
200
Lepton pT [GeV]

50

100

150
200
Lepton pT [GeV]

Yield [Events/0.08]

Yield [Events/0.08]

Figure A.2: The transverse momentum of the lepton in the electron+muon channel
for positive (left) and negative (right) samples.

DØ 5.4 fb -1
600

400

200

0

DØ 5.4 fb -1
600

400

200

0

0.2

0.4

0.6

0

0.8
Centrality

0

0.2

0.4

0.6

0.8
Centrality

Figure A.3: The Centrality for All Jets in the electron+muon channel for positive
(left) and negative (right) samples.

171

Yield [Events/0.2]

Yield [Events/0.2]

500

DØ 5.4 fb -1

400
300

DØ 5.4 fb -1
400
300

200

200

100

100

0

0

1

0

2
3
∆φ(Lepton,MET)[Rad]

0

1

2
3
∆φ(Lepton,MET)[Rad]

Yield [Events/20GeV]

Yield [Events/20GeV]

Figure A.4: The Delta Phi between Lepton MET in the electron+muon channel for
positive (left) and negative (right) samples.

DØ 5.4 fb -1
600

400

200

0
100

200

300

DØ 5.4 fb -1
600

400

200

0
100

400
HT [GeV]

200

300

400
HT [GeV]

Figure A.5: The HT in the electron+muon channel for positive (left) and negative
(right) samples.

172

Yield [Events/10GeV]

Yield [Events/10GeV]

DØ 5.4 fb -1
600

400

200

0

0

50

DØ 5.4 fb -1
600

400

200

0

100
150
200
HT light jets [GeV]

0

50

100
150
200
HT light jets [GeV]

Yield [Events/20GeV]

Yield [Events/20GeV]

Figure A.6: The HT-AllJets-MinusBTaggedJet in the electron+muon channel for
positive (left) and negative (right) samples.

DØ 5.4 fb -1

800
600
400
200
0

DØ 5.4 fb -1
800
600
400
200

0

100

200
300
Minv jets [GeV]

0

0

100

200
300
Minv jets [GeV]

Figure A.7: The Invariant Mass for the two leading jets in the electron+muon channel
for positive (left) and negative (right) samples.

173

Yield [Events/20GeV]

Yield [Events/20GeV]

400

DØ 5.4 fb -1

300

200

100

0

DØ 5.4 fb -1

400
300
200
100

0

100

0

200
300
Minv light jets [GeV]

0

100

200
300
Minv light jets [GeV]

3000

Yield [Events/0.2]

Yield [Events/0.2]

Figure A.8: The InvariantMass-LightQuarkJets1-2 in the electron+muon channel for
positive (left) and negative (right) samples.

DØ 5.4 fb -1

3000

2000

2000

1000

0

DØ 5.4 fb -1

1000

0

0.8
0.85
0.9
Leading tagged jet NN output

0.8
0.85
0.9
Leading tagged jet NN output

Figure A.9: The Leading b Tagged Jet in the electron+muon channel for positive
(left) and negative (right) samples.

174

Yield [Events/0.25]

Yield [Events/0.25]

DØ 5.4 fb -1
400

200

0

-2

-1

DØ 5.4 fb -1
400

200

0

0
1
2
Leading tagged jet η

-2

-1

0
1
2
Leading tagged jet η

500

Yield [Events/0.2]

Yield [Events/0.2]

Figure A.10: The LeadingBTaggedJetEta in the electron+muon channel for positive
(left) and negative (right) samples.

DØ 5.4 fb -1

400
300

DØ 5.4 fb -1
500
400
300

200

200

100

100

0

0

0
1
2
3
∆φ(Lepton,Leading tagged jet)[Rad]

0
1
2
3
∆φ(Lepton,Leading tagged jet)[Rad]

Figure A.11: The LeadingBTaggedJetLeptonDeltaPhi in the electron+muon channel
for positive (left) and negative (right) samples.

175

Yield [Events/10 GeV]

Yield [Events/10 GeV]

800

DØ 5.4 fb -1

600
400
200
0

800

DØ 5.4 fb -1

600
400
200
0

50
100
150
200
Leading tagged jet p [GeV]
T

50
100
150
200
Leading tagged jet p [GeV]
T

105

Yield [Events/0.2]

Yield [Events/0.2]

Figure A.12: The LeadingBTaggedJetPt in the electron+muon channel for positive
(left) and negative (right) samples.

DØ 5.4 fb -1

4

10

103
102

105

DØ 5.4 fb -1

4

10

103
102

10

10

1

1

10-1 0

10-1 0

0.2
0.4
0.6
0.8
1
Leading tagged jet NN output

0.2
0.4
0.6
0.8
1
Leading tagged jet NN output

Figure A.13: The LeadingLightQuarkJetBTagNN in the electron+muon channel for
positive (left) and negative (right) samples.

176

Yield [Events/0.4]

Yield [Events/0.4]

DØ 5.4 fb -1

600

400

200

0

DØ 5.4 fb -1

600

400

200

-4

-2

0

0
2
4
Leading tagged jet η

-4

-2

0
2
4
Leading tagged jet η

Yield [Events/0.2]

Yield [Events/0.2]

Figure A.14: The LeadingLightQuarkJetEta in the electron+muon channel for positive (left) and negative (right) samples.

DØ 5.4 fb -1
400
300
200

300
200
100

100
0

DØ 5.4 fb -1

400

0

0

1
2
3
∆φ(Lepton,Leading light jet)[Rad]

0

1
2
3
∆φ(Lepton,Leading light jet)[Rad]

Figure A.15: The LeadingLightQuarkJetLeptonDeltaPhi in the electron+muon
channel for positive (left) and negative (right) samples.

177

Yield [Events/10 GeV]

Yield [Events/10 GeV]

DØ 5.4 fb -1

800
600
400
200

DØ 5.4 fb -1

800
600
400
200

0

50

0

100
150
200
Leading light jet p [GeV]

50

T

100
150
200
Leading light jet p [GeV]
T

Yield [Events/0.25]

Yield [Events/0.25]

Figure A.16: The LeadingLightQuarkJetPt in the electron+muon channel for positive (left) and negative (right) samples.

DØ 5.4 fb -1
400

200

0

DØ 5.4 fb -1

400

200

-2

-1

0

1

0

2
Lepton η

-2

-1

0

1

2
Lepton η

Figure A.17: The LeptonEta in the electron+muon channel for positive (left) and
negative (right) samples.

178

Yield [Events/10 GeV]

Yield [Events/10 GeV]

DØ 5.4 fb -1

1000

500

0

50

100

DØ 5.4 fb -1

1000

500

0

150
200
Missing E [GeV]

50

T

100

150
200
Missing E [GeV]
T

600

(c)

Yield [Events/0.4]

Yield [Events/0.4]

Figure A.18: The METPt in the electron+muon channel for positive (left) and negative (right) samples.

DØ 5.4 fb -1

400

200

0
-4

600

(c)

DØ 5.4 fb -1

400

200

0
-4

-2
0
2
4
Q(lepton) X η(light-quark jet)

-2
0
2
4
Q(lepton) X η(light-quark jet)

Figure A.19: The QTimesEta in the electron+muon channel for positive (left) and
negative (right) samples.

179

Yield [Events/5GeV]

Yield [Events/5GeV]

DØ 5.4 fb -1
400

200

0

20

DØ 5.4 fb -1
400

200

0

40
60
80
Second light jet p [GeV]

20

T

40
60
80
Second light jet p [GeV]
T

Yield [Events/5GeV]

Yield [Events/5GeV]

Figure A.20: The SecondLightQuarkJetPt in the electron+muon channel for positive
(left) and negative (right) samples.

DØ 5.4 fb -1

(d)
600

400

200

0

DØ 5.4 fb -1

(d)
600

400

200

140

160

0

180
200
220
Top Quark Mass [GeV]

140

160

180
200
220
Top Quark Mass [GeV]

Figure A.21: The SigTopMass in the electron+muon channel for positive (left) and
negative (right) samples.

180

Yield [Events/0.2]

Yield [Events/0.2]

300

DØ 5.4 fb -1

200

100

DØ 5.4 fb -1

300

200

100

0
-1

0
-1

-0.5
0
0.5
1
Top Quark Spin Correlation

-0.5
0
0.5
1
Top Quark Spin Correlation

Yield [Events/20]

Yield [Events/20]

Figure A.22: The SpinCorr in the electron+muon channel for positive (left) and
negative (right) samples.

DØ 5.4 fb -1

1000

1000

500

0

DØ 5.4 fb -1

500

0

100

200

χ2

300
400
of top mass

0

0

100

200

300
400
χ2 of top mass

Figure A.23: The TopMassMinChiSqr in the electron+muon channel for positive
(left) and negative (right) samples.

181

Yield [Events/10GeV]

Yield [Events/10GeV]

DØ 5.4 fb -1

1000

500

0

0

DØ 5.4 fb -1

1000

500

0

50
100
150
W Boson Transverse Mass [GeV]

0

50
100
150
W Boson Transverse Mass [GeV]

Figure A.24: The WT in the electron+muon channel for positive (left) and negative
(right) samples.

182

Appendix B
SAM Deﬁnitions of Dataset Used
in This Analysis
The dataset is obtained from the MUinclusive and EMinclusive common sample
group (CSG) skims:
• CSG CAF skim PASS3 p18.13.01 for RunIIa
• CSG CAF skim PASS2 p21.10.00 for Run IIb1
• CSG CAF skim PASS4 p21.10.00 p20.12.00-04 and
CSG CAF skim PASS4 p21.12.00 p20.12.05 allﬁx for RunIIb2
and the following VJets deﬁnitions
• VJets EM/MUinclusive Moriond09 RunIIa v1 for Run IIa
183

• VJets EMinclusive Summer09 extended RunIIb v3 for Run IIb electrons
• VJets MUinclusive Summer09 extended allﬁx RunIIb for RunIIb muons

184

Appendix C
Plots for discriminants after the
binning transformation
Figures C.1 to C.6 show various discriminants distributions in the tagged ﬁnal samples after the binning transformation applied to the discriminants [58, 30]. When
trying to calculate the cross section, in the high discriminant region there may be
some bins in which there are some signal but no background events. To avoid this,
we applied a binning transformation to MVA outputs that ensures that there is a
minimum amount of eﬀective background events in each bin. We followed the same
procedure as described in detail in Appendix D of Ref. [58] and applied the same
binning transformation function derived for the BNNComb discriminants in Ref. [30]
to both positive and negative charge samples. The signal region with discriminant
185

output from 0.8 to 1 are zoomed to show that there are in fact signals in that region.
The “Sort” or “Ranked” plots sorted each bin according to their S/B ratios from left
to right.

186

400

Yield [Events/0.04]

Yield [Events/0.04]

(e)

DØ 5.4 fb -1
Data
tb+tqb
W+jets
tt
Multijets

200

(e)

400
300

DØ 5.4 fb -1
Data
tb+tqb
W+jets
tt
Multijets

200
100

0

0.2
0.4
0.6
0.8
1
Ranked BNNcomb6 discriminant

0

0.2
0.4
0.6
0.8
1
Ranked BNNcomb6 discriminant

(f)

Yield [Events/0.04]

Yield [Events/0.04]

Figure C.1: The s+t discriminant plots in the electron+muon channel for positive
(left) and negative (right) events.

DØ 5.4 fb -1

10

5

0
0.8

(f)
10

5

0
0.8

0.85
0.9
0.95
1
Ranked BNNcomb6 discriminant

DØ 5.4 fb -1

0.85
0.9
0.95
1
Ranked BNNcomb6 discriminant

Figure C.2: The BNNcomb6SortZoom plots in the electron+muon channel for positive (left) and negative (right) events.

187

400
300
200

Yield [Events/0.04]

Yield [Events/0.04]

(e)

DØ 5.4 fb -1
Data
tqb
tb
W+jets
tt
Multijets

DØ 5.4 fb -1
Data
tqb
tb
W+jets
tt
Multijets

400

200

100

0

0.2
0.4
0.6
0.8
1
Ranked BNNcomb6 discriminant

0

0.2
0.4
0.6
0.8
1
Ranked BNNcomb6 discriminant

(f)

Yield [Events/0.04]

Yield [Events/0.04]

Figure C.3: The t discriminant plots in the electron+muon channel for positive (left)
and negative (right) events.

DØ 5.4 fb -1

6

4

2

0
0.8

8

DØ 5.4 fb -1

6

4

2

0
0.8

0.85
0.9
0.95
1
Ranked BNNcomb6 discriminant

0.85
0.9
0.95
1
Ranked BNNcomb6 discriminant

Figure C.4: The BNNcombTSortZoom plots in the electron+muon channel for positive (left) and negative (right) events.

188

(e)

400

Yield [Events/0.04]

Yield [Events/0.04]

600

DØ 5.4 fb -1
Data
tb
tqb
W+jets
tt
Multijets

200

(e)
600

400

DØ 5.4 fb -1
Data
tb
tqb
W+jets
tt
Multijets

200

0

0.2
0.4
0.6
0.8
1
Ranked BNNcomb6 discriminant

0

0.2
0.4
0.6
0.8
1
Ranked BNNcomb6 discriminant

(f)

Yield [Events/0.04]

Yield [Events/0.04]

Figure C.5: The s discriminant plots in the electron+muon channel for positive (left)
and negative (right) events.

DØ 5.4 fb -1

6

4

(f)

DØ 5.4 fb -1

10

5

2

0
0.8

15

0
0.8

0.85
0.9
0.95
1
Ranked BNNcomb6 discriminant

0.85
0.9
0.95
1
Ranked BNNcomb6 discriminant

Figure C.6: The BNNcombSSortZoom plots in the electron+muon channel for positive (left) and negative (right) events.

189

Appendix D
Lepton Charge Mis-Identiﬁcation
The charge measurement is important for this analysis since CP violation can be
diluted by the mis-identiﬁcation (mis-ID) of the lepton charge. We rely on the track
charge to determine whether the selected lepton carries positive or negative charge.
We select Z → ee or Z → µµ events to measure the charge mis-ID rate. We
require each event to have exactly two electrons or two muons, and the invariant
mass of the leptons to be between 80 and 100 GeV (Z peak). We assume that all
same sign events (deﬁned as the events containing two same sign leptons) under the
Z peak must have one lepton with the wrong charge, and that opposite sign events
(deﬁned as the events containing two opposite sign leptons) have correctly measured
lepton charges. Therefore the charge mis-ID rate is deﬁned as the ratio between the
number of same sign events and the total number of di-lepton events.
190

We measured the lepton charge mis-ID rate dependences on lepton detector eta
and lepton transverse momentum using CSG 2EMhighpt and 2MUhighpt data samples. The luminosity of the data is 1.1 f b−1 for p17 and 4.3 f b−1 for p20.
We also measured the mis-ID rate from MC samples as a cross check.
The data and simulation samples used in this analysis are listed below. Electron
Channel data samples:
• CSG CAF 2EMhighpt PASS3 p18.13.01 for RunIIa
• CSG CAF 2EMhighpt PASS4 p21.10.00 p20.12.00 for RunIIb
• CSG CAF 2EMhighpt PASS4 p21.10.00 p20.12.01 for RunIIb
• CSG CAF 2EMhighpt PASS4 p21.10.00 p20.12.02 for RunIIb
• CSG CAF 2EMhighpt PASS4 p21.10.00 p20.12.05 for RunIIb
• CSG CAF 2EMhighpt PASS2 p21.10.00 for RunIIb
Electron Channel MC samples:
• AFB-Zee-60-130GeV-p21 for RunIIb
• AFB-Zee-60-130GeV-NoExtraSmear for RunIIa
Muon Channel data samples:
• CSG CAF 2MUhighpt PASS3 p18.14.00 for RunIIa
191

• CSG CAF 2MUhighpt PASS4 p21.10.00 p20.12.00 for RunIIb
• CSG CAF 2MUhighpt PASS4 p21.10.00 p20.12.01 for RunIIb
• CSG CAF 2MUhighpt PASS4 p21.10.00 p20.12.02 for RunIIb
• CSG CAF 2MUhighpt PASS4 p21.10.00 p20.12.03 for RunIIb
• CSG CAF 2MUhighpt PASS4 p21.10.00 p20.12.04 for RunIIb
• CSG CAF 2MUhighpt PASS4 p21.10.00 p20.12.05 summer2009 for RunIIb
• CSG CAF 2MUhighpt PASS2 p21.10.00 for RunIIb
Muon Channel MC samples:

• Zmumu-weigang-60-130GeV-p20 2 for RunIIb
• Zmumu-weigang-60-130GeV-p17 for RunIIa
For both channels we used the selection cuts as close as possible to the single top
cross section measurement, except that we have 2 leptons instead of 1 (same isolation
requirements for both leptons.), and we used the 2 jet inclusive bin while single top
analysis uses 2, 3, 4 jet exclusive bins. Also there is no MET requirement.
192

D.1

Electron Charge Mis-ID Rate

Electrons are deﬁned as clusters of energy depositions in the electromagnetic section
of the calorimeter, consistent in shape with an electromagnetic shower. We used
these cuts for electrons in the charge mis-ID measurement:

• We require invariant mass 80 < M12 < 100 GeV
• We require electrons to be within the central calorimeter with |η det | < 1.1
(CC).
• electron transverse momentum pT > 15 GeV
• z(track, primaryvertex) < 1 cm
• At lease 90% of the energy of the cluster must be contained in the electromagnetice section of the calorimeter. EMF> 90%
• The isolation requirement is ISO< 0.15
• EM-likelihood> 0.85
• 7 × 7 H-matrix χ2 < 50 HMx7cut = 50.
• Track pT > 5 GeV
• RDCA< 2. (= no rdca requirement.)
193

• Track match changed from spatial match only to also EOP match included
(track match spatialchi2prob() => track match chi2prob()).

This further

lowers electron misid rate from 0.6 to 0.3 percent.
The mis-ID rate is less than 0.5% for both MC and data, in all bins.

194

D.2

Muon Charge Mis-ID Rate

Muons are identiﬁed by combining tracks in the muon spectrometer ( η det < 2.0)
with central detectors tracks. We used these cuts for muons in the charge mis-ID
measurement:
• We require Invariant Mass 80 < M12 < 100 GeV
• We require muons to match the central calorimeter with |η det | < 2.0
• muon pT > 15 GeV
• z(track, primaryvertex) < 1 cm
• We require muon not in a jet: ∆R(muon, jet) > 0.5
• Momenta of all tracks are within R < 0.5 except muon track < 20% of muon
pT: etTrkCone5/pT< 0.2
• energy in cone 0.1 < R < 0.4 less than 20% of muon pT: etHalo/pT< 0.2
• pass 3 layers of muon scintilators: |nseg| = 3
• not from cosmic rays
• has a central track
• track Chi2Ndf () < 4.0
195

• when there is no SMT hits, require DCA < 0.2 (track match muon in η, φ.)
Figure D.1 shows the muon charge mis-ID rate as a function of lepton pT for
Run IIa (left) and Run IIb (right).
The mis-ID rate is less than 1% for both MC and data, in all bins.

196

p17 muon mis ID rate over muon transverse momentum
Entries 69208

Lepton mis-ID rate

0.07

Mean
RMS

0.06

66.7
17.39

0.05
0.04

data
mc

0.03
0.02
0.01

0
20 30 40 50 60 70 80 90 100
Transverse momentum of lepton [GeV]
p20 muon mis ID rate over muon transverse momentum
Entries 116916

Lepton mis-ID rate

0.08
Mean
73.58
RMS
20.74
0.07
0.06
0.05
data
0.04
mc
0.03
0.02
0.01
0
20 30 40 50 60 70 80 90 100
Transverse momentum of lepton [GeV]

Figure D.1: The muon charge mis-ID rate as a function of lepton pT for Run IIa
(left) and Run IIb (right).

197

D.3

Conclusion

In both MC and data, the fake rate is always below 1%, as shown in Table D.1. And
since we are not sensitive to eﬀects of the order of 1% compared to the existing much
larger systematic uncertainties, we simply add a systematic uncertainty of 1% anticorrelated between the positive and negative charge samples. One may note that the
data charge mis-ID rate is signiﬁcantly higher than MC. This is because MC does
not simulate our detector well. As long as they are all very small, it shouldn’t aﬀect
the ﬁnal result.
Table D.1: Overall Charge mis-ID Rates.

Overall charge mis-ID rates
Run IIa

Run IIb

Electron

Muon

Electron

Muon

Data

0.22%(1 ± 0.11)

0.52%(1 ± 0.053)

0.32%(1 ± 0.074)

0.50%(1 ± 0.041)

MC

0.12%(1 ± 0.036) 0.15%(1 ± 0.016)

198

0.068%(1 ± 0.054) 0.18%(1 ± 0.0068)

Appendix E
W Asymmetry Check
We performed a cross-check, for measuring the W + and W − asymmetry in the
W +jets sample. This check was done by using W +jets samples as the signal and
tbtqb sample as a part of the backgrounds in the 2-jet and 1-tag bin. The expected
and observed posterior density funcions are shown in Fig. E.1 to Fig. E.2, taking
into account all systematics. The W asymmetry is found to be consistent with 0 (as
expected) and compared to the single top asymmetry we are trying to measure. This
check is sensitive to possible bias from the detector eﬀect, and we ﬁnd that there are
none.

199

Posterior density

20

DØ Run II, 5.4 fb -1

18
16
14

Expected
Asymmetry

12

= -0.011+0.025
-0.025

10
8
6
4
2
0
-0.2 -0.15 -0.1 -0.05

0

0.05 0.1 0.15 0.2

Cross section asymmetry Aw
Figure E.1: Expected posterior density distributions for W + jets asymmetry in the
s + t-channel 2-jet 1-tag bin.

200

Posterior density

20

DØ Run II, 5.4 fb -1

18
16
14

Observed
Asymmetry

12

= -0.025 +0.026
-0.024

10
8
6
4
2
0
-0.2 -0.15 -0.1 -0.05

0

0.05 0.1 0.15 0.2

Cross section asymmetry Aw
Figure E.2: Observed posterior density distributions for W + jets asymmetry in the
s + t-channel 2-jet 1-tag bin.

201

Appendix F
b/¯ Jet Energy Scale (JES) Check
b
The measurement of the top-antitop cross section diﬀerence could be aﬀected by a
diﬀerent response of the calorimeter to quark and antiquark jets. Such a bias could
come from a diﬀerent calorimeter response to the b and the ¯ decays, respectively.
b
In principle, c and c responses in the calorimeter could aﬀect b/¯ JES also, but it’s
¯
b
an even smaller eﬀect, thus we only check b and ¯ responses check.
b
We rescale the pT (the 4-vectors accordingly) of the b and ¯ by a factor of 0.9971
b
and 1.0021 respectively [73], depending on b or ¯ and then re-apply the MVA ﬁlters,
b,
for all MC samples. Then, we evaluate the diﬀerence in the CP violation extracted
from the original and modiﬁed samples. We only do this for the signal, which we
expect will have the largest eﬀect, because our main measurements come from the
signal region of the signal asymmetry posteriors.
202

The Jet Energy Scale (JES) is applied before the event selection and can introduce
changes not only to event yields, but also because the jet kinematics are changed. It
can change the jet multiplicity distribution of the events, and thus aﬀect the number
of events passing the selection. Tables F.1 to F.4 shows the expected event yields for
the signals after b-tagging before and after applying b/bbar JES for both “positive”
and “negative” samples. We can see that within errors the event yields do not change.
Table F.1: Number of expected yields for signals in “positive” samples after btagging.
Source

2 jets

3 jets

4 jets

All Channels

tb

53 ± 7.6

22 ± 3.7

6.4 ± 1.7

81 ± 13

tqb

70 ± 5.6

37 ± 4.4

13 ± 2.9

119 ± 13

tb+tqb

122 ± 13 59 ± 8.1

19 ± 4.6

200 ± 26

Table F.2: Number of expected yields for signals in “positive” samples after b-tagging
for b/bbar JES.
Source

2 jets

3 jets

4 jets

All Channels

tb

53 ± 0.84 22 ± 0.54 6.5 ± 0.26

81 ± 1.5

tqb

70 ± 1.2

36 ± 0.95

13 ± 0.53

120 ± 2.4

tb+tqb

123 ± 1.9

58 ± 1.4

19 ± 0.74

200 ± 3.9

To check the normalization eﬀects, we compare a few variables before and after
203

Table F.3: Number of expected yields for signals in “negative” samples after btagging.
Source

2 jets

3 jets

4 jets

All Channels

tb

52 ± 7.6

22 ± 3.7

6.5 ± 1.6

80 ± 13

tqb

71 ± 5.5

36 ± 4.2

13 ± 2.9

120 ± 13

tb+tqb

123 ± 13 58 ± 7.9

20 ± 4.5

200 ± 25

Table F.4: Number of expected yields for signals in “negative” samples after btagging for b/bbar JES
Source

2 jets

3 jets

4 jets

All Channels

tb

52 ± 0.84 22 ± 0.54 6.4 ± 0.26

80 ± 1.5

tqb

71 ± 1.2

36 ± 0.94

13 ± 0.52

120 ± 2.4

tb+tqb

123 ± 1.9

58 ± 1.4

19 ± 0.73

200 ± 3.9

204

b/¯ JES. To obtain a numerical estimate, we integrate from the peak bin, and look
b
into the ratios of the integrals before and after this JES. This is done for tb + tqb
combined for the one tag two jet bin for positive and negative samples separately.
We looked into several variables that are sensitive to this b/¯ JES: HT for all jets,
b
Leading jet transverse momentum, Single top mass. The results are summarized in
Table F.5. The plots are show in Figure F.1 to Figure F.6.
Table F.5: Ratio of integrals for several kinematic variables before and after b/¯ JES:
b
HT for all jets, Leading jet transverse momentum, Single top mass, in the positive
and negative tbtqb samples in one tag two jet bin. The integral is from the peak
bin.
positive

negative

HTAllJets

0.987

0.993

P TJet1

0.986

0.994

Mtop

0.990

0.982

Based on these ﬁndings, we apply another 1% anti-correlated ﬂat uncertainty on
single top and ttbar and Wbb samples, i.e. 0.01 for the top ﬁles and -0.01 for the
antitop ﬁles.

205

HT : All Jets
Entries 42545

5

Mean
RMS

4

114.6
50.3

Before b/ b JES
After b/ b JES

3
2
1
0
0

100 200 300 400 500 600 700 800
H : All Jets [GeV]

Figure F.1: HT for all jets before and after b/¯ JES for positive tbtqb samples in
b
one tag two jet bin.

206

Jet1 pT
Entries 42545

7

Mean
RMS

6

72.51
30.87

Before b/ b JES

5

After b/ b JES

4
3
2
1
0 20 40 60 80 100120140160180200220240
0
Jet1
p [GeV]
Figure F.2: Leading jet transverse momentum before and after b/¯ JES for positive
b
tbtqb samples in one tag two jet bin.

207

Top mass corresponding to minimal significance

14

Entries 42545
Mean

10

173.7

RMS

12

21.49

Before b/ b JES

8

After b/ b JES

6
4
2
0
0

50 100 150 200 250 300 350 400 450 500
Top mass

Figure F.3: Single top mass before and after b/¯ JES for positive tbtqb samples in
b
one tag two jet bin.

208

HT : All Jets
Entries 42648
Mean

4

115.1

RMS

5

52.1

Before b/ b JES
After b/ b JES

3
2
1
0
0

100 200 300 400 500 600 700 800
H : All Jets [GeV]

Figure F.4: HT for all jets before and after b/¯ JES for negative tbtqb samples in
b
one tag two jet bin.

209

8

Jet1 pT
Entries 42648

7

Mean
RMS

6

72.72
31.24

Before b/ b JES

5

After b/ b JES

4
3
2
1
0 20 40 60 80 100120140160180200220240
0
Jet1
p [GeV]
Figure F.5: Leading jet transverse momentum before and after b/¯ JES for negative
b
tbtqb samples in one tag two jet bin.

210

Top mass corresponding to minimal significance
Entries 42648

14

Mean
RMS

12
10

173.9
21.76

Before b/ b JES
After b/ b JES

8
6
4
2
0
0

50 100 150 200 250 300 350 400 450 500
Top mass

Figure F.6: Single top mass before and after b/¯ JES for negative tbtqb samples in
b
one tag two jet bin.

211

Appendix G
Production and decay asymmetries
The asymmetry in single top production can be expressed as follows:

AP =

¯
σ(p¯ → tX) − σ(p¯ → tX)
p
p
.
¯
σ(p¯ → tX) + σ(p¯ → tX)
p
p

(G.1)

¯
The asymmetry can also originate from the top decay t → bW + or t → ¯ − ,
bW
quantiﬁed by the partial rate asymmetry:

AD =

¯ bW
Γ(t → bW + ) − Γ(t → ¯ − )
¯ bW
Γ(t → bW + ) + Γ(t → ¯ − )

(G.2)

where Γ denotes the decay rate.
It is customary when measuring a cross section to assume that the top quark
decays exclusively in t → bW and ignore any possible CP violation contribution from
212

the decay process. However, in the more general case, the measured asymmetry A
is equal to

A=

¯
¯ bW
σ(p¯ → tX)B(t → bW + ) − σ(p¯ → tX)B(t → ¯ − )
p
p
.
¯
¯ bW
σ(p¯ → tX)B(t → bW + ) + σ(p¯ → tX)B(t → ¯ − )
p
p

(G.3)

where B denotes the branching ratio. Note the asymmetry deﬁned in this way is
always smaller than one.

The decay asymmetry can be written as a function of Γ(t → bW + ) = Γt B(t →
¯
bW + ) and Γ(t → ¯ − ) = Γt B(t → ¯ − ) where Γt and Γt are the total decay
bW
bW
¯
¯ ¯
¯
width for t and t quark. Because of the conservation Γt = Γt = Γ, the measured
¯
asymmetry can be expressed in terms of the partial decay rates
¯
¯ bW
σ(p¯ → tX)Γ(t → bW + ) − σ(p¯ → tX)Γ(t → ¯ − )
p
p
A=
¯
¯ bW
σ(p¯ → tX)Γ(t → bW + ) + σ(p¯ → tX)Γ(t → ¯ − )
p
p

(G.4)

Therefore the measured asymmetry contains contributions from both production and
partial rate asymmetries. In particular, it can be shown that

A=

AP + AD
1 + AP AD

(G.5)

If one assumes that CP violation only happens in the top production or AD is
negligible [5], then we have AD ≈ 0, A = AP . On the other hand, if one assumes
213

that AD = AP , then A =
A

2AP
. In this analysis, we only measure the asymmetry
1+A2
P

214

BIBLIOGRAPHY

215

BIBLIOGRAPHY
[1] N. Kidonakis. Single top quark production at the tevatron: threshold resummation and ﬁnite-order soft gluon corrections. Phys. Rev. D, 74:114012, 2006.
[2] L. D. Landau. On the Conservation Laws in Weak Interactions . Zh. Eksp.
Teor. Fiz., 32:405, 1957.
[3] V.L. Fitch J. Christenson, J.W. Cronin and R. Turlay. Evidence for the 2π
Decay of the K20 Meson . Phys. Rev. Lett., 13:138.
[4] S. Abachi et al. Observation of the top quark. Phys. Rev. Lett., 74:2632, 1995.
[5] D Atwood S. Bar-Shalom and A. Soni. CP Violation in Single Top Production
and Decay Via pbarp → tbarb + X → W + b¯ + X Within the MSSM: A possible
b
Application for Measuring arg(At ) at Hadron Colliders . Phys. Rev. D, 57:1495.
[6] G. Eilan A. Soni D. Atwood, S. Bar-Shalom. CP nonconservation in p¯ → tbX
p
at the Fermilab Tevatron . Phys. Rev. D, 54:5412.
[7] Makoto Kobayashi and Toshihide Maskawa. CP Violation in the Renormalizable
Theory of Weak Interaction. Prog. Theor. Phys., 49:652–657, 1973.
[8] G. Eilan A. Soni D. Atwood, S. Bar-Shalom. CP Violation in Top Physics .
Phys. Rept., 1:347.
[9] S. F. Novaes. Standard model: An Introduction. hep-ph/0001283, 1999.
216

[10] V. D. Barger and R. J. N. Phillips. Collider Physics (Frontiers in Physics).
Addison Wesley Publishing Company, 1987.
[11] D. Griﬃths. Introduction to elementary particles. Weinheim, USA: Wiley-VCH,
1987.
[12] M. Kaku. Quantum Field Theory, a modern introduction. Oxford University
Press, 1993.
[13] C.-P. Yuan. A New Method to Detect a Heavy Top Quark at the Tevatron.
Phys. Rev., D41:42, 1990.
[14] S. Willenbrock and D. Dicus. Production of Heavy Quarks from W Gluon
Fusion. Phys. Rev., D34:155, 1986.
[15] V. M. Abazov et al.
arXiv:0903.0850, 2009.

Observation of Single Top Quark Production.

[16] T. Aaltonen et al. First Observation of Electroweak Single Top Quark Production. arXiv:0903.0885, 2009.
[17] Wikipedia. Standard model — wikipedia, the free encyclopedia, 2009. [Online;
accessed 8-July-2009].
¨
[18] W. Pauli. Uber den Zusammenhang des Abschlusses der Elektronengruppen
im Atom mit der Komplexstruktur der Spektren (¿On the Connexion between
the Completion of Electron Groups in an Atom with the Complex Structure of
Spectra¿) . Z. Phys., 31:765, 1925.
[19] R. Brout F. Englert. Broken Symmetry and the Mass of Gauge Vector Mesons.
Phys. Rev. Lett., 13(9):321–323, 1964.
[20] P. W. Higgs. Broken Symmetries and the Masses of Gauge Bosons. Phys. Rev.
Lett., 13(16):508–509, 1964.
217

[21] T.W.B. Kibble G.S. Guralnik, C.R. Hagen. Global Conservation Laws and
Massless Particles. Phys. Rev. Lett., 13(20):585–587, 1964.
[22] NASA. Seven-year wilson microwave anisotropy probe (wmap) observations:
Sky maps, systematic errors, and basic results, 2010. [nasa.gov. Retrieved 201012-02.].
[23] F. P. An. Observation of electron-antineutrino disappearance at Daya Bay.
arXiv:1203.1669v2, 2012.
[24] B. A. Brak. The Hierarchy Problem in the Standard Model and Little Higgs
Theories (Chapter 5). 2004.
[25] V. Trimble. Existence and nature of dark matter in the universe. Annual
Review of Astronomy and Astrophysics, 25:425–472, 1987.
[26] C. Amsler et al. Review of particle physics. Phys. Lett., B667:1, 2008.
[27] F. Maltoni F. Tramontano J. M. Campbell, R. Frederix. NLO predictions for
t-channel production of single top and fourth generation quarks at hadron colliders. arXiv:0907.3933, 2009.
[28] C.-P. Yuan. Strategies for Probing CP Properties in the Top Quark System at
e− e+ and Hadron Colliders . Mod. Phys. Lett. A, 10:627.
[29] The Single Top Working Group. Single top quark production in 5.4 fb−1 of
data — signal and background modeling and event selection. DØ Note, 6099,
2010.
[30] The Single Top Working Group. Measurement of the single top quark production
cross section in 5.4 fb−1 of data. DØ Note, 6128, 2010.
[31] (Ed. ) F.T. Cole, (Ed. ) E.L. Goldwasser, and (Ed. ) R.R. Wilson. National Accelerator Laboratory design report January 1968. 1968. FERMILAB-DESIGN1968-01.
218

[32] Fermilab National Accelerator Laboratory, 2008. http://www.fnal.gov.
[33] Pushpalatha C. Bhat and William J. Spalding. Fermilab collider Run II: Accelerator status and upgrades. AIP Conf. Proc., 753:30–41, 2005.
[34] Fermilab. Tevatron luminosity, 2002. [Online; accessed 28-January-2012].
[35] Wikimedia
Commons.
A
schematic
of
the
sic
components
of
a
hermetic
detector,
http://upload.wikimedia.org/wikipedia/commons/e/eb/Hermetic.png.

ba2012.

[36] T. Han. Collider Phenomenology - Basic Knowledge and Techniques. 2005.
[37] G-Links Co. G-links, 2012. http://www.g-links.com.tw/.
[38] Cypress Semiconductor
http://www.cypress.com/.

Corporation.

Cypress

[39] Fermilab
National
Accelerator
Laboratory.
rithms
group.
how
to
run
reco,
2008.
d0.fnal.gov/computing/algorithms/howto/howtoreco.html.

hotlinks,

2012.

DØ
algohttp://www-

[40] Wikipedia. Hough transform, 2012. [Online; accessed February-2012].
√
¯
[41] Alexander Khanov. Measurement of the tt Production Cross-Section at s =
1.96 T eV UsingLif etimeT agging. PhD thesis, University of Rochester, 2004.
[42] G. Borissov. Ordering a chaos or .. technical details of aa tracking All
DØ Meeting. 2003. http://www-d0.fnal.gov/atwork/adm/d0 private/2003-0228/adm talk.ps.
[43] A. Schwartzman and C. Tully. Primary vertex reconstruction by means of adaptive vertex ﬁtting. DØ Note, 4918, 2005.
219

[44] Preliminary p17 jes for data and mc (jetcorr v07-01-02), 2004. http://wwwd0.fnal.gov/phys id/jes/d0 private/certiﬁed/certiﬁed jes.html.
[45] N. Makovec and J.-F. Grivaz. Shifting, smearing and removing simulated jets.
DØ Note, 4914, 2005.
[46] A. Harel and J. Kvita. p20 jetid eﬃciencies and scale factors. DØ Note, 5634,
2008.
[47] M. Anastasoaie, S. Robinson, and T. Scanlon. Performance of the nn b-tagging
tool on p17 data. DØ Note, 5213, 2006.
[48] E. Barberis et. al. The matrix method and its error calculation. DØ Note, 4564,
2004.
[49] E. Boos, V. Bunichev, L. Dudko, V. Savrin, and V. Sherstnev. Method for
simulating electroweak top-quark production events in the nlo approximation:
Singletop event generator. Physics of Atomic Nuclei, 69(8):1317, 2006.
[50] J. Pumplin et. al. New generation of parton distributions with uncertainties
from global QCD analysis. J. High Energy Physics, 07:012, 2002.
[51] T. Sjostrand, S. Mrenna, and P. Skands. PYTHIA 6.4 Physics and Manual. J.
High Energy Physics, 05:026, 2006.
[52] D.J. Lange. The EvtGen particle decay simulation package. Nucl. Instrum.
Meth., A462:152–155, 2001.
[53] M. Mangano, M. Moretti, F. Piccinini, R. Pittau, and A.D. Polosa. ALPGEN, a
generator for hard multiparton processes in hadronic collisions. J. High Energy
Physics, 07:001, 2003.
[54] J. Hays et al. Single electron eﬃciencies in p17 data and monte-carlo using
p18.05.00 d0correct. DØ Note, 5105, 2006.
220

[55] J. Atramentov et al. Electron and photon identiﬁcation with p20 data. DØ Note,
5761, 2008.
[56] P. Calfayan et al. Muon identiﬁcation certiﬁcation for p17 data. DØ Note,
5157, 2006.
[57] S. Muanza. A compilation of mcfm v5.1 cross section. DØ Note, 5406, 2007.
[58] The Single Top Working Group. Observation of single top quark production in
2.3 fb− 1 of data using boosted decision trees. DØ Note, 5811, 2009.
[59] Y. Freund and R.E. Schapire. Experiments with a new boosting algorithm.
Machine Learning: Proceedings of the Thirteenth International Conference, :148,
1996.
[60] B. A. Berg. Markov Chain Monte Carlo Simulations and Their Statistical Analysis. World Scientiﬁc Singapore, 2004.
[61] B. J. Pendleton S. Duane, A. D. Kennedy and D. Roweth. The boosting approach to machine learning an overview. MSRI Workshop on Nonlinear Estimation and Classiﬁcation, 2002, 2002.
[62] The Single Top Working Group. Measurement of the single top quark production
cross section in 5.4 fb− 1 of data. DØ Note, 6128, 2011.
[63] R. Schwienhorst et. al. Statistical methods implemented in the package
top statistics. DØ Note, 5817, 2008.
[64] S. Moch and journal = PRD volume = 78 pages = 034003 year = 2008 P. Uwer,
title = Theoretical status and prospects for top-quark pair production at hadron
colliders.
[65] F. Deliot et. al. Z/γ ∗ cross section computation at nnlo using the fewz code.
DØ Note, 6050, 2010.
221

[66] T. Nunnemann.
The mcfm (n)nlo calculations of the diboson cross
sections and their uncertainties, MCFM version 3.4.5.
http://wwwclued0.fnal.gov/∼nunne/cross-sections/mcfm cross-sections.html.
[67] H. Schellman. The longitudinal shape of the luminous region at DØ. DØ Note,
5142, 2006.
[68] H. Schellman. Run iib longitudinal beam shape. DØ Note, 5540, 2007.
[69] F. Deliot et. al. Systematic uncertainties in top quark measurements. DØ Note,
6024, 2010.
[70] Y. Peters, M. Begel, K. Hamacher, , and D. Wicke. Reweighting of the fragmentation function for the DØ monte carlo. DØ Note, 5325, 2007.
[71] V.M. Abazov et al. Measurement of the t anti-t production cross section in p
anti-p collisions at s**(1/2)=1.96-tev using kinematic characteristics of lepton
+ jets events. Phys. Lett. B, 626:45, 2005.
[72] O. Brandt et. al. Muon identiﬁcation certiﬁcation for the summer 2009 extended
dataset (run iib-1 and -2). DØ Note, 6109, 2010.
[73] O. Brandt et. al. Measurement of the mass diﬀerence between top and antitop
quarks in the lepton+jets channel using the matrix element method on 3.6 fb-1
of run iib data. DØ Note, 6105, 2011.

222