SYMMETRY BREAKING AND CLOCK MODEL INTERPOLATION IN 2D CLASSICAL
𝑂 (2) SPIN SYSTEMS

By

Leon Hostetler

A DISSERTATION

Submitted to
Michigan State University
in partial fulfillment of the requirements
for the degree of

Physics—Doctor of Philosophy
Computational Mathematics, Science and Engineering—Dual Major

2023

ABSTRACT

The field of two-dimensional classical spin systems has been studied for many decades using

a variety of analytical and numerical methods. This field of study contains many interesting

models including the O(2) model, which has an infinite-order Berezinskii-Kosterlitz-Thouless

(BKT) transition and the class of 𝑞-state clock models which have second-order phase transitions

when 𝑞 = 2, 3, 4 but BKT transitions when 𝑞 ≥ 5. Motivated by attempts to quantum simulate

lattice models with continuous Abelian symmetries using discrete approximations, we study an

extended-O(2) model that differs from the ordinary O(2) model by the addition of an explicit

symmetry breaking term. Its coupling allows us to smoothly interpolate between the O(2) model

(zero coupling) and a 𝑞-state clock model (infinite coupling). In the latter case, a 𝑞-state clock

model can also be defined for non-integer values of 𝑞. Thus, such a limit can also be considered

as an analytic continuation of an ordinary 𝑞-state clock model to non-integer 𝑞.

In the infinite

coupling limit, the extended-O(2) model can be simplified, and so we start by establishing the

phase diagram in that case. Using Monte Carlo and tensor methods, we show that for non-integer

𝑞, there is a second-order phase transition at low temperature and a crossover at high temperature.

Next we establish the phase diagram at finite values of the coupling again using both Monte Carlo

and tensor methods. We show that for non-integer 𝑞, the second-order phase transition at low

temperature and crossover at high temperature persist to finite coupling. For integer 𝑞 = 2, 3, 4,

there is a second-order phase transition at infinite coupling (i.e. the clock models). At intermediate

coupling, there are second-order phase transitions, but the critical exponents vary with the coupling.

At small coupling, the second-order phase transitions may turn into BKT transitions.

ACKNOWLEDGMENTS

My time as a Ph.D. student at MSU has come to a close, and I am grateful that I have had this

opportunity to do something that I truly love—to learn about and to study the basic laws of the

universe—as my full-time job. The years spent at MSU were at times difficult, but they were also

the most fulfilling years of my life so far.

A thesis like this, and the five years of research that led to it, is not the product of a single

person.

I have many people to thank for helping me. First and foremost, I’d like to thank my

supervisor Alexei Bazavov who guided me throughout my Ph.D. education and without whom this

research would not have happened. Second, I’d like to thank Yannick Meurice who went above and

beyond coauthor and PI. Next, I want to thank the other members of my Ph.D. committee—Dean

Lee, Huey-Wen Lin, Mohammad Maghrebi, and Andreas von Manteuffel. In addition to benefiting

from their guidance on my research, I have had the privilege of taking and/or auditing courses from

many of these exceptional educators.

A deep thanks to the many collaborators and coauthors that I have worked with including

many of the above names but also Judah Unmuth-Yockey, Giovanni Pederiva, Andrea Shindler, and

Brandon Henke. Without you, I would not be where I am today. A particularly deep thanks to two

specific collaborators—Jin Zhang and Ryo Sakai—whose work with tensor methods was especially

valuable to my research and plays a prominent role in this thesis.

A special thanks to Johannes H. Weber, Yannis Trimis, Thomas (TC) Chuna, Daniel Hoying,

Yash Mandlecha, and Alejandro Salas for all the interesting and thought-provoking discussions. I

also want to acknowledge the members of the QuLAT Collaboration for the excellent questions and

general discussions.

This research was a project of the QuLAT Collaboration, which studies the foundations of

quantum computing for gauge theories and quantum gravity. Monte Carlo results presented in this

dissertation were obtained using computational resources and services provided by the Institute

for Cyber-Enabled Research (ICER) at Michigan State University (MSU). Tensor renormalization

group results were computed on Syracuse University HTC Campus Grid and at the National Energy

iii

Research Scientific Computing Center (NERSC) a U.S. Department of Energy Office of Science

User Facility located at Lawrence Berkeley National Laboratory, operated under Contract No.

DE-AC02-05CH11231 using NERSC awards HEP-ERCAP0020659 and HEP-ERCAP0023235.

The research presented in this dissertation was supported in part by NSF award ACI-1341006 and

supported in part by the Quantum Information Science Enabled Discovery (QuantISED) program of

the U.S. Department of Energy (DOE) under Awards No. DE-SC0010113 and No. DE-SC0019139.

The author was personally supported during the first and final years of his Ph.D. by Michigan State

University via a University Distinguished Fellowship (UDF).

Finally, I would like to thank my parents. Although they inhabit a different world, they are

curious about my work and always there for support. Last, but certainly not least, a loving thanks

to my wife Esther for her support and for her patience with me as I put in long hours on esoteric

work.

–Leon Hostetler, Aug. 17, 2023

iv

TABLE OF CONTENTS

LIST OF ABBREVIATIONS .

LIST OF SYMBOLS .

.

.

.

.

.

.

.

.

.

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

vi

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . vii

CHAPTER 1

INTRODUCTION . . . . . . . . . . . . . . . . . . . . . . . . . . . .

.

1

CHAPTER 2

CLASSICAL SPIN SYSTEMS AND CRITICAL PHENOMENA ON
THE LATTICE . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
.
. . . . . . . . . . . . . . . . . . . . . . . . . . . . .
.

4
.
.
4
. 14
.
. 21
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28

2.1
. .
Ising Model
2.2 Potts Models . .
2.3 𝑂 (𝑛) Models
.
2.4 Clock Models .

. .
. .
.
.
. .

CHAPTER 3

MCMC SIMULATIONS AND THEIR ANALYSIS . . . . . . . . . . . . 38
3.1 Markov Chain Monte Carlo . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38
3.2 Updating Algorithms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51
3.3 Autocorrelations . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 56
3.4
Jackknife Error Analysis
.
.
3.5 Reweighting .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 60
.
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 62
3.6 Curve Fitting .
.
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 67
3.7 Finite Size Scaling .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 70
3.8 MCMC in Practice .

.
.
. .

.

CHAPTER 4

Introduction . .

THE EXTENDED-O(2) MODEL . . . . . . . . . . . . . . . . . . . . . 76
4.1
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 76
4.2 The ℎ𝑞 → ∞ Phase Diagram . . . . . . . . . . . . . . . . . . . . . . . . . . . 80
4.3 The Extended-𝑂 (2) Model at Finite-ℎ𝑞 . . . . . . . . . . . . . . . . . . . . . . 116
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 164
4.4 Future Questions

. .

.

.

.

.

CHAPTER 5

.

.

. .

Introduction .

QUANTUM FIELD THEORIES ON THE LATTICE . . . . . . . . . . . 167
5.1
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 167
5.2 QCD in the Continuum . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 167
5.3 Lattice Regularization .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 171
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 175
5.4 MCMC .
. . . . . . . . . . . . 183
5.5 Quantum Simulation of Lattice Quantum Field Theories

.

.

.

.

.

.

.

CHAPTER 6

SUMMARY .

BIBLIOGRAPHY .

. .

.

.

.

.

.

.

.

.

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 191

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 194

APPENDIX A

ADDITIONAL DETAILS ON CLASSICAL SPIN SYSTEMS . . . . . 209

APPENDIX B

TECHNICAL DETAILS OF SIMULATING THE EXTENDED-𝑂 (2)
MODEL .

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

.

. 218

v

LIST OF ABBREVIATIONS

BKT

Berezinskii-Kosterlitz-Thouless

BMHA Biased Metropolis Heatbath Algorithm

CDF

HMC

Cumulative distribution function

Hybrid Monte Carlo

HOTRG Higher-order tensor renormalization group

LQFT

LQCD

lattice quantum field theory

lattice quantum chromodynamics

MCMC Markov chain Monte Carlo

MC

NISQ

PDF

QCD

QFT

RG

RNG

SVD

TRG

XY

Monte Carlo

Noisy intermediate scale quantum

Probability density function

Quantum chromodynamics

Quantum field theory

Renormalization group

Random number generator

Singular value decomposition

Tensor renormalization group

I.e. the 𝑋𝑌 model also called the 𝑂 (2) model in two dimensions

vi

⟨· · · ⟩

Ensemble average taken over many lattice configurations

LIST OF SYMBOLS

𝜶

𝜷

𝜷𝒄

𝑪

𝝌, 𝝌𝑴

𝝌2

𝑫

𝜹

𝑬

𝜼

Specific heat critical exponent

Inverse temperature or magnetic critical exponent

Inverse temperature at critical point

Specific heat

Magnetic susceptibility

Chi-square function or statistic

Number of dimensions

Magnetic critical exponent

Internal energy

Correlation function critical exponent

𝑭( 𝒑)

Structure factor

𝜸

𝑯

𝒉, (cid:174)𝒉

𝒉𝒒

Susceptibility critical exponent

Hamiltonian

External magnetic field

Symmetry breaking strength in Extended-O(2) model

𝑱, 𝑱𝒊, 𝑱𝒊 𝒋 Coupling strength

𝒌 𝑩

𝑳

Boltzmann constant

The length of the lattice along one/each dimension

𝑴, (cid:174)𝑴

Magnetization

𝑵

𝝂

𝑶 (𝒏)

𝒒

Number of lattice sites (typically) i.e. the volume of the system

Correlation length critical exponent

The orthogonal group in dimension 𝑛

Number of “clock” states in clock models

vii

𝑸

𝑺𝒊, (cid:174)𝑺𝒊

𝝈

𝑻

𝝉𝒊𝒏𝒕

𝜽 𝒊, 𝝋𝒊

𝑼𝑴

𝝃

𝒁

A goodness-of-fit measure

Spin variable at lattice site 𝑖

Standard deviation

Temperature

Integrated autocorrelation time

Angle of spin vector at site 𝑖

Binder cumulant

Correlation length

Partition function

viii

CHAPTER 1

INTRODUCTION

Quantum chromodynamics (QCD), a key component of the Standard Model of particle physics,

is the theory of the strong interaction that occurs between color-charged particles like quarks and

gluons. The expectation value of observables is expressed in terms of infinite-dimensional path

integrals. Evaluating these path integrals is not easy. One approach is that of perturbative QCD

where the path integrals are expanded in powers of the coupling constant. On the other hand,

lattice QCD is an ab initio nonperturbative approach where the theory is regularized on a lattice

and a Wick rotation is performed so that the path integrals can be interpreted as classical partition

functions. This allows us to apply the extensive machinery of classical statistical mechanics to

QCD. In particular, equilibrium expectation values can be estimated by Monte Carlo simulation.

Lattice QCD using Markov Chain Monte Carlo (MCMC) methods is our most powerful tool

for extracting predictions from quantum chromodynamics (QCD) in the non-perturbative regime,

however, there are some serious limitations and challenges. The distribution weight for the gauge

fields is proportional to the determinant of a very large matrix resulting from integrating over the

fermionic degrees of freedom, which makes even equilibrium calculations difficult. The challenges

become even more severe when we want to study non-equilibrium QCD or QCD at non-zero baryon

density. In both cases, we run into sign problems which make our current approaches impractical.

Quantum simulation may offer a way forward.

At its simplest, the idea of quantum simulation is to investigate quantum field theories by

studying the behavior of simpler analogues in a laboratory setting.

In a digital simulation, the

Hamiltonian is mapped to a quantum machine, such as a universal gate-based quantum computer,

which is time-evolved stroboscopically in discrete steps. This approach seems promising, however,

the circuit depth of these algorithms and the lack of error correction available on current hardware,

means this is more of a long-term approach. If we want useful results in the near-term, it seems

that analog simulation may be the best path forward. In an analog simulation, the Hamiltonian of

the field theory is mapped to a simpler quantum system, such as a lattice of cold atoms, which is

1

allowed to evolve continuously in real-time. Such real-time evolution would completely bypass the

sign problem that occurs when we try to simulate real-time physics using the traditional MCMC

approach.

Whether one pursues analog or digital quantum simulation, we will need to start with toy

models. One example is the Schwinger model, which is quantum electrodynamics (QED) in 1+1

dimensions. The Schwinger model exhibits some features of QCD such as confinement, however,

it is much easier to study since it is an Abelian model in 1+1 dimensions. Even simpler is the

Abelian-Higgs model (i.e. scalar QED), which in 1+1 dimensions is the Schwinger model with the

electron replaced by a complex scalar field. It has been shown that in principle the Abelian-Higgs

model can be mapped to a Rydberg ladder [1–4]. In this proposed approach, the Abelian-Higgs

Hamiltonian is mapped to an angular momentum Hamiltonian, which in turn is truncated to a finite

number (e.g. 5) of angular momentum states, which in principle can be simulated on a Rydberg

ladder. In the limit of infinite self-coupling and zero gauge coupling, the Abelian-Higgs model

reduces to the classical O(2) spin model.

In this dissertation, we study classical O(2)-like spin systems in two dimensions. There are

a number of reasons why these kinds of models are interesting to those working towards the

quantum simulation of quantum field theories. The first is that the classical O(2) spin model in two

dimensions is a particular limit of the Abelian-Higgs model, which itself is a toy model for QCD.

The implementation of such O(2)-like models on an analog simulator may therefore be a first step

toward the simulation of more complicated models like the Abelian-Higgs model or the Schwinger

model. With the O(2) model, we can add a symmetry-breaking term that allows us to break the O(2)

symmetry down to Z𝑞 in a controlled way. This allows us to study the role of symmetry in such

systems, and it allows us to study Z𝑞 approximations of U(1) or O(2) symmetry. This is relevant

for the “field digitization” of gauge theories. If we want to put gauge fields on a quantum simulator,

whether analog or digital, we need some way to handle the infinite Hilbert spaces of the gauge

fields with only a finite set of qubits. One way to handle this is to approximate the gauge fields,

by representing them with a discrete subgroup. This is the idea of field digitization. For example,

2

for a U(1) gauge theory such as Abelian-Higgs or the Schwinger model, one could approximate

the gauge group by Z𝑞. With the O(2) model extended by this symmetry breaking term, we can

continuously interpolate between different values of 𝑞.

With this dissertation, we have tried to present the material in a self-contained manner. This

meant starting with an introduction to classical spin systems and critical phenomena on the lattice.

This is the given in Chapter 2. In Chapter 3, we introduce MCMC simulations and their statistical

analysis. The goal was for the first two chapters to be a pedagogical introduction to these topics.

Then in Chapter 4, we present the bulk of the author’s research. This is the chapter on the Extended-

O(2) model. In Chapter 5, we come full circle back to QCD and the quantum simulation of QCD.

In Chapter 6 we summarize our findings.

3

CLASSICAL SPIN SYSTEMS AND CRITICAL PHENOMENA ON THE LATTICE

CHAPTER 2

2.1

Ising Model

2.1.1

Introduction

The Ising model was developed to model ferromagnetism. Despite its simplicity, in two and

higher dimensions it possesses features that appear also in the quantum field theories which describe

the Standard Model of particle physics. These mathematical models begin with the discretization

of space or spacetime as a lattice. The fields of the physical theory are defined on the lattice sites

and/or on the links between the sites, and the fundamental mechanics of the theory are encoded

in the way in which the field variables at neighboring sites and/or links interact. Monte Carlo

simulation of such lattice models provides a framework for studying these theories even in regimes

where other methods fail. Techniques originally derived to study Ising-like systems are used today

to solve problems in quantum field theory (QFT). As such, understanding these classical lattice spin

systems and the tools used to study them can be a useful first step to understanding lattice quantum

field theories.

In the Ising model, we define discrete “atomic spins” on a lattice of 𝑁 sites with periodic

boundary conditions. The Hamiltonian energy function, which gives the energy of a particular

lattice configuration or microstate, is

𝐻 = −

∑︁

⟨𝑖, 𝑗⟩

𝐽𝑖 𝑗 𝑆𝑖𝑆 𝑗 − 𝜇

𝑁
∑︁

𝑗=1

ℎ 𝑗 𝑆 𝑗 ,

(2.1)

where the first sum is over nearest neighbor pairs, 𝑆𝑖 = ±1 is the spin-variable sitting at the lattice

site 𝑖, 𝐽𝑖 𝑗 is the interaction strength between the spin variables at sites 𝑖 and 𝑗, 𝜇 is the magnetic

moment, and ℎ 𝑗 is the interaction strength between the spin variable at site 𝑗 and an external

magnetic field. Typically, the interaction strength for site variables is taken to be the same between

all nearest neighbor pairs, i.e. 𝐽𝑖 𝑗 = 𝐽, and the external magnetic field is taken to be constant,

4

ℎ 𝑗 = ℎ. The Ising Hamiltonian becomes

𝐻 = −𝐽 ∑︁
⟨𝑖, 𝑗⟩

𝑆𝑖𝑆 𝑗 − ℎ

𝑁
∑︁

𝑗=1

𝑆 𝑗 ,

(2.2)

where 𝜇 has been absorbed in the new ℎ. The spin-variables in this model take the values +1 and

−1. Alternatively, one can think of the spin-variables as unit vectors pointing in some reference

direction e.g. + ˆ𝑦 or the opposite direction − ˆ𝑦. Then the magnetic field with strength ℎ biases the

spin variables toward the positive (if ℎ > 0) or negative direction (if ℎ < 0).

A mechanical system seeks to minimize its free energy. More precisely, at constant temperature

𝑇, the Helmholtz free energy 𝐹 = 𝐸 − 𝑇 𝑆 of a statistical mechanical system is minimized at

equilibrium. This involves the interplay of two quantities—the internal energy 𝐸 and the entropy 𝑆.

For the Ising model, the states with the lowest internal energy are those in which the spins are aligned

with each other, so states in which most or all spins are pointing in the same direction dominate

when the temperature is low. On the other hand, when the temperature is high, the free energy is

minimized when the entropy is large. High entropy states correspond to those with a lot of disorder,

i.e., the spins are pointing in random directions. As the temperature of the system is increased, at

some point it transitions from a mostly ordered phase to a mostly disordered phase. This transition

can be thought of as governed by the competition between the propensity for energy minimization

(which is ultimately due to conservation of energy) and the propensity for maximization of entropy

(which is ultimately due to the statistical fact that the system is more likely to find itself in a more

common configuration). The two-dimensional Ising model is of great theoretical interest because

it is the simplest system in statistical mechanics to show a phase transition. It is also one of the

few classical spin models whose exact solution is known in one and two dimensions. Furthermore,

exact results can be computed on two-dimensional lattices. See Appendix A.1.

At the microscopic level, the behavior of the individual spin variables is governed by a local

Hamiltonian—each spin-variable is affected only by its immediate neighbors. Nevertheless, at the

macroscopic level, the system shows sudden and dramatic changes in thermodynamic quantities—

e.g.

the specific heat shows an infinite discontinuity in an otherwise smooth function—as the

5

temperature of the system crosses the critical temperature associated with the phase transition.

Evidently, even well-behaved Hamiltonians governing the microscopic behavior can lead to non-

analytic behavior in the thermodynamic limit.

The partition function of the Ising model is

where the sum is over all possible lattice configurations, and

𝑍 =

∑︁

𝑒−𝛽𝐻𝑖 ,

𝑖

𝛽 =

1
𝑇

,

(2.3)

(2.4)

is the inverse temperature in units where the Boltzmann constant is 𝑘 𝐵 = 1. The Helmholtz free

energy is

𝐹 = −

1
𝛽

ln 𝑍 = 𝐸 −

1
𝛽

𝑆.

(2.5)

2.1.2 Thermodynamics and Observables

The results in this section were produced using Markov chain Monte Carlo (MCMC) with a

heatbath updating algorithm. MCMC methods and their statistical analysis will be introduced in

Chapter 3. The MCMC code that was used to produce the Ising results in this section was adapted

from publicly available Fortran code developed by Bernd Berg [5].

When working with lattice systems, it is often instructive to look at equilibrium lattice con-

figurations at different temperatures.

In two dimensions, this is particularly helpful, as lattice

configurations can easily be visualized as images. For example, one can use imshow in matplotlib

to display a lattice of numbers as a lattice of colors. In Figure 2.1, we show examples of equilibrium

Ising configurations at different temperatures. These results were obtained after performing MCMC

simulations on 400 × 400 lattices for different values of 𝛽. One can immediately see how the level

of order changes for equilibrium states at different 𝛽. At high temperatures (i.e. small 𝛽), the

equilibrium configurations are highly disordered. At low temperatures (large 𝛽), the equilibrium

configurations are highly ordered i.e.

they are almost fully magnetized in one direction. The

images in Figure 2.1 are from independent simulations performed at different fixed 𝛽. They do

6

(a) 𝛽 = 0

(b) 𝛽 = 0.2

(c) 𝛽 = 0.3

(d) 𝛽 = 0.4

(e) 𝛽 = 0.4407

(f) 𝛽 = 0.5

Figure 2.1 Example Ising configurations at different 𝛽 (inverse temperature) from MCMC simula-
tions performed on 400×400 lattices. At 𝛽 = 0, the system is at infinite temperature and completely
disordered/random. As 𝛽 is increased, the system cools and increasingly large domains develop.
Near 𝛽 = 0.4407, the system undergoes a second-order phase transition in the infinite-volume
limit. At larger values of 𝛽, the system is “frozen” with a single domain having taken over the entire
lattice. Note that these images are obtained from independent MCMC simulations at fixed 𝛽.

not illustrate how a single lattice would vary as 𝛽 is changed. Notice the appearance of magnetic

“domains” where multiple regions of different magnetizations coexist.

The internal energy is

𝐸 = −

𝜕
𝜕 𝛽

ln 𝑍 = ⟨𝐻⟩,

(2.6)

where 𝑍 is the partition function. For a set of lattice configurations, the internal energy is calculated

by computing 𝐻 for each configuration and then taking the average. To normalize the energy across

different lattice sizes, one typically reports the energy density 𝐸/𝑁 i.e. the energy per lattice site.

7

The specific heat is defined as

𝐶 = −

𝛽2
𝑁

𝜕𝐸
𝜕 𝛽

𝛽2
𝑁

=

(cid:16)

⟨𝐸 2⟩ − ⟨𝐸⟩2(cid:17)

,

(2.7)

where, as usual, 𝑁 is the volume or the number of sites in the lattice. The quantity on the right1

can be computed on a sample of equilibrium lattice configurations by computing 𝐸 and 𝐸 2 on each

configuration and then averaging over the sample to get ⟨𝐸⟩ and ⟨𝐸 2⟩. Note that the variance of a

random variable 𝑋 can be written as Var[𝑋] = ⟨𝑋 2⟩ − ⟨𝑋⟩2. Thus, the specific heat is proportional

to the variance or the fluctuations of the internal energy 𝐸.

The magnetization is formally defined as the derivative of the log of the partition function with

respect to the ordering field

𝑀′ =

1
𝛽

𝜕
𝜕ℎ

ln 𝑍 =

(cid:42) 𝑁
∑︁

(cid:43)

𝑆𝑖

,

𝑖=1

(2.8)

where the sum is over the sites of the lattice and the angle brackets indicate an average over an

ensemble of lattices. Typically, one reports the magnetization density 𝑀/𝑁 i.e. the magnetization

per lattice site. For a completely disordered lattice, 𝑀/𝑁 = 0 in the 𝑁 → ∞ limit. For a completely

ordered lattice, 𝑀/𝑁 = −1 or 𝑀/𝑁 = +1. Since the lattice should be equally likely to magnetize

in the +1 or the −1 direction, these will cancel out in the limit of infinite statistics. Therefore, it is

generally more useful to use a proxy magnetization such as

𝑀 =

(cid:42)(cid:12)
𝑁
(cid:12)
∑︁
(cid:12)
(cid:12)
(cid:12)

𝑖=1

𝑆𝑖

(cid:43)

(cid:12)
(cid:12)
(cid:12)
(cid:12)
(cid:12)

.

(2.9)

On a two-dimensional lattice with no external magnetic field, 𝑀 is a nonzero quantity for 𝛽 > 𝛽𝑐.

This is the phenomenon of spontaneous magnetization. For the two-dimensional Ising model in

infinite volume, the exact value is known to be

𝑀 (𝛽) =

(cid:20)

1 −

1
sinh4(2𝛽)

(cid:21) 1/8

.

(2.10)

This formula was presented without proof by Onsager during a pair of conferences in 1948 [6] and

proved by Yang in 1952 [7].

1It is easy to show that 𝜕𝐸/𝜕 𝛽 = −⟨𝐸 2⟩ + ⟨𝐸⟩2 by starting with the definition ⟨𝐸⟩ = (cid:205)𝑖 𝐸𝑖𝑒−𝛽𝐸𝑖 /(cid:205)𝑖 𝑒−𝛽𝐸𝑖 ,

differentiating both sides, and then simplifying using ⟨𝑂⟩ = (cid:205)𝑖 𝑂𝑖𝑒−𝛽𝐸𝑖 /(cid:205)𝑖 𝑒−𝛽𝐸𝑖 for a general observable 𝑂.

8

The magnetic susceptibility, i.e. the fluctuations of the magnetization, can be formally defined

as

𝜒𝑀 ′ =

1
𝛽𝑁

𝜕 𝑀′
𝜕ℎ

=

1
𝑁

(cid:16)

⟨𝑀′2⟩ − ⟨𝑀′⟩2(cid:17)

.

We use the proxy magnetization Eq. (2.9) instead of Eq. (2.8) and hence use

𝜒𝑀 =

1
𝑁

(cid:16)

⟨𝑀 2⟩ − ⟨𝑀⟩2(cid:17)

.

(2.11)

(2.12)

In Figure 2.2, we show the internal energy density, specific heat, magnetization per site, and the

magnetic susceptibility all versus 𝛽 for lattice volumes going from 4 × 4 up to 64 × 64. The Ising

model has a second-order phase transition at 𝛽𝑐 = ln(1 +

√

2)/2 ≈ 0.440687 in the infinite-volume

limit, and each of these thermodynamic functions shows a particular kind of response near the

critical point. The specific heat and magnetic susceptibility, for example, diverge with increasing

lattice size near the critical point.

2.1.3 Symmetry Considerations

In the Ising model with no external field, there is a global Z2 symmetry—that is, one can

transform the spin variables 𝑆 → −𝑆 without changing the energy function Eq. (2.2). This

symmetry is spontaneously broken when at low energy, the system chooses either the configuration

with most spins oriented up or the configuration with most spins oriented down. These two

configurations have the same energy, however, there is a very large energy barrier between them,

and it is very unlikely for the overall magnetization to flip orientations in the low energy sector.

See Figure 2.3.

Since the two low-energy possibilities have the same energy and spin-flip probabilities, one

can say that there are two “thermodynamically equivalent” low-energy sectors. A Monte Carlo

simulation at very low energy (i.e.

large 𝛽) may spend all of its time “frozen” in one of these

sectors. The likelihood of “tunneling” to the other sector is very low since almost all spins would

have to flip directions. In such a case, the Monte Carlo simulation is stuck in one sector and is

not sampling the entire space.

In the Ising model, this is not a problem since the two sectors

are thermodynamically equivalent, and all measurements will be the same whether it’s stuck in

9

Figure 2.2 Here we show several thermodynamical functions versus 𝛽 for the Ising model on several
different lattice sizes. From top to bottom, we have the energy density, specific heat, magnetization
per site, and the magnetic susceptiblity. Notice how these functions behave near the critical point
𝛽𝑐 = ln(1 +

2)/2 ≈ 0.4407 where there is a second-order phase transition.

√

10

2.01.51.00.50.0E/N4×48×816×1632×3264×640.00.51.01.52.02.53.0C0.00.20.40.60.81.0M/N0.00.10.20.30.40.50.60.70.80255075100125150MFigure 2.3 In the Ising model with no external field, there is a global Z2 symmetry—one can
transform the spin variables 𝑆 → −𝑆 without changing the energy function Eq. (2.2). This
symmetry is spontaneously broken when at low energy, the system chooses either the configuration
with most spins oriented up or the configuration with most spins oriented down. These two
configurations have the same energy, however, there is a very large energy barrier between them,
and it is very unlikely for the overall magnetization to flip orientations in the low energy sector.

one sector or the other. In a later chapter, we will see examples of models with explicitly-broken

symmetries which result in thermodynamically distinct low-energy sectors, which cause a lot of

trouble for the MCMC approach.

2.1.4 Topological Defects: Domain Walls

In ferromagnetism, a magnetic domain is a region in which the magnetization is in a uniform

direction. When cooled below the Curie temperature, the magnetization of a ferromagnetic material

spontaneously divides into many magnetic domains. An analogous phenomenon occurs in the Ising

model. Near the critical point, many neighboring spins align with each other forming “domains”

of aligned spins. This can be seen very clearly in panels (d) and (e) of Figure 2.1. The boundaries

between domains of oppositely-oriented spins are called “domain walls”.

In the Ising model, magnetic domains are a consequence of the competition between energy-

minimization and entropy maximization. Consider the Ising model in one dimension for some

finite 𝛽. When all 𝑁 spins are aligned

· · · + + + + + + + + + + + + + + + + + + + + + + + + · · ·

11

high energylow energythen the internal energy 𝐸 is at its absolute minimum, but the entropy is zero. In contrast, consider

the same model but now with a domain wall created by flipping all spins to the right of some

arbitrary point

· · · + + + + + + +

(cid:12)
(cid:12)
(cid:12)

− − − − − − − − − − − − − − − · · ·

The internal energy is increased by 2, but the entropy is increased by 𝑘 𝐵 ln(𝑁 − 1) since there are

𝑁 − 1 places to put the wall. Thus, the free energy changes by an amount Δ𝐹 = 2 − ln(𝑁 − 1)/𝛽.

Evidently, for finite 𝛽 and for 𝑁 → ∞, the creation of a domain wall lowers the free energy. This

implies that there cannot be spontaneous magnetization in the one-dimensional Ising model, and

thus no phase transition at finite temperature.

In the two-dimensional Ising model, an argument from domain walls can be used to show the

existence of spontaneous magnetization (and therefore a phase transition) [8–10]. Also, the low-

temperature expansion of the two-dimensional Ising model can be expressed in terms of domain

walls. See for example [11].

2.1.5 Second-order Phase Transitions

In the old Ehrenfest classification, phase transitions are labeled by the lowest derivative of the

free energy that is discontinuous at the transition. In this scheme, a “first-order” transition exhibits a

discontinuity in the first derivative of the free energy with respect to some thermodynamic variable.

A “second-order” phase transition is continuous in the first derivative but discontinuous in the

second derivative. The internal energy, Eq. (2.6), and the specific heat, Eq. (2.7), are the first and

second derivatives, respectively, of the free energy with respect to 𝛽. The magnetization, Eq. (2.9),

and magnetic susceptibility, Eq. (2.12), are formally the first and second derivatives of the free

energy with respect to the external field ℎ. In the modern classification, phase transitions are also

divided into “first” and “second” order phase transitions, which mostly coincide with the Ehrenfest

classification.

The Ising model has a second-order phase transition at the critical point 𝛽𝑐 = ln(1 +

√

2)/2 ≈

0.4407. At the phase transition, the internal energy and magnetization are continuous, but the

specific heat and magnetic susceptibility diverge. On a finite lattice, true divergences cannot occur,

12

Table 2.1 Here we show (for reference) the critical exponents of the Ising model in two dimensions.
The final column gives the exact infinite-volume critical point.

𝛼

0

𝛽

1/8

𝛾

7/4

𝛿

15

𝜂

1/4

𝜈

1

𝛽𝑐
1
2 ln(1 +

√

2)

however as seen in Figure 2.2, we do see peaks in the specific heat and magnetic susceptibility, which

grow with increasing lattice size. Here we see that the specific heat diverges “logarithmically” with

volume, i.e. the peak height increases by a constant value each time the lattice size is doubled. The

magnetic susceptibility diverges faster.

Near a phase transition, the thermodynamic functions exhibit power law divergences described

by a set of critical exponents: 𝛼, 𝛾, 𝜈, 𝛽, 𝛿, 𝜂. For example, if one defines the reduced temperature

𝜏 = (𝑇 − 𝑇𝑐)/𝑇𝑐 = (𝛽𝑐 − 𝛽)/𝛽, then the specific heat near the phase transition diverges as 𝐶 ∝ 𝜏−𝛼

with some fixed value for the critical exponent2 𝛼. The magnetic susceptibility diverges as 𝜒 ∝ 𝜏−𝛾,

the correlation length diverges as 𝜉 ∝ 𝜏−𝜈, the ordering field and the magnetization are related as

ℎ ∝ 𝑀 𝛿, and the spin-spin correlator for spins separated by a distance 𝑟 diverges as ⟨𝑆0𝑆𝑟⟩ ∝ 𝑟 −𝑑+2−𝜂.

To list them nicely in one place, we repeat them here:

𝐶 ∝ 𝜏−𝛼

𝜒 ∝ 𝜏−𝛾

𝜉 ∝ 𝜏−𝜈

𝑀 ∝ 𝜏 𝛽

ℎ ∝ 𝑀 𝛿

⟨𝑆0𝑆𝑟⟩ ∝ 𝑟 −𝑑+2−𝜂

(2.13)

(2.14)

(2.15)

(2.16)

(2.17)

(2.18)

A fixed set of critical exponents defines a universality class. For example, in Table 2.1, we list the

exponents of the 2D Ising universality class.

2More completely, one can define two exponents 𝛼 and 𝛼′ to allow for different behavior on either side of the

critical point.

13

2.2 Potts Models

2.2.1

Introduction

The Potts model is a generalization of the Ising model. Whereas each Ising spin variable takes

on the two possible values ±1, in the Potts model, the spin variable takes on one of 𝑞 possible

values. In terms of notation, we follow mostly [5]. The Potts model is also called the “standard

Potts model” or sometimes the “Ashkin-Teller-Potts model”. The “vector” or “planar” Potts model,

on the other hand, refers to what is more commonly called the “clock model”, which we will look

at in Section 2.4.

As with the Ising model, we consider a 𝑑-dimensional lattice with 𝑁 sites and periodic boundary

conditions. For the 𝑞-state Potts model, the 𝑖th lattice site contains a spin-variable 𝑞𝑖, which can

take on the possible values 𝑞𝑖 = 0, 1, . . . 𝑞 − 1. Each spin interacts only with its nearest neighbors.

The Hamiltonian energy function is

𝐻 = −2 ∑︁
⟨𝑖, 𝑗⟩

𝐽𝑖 𝑗 (𝑞𝑖, 𝑞 𝑗 )𝛿𝑞𝑖 𝑞 𝑗 +

2𝑑𝑁
𝑞

−

2
𝛽

ℎ

𝑁
∑︁

𝑖=1

𝛿0𝑞𝑖 ,

(2.19)

where the first sum is over nearest neighbor pairs of lattice sites and 𝐽𝑖 𝑗 (𝑞𝑖, 𝑞 𝑗 ) parameterizes the

interaction between adjacent lattice sites 𝑖 and 𝑗. In principle, 𝐽𝑖 𝑗 could be different for each pair

of adjacent sites and thus would be an array. Typically, 𝐽𝑖 𝑗 = 𝐽 is a constant, and a common choice

is 𝐽 = 1. The second term is a convenient normalization term. The third term accounts for the

possibility of an external magnetic field. With this choice of the magnetic field term, the external

field interacts only with site variables in the state 𝑞𝑖 = 0. When 𝐽 = 1 and 𝑞 = 2, the Potts model is

equivalent to the Ising model.

For the rest of this section, we will assume 𝐽 = 1 and ℎ = 0, in which case the Hamiltonian

function simplifies to

𝐻 = −2 ∑︁
⟨𝑖, 𝑗⟩

𝛿𝑞𝑖 𝑞 𝑗 +

2𝑑𝑁
𝑞

.

The canonical partition function takes the familiar form

𝑍 =

∑︁

𝑒−𝛽𝐻𝑖 ,

where the sum is over lattice configurations.

𝑖

14

(2.20)

(2.21)

(a) 𝑞 = 2, 𝛽 = 0.4407

(b) 𝑞 = 3, 𝛽 = 0.5025

(c) 𝑞 = 4, 𝛽 = 0.5493

(d) 𝑞 = 5, 𝛽 = 0.5872

(e) 𝑞 = 6, 𝛽 = 0.6191

(f) 𝑞 = 10, 𝛽 = 0.7130

Figure 2.4 Example 𝑞-state Potts model configurations at different 𝑞 from MCMC simulations
performed on 400 × 400 lattices near their critical points.

2.2.2 Thermodynamics and Observables

The results in this section were produced using MCMC with a heatbath updating algorithm.

The MCMC code was adapted from publicly available Fortran code developed by Bernd Berg [5].

In Figure 2.4 we show example equilibrium configurations for the 𝑞-state Potts model at different

𝑞 from MCMC simulations performed on 400 × 400 lattices near their critical points.

As usual, the internal energy3 is defined as

𝐸 = −

𝜕
𝜕 𝛽

ln 𝑍 = ⟨𝐻⟩,

(2.22)

At 𝛽 = 0, the system is completely disordered and all possible values of 𝑞𝑖 are equally likely. In

that case, the expected contribution to the energy of site 𝑖 is 1/𝑞, and so the sum in Eq. (2.20) is

3Sometimes it is more convenient to consider the “action” 𝑆 = ⟨(cid:205)

⟨𝑖, 𝑗 ⟩ 𝛿𝑞𝑖 𝑞 𝑗 ⟩ since for the Potts model, this quantity
is a non-negative integer. For a given configuration, the action takes the possible values 𝑆 = 0, 1, 2, . . . , 𝑑𝑁. The
largest possible action, 𝑆𝑚𝑎𝑥 = 𝑑𝑁, occurs when all spins have the same value.

15

canceled out by the normalization term 2𝑑𝑁/𝑞, resulting in

The minimum energy density, which occurs when 𝛽 → ∞ and all spins are aligned, is easily

𝐸
𝑁

(cid:12)
(cid:12)
(cid:12)
(cid:12)𝛽=0

= 0.

(2.23)

calculated to be

For the specific heat, we take the usual definition

𝐸
𝑁

(cid:12)
(cid:12)
(cid:12)
(cid:12)𝛽→∞

= 2𝑑

(cid:19)

.

− 1

(cid:18) 1
𝑞

𝐶 = −

𝛽2
𝑁

𝜕𝐸
𝜕 𝛽

𝛽2
𝑁

=

(cid:16)

⟨𝐸 2⟩ − ⟨𝐸⟩2(cid:17)

.

(2.24)

(2.25)

In two dimensions, the Potts model has a second order phase transition for 𝑞 ≤ 4 and a first

order phase transition for 𝑞 ≥ 5 [12]. The infinite-volume critical point is [5]

𝛽𝑐 =

1
𝑇𝑐

1
2

=

ln(1 +

√

𝑞),

𝑞 = 2, 3, . . .

(2.26)

Formally, the magnetization is the derivative of the log of the partition function with respect to

the ordering field

𝑀′ =

𝜕
𝜕ℎ

(cid:42)

ln 𝑍 =

2

(cid:43)

𝛿0𝑞𝑖

.

𝑁
∑︁

𝑖=1

(2.27)

However, in MCMC simulations without an ordering field, this quantity does not signal critical

behavior, so we turn to proxy definitions of the magnetization. Typically, one wants the “magneti-

zation” to be a measure of the amount of order in the lattice. For a system with a phase transition,

it should be some order parameter whose derivative shows a divergence at the critical point. We

start by defining the “occupation numbers”

𝑁𝑞0 =

𝑁
∑︁

𝑖=1

𝛿𝑞𝑖,𝑞0,

𝑞0 = 0, 1, . . . , 𝑞 − 1.

(2.28)

That is, 𝑁𝑞0 gives the number of lattice sites with value 𝑞0. These numbers satisfy 𝑁0 + 𝑁1 + · · · +

𝑁𝑞−1 = 𝑁. One approach, as in Ref. [5], is to define the Potts magnetization density with respect

to 𝑞0 as

𝑀𝑞0 =

(cid:10)𝑁𝑞0

(cid:11) .

1
𝑁

16

(2.29)

where the angle brackets as usual represent the ensemble average taken over many lattices at the

same value of 𝛽. This magnetization can be understood as the average fraction of spin variables

with spin value 𝑞 = 𝑞0. For example, in a 3-state Potts model with all spins in the state 𝑞 = 1, we

would have 𝑀1 = 1 and 𝑀0 = 𝑀2 = 0. For the 2-state Potts model (𝑞𝑖 ∈ {0, 1}), the conventional

definition of the Ising magnetization, Eq. (2.9), is the difference 𝑚 = ⟨|𝑁0 − 𝑁1|⟩. For example,

in a 2-state Potts model with 75% of the spins in the state 𝑞 = 1, the Potts magnetizations are

𝑀1 = 0.75 and 𝑀0 = 0.25, and the Ising magnetization density is 𝑚/𝑁 = |0.25 − 0.75| = 0.50. In

the infinite-time limit, the expectation of the Potts magnetization, when there is no external field, is

𝑀𝑞0 =

1
𝑞

,

(2.30)

independent of 𝛽. Thus, the Potts magnetization does not signal critical behavior and cannot be

used as an order parameter. However, the Potts magnetization squared

𝑀 2

𝑞0 ≡ 𝑞

(cid:42) 𝑁 2
𝑞0
𝑁 2

(cid:43)

,

(2.31)

does signal critical behavior. According to [5], a finite size scaling analysis would reveal a

singularity at 𝛽𝑐 in 𝑀 2
𝑞0

for first-order transitions and in the derivative of 𝑀 2
𝑞0

for second-order

transitions.

In this work, we follow others (e.g. [13, 14]) in defining magnetization in the 𝑞-state Potts

model. Now we are interested in the largest occupation number 𝑁 ∗ ≡ max(𝑁0, 𝑁1, · · · , 𝑁𝑞−1) on

a per-configuration basis. For example, if a particular 10 × 10 configuration of the 3-state clock

model has thirty-three sites with 𝑞𝑖 = 0, twenty-one sites with 𝑞𝑖 = 1, and the remaining forty-six

sites with 𝑞𝑖 = 2, then 𝑁 ∗ = 46 for that configuration. We then define the magnetization for the

𝑞-state Potts model as

which is normalized such that the magnetization per site is

𝑀 =

𝑞⟨𝑁 ∗⟩ − 1
𝑞 − 1

,

𝑀
𝑁

(cid:12)
(cid:12)
(cid:12)
(cid:12)𝛽=0

= 0,

𝑀
𝑁

(cid:12)
(cid:12)
(cid:12)
(cid:12)𝛽→∞

= 1.

17

(2.32)

(2.33)

For the magnetic susceptibility, we take the usual definition

𝜒𝑀 =

1
𝑁

(cid:16)

⟨𝑀 2⟩ − ⟨𝑀⟩2(cid:17)

.

(2.34)

In Figure 2.5, we look at these thermodynamic quantities as functions of 𝛽 for different values

of 2 ≤ 𝑞 ≤ 10. These results are from MCMC simulations performed on 32 × 32 lattices. The top

panel in this figure shows the energy density. As shown in Eq. (2.23), the energy density is zero

at 𝛽 = 0. This reflects the normalization chosen such that a fully disordered lattice corresponds to

zero energy. The inflection points in the energy density curves occur near the critical points given

by Eq. (2.26). As 𝑞 increases, the critical point shifts toward larger 𝛽. Notice that for 𝑞 ≥ 5, the

energy density seems to become discontinuous at the critical point. This is a reflection of the phase

transition being first-order for 𝑞 ≥ 5. On an infinite lattice, there would be true discontinuities here.

As 𝛽 becomes large, the energy density asymptotically approaches the value given by Eq. (2.24).

In the second panel, we plot the specific heat defined in Eq. (2.25). The specific heat shows a

prominent peak near the critical point, and we see that this peak becomes taller and sharper at larger

𝑞. In the third panel, we show the magnetization defined in Eq. (2.32). In the disordered regime at

small 𝛽, the magnetization is 𝑀 ∼ 0, whereas in the ordered regime at large 𝛽, 𝑀 ∼ 1. As with the

energy, the magnetization also seems to become discontinuous at the critical point for 𝑞 ≥ 5. In

the fourth and final panel, we plot the magnetic susceptibility defined in Eq. (2.34). This quantity

shows peaks near the critical point similar to those seen in the specific heat.

2.2.3 First-order Phase Transitions

The 𝑞-state Potts model is known to have a second-order phase transition for 𝑞 = 2, 3, 4 and a

first-order phase transition for 𝑞 ≥ 5 [12]. In the infinite volume limit, this implies for 𝑞 = 2, 3, 4, a

continuous energy and magnetization but a discontinuous specific heat and magnetic susceptibility.

For 𝑞 ≥ 5, it implies a discontinuous energy and magnetization and also of course discontinuous

specific heat and susceptibility. Numerical studies are performed on finite lattices which cannot

show true discontinuities, so it is at times tricky to discern the nature of a phase transition. Typically,

numerical simulations have to be done on different size lattices and then one tries to extrapolate the

18

Figure 2.5 The energy density, specific heat, magnetization per site, and magnetic susceptibility
of the 𝑞-state Potts model on a 32 × 32 lattice. These were obtained from low statistics MCMC
simulations using a heatbath algorithm. Notice how the energy is continuous at 𝑞 = 2, 3, 4, but
becomes discontinuous at larger 𝑞. This is an illustration of the Potts model having a second-order
phase transition at small 𝑞, but a first-order phase transition at larger 𝑞.

19

3.53.02.52.01.51.00.50.0E/Nq = 2q = 3q = 4q = 5q = 6q = 7q = 8q = 9q = 10050100150200250C0.00.20.40.60.81.0M/N0.20.30.40.50.60.70.8020406080100120140MFigure 2.6 The specific heat (left) and magnetic susceptibility (right) of the 8-state Potts model
both diverge sharply at the first-order phase transition near 𝛽 ≈ 0.671227. On a finite lattice, these
divergences are finite, but indicate the presence of a phase transition in the infinite volume model.
These examples were obtained from small-statistics MCMC simulations using a heatbath updating
algorithm.

Figure 2.7 Histograms of the energy density at different values of 𝛽 in the vicinity of a phase
transition. In the left panel, histograms of the energy density near the second-order phase transition
of the 2-state Potts model (i.e. the Ising model). In the right panel, histograms of the energy density
near the first-order phase transition of the 5-state Potts model. Notice that the distribution becomes
bimodal near a first-order critical point. These results were obtained on 100 × 100 lattices using an
MCMC heatbath algorithm.

behavior to the infinite-volume limit. In Figure 2.6, we show the specific heat (left panel) and the

magnetic susceptibility (right panel) for the 8-state Potts model on lattices of different size. These

quantities are clearly diverging with volume.

Distinguishing between first and second order phase transitions is not always easy. The specific

20

0.620.640.660.680.700100200300400C4x48x816x1632x3264x640.620.640.660.680.700100200300400500M1.61.51.41.3E/N0200000400000600000=0.4370=0.4380=0.4390=0.4400=0.44102.62.42.22.01.81.6E/N050000100000150000200000=0.5850=0.5860=0.5867=0.5870=0.5890heat and magnetic susceptibility diverge with volume in both cases. However, for first-order

transitions, the energy and magnetization become discontinuous at the critical point. For second-

order transitions, they do not. We see hints of these energy and magnetization discontinuities

for 𝑞 ≥ 5 in Figure 2.5. For example, for 𝑞 = 10, we see that as 𝛽 → 𝛽𝑐 from the left, the

energy approaches 𝐸/𝑁 ≈ −1.5, but as 𝛽 → 𝛽𝑐 from the right, the energy approaches 𝐸/𝑁 ≈ −3.

Histogramming the energy values from an MCMC simulation performed very close to the critical

point 𝛽𝑐 results in a bimodal distribution as seen in the right panel of Figure 2.7. In fact, one way

to find the critical point 𝛽𝑐 of a first-order phase transition is to vary 𝛽 until the two peaks in such

a histogram have the same height. A similar histogram of the energies near the critical point of a

second-order phase transition, shows a single peak as seen in the left panel of Figure 2.7.

2.3 𝑂 (𝑛) Models

2.3.1

Introduction

We turn our attention now to our first spin system with continuous symmetry. The Hamiltonian

energy function of the classical 𝑂 (𝑛) model (aka 𝑛-vector model) on a lattice with 𝑁 sites is

𝐻 = −𝐽 ∑︁
⟨𝑖, 𝑗⟩

(cid:174)𝑆𝑖 · (cid:174)𝑆 𝑗 −

𝑁
∑︁

𝑖=1

(cid:174)ℎ · (cid:174)𝑆𝑖,

(2.35)

where the first sum is over nearest neighbor pairs, 𝐽 gives the coupling strength, (cid:174)𝑆𝑖 is an 𝑛-component

unit vector at the site 𝑖, and (cid:174)ℎ is the external magnetic field.

We will focus on the case 𝑛 = 2, which is commonly called the “𝑋𝑌 model”. In this case,

we can parameterize the spin variable using a single angle 𝜃𝑖 ∈ [0, 2𝜋) as (cid:174)𝑆𝑖 ≡ (cos 𝜃𝑖, sin 𝜃𝑖)𝑇 .

Assuming a constant and uniform magnetic field, we can choose (cid:174)ℎ = ℎ ˆ𝑥. Then the Hamiltonian

simplifies to

𝐻 = −𝐽 ∑︁
⟨𝑖, 𝑗⟩

cos(𝜃𝑖 − 𝜃 𝑗 ) − ℎ

𝑁
∑︁

𝑖=1

cos 𝜃𝑖.

(2.36)

For the rest of this section, we will assume 𝐽 = 1 and no external field so that ℎ = 0, in which case

𝐻 = −

∑︁

⟨𝑖, 𝑗⟩

cos(𝜃𝑖 − 𝜃 𝑗 ).

21

(2.37)

Furthermore, we will focus on the 𝑂 (2) model in two dimensions.

In the previous models we

considered, there was a discrete symmetry, but here we have a model with continuous 𝑂 (2)/𝑈 (1)

symmetry.

The 𝑋𝑌 model has been studied analytically [15], using Monte Carlo methods [16–23], and

using tensor methods [24]. Unless otherwise noted, numerical results and plots presented in this

section were obtained using a lattice codebase developed in-house.

The canonical partition function must now take into account that the degrees of freedom are

continuous. Schematically, we can write

∫

𝑍 =

𝑑𝑈 𝑒−𝛽𝐻 (𝑈),

(2.38)

where the integration is over lattice configurations 𝑈.

2.3.2 Thermodynamics and Observables

The results in this section were produced using a Biased Metropolis Heatbath Algorithm

(BMHA) in an MCMC codebase developed by the author.

The internal energy and specific heat for the 𝑋𝑌 model are defined in the usual way.

We can define the magnetization vector for a given spin configuration as

(cid:174)𝑀 = (cid:0)𝑀𝑥, 𝑀𝑦(cid:1) =

(cid:32) 𝑁
∑︁

𝑖=1

cos 𝜃𝑖,

(cid:33)

sin 𝜃𝑖

=

𝑁
∑︁

𝑖=1

𝑁
∑︁

𝑖=1

(cid:174)𝑆𝑖,

(2.39)

where 𝑁 is the number of lattice sites, and (cid:174)𝑆𝑖 is the spin vector at site 𝑖 which makes an angle

𝜃𝑖 with some reference direction. With this definition, the ensemble average matches the formal

definition of the magnetization as the derivative of the logarithm of the partition function

⟨ (cid:174)𝑀⟩ =

1
𝛽

𝜕
𝜕 (cid:174)ℎ

ln 𝑍 =

(cid:42) 𝑁
∑︁

(cid:43)

(cid:174)𝑆𝑖

.

𝑖=1

(2.40)

However, in MCMC simulations with zero external field (i.e. (cid:174)ℎ = 0) and no explicit symmetry

breaking, the spontaneous magnetization is uninteresting:

⟨ (cid:174)𝑀⟩ = 0. So we use instead the

magnitude of the vector magnetization

𝑀 = | (cid:174)𝑀 | =

(cid:12)
(cid:12)
(cid:12)
(cid:12)
(cid:12)

𝑁
∑︁

𝑖=1

(cid:174)𝑆𝑖

(cid:12)
(cid:12)
(cid:12)
(cid:12)
(cid:12)

=

(cid:118)(cid:117)(cid:117)(cid:116)(cid:32) 𝑁
∑︁

(cid:33) 2

cos 𝜃𝑖

+

𝑖=1

22

(cid:33) 2

sin 𝜃𝑖

,

(cid:32) 𝑁
∑︁

𝑖=1

(2.41)

as a proxy magnetization. This serves as an order parameter to signal critical behavior. Other

quantities that may be used as an order parameter include ⟨𝑀⟩ and ⟨𝑀𝑥⟩ or ⟨𝑀𝑦⟩ or even ⟨𝑀 2⟩ =

⟨𝑀 2

𝑥 ⟩ + ⟨𝑀 2

𝑦 ⟩. In the thermodynamic limit, the Mermin-Wagner theorem requires that ⟨𝑀⟩ = 0,

however, on any finite lattice, this quantity is nonzero at low temperature and may be used as an

order parameter.

By the Mermin-Wagner theorem, a continuous symmetry cannot be spontaneously broken in a

two-dimensional system with short-range interactions at any finite temperature. The 𝑋𝑌 model has

a continuous 𝑈 (1) symmetry, and so this theorem implies that long-range order (i.e. magnetization)

cannot occur. Physically, this can be understood as follows. In a low-dimensional system with a

continuous symmetry such as the 𝑋𝑌 model, long-range order is more fragile and vulnerable to

thermal fluctuations, and the long-range order that one might expect at low temperature ends up

being destroyed by long wavelength spin-wave excitations [25]. On the other hand, the Mermin-

Wagner theorem applies to the infinite 𝑋𝑌 model. Magnetization can and does occur at sufficiently

low temperature in even the largest realizable finite-size implementation of the 𝑋𝑌 model. This

effectively makes the true thermodynamic limit of the 2D 𝑋𝑌 model inaccessible [26–29].

Once the order parameter 𝑀 is chosen, one defines the magnetic susceptibility in the usual

manner.

In Figure 2.8, we show several thermodynamic quantities versus 𝛽 for the 𝑋𝑌 model at different

lattice sizes. In the top panel, we show the energy density, and in the second panel the specific heat.

In the third panel, we show the magnetization per lattice site, and in the bottom panel, we show the

magnetic susceptibility. Near the critical point, the magnetic susceptibility diverges with volume,

but the specific heat does not. This is in contrast to the behavior of the specific heat near a first or

second-order transition.

2.3.3 Topological Defects: Vortices

In the two-dimensional 𝑋𝑌 model, where the spin degrees-of-freedom are vectors that can

take angles 𝜃 ∈ [0, 2𝜋), we see a new kind of topological excitation—vortices, antivortices, and

vortex-antivortex pairs.

23

Figure 2.8 Here we show several thermodynamic quantities versus 𝛽 for the 𝑋𝑌 model at different
lattice sizes. In the top panel, we show the energy density, and in the second panel the specific heat.
In the third panel, we show the magnetization per lattice site, and in the bottom panel, we show the
magnetic susceptibility. Near the critical point, the magnetic susceptibility diverges with volume,
but the specific heat does not. This is in contrast to the behavior of the specific heat near a first or
second-order transition.

24

1.751.501.251.000.750.500.250.00E/N4×48×816×1632×3264×640.000.250.500.751.001.251.50C0.00.20.40.60.8M/N0.000.250.500.751.001.251.501.752.0001020304050MWe start by defining the winding number as in [25]

𝑘 =

1
2𝜋

∑︁

Γ

Δ𝜑𝑖𝑖′,

where Γ is an oriented closed loop and

Δ𝜑𝑖𝑖′ = 𝜑𝑖′ − 𝜑𝑖 ∈ [−𝜋, 𝜋),

(2.42)

(2.43)

is the angular difference between spin 𝜑𝑖′ and its loop neighbor 𝜑𝑖. If an oriented closed loop has

𝑘 > 0, it is a (positive) vortex of strength 𝑘, and if it has 𝑘 < 0, it is a “negative vortex” or an

“antivortex” of strength 𝑘. The most common value is 𝑘 = 0. In practice, one can choose Γ to be

an elementary plaquette measured in the counter-clockwise manner.

There are really two varieties of vortices that appear in these kinds of models. In a Monte Carlo

simulation, “dynamical” vortices occur when a system is quenched from a very high temperature

to a low temperature. The vortices and antivortices that appear during the equilibration period are

large and prominent and make nice pictures as in Figure 2.9. In this figure we start with a system at

infinite temperature and cool it very quickly to a cold temperature. At first there are a large number

of tiny vortices and antivortices which quickly annihilate leaving a handful of vortex-antivortex

pairs that last for a long time (in terms of Monte Carlo steps) before they too annihilate—leaving a

quasi-ordered lattice. Dynamics are explored in [16, 30–32].

One can also study “equilibrium” vortices, by appropriately sampling equilibrium lattice con-

figurations and identifying the vortices therein. In this case, one is not interested in the dynamical

evolution of the vortex through Monte Carlo time. Rather, one may be interested in the density

of vortices and their role in the BKT transition as one approaches the critical point 𝛽𝑐. Exam-

ple equilibrium vortices are shown in Figure 2.10. Further discussion of vortices can be found

in [18, 29, 33–35].

2.3.4 BKT Phase Transition

Typically, a phase transition in a classical spin system occurs between a disordered phase at

high temperature and a broken symmetry phase with long-range order at low temperature. The 𝑋𝑌

model, which has continuous unbroken symmetry, cannot have long-range order. For this reason,

25

(a) 𝑡 𝑀𝐶 = 0

(b) 𝑡 𝑀𝐶 = 1

(c) 𝑡 𝑀𝐶 = 10

(d) 𝑡 𝑀𝐶 = 100

(e) 𝑡 𝑀𝐶 = 1000

(f) 𝑡 𝑀𝐶 = 10000

Figure 2.9 Here we show an example of the evolution of a 32 × 32 lattice in the 𝑋𝑌 model with
periodic boundary conditions when it is started at infinite temperature and immediately quenched
to nearly zero temperature. We start in panel (a) with random spins. This corresponds to an
infinite temperature lattice. At this temperature, tiny vortices (black squares) and antivortices (red
squares) proliferate. Vortex positions are identified by computing the winding number Eq. (2.42)
of each plaquette. A winding number of +1 corresponds to a vortex, and a winding number of
−1 corresponds to an antivortex. Panel (b) shows the same lattice after one BMHA sweep with
the low temperature 𝛽 = 100. Already many of the vortices and antivortices have annihilated and
some order is starting to appear. Panel (c) shows the lattice after 10 BMHA sweeps with 𝛽 = 100.
Panels (d) and (e) show the lattice after 100 and 1000 BMHA sweeps. Now only two or three
vortex-antivortex pairs remain. But they are large and annihilate slowly. By the 10000th BMHA
sweeps (panel (f)), no vortex-antivortex pairs remain, and the lattice is in a state of quasi-long-range
order.

such a model was believed to have no phase transition. However, the work of Berezinskii, Kosterlitz,

and Thouless [15, 36, 37], showed that the 𝑋𝑌 model has a new kind of phase transition—one that

separates a region with quasi-long-range order from one where correlations decay exponentially.

The BKT transition is not accompanied with any symmetry-breaking, making it a rather special

26

(a) 𝛽 = 0.80

(b) 𝛽 = 1.0

(c) 𝛽 = 1.3

Figure 2.10 A few examples of equilibrium vortices from independent Monte Carlo simulations of
the 𝑋𝑌 model performed on 32 × 32 lattices with periodic boundary conditions. Vortex positions
are identified by computing the winding number Eq. (2.42) of each plaquette. A winding number
of +1 corresponds to a vortex, and a winding number of −1 corresponds to an antivortex. In panel
(a), an equilibrium configuration is shown for 𝛽 = 0.8, which is in the disordered phase. The
vortex-antivortex pair density is high and many are unbound. Panel (b) comes from a simulation
performed at 𝛽 = 1.0, which is not far from the critical point of the 𝑋𝑌 model. Panel (c) shows
an equilibrium configuration at 𝛽 = 1.3, which is in the quasi-ordered phase. Here the vortex-
antivortex pair density is small, and they are tightly bound.

case of a phase transition.

The phase transition in the 𝑋𝑌 model can be understood in terms of vortices [16, 38]. In the

low temperature quasi-ordered phase, vortex-antivortex pairs are tightly bound by a logarithmic

potential. As the temperature increases, they become unbound, and in the disordered phase, vortices

and antivortices behave like a plasma of free charges. This is analogous to the phenomenon of

confinement and deconfinement in QCD.

The low temperature phase (i.e. 𝛽 > 𝛽𝑐) of the 𝑋𝑌 model goes by many names including

“quasi-ordered phase”, “topological phase”, “critical phase”, “spin-wave phase”, “condensed BKT

phase”, and “vortex-antivortex phase”.

In this phase, the correlation length is infinite, and the

spin-spin correlation function decays as a power of the distance. There is no true long-range order

in this phase—thus the Mermin-Wagner theorem is satisfied—however, the system is quasi-ordered

and on a finite lattice the system is magnetized in this phase. Vortices exist as vortex-antivortex

pairs which are tightly bound by a logarithmic potential. Such tightly-bound vortex pairs are

unimportant since their effect is short-ranged. The important large-scale fluctuations in this phase

27

Figure 2.11 Here we show the allowed “clock” angles for several different values of 𝑞. In the 𝑞-state
clock model, the spin-variables sitting at lattice sites can only take the allowed values 𝜃 = 2𝜋𝑘
𝑞 with
𝑘 = 0, 1, . . . 𝑞 − 1.

are spin waves.

At the critical point 𝛽𝑐 = 1.11995(6) [39], there is an “infinite-order” BKT transition. As

the temperature is increased, bound vortex-antivortex pairs dissociate at the critical temperature—

leaving free vortices and antivortices.

The high temperature phase (i.e. 𝛽 < 𝛽𝑐) goes by several names including “disordered phase”

and “paramagnetic phase”. In this phase, the spin-spin correlation function decays exponentially

with distance and free vortices and antivortices behave like a plasma of free charges.

The BKT transition can be distinguished from first and second-order transitions by the behavior

of the specific heat. The specific heat typically diverges at first-order (see Figure 2.6) and second-

order (see Figure 2.2) phase transitions but not at a BKT transition (see Figure 2.8). However, this

check may not be sufficient. If the critical exponent 𝛼 is negative, then the specific heat plateaus

also for second-order transitions. See for example, [40]. For more on distinguishing BKT from

first- and second-order transitions, see [40, 41].

2.4 Clock Models

2.4.1

Introduction

We now look at the 𝑞-state clock model, which is also called the “planar Potts model”, the

“vector Potts model”, or the “Z𝑞 model”. The Hamiltonian energy function is the familiar

𝐻 = −𝐽 ∑︁
⟨𝑖, 𝑗⟩

(cid:174)𝑆𝑖 · (cid:174)𝑆 𝑗 −

𝑁
∑︁

𝑖=1

(cid:174)ℎ · (cid:174)𝑆𝑖,

(2.44)

28

220221q=2230231232q=3240241242243q=4250251252253254q=5where the first sum is over nearest neighbor pairs, 𝐽 gives the coupling strength, (cid:174)𝑆𝑖 is a 2-component

unit vector at the site 𝑖, the second sum is over the 𝑁 lattice sites, and (cid:174)ℎ is the external magnetic

field. This is precisely, the energy function of the 𝑂 (2) or 𝑋𝑌 model, however, the spins are now

restricted to take the “clock” angles

𝜃𝑖 =

2𝜋𝑘
𝑞

,

𝑘 = 0, 1, . . . 𝑞 − 1.

(2.45)

See Figure 2.11. We parameterize the spin variables as (cid:174)𝑆𝑖 ≡ (cos 𝜃𝑖, sin 𝜃𝑖)𝑇 . Assuming a constant

and uniform magnetic field, we can choose (cid:174)ℎ = ℎ ˆ𝑥. Then the Hamiltonian simplifies to

𝐻 = −𝐽 ∑︁
⟨𝑖, 𝑗⟩

cos(𝜃𝑖 − 𝜃 𝑗 ) − ℎ

𝑁
∑︁

𝑖=1

cos 𝜃𝑖.

(2.46)

For the rest of this section, we will assume 𝐽 = 1 and no external field so that ℎ = 0, in which case

𝐻 = −

∑︁

⟨𝑖, 𝑗⟩

cos(𝜃𝑖 − 𝜃 𝑗 ).

(2.47)

The 𝑞-state clock model is a rich model with connections to several other spin systems. For

𝑞 = 2, the clock angles are 𝜃𝑖 ∈ {0, 𝜋}, and so




This is equivalent to the Ising model. For 𝑞 = 3, the clock angles are 𝜃𝑖 ∈ {0, 2𝜋/3, 4𝜋/3}, and so

cos(𝜃𝑖 − 𝜃 𝑗 ) =

if 𝜃𝑖 ≠ 𝜃 𝑗

if 𝜃𝑖 = 𝜃 𝑗

(2.48)

−1

1

.

cos(𝜃𝑖 − 𝜃 𝑗 ) =

1

− 1
2





if 𝜃𝑖 = 𝜃 𝑗

.

if 𝜃𝑖 ≠ 𝜃 𝑗

(2.49)

This is equivalent to the 3-state “standard” Potts model, which has 𝑞𝑖 ∈ {0, 1, 2}.

In fact, one

can use the standard Potts model code to simulate the 3-state vector Potts model by substituting

the factor 2 → 3/2 in the energy function Eq. (2.20). The 4-state clock model with interaction

strength 𝐽 is equivalent to two decoupled Ising models with interaction strengths 𝐽/2 [42]. That

is, 𝑍4,𝐽 = 𝑍2,𝐽/2𝑍2,𝐽/2, where 𝑍4,𝐽 is the partition function of the 4-state clock model, and 𝑍2,𝐽/2 is

the partition function of the 2-state clock model (i.e. Ising model) with half the normal interaction

29

strength. Since we have algorithms that can exactly calculate the Ising lattice partition function, this

means we can also calculate it for the 4-state clock model. In each of these cases (i.e. 𝑞 = 2, 3, 4),

the clock model has a second-order phase transition. However, for 𝑞 ≥ 5, the clock model has a

pair of BKT transitions. In the limit 𝑞 → ∞, the clock model becomes the 𝑋𝑌 model with a single

BKT transition at finite 𝛽.

The clock model has been studied using various approaches including analytical [25,43], Monte

Carlo [44–48], and tensor methods [49–51].

The canonical partition function takes the usual form

𝑍 =

∑︁

𝑒−𝛽𝐻𝑖 ,

𝑖

(2.50)

where the sum is over lattice configurations.

2.4.2 Thermodynamics and Observables

The results in this section were produced using MCMC with a heatbath updating algorithm.

The MCMC code was adapted from publicly available Fortran code developed by Bernd Berg [5].

As usual, the internal energy is the expectation value of the energy function Eq. (2.47) measured

on a sample of equilibrium lattice configurations

𝐸 = −

𝜕
𝜕 𝛽

ln 𝑍 = ⟨𝐻⟩ = −

(cid:43)

cos(𝜃𝑖 − 𝜃 𝑗 )

.

(cid:42)

∑︁

⟨𝑖, 𝑗⟩

(2.51)

At 𝛽 = 0, the system is completely disordered, and the angles allowed by Eq. (2.45) are equally

likely. Due to the Z𝑞 symmetry of these angles,

𝐸
𝑁

(cid:12)
(cid:12)
(cid:12)
(cid:12)𝛽=0

= 0.

(2.52)

The minimum energy density, which occurs when 𝛽 → ∞ and all spins are aligned, is easily

calculated to be

𝐸
𝑁

(cid:12)
(cid:12)
(cid:12)
(cid:12)𝛽→∞

= −𝑑,

(2.53)

where 𝑑 is the dimension of the system. Unless otherwise noted, all results in this dissertation are

for the models in 𝑑 = 2 dimensions. The energy in the large-𝛽 regime can be approximated using a

30

Figure 2.12 Here we show several thermodynamic quantities versus 𝛽 for the 𝑞 = 3, 4, 5 state clock
models at different lattice sizes. In the top row, we show the energy density, and in the second row
the specific heat. In the third row, we show the magnetization per lattice site, and in the bottom
row, we show the magnetic susceptibility. Note that the 2-state clock model is just the Ising model,
with results shown in Figure 2.2.

31

2.001.751.501.251.000.750.500.250.00E/Nq=34×48×816×1632×3264×642.001.751.501.251.000.750.500.250.00q=42.001.751.501.251.000.750.500.250.00q=5012345678C012340.000.250.500.751.001.251.501.750.00.20.40.60.81.0M/N0.00.20.40.60.81.00.00.20.40.60.81.00.000.250.500.751.001.251.50020406080100M0.000.250.500.751.001.251.500102030405060700.00.51.01.52.001020304050low-temperature expansion. See Appendix A.4. In the top row of Figure 2.12, we show the energy

density for the 𝑞 = 3, 4, 5 clock models on different lattice volumes.

As usual, the specific heat is defined as

𝐶 = −

𝛽2
𝑁

𝑑𝐸
𝑑𝛽

𝛽2
𝑁

=

(cid:16)

⟨𝐸 2⟩ − ⟨𝐸⟩2(cid:17)

.

(2.54)

In the second row of Figure 2.12, we show the specific heat for the 𝑞 = 3, 4, 5 clock models on

different lattice volumes.

We define the magnetization vector as [40, 46–48]

(cid:174)𝑀 = (cid:0)𝑀𝑥, 𝑀𝑦(cid:1) =

(cid:32) 𝑁
∑︁

𝑖=1

cos 𝜃𝑖,

(cid:33)

sin 𝜃𝑖

=

𝑁
∑︁

𝑖=1

𝑁
∑︁

𝑖=1

(cid:174)𝑆𝑖,

(2.55)

where 𝑁 is the number of sites and (cid:174)𝑆𝑖 is the spin-vector at site 𝑖. Formally, the magnetization is

⟨ (cid:174)𝑀⟩ =

1
𝛽

𝜕
𝜕 (cid:174)ℎ

ln 𝑍 =

(cid:42) 𝑁
∑︁

(cid:43)

(cid:174)𝑆𝑖

.

𝑖=1

(2.56)

However, as noted in the previous models studied, the formal spontaneous magnetization is unin-

teresting in MCMC simulations when there is no external field or explicit symmetry breaking. So

instead, we use the magnitude of the magnetization vector

𝑀 = | (cid:174)𝑀 |,

(2.57)

as a proxy magnetization. Often it is useful to work with the magnetization per spin 𝑀/𝑁.

Quantities that are often used as order parameters include ⟨𝑀⟩ [40, 45, 50, 52] and ⟨𝑀𝑥⟩ or ⟨𝑀𝑦⟩

or even ⟨𝑀 2⟩ = ⟨𝑀 2

𝑥 ⟩ + ⟨𝑀 2

𝑦 ⟩ [47]. In the third row of Figure 2.12, we show the magnetization

per site for the 𝑞 = 3, 4, 5 clock models on different lattice volumes. Analytically, we know that

⟨𝑀⟩ = 0 for 𝛽 = 0. However, in an MCMC lattice simulation, when computing an average like

⟨| (cid:174)𝑀 |⟩ from a sample of lattice configurations, one is always adding a (small) positive number, and

so in practice one sees ⟨| (cid:174)𝑀 |⟩| 𝛽=0 = 0 only in the infinite volume limit.

One can also define a complex magnetization as in [44, 53, 54]

𝑀𝐶 =

∑︁

𝑖

𝑒𝑖𝜃𝑖 = |𝑀𝐶 |𝑒𝑖𝜓.

32

(2.58)

The magnitude |𝑀𝐶 | then serves as an order parameter. The vector magnetization (cid:174)𝑀 = 𝑀𝑥 ˆ𝑥 + 𝑀𝑦 ˆ𝑦

given by Eq. (2.55) and the complex magnetization 𝑀𝐶 = 𝑀𝑥 + 𝑖𝑀𝑦 given by Eq. (2.58) are really

the same 2-component objects. One is written as a real 2D vector, and the other is written as a

complex number. In fact, Eq. (2.57) and |𝑀𝐶 | are exactly the same thing.

In the 𝑞-state clock model with 𝑞 ≥ 5 there are two BKT phase transitions. The order parameter

Eq. (2.57) can readily detect4 the small-𝛽 transition between the disordered and quasi-ordered

phases, but it is unable to detect the large-𝛽 transition between the quasi-ordered and ordered

phases. To detect the large-𝛽 transition, a “rotated magnetization” [44, 45, 53–55]

𝑚𝜓 = cos(𝑞𝜓).

is sometimes used. Here, 𝜓 = tan−1(𝑀𝑦/𝑀𝑥).

One can also look at the “population” as in [54]

𝑆 =

𝑞
𝑞 − 1

(cid:20) 1
𝑁

max (cid:8)𝑛0, 𝑛1, . . . , 𝑛𝑞−1(cid:9) −

(cid:21)

,

1
𝑞

(2.59)

(2.60)

where 𝑁 is the total number of lattice sites, and 𝑛𝑘 is the number of sites with spin angle 𝜃 𝑘 . If

the system is disordered, then max{𝑛0, 𝑛1, . . . , 𝑛𝑞−1} = 𝑁/𝑞 and 𝑆 = 0. On the other hand, if

the system is ordered with all 𝑁 spin variables having the same angle, then max𝑘=0,𝑞−1(𝑛𝑘 ) = 𝑁

and 𝑆 = 1. This order parameter is more sensitive to the lower temperature transition, i.e.

its

susceptibility shows a pronounced peak at the lower temperature transition between the ordered

and quasi-ordered phases.

For the magnetic susceptibility, we take the usual definition

𝜒𝑀 =

1
𝑁

(cid:16)

⟨𝑀 2⟩ − ⟨𝑀⟩2(cid:17)

(2.61)

Additional notes on magnetic susceptibility can be found in Appendix A.2.

2.4.3 Topological Defects: Domain Walls and Vortices

In the Ising model there are domain walls but no vortices because spin variables in the Ising

model are restricted to point in one of two opposite directions.

In the 𝑋𝑌 model, there are

4Via e.g. Binder cumulants, which will be discussed in a later section

33

(a) 𝛽 = 0.60

(b) 𝛽 = 1.0

(c) 𝛽 = 1.3

Figure 2.13 A few examples of equilibrium vortices from independent Monte Carlo simulations of
the 5-state clock model performed on 32 × 32 lattices with periodic boundary conditions. Vortex
positions are identified by computing the winding number Eq. (2.42) of each plaquette. A winding
number of +1 corresponds to a vortex, and a winding number of −1 corresponds to an antivortex. In
panel (a), an equilibrium configuration is shown for 𝛽 = 0.6, which is in the disordered phase. The
vortex-antivortex pair density is high and many are unbound. Panel (b) comes from a simulation
performed at 𝛽 = 1.0, which is close to the critical region of the 5-state clock model. Panel (c)
shows an equilibrium configuration at 𝛽 = 1.3, which is in the quasi-ordered phase. Here the vortex
pair density is very low. In this example we see no vortices.

vortices but no true domain walls since the spin degrees of freedom are continuous. In the clock

model, at least for sufficiently large 𝑞, we see both domain walls and vortices. Here, the low-

temperature physics is dominated by domain wall excitations, whereas the vortices are relevant in

the intermediate and high temperature phases [25].

There seems to be some confusion in the literature regarding vortices in the 𝑞-state clock model.

Some authors [25] claim that vortices appear only for 𝑞 ≥ 5, with the implication that they play a

very fundamental role in the BKT transition. However, the definition of a vortex clearly allows for

vortices also in the 𝑞 = 4 state clock model which does not have a BKT transition. For example,

one can construct a vortex in the 𝑞 = 4 state clock model by having the (ordered) spins around a

plaquette having the angles 0, 𝜋/2, 𝜋, 3𝜋/2.

Further discussion on topological defects can be found in [45, 47, 52, 56]. Discussion of

dynamics can be found in [45, 47, 56, 57].

34

2.4.4 Phase Diagram

The clock model interpolates between the Ising model (𝑞 = 2), which has a second-order phase

transition and the 𝑋𝑌 model (𝑞 → ∞), which has a BKT transition.

For 𝑞 = 2, 3, 4, the clock model has a second-order phase transition—so-called because the

second derivatives of the free energy (e.g. specific heat) are divergent while the first derivatives

(e.g. internal energy) are continuous at the critical point. At the critical point 𝛽𝑐, the correlation

length, specific heat, and susceptibilities become infinite (in the infinite-volume limit). We can

see this in Figure 2.12, which shows several thermodynamic functions versus 𝛽 for different lattice

sizes. The internal energy and magnetization remain continuous, but the specific heat and magnetic

susceptibility diverge with volume near the critical point. The critical point separates the model

into two phases. The high temperature paramagnetic phase (i.e. 𝛽 < 𝛽𝑐) is disordered, i.e.

the

magnetization order parameter 𝑀 is zero above the critical temperature. As the system is cooled

from the high temperature phase it passes through a second-order phase transition at the critical

point 𝛽 = 𝛽𝑐. Here the symmetry is spontaneously broken and the system chooses a magnetization

direction. That is, the order parameter 𝑀 becomes nonzero below the critical temperature. The low

temperature ferromagnetic phase (i.e. 𝛽 > 𝛽𝑐) features long-range order in the form of spontaneous

magnetization.

For 𝑞 ≥ 5, there are two Berezinskii-Kosterlitz-Thouless (BKT) transitions. The BKT transition

is sometimes referred to as an “infinite-order” phase transition. The free energy has an essential

singularity at 𝛽 = 𝛽𝑐, however, it is smooth and infinitely differentiable. The two BKT transitions,

which occur at the critical points 𝛽𝑐1 < 𝛽𝑐2, separate the model into three phases. The low

temperature (i.e. 𝛽 > 𝛽𝑐2) features long-range order i.e. magnetization. At 𝛽 = 𝛽𝑐2, there is a

phase transition believed to be of the BKT type. As 𝑞 → ∞, this critical point moves 𝛽𝑐2 → ∞.

Hence the 𝑋𝑌 model, which corresponds to 𝑞 = ∞, does not have this transition. The intermediate

temperature phase (i.e. 𝛽𝑐1 < 𝛽 < 𝛽𝑐2) goes by many names including “quasi-ordered phase”,

“quasi-liquid phase”, “topological phase”, “critical phase”, “spin-wave phase”, “condensed BKT

phase”, and “vortex-antivortex phase”. In this phase, the correlation length is infinite, and the spin-

35

Figure 2.14 The phase diagram of the 𝑞-state clock model in two dimensions. The black points
indicate critical points associated with second order phase transitions, and red points indicate
critical temperatures associated with BKT phase transitions.

spin correlation function decays as a power of the distance. There is no true long-range order in this

phase, however, the system is quasi-ordered and on a finite lattice, it is magnetized in this phase.

This phase is analogous to the low temperature phase of the 𝑋𝑌 model. The high temperature

paramagnetic phase (i.e. 𝛽 < 𝛽𝑐1) is disordered and correlations decay exponentially. This phase

is analogous to the high temperature phase of the 𝑋𝑌 model.

The phase diagram of the 𝑞-state clock model is shown in Figure 2.14. The critical points and

some critical exponents are tabulated in Table 2.2.

36

q23456βZqorderedphase2ndordertransitionBKTtransitionCriticalphaseTable 2.2 Here we show (for reference) the critical exponents for some 𝑞-state clock models. For
𝑞 = 2, this is the Ising model. For 𝑞 = 3, the clock model is in the same universality class as
the 3-state Potts model. For 𝑞 = 4, it has been shown [58] that the clock model is in the Ising
universality class. The final column gives the exact infinite-volume critical point for these three
cases. The Ising and 3-state Potts models are two well-studied universality classes, and their exact
critical exponents can be found in textbooks such as [5]. The critical point of the Ising Model can
similarly be found in textbooks such as [5]. The exact critical points for 𝑞 = 3, 4 and the estimated
critical points for the 𝑞 = 5, 6 clock models is taken from [51] (Supplemental Material). The
estimated critical point for the 𝑋𝑌 model (i.e. 𝑞 = ∞) is taken from [39].

√

√

𝛽𝑐
1
2 ln(1 +
2
3 ln(1 +
√
ln(1 +

2)
3)
2)

1.0503(2), 1.1039(2)

1.0957(2), 1.4491(8)
...
1.11995(6)

𝛼

0

1/3

0

...

𝑞

2

3

4

5

6
...
∞

𝛽

1/8

1/9

1/8

𝛾

7/4

13/9

7/4

𝛿

15

14

15

𝜂

1/4

4/15

1/4

𝜈

1

5/6

1

...

37

CHAPTER 3

MCMC SIMULATIONS AND THEIR ANALYSIS

3.1 Markov Chain Monte Carlo

The canonical ensemble in statistical mechanics is the ensemble that describes the distribution

of the possible (micro)states of a mechanical system in thermal equilibrium at a fixed temperature,

a fixed volume, and with a fixed number of degrees of freedom. The internal energy 𝐸 of such a

system can vary with the microstate. The probability 𝑃(𝑈) for the system to be in a microstate 𝑈

with energy 𝐸 (𝑈) is given by the Boltzmann distribution

𝑃(𝑈) =

𝑒−𝛽𝐸 (𝑈)
𝑍

,

(3.1)

where 𝛽 = 1/(𝑘 𝐵𝑇) is the inverse temperature with 𝑘 𝐵 being Boltzmann’s constant and 𝑇 being the

temperature. The partition function of the system has the form

𝑍 =

∑︁

𝑈

𝑒−𝛽𝐸 (𝑈),

(3.2)

where the sum is over all possible states of the system. Then the equilibrium expectation value of

an observable 𝑂 is

⟨𝑂⟩ =

∑︁

𝑈

𝑂 (𝑈)𝑃(𝑈) =

(cid:205)𝑈 𝑂 (𝑈)𝑒−𝛽𝐸 (𝑈)
(cid:205)𝑈 𝑒−𝛽𝐸 (𝑈)

.

(3.3)

More on the canonical ensemble can be found in standard textbooks such as [59].

In principle, one can compute the expectation value ⟨𝑂⟩ by enumerating all possible microstates

of the system, computing the internal energy 𝐸 (𝑈) and the observable 𝑂 (𝑈) for each microstate 𝑈,

and then computing Eq. (3.3). In practice, this approach is infeasible for even small systems. Con-

sider the Ising model on a 10 × 10 lattice. There are 100 sites each containing a spin variable which

can be in one of two possible states—so the number of possible microstates is 2100 (which is larger

than 1030). Computing this using a naive approach would take the current fastest supercomputer

millions of years1.

1With 2100 microstates and assuming the calculation of 𝑂𝑖 and 𝑃𝑖 takes 2 × 100 floating point operations per
100-site microstate, this is ∼ 1032 floating point operations. The current fastest supercomputer—Frontier—has an
Rmax value of 1102 PetaFLOPS. So total computation time is 1032 FLOP / 1102 petaFLOPS ∼ 2.9 million years.

38

Instead of enumerating all states, one could estimate Eq. (3.3) by taking the mean of a randomly

selected sample of the microstates. This would be an example of a “Monte Carlo” method. However,

a relatively tiny fraction of states actually contributes meaningfully to the sum, so if one were to

select the microstates uniformly, one would still need to sample an infeasibly large number of states

to get an accurate average. The key is to use “importance sampling” where the microstates are

selected according to their underlying probability distribution given in Eq. (3.1). If 𝑁 microstates

are sampled according to this distribution, then an approximation of Eq. (3.3) is given by the average

of the sample

⟨𝑂⟩ ≈

1
𝑁

𝑁
∑︁

𝑖=1

𝑂 (𝑈𝑖),

(3.4)

and the statistical error of the approximation goes as 1/

√

𝑁, meaning that only a “small” number

of microstates need to be sampled to get a good estimate. The question becomes how to sample

(or more accurately how to generate) microstates, which in our case will be lattice configurations,

according to the probability distribution Eq. (3.1). This is where we come to Markov chain Monte

Carlo (MCMC).

The idea is to start with some arbitrary configuration 𝑈0 and construct a chain of configurations,

where each configuration can be generated from the previous configuration

𝑈0 → 𝑈1 → 𝑈2 → · · · ,

(3.5)

such that this Markov chain eventually reaches an equilibrium distribution that matches our desired

probability distribution Eq. (3.1). After discarding some number of the initial configurations to

ensure equilibration, we can estimate our observables using Eq. (3.4). An obvious complication

is that the configurations in such a chain will be correlated with previous configurations. We will

discuss in Section 3.3 how to handle these autocorrelations. Another problem that can occur is

that of the Markov chain becoming stuck in one sector of configuration space and failing to sample

the whole space. This is analogous to a minimizer becoming stuck in a local minimum instead of

finding the global minimum. An example of this is discussed in Sec. 4.3.7.

39

Given a configuration 𝑈𝑡 in the Markov chain time series, some updating algorithm (several

will be discussed in Section 3.2) produces the next configuration 𝑈𝑡+1. Typically, this is done by

making some small change, e.g. flipping a single spin in the lattice.

Let 𝑇 (𝑈′|𝑈) be the conditional transition probability for the updating algorithm to take us from

some configuration 𝑈 to a configuration 𝑈′. Since it is a probability, 0 ≤ 𝑇 (𝑈′|𝑈) ≤ 1, and we have

the normalization condition (cid:205)𝑈′ 𝑇 (𝑈′|𝑈) = 1. We require that every possible configuration in our

distribution is reachable by our Markov chain in a finite number of steps. This is the requirement

of ergodicity. We need our Markov process to eventually reach the equilibrium distribution 𝑃(𝑈)

and then stay there, i.e. we require that there be no sources or sinks of probability. A sufficient

condition, known as the detailed balance condition, is

𝑇 (𝑈′|𝑈)𝑃(𝑈) = 𝑇 (𝑈|𝑈′)𝑃(𝑈′),

(3.6)

for every pair of configurations 𝑈 and 𝑈′. For a detailed introduction to Markov chain Monte Carlo,

see for example [5, 60].

3.2 Updating Algorithms

In this section, we discuss several algorithms that can be used to generate the next lattice

configuration 𝑈𝑡+1 from the current configuration 𝑈𝑡 in a Markov chain. Our examples will be

classical spin systems, but the same algorithms can be used for quantum field theories in the path

integral representation on the lattice.

3.2.1 Metropolis

The Metropolis updating algorithm [61] is one of the simplest MCMC updating algorithms. In

most modern applications it has been replaced with more efficient methods, however, it remains a

popular choice due to its simplicity and generality. It is a simple-to-implement updating algorithm

that works whenever one is able to compute the energy 𝐸 (𝑈) of the state 𝑈.

The detailed balance equation is

𝑇 (𝑈′|𝑈)𝑒−𝛽𝐸 (𝑈) = 𝑇 (𝑈|𝑈′)𝑒−𝛽𝐸 (𝑈′).

(3.7)

40

We can rearrange this as

𝑇 (𝑈′|𝑈)
𝑇 (𝑈|𝑈′)

= 𝑒−𝛽Δ𝐸 ,

(3.8)

where Δ𝐸 = 𝐸 (𝑈′) − 𝐸 (𝑈) is the internal energy difference between a candidate configuration 𝑈′

and the old configuration 𝑈.

The Metropolis updating algorithm, in its simplest form, proceeds as follows:

1. Given the current configuration 𝑈𝑡, generate a candidate configuration 𝑈′ by some random

process that selects uniformly on the group. For example, let 𝑈′ be 𝑈𝑡, but with a single spin

flipped to a random value from a uniform distribution

2. Accept this candidate as the new configuration (i.e. let 𝑈𝑡+1 = 𝑈′) with probability

𝑇 (𝑈′|𝑈𝑡) = min (cid:16)1, 𝑒−𝛽Δ𝐸 (cid:17)

.

(3.9)

Otherwise, keep the old configuration (i.e. let 𝑈𝑡+1 = 𝑈𝑡)

3. Repeat these steps

It is easy to show that detailed balance is satisfied.

𝑇 (𝑈′|𝑈)𝑒−𝛽𝐸 (𝑈) = min (cid:16)1, 𝑒−𝛽Δ𝐸 (cid:17)

𝑒−𝛽𝐸 (𝑈)

(cid:18)

𝑒−𝛽𝐸 (𝑈),

= min

𝑒−𝛽𝐸 (𝑈′)
𝑒−𝛽𝐸 (𝑈)

(cid:19)

𝑒−𝛽𝐸 (𝑈)

= min

(cid:18) 𝑒−𝛽𝐸 (𝑈)
𝑒−𝛽𝐸 (𝑈′)

(cid:19)

, 1

𝑒−𝛽𝐸 (𝑈′)

= 𝑇 (𝑈|𝑈′)𝑒−𝛽𝐸 (𝑈′).

(3.10)

All of the spin systems that we have considered in this thesis have been “local” in the sense

that if the spin variable at a single site is changed, then the energy change Δ𝐸 can be computed

by looking only at the stencil composed of that spin and its nearest neighbors. As a concrete

example, let us consider the Ising model with energy function Eq. (2.2). We will assume 𝐽 = 1 and

ℎ = 0, so the energy 𝐸 (𝑈) of a configuration is simply − (cid:205)

⟨𝑖, 𝑗⟩ 𝑆𝑖𝑆 𝑗 where the sum is over nearest
neighbor pairs, and the spin variables 𝑆𝑖 at a lattice site 𝑖 can take the value +1 or −1. The number

41

of terms in the sum 𝐸 (𝑈) = − (cid:205)

⟨𝑖, 𝑗⟩ 𝑆𝑖𝑆 𝑗 is 𝑁 𝑑 where 𝑁 is the number of lattice sites and 𝑑 is
the number of dimensions. Any specific site appears in 2𝑑 of these terms, which can be grouped

together and called a “stencil”. For example, in two dimensions, any given site variable 𝑆𝑘 appears

in 4 terms which we can denote: 𝑆𝑘 𝑆𝑟 + 𝑆𝑘 𝑆𝑙 + 𝑆𝑘 𝑆𝑢 + 𝑆𝑘 𝑆𝑑 where, for example, 𝑆𝑟 refers to the

nearest neighbor to the right. Respectively, 𝑆𝑙, 𝑆𝑢, and 𝑆𝑑 are the nearest neighbors to the left and

to the up and down directions. Thus, when a single spin 𝑆𝑘 at the site 𝑘 is changed to 𝑆′

𝑘 in the

two-dimensional Ising model with no external field, then the energy change is

Δ𝐸 = 𝐸 (𝑈′) − 𝐸 (𝑈) = (𝑆𝑘 − 𝑆′

𝑘 ) (𝑆𝑟 + 𝑆𝑙 + 𝑆𝑢 + 𝑆𝑑).

(3.11)

All other terms cancel out. For the other classical spin models we’ve considered, Δ𝐸 can be

computed similarly. The only change is the energy function 𝐸 (𝑈). Keep in mind, that the number

of terms in the stencil depends on the number of dimensions one is operating in.

A Metropolis “sweep” occurs when all the sites in the lattice have (on average) had a chance of

being updated once. The order in which lattice sites are visited and updated is a matter of choice—a

common choice is to sequentially update all of the sites. Pseudocode for a sequential Metropolis

sweep for the Ising model is as follows:

for SITE in LATTICE do

# Get the current value of the spin variable
OLDSPIN = get_site_variable(SITE)

# Get a random candidate spin variable
NEWSPIN = random_spin_value()

# Compute the change in energy if NEWSPIN were accepted
DELTAE = 0
for DIM in NDIMENSIONS do # Computing stencil

DELTAE += get_site_variable(SITE + 1 unit in forward dir. in this DIM)
DELTAE += get_site_variable(SITE + 1 unit in backward dir. in this DIM)

DELTAE *= (OLDSPIN - NEWSPIN)

# Accept or reject the candidate
R = random_float[0,1)
if R <= exp(-BETA*DELTAE) then # Accepting the candidate

set_site_variable(SITE) = NEWSPIN

42

# If rejected, the site variable remains OLDSPIN

For other classical spin models one would only need to change the Δ𝐸 calculation.

To sample an equilibrium distribution, an MCMC updating algorithm samples states around

the minimum of the energy function.

In the Metropolis updating scheme, a candidate for the

next configuration in the Markov chain is generated by randomly flipping a spin in the current

configuration.

If the candidate has a lower internal energy, then it is accepted unconditionally.

Otherwise, it is accepted probabilistically. This approach makes only small changes at a time—a

single spin flip per updating step—resulting in a rather slow advancement through the space of

possible configurations.

Depending on the spin system and the temperature regime, the Metropolis acceptance rate can

become very low. Whenever a candidate is rejected, the current configuration is repeated in the

Markov chain. Hence, a low acceptance rate implies a long autocorrelation time, and this means the

MCMC simulation must be run for a longer time in order to get a statistically-independent sample

of configurations. In the next section, we look at an updating algorithm that always has a 100%

acceptance rate.

3.2.2 Heatbath

The heatbath algorithm effectively combines steps 1 and 2 of the Metropolis algorithm to more

directly produce configurations distributed according to the desired probability distribution. Instead

of randomly flipping a spin and seeing if the energy is reduced, the heatbath algorithm looks at all

possible values for that spin and flips it probabilistically to the value that locally (considering it and

its nearest neighbors) minimizes the energy. The result is a much faster advancement through state

space, that is, one gets a Markov chain with a much smaller autocorrelation time, which means one

can generate the same effective statistics with a much shorter simulation.

The heatbath updating algorithm can be used when it’s possible to efficiently invert the cumu-

lative distribution function (CDF). In some cases, inverting the CDF may be unacceptably slow or

it may be a priori unknown, and then one has to rely on other updating algorithms.

For an equilibrium MCMC simulation, the probability to be in the configuration 𝑈 is given

43

by the probability density function (PDF) 𝑃(𝑈) = 𝐶′𝑒−𝛽𝐸 (𝑈) where 𝛽 is the inverse temperature,

𝐸 (𝑈) is the internal energy of configuration 𝑈, and 𝐶′ is determined by the normalization condition

(cid:205)𝑈 𝑃(𝑈) = 1. The normalization constant 𝐶′ is the reciprocal of the partition function, but in

practice, this quantity is not known during an MCMC simulation.

During an updating sweep, one site at a time is updated, so we can consider things at the level

of an individual site rather than an entire configuration. Suppose the spin system being considered

has site variables which can be in 𝑞 discrete states labeled by 𝑘. For example, for the Ising model,

𝑞 = 2 with site variable 𝑆𝑘 = ±1. For the 𝑞-state Potts models, the site variable takes the values

𝑆𝑘 ∈ {0, 1, 2, . . . , 𝑞 − 1}. For the 𝑞-state clock models in two dimensions, one can parameterize the

vector site variables by a single angle 𝜃 𝑘 = 2𝜋𝑘/𝑞 with 𝑘 ∈ {0, 1, 2, . . . , 𝑞 − 1}. For an equilibrium

configuration, the probability for a particular spin variable to be in the state 𝑘 is

𝑃(𝑘) = 𝐶𝑒−𝛽𝐸𝑘 ,

where the constant 𝐶 is determined by the normalization condition

𝑞−1
∑︁

𝑘=0

𝑃(𝑘) = 1 =⇒ 𝐶 =

1
(cid:205)𝑞−1
𝑘=0 𝑒−𝛽𝐸𝑘

.

(3.12)

(3.13)

The energy 𝐸𝑘 is the interaction energy of the particular spin variable with its nearest neighbors.

For example, for the Ising (see Eq. (2.2)), Potts (see Eq. (2.20)), and clock models (see Eq. (2.47)),

we have

𝐸 𝐼𝑠𝑖𝑛𝑔
𝑘

= −𝐽

𝑛𝑑𝑖𝑚𝑠
∑︁

∑︁

𝑆𝑘 𝑆𝜇 𝑗

𝜇=1
𝑛𝑑𝑖𝑚𝑠
∑︁

𝐸 𝑃𝑜𝑡𝑡𝑠
𝑘

= −2

𝑗=±

∑︁

𝑗=±

𝛿𝑆𝑘,𝑆𝜇 𝑗

𝐸𝐶𝑙𝑜𝑐𝑘
𝑘

= −

𝜇=1
𝑛𝑑𝑖𝑚𝑠
∑︁

∑︁

𝜇=1

𝑗=±

(cid:174)𝑆𝑘 · (cid:174)𝑆𝜇 𝑗 = −

𝑛𝑑𝑖𝑚𝑠
∑︁

∑︁

𝜇=1

𝑗=±

cos(𝜃 𝑘 − 𝜃 𝜇 𝑗 ),

(3.14)

(3.15)

(3.16)

where the first sum is over the number of dimensions, the second sum is over the forward and

backward directions, and 𝑆𝜇 𝑗 is the nearest neighbor site variable in the 𝜇 𝑗 direction. The other

spin variables contribute of course to the total energy of the lattice, but when we are considering a

44

single spin, the contributions from the rest of the lattice can be absorbed into the overall constant

𝐶. The CDF associated with Eq. (3.12) is

𝑧 = 𝐹 (𝑘) =

𝑘
∑︁

𝑗=0

𝑃( 𝑗).

The normalization condition implies that

𝐹 (𝑞 − 1) = 1.

(3.17)

(3.18)

The heatbath update selects 𝑘 distributed according to 𝑃(𝑘) by converting a uniformly distributed

random number 0 ≤ 𝑧 < 1 into

𝑘 = 𝐹−1(𝑧).

(3.19)

This is the value that our particular site variable should be set to.

To perform a sequential heatbath sweep, loop over all sites in the lattice. For each site:

1. Loop over the 𝑞 possible values of the spin variable, and for each one, calculate what would

be the local contribution 𝐸𝑘 to the energy given the fixed values of the neighboring spins

2. Calculate the CDF Eq. (3.17). It can be normalized using the condition Eq. (3.18)

3. Choose a random number 𝑧 uniformly in [0, 1)

4. Set the value 𝑆𝑘 of the spin variable using the smallest 𝑘 for which 𝐹 (𝑘) > 𝑧

With this procedure, one is setting the spin value to 𝑆𝑘 with probability 𝑃(𝑘) by effectively inverting

the CDF 𝐹 (𝑘).

Here is pseudocode for performing a sequential heatbath sweep:

# Cumulative distribution ’function’
CUMULDIST[-1] = 0

# Loop over all sites in the lattice
for SITE in LATTICE do

# For this site, calculate the energy contribution
# for each possible value of the site variable

45

for K = 0 to Q-1:

EK = 0

# Computing stencil
for DIM in NDIMENSIONS do

EK += ...MODEL DEPENDENT CALCULATION...

# Calculate the cumulative distribution function
CUMULDIST[K] = CUMULDIST[K-1] + exp(-BETA*EK)

# Set the value of the spin variable according to the probability distribution
R = random_float[0,1)
for K = 0 to Q-2:

P = CUMULDIST[K]/CUMULDIST[Q-1] # Normalized cum. dist.
if R < P:

set_site_variable(SITE) = set site variable to value K

# Otherwise
if(not yet set)

set_site_variable(SITE) = set site variable to value Q-1

In the heatbath algorithm, for each site update, one has to compute the CDF as a function of

the neighboring sites. For a lattice with 𝑁 sites, this procedure must be done 𝑁 times per “sweep”.

The new site variable is then chosen by effectively inverting the CDF. For discrete models such

as the Ising, Potts, and clock models—whose site variables take only 𝑞 discrete values—one can

efficiently compute and store the CDF. For a continuous model, such as the 𝑋𝑌 model—whose

site variables take continuous values 0 ≤ 𝜃 < 2𝜋—it is not possible to compute the exact CDF. It

can still be approximated to any desired precision by discretizing it, however, this approach is not

efficient since it would have to be repeated 𝑁 times per heatbath sweep.

3.2.3 Biased Metropolis Heatbath Algorithm (BMHA)

The Biased Metropolis Heatbath Algorithm (BMHA) [62–65] is the solution to the problem

of applying the heatbath algorithm to models with continuous degrees of freedom.

Instead of

calculating a CDF specific to each site update, one makes a discrete approximation of the general

CDF and stores it as a lookup table. To keep memory access efficient, this must be a rather poor

approximation of the CDF, however, the discretization error can be completely removed by adding a

Metropolis accept/reject step. In a very real sense, the BMHA interpolates between the Metropolis

46

and heatbath algorithms. Improving the CDF approximation in BMHA improves the acceptance

rate but also increases the table size which slows the lookup procedures. There is a balance to be

achieved. In practice, one can achieve efficient BMHA algorithms with acceptance rates ∼ 90%

even in regimes where Metropolis has very low acceptance.

At its simplest, we want to sample random variables 𝑥 ∈ [𝑥1, 𝑥2] from some PDF 𝑃(𝑥) with

associated CDF 𝐹 (𝑥). We do this via a Markov chain where each 𝑥𝑛𝑒𝑤 is related to the previously

sampled random variable 𝑥𝑜𝑙𝑑. The Metropolis algorithm does this as follows:

1. Uniformly generate 𝑥𝑛𝑒𝑤 ∈ [𝑥1, 𝑥2]

2. Accept with probability

𝑃𝑚𝑒𝑡𝑟 = min

(cid:18)

1,

𝑃(𝑥𝑛𝑒𝑤)
𝑃(𝑥𝑜𝑙𝑑)

(cid:19)

.

(3.20)

In contrast, the BMHA proceeds as follows (we use the notation of [65]):

1. Discretize 𝑥 into 𝑛𝑣𝑎𝑟 bins: 𝑥1 = 𝑥0 < 𝑥1 < · · · < 𝑥𝑛𝑣𝑎𝑟 = 𝑥2, where the bin widths

are Δ𝑥 𝑗 = 𝑥 𝑗 − 𝑥 𝑗−1 with 𝑗 = 1, . . . , 𝑛𝑣𝑎𝑟. The Δ𝑥 𝑗 are chosen nonuniformly such that

𝑥 𝑗 = 𝐹−1( 𝑗/𝑛𝑣𝑎𝑟)

2. Propose 𝑥𝑛𝑒𝑤 by uniformly picking a bin 𝑗𝑛𝑒𝑤 and proposing 𝑥𝑛𝑒𝑤 uniformly within that bin

3. Locate the bin 𝑗𝑜𝑙𝑑 to which 𝑥𝑜𝑙𝑑 belongs

4. Accept 𝑥𝑛𝑒𝑤 with probability

𝑃𝑏𝑚ℎ𝑎 = min

(cid:18)

1,

𝑃(𝑥𝑛𝑒𝑤)
𝑃(𝑥𝑜𝑙𝑑)

Δ𝑥 𝑗𝑛𝑒𝑤
Δ𝑥 𝑗𝑜𝑙𝑑

(cid:19)

.

(3.21)

In the limit 𝑛𝑣𝑎𝑟 → ∞, 𝑃𝑏𝑚ℎ𝑎 → 1, i.e. we recover the 100% acceptance rate of the heatbath

algorithm. Crucially, this modification of the Metropolis algorithm influences only the acceptance

rate—the same detailed balance conditions are satisfied as by the Metropolis algorithm. Note that

in an MCMC simulation Step 1.

is done only once at the start of the simulation and saved as a

lookup table.

47

In a slightly more complicated example, suppose we want to sample random variables 𝑥 ∈

[𝑥1, 𝑥2] from a PDF 𝑃(𝑥; 𝛼) that includes a parameter 𝛼 ∈ [𝛼1, 𝛼2], which depends on the

environment of the site being updated. In that case, one can uniformly discretize the parameter

𝛼1 ≤ 𝛼 ≤ 𝛼2 with 𝑛𝑝𝑎𝑟 bins. Within each of these bins, the variable 𝑥1 ≤ 𝑥 ≤ 𝑥2 is discretized into

𝑛𝑣𝑎𝑟 bins as in Step 1. of the BMHA. Each parameter bin will contain a different discretization of

the variable 𝑥, since the bin widths for the variable depend on the inverse CDF which itself depends

on the parameter 𝛼 through the PDF 𝑃(𝑥; 𝛼).

To make this more concrete, we consider the 𝑂 (2) or 𝑋𝑌 model as a specific example. This

is a “local” Hamiltonian, so we can factor any particular site (cid:174)𝑆𝑖 out of the interaction term. For

example, for the 𝑂 (2) model on a lattice with 𝑁 sites and no external field,

𝐻 = −

∑︁

⟨𝑖, 𝑗⟩

(cid:174)𝑆𝑖 · (cid:174)𝑆 𝑗 = −

𝑁
∑︁

𝑖=1

(cid:174)𝑆𝑖 ·

𝑛𝑑𝑖𝑚
∑︁

𝜇=1

(cid:174)𝑆𝑖+ ˆ𝜇 = −

1
2

𝑁
∑︁

𝑖=1

(cid:174)𝑆𝑖 · (cid:174)𝑆𝑖,+,

where

(cid:174)𝑆𝑖,+ =

𝑛𝑑𝑖𝑚
∑︁

𝜇=1

(cid:16) (cid:174)𝑆𝑖− ˆ𝜇 + (cid:174)𝑆𝑖+ ˆ𝜇

(cid:17)

,

(3.22)

(3.23)

is the “stencil” (i.e. the sum of the nearest neighbors) at site 𝑖. The site variable (cid:174)𝑆𝑖 is a unit vector,

however, the stencil is not since in 𝑑 dimensions, it is the sum of 2𝑑 unit vectors. For the Metropolis

accept/reject step, we compute the ratio of probabilities

𝑃(𝑥𝑛𝑒𝑤)
𝑃(𝑥𝑜𝑙𝑑)

𝑒−𝛽𝐸𝑛𝑒𝑤
𝑒−𝛽𝐸𝑜𝑙𝑑

=

= 𝑒−𝛽(𝐸𝑛𝑒𝑤−𝐸𝑜𝑙𝑑) = 𝑒−𝛽Δ𝐸 ,

(3.24)

where Δ𝐸 ≡ 𝐸𝑛𝑒𝑤 − 𝐸𝑜𝑙𝑑, and so we only care about the energy difference when a single site

variable (cid:174)𝑆𝑖 is changed to (cid:174)𝑆′
𝑖

Δ𝐸 = −

∑︁

𝜇

(cid:16) (cid:174)𝑆′

𝑖 · (cid:174)𝑆𝑖+ ˆ𝜇 + (cid:174)𝑆′

𝑖 · (cid:174)𝑆𝑖− ˆ𝜇

(cid:17)

+

∑︁

(cid:16) (cid:174)𝑆𝑖 · (cid:174)𝑆𝑖+ ˆ𝜇 + (cid:174)𝑆𝑖 · (cid:174)𝑆𝑖− ˆ𝜇

(cid:17)

= −

(cid:16) (cid:174)𝑆′

𝑖 − (cid:174)𝑆𝑖

(cid:17)

· (cid:174)𝑆𝑖,+.

(3.25)

𝜇

So the ratio of probabilities can be written as 𝑒 𝛽 (cid:174)𝑆′

𝑖 · (cid:174)𝑆𝑖,+/𝑒 𝛽 (cid:174)𝑆𝑖· (cid:174)𝑆𝑖,+, which implies the probability

distribution to be sampled is 𝑃( (cid:174)𝑆) = 𝑒 𝛽 (cid:174)𝑆· (cid:174)𝑆+. We can parameterize the site vector with an angle 𝜃,

48

so the PDF2 can be written as

𝑃(𝜃; 𝛼) = 𝑒𝛼 cos(𝜃−𝜃+),

𝛼 ≡ 𝛽

(cid:12)
(cid:12)
(cid:12)

(cid:174)𝑆+

(cid:12)
(cid:12)
(cid:12)

.

(3.26)

where 𝜃 is the random variable and 𝜃+ is the angle made by the stencil vector. Note that 𝛼 and

𝜃+ are parameters which depend on the environment/stencil of the site, and 𝛽 is fixed for a given
simulation. In practice, we do not need to treat 𝜃+ as a parameter since we can define ˜𝜃 = 𝜃 − 𝜃+,
sample the simpler PDF 𝑃( ˜𝜃; 𝛼) = 𝐶𝑒𝛼 cos( ˜𝜃), and then calculate 𝜃 = ˜𝜃 + 𝜃+. Since 𝜃 and 𝜃+ both
span [0, 2𝜋), we want −2𝜋 ≤ ˜𝜃 < 2𝜋. Note that the stencil (cid:174)𝑆+ is a sum of unit vectors, so in
𝑑-dimensions, 0 ≤ | (cid:174)𝑆+| ≤ 2𝑑. This implies that the parameter 𝛼 lies in 0 ≤ 𝛼 ≤ 2𝛽𝑑. The CDF

associated with Eq. (3.26) is

𝑧 = 𝐹 ( ˜𝜃; 𝛼) =

∫ ˜𝜃

−2𝜋

𝑃(𝜃′; 𝛼) 𝑑𝜃′

(3.27)

We can simulate the 𝑂 (2) model with “1 variable / 1 parameter” BMHA. The two-dimensional

lookup table is constructed by discretizing the parameter 0 ≤ 𝛼 ≤ 2𝛽𝑑 into 𝑛𝑝𝑎𝑟 bins with uniform
bin widths Δ𝛼𝑖 = 2𝜋/𝑛𝑝𝑎𝑟. Within each of those bins, the variable −2𝜋 ≤ ˜𝜃 ≤ 2𝜋 is discretized

into 𝑛𝑣𝑎𝑟 bins with bin widths Δ𝑥 𝑗 = 𝑥 𝑗 − 𝑥 𝑗−1 with 𝑗 = 1, . . . , 𝑛𝑣𝑎𝑟 and the bin widths Δ𝑥 𝑗 chosen

nonuniformly such that 𝑥 𝑗 = 𝐹−1(𝑧 = 𝑗/𝑛𝑣𝑎𝑟). An updating sweep of the lattice would proceed as

follows: Loop sequentially over all lattice sites. For each site,

1. Get the angle 𝜃𝑜𝑙𝑑 associated with the current site variable

2. Compute the stencil magnitude | (cid:174)𝑆+| and angle 𝜃+ and 𝛼 = 𝛽| (cid:174)𝑆+|

3. The shifted angle is ˜𝜃𝑜𝑙𝑑 ≡ 𝜃𝑜𝑙𝑑 − 𝜃+

4. Get the bin 𝑖 to which 𝛼 belongs

5. Within that 𝛼 bin, locate the bin 𝑗𝑜𝑙𝑑 to which ˜𝜃𝑜𝑙𝑑 belongs

2This is an unnormalized PDF, which is fine when computing the ratio of probabilities. However, when using the

PDF to construct the CDF, one must normalize it via numerical integration.

49

6. Still within that 𝛼 bin, propose ˜𝜃𝑛𝑒𝑤 by uniformly picking a ˜𝜃 bin 𝑗𝑛𝑒𝑤 and proposing ˜𝜃𝑛𝑒𝑤

uniformly within that bin

7. Accept ˜𝜃𝑛𝑒𝑤 with probability

(cid:32)

𝑃𝑏𝑚ℎ𝑎 = min

1,

𝑒𝛼 cos( ˜𝜃𝑛𝑒𝑤)
𝑒𝛼 cos( ˜𝜃𝑜𝑙𝑑)

Δ ˜𝜃 𝑗𝑛𝑒𝑤
Δ ˜𝜃 𝑗𝑜𝑙𝑑

(cid:33)

.

(3.28)

Otherwise, set ˜𝜃𝑛𝑒𝑤 = ˜𝜃𝑜𝑙𝑑

8. Unshift the angle 𝜃𝑛𝑒𝑤 = ˜𝜃𝑛𝑒𝑤 + 𝜃+ with 𝜃𝑛𝑒𝑤 ∈ [0, 2𝜋)

9. Update the site variable so that it points in the direction 𝜃𝑛𝑒𝑤

The BMHA can be generalized to a PDF 𝑃(𝑥, 𝑦, 𝑧, . . . ; 𝛼, 𝛽, 𝛾, . . .) with 𝑁𝑣𝑎𝑟 random variables

𝑥, 𝑦, 𝑧, . . . and 𝑁 𝑝𝑎𝑟 parameters 𝛼, 𝛽, 𝛾, . . .. Increasing the number of parameters does not signifi-

cantly complicate things since one just uniformly discretizes the space of the parameters and within

each resulting “cell”, discretizes the PDF. So a “1 variable / 𝑁 𝑝𝑎𝑟 parameter” PDF would result in

a (𝑁 𝑝𝑎𝑟 + 1)-dimensional lookup table. Increasing the number of variables is more complicated.

One approach would be to uniformly discretize each dimension of the parameter space so that one

has an 𝑁 𝑝𝑎𝑟-dimensional grid. Then within each of these cells, discretize the 𝑁𝑣𝑎𝑟-dimensional

variable space into “volumes” of equal probability. For more details, see Section 3 of [65].

To perform BMHA updating for the general 𝑂 (𝑛) model with total energy function 𝐻 =

⟨𝑖, 𝑗⟩

− (cid:205)
(cid:174)𝑆𝑖 · (cid:174)𝑆 𝑗 , where (cid:174)𝑆𝑖 and (cid:174)𝑆 𝑗 are 𝑛-dimensional unit vectors, the PDF to be sampled is
𝑃({𝜃𝑖}; {𝛼𝑖}) = 𝑒 𝛽 (cid:174)𝑆· (cid:174)𝑆+. Again, (cid:174)𝑆 is the random variable and (cid:174)𝑆+ is the stencil vector fixed by
the environment of (cid:174)𝑆. When expanded in the trigonometric functions of generalized spher-

ical coordinates, the dot product (cid:174)𝑆 · (cid:174)𝑆+ contains 𝑛 terms containing a total of 𝑛 − 1 angles

(these will be the random variables) 𝜃1, . . . , 𝜃𝑛−1 and 𝑛 parameters 𝛼1, . . . , 𝛼𝑛. However, one

can use the trigonometric identity cos(𝑎 − 𝑏) = cos 𝑎 cos 𝑏 + sin 𝑎 sin 𝑏 to eliminate one of

the parameters in return for shifting one of the angles. Thus for the 𝑂 (𝑛) model, we need

“𝑛 − 1 variable / 𝑛 − 1 parameter” BMHA. For example, for the 𝑂 (3) model, one can write the

50

PDF as 𝑃(𝜃1, 𝜃2; 𝛼1, 𝛼2) = exp(𝛼1 cos 𝜃1 + 𝛼2 sin 𝜃1 cos(𝜃2 − 𝜃+,2)), where 𝛼1 = 𝛽| (cid:174)𝑆+| cos 𝜃+,1,
𝛼2 = 𝛽| (cid:174)𝑆+| sin 𝜃+,1, and 𝜃+,𝑖 is the 𝑖th component of the stencil vector (cid:174)𝑆+.

3.3 Autocorrelations

In a Markov chain, where configurations are separated by a (computer) time step, consecutive

configurations tend to be correlated with each other. This “autocorrelation” of lattice configurations

results in the autocorrelation of any observables calculated on those configurations. In other words,

observables calculated in MCMC are generally not independent, and this complicates their statistical

analysis. A good discussion of autocorrelation is given in [5] and a briefer discussion is given

in [66].

For an observable 𝑋

𝐶𝑋 (𝑋𝑖, 𝑋𝑖+𝑡) = ⟨𝑋𝑖 𝑋𝑖+𝑡⟩ − ⟨𝑋𝑖⟩⟨𝑋𝑖+𝑡⟩,

(3.29)

is the correlation function between the observable 𝑋 measured at Markov times 𝑖 and 𝑖 + 𝑡. In

equilibrium, this autocorrelation function depends only on the time separation 𝑡, and one writes

𝐶𝑋 (𝑡) instead of 𝐶𝑋 (𝑋𝑖, 𝑋𝑖+𝑡). Typically, the normalized correlation function 𝑐 𝑋 (𝑡) has exponential

fall-off with 𝑡

𝑐 𝑋 (𝑡) ≡

𝐶𝑋 (𝑡)
𝐶𝑋 (0)

∼ 𝑒−𝑡/𝜏𝑋,𝑒𝑥 𝑝 ,

(3.30)

where 𝜏𝑋,𝑒𝑥 𝑝 is the exponential autocorrelation time for the observable 𝑋. In general, the autocor-

relation time depends on the observable 𝑋, the lattice volume, and the updating algorithm.

An estimator for the integrated autocorrelation time for an observable 𝑋 is given by

˜𝜏𝑋,𝑖𝑛𝑡 = 1 + 2

𝑇
∑︁

𝑡=1

𝑐 𝑋 (𝑡).

(3.31)

The integrated autocorrelation time 𝜏𝑋,𝑖𝑛𝑡, which is a measure of the self-correlation between

consecutive configurations in a Markov chain, is estimated by finding a window in 𝑡 for which ˜𝜏𝑋,𝑖𝑛𝑡

is nearly independent of 𝑡. In other words, one looks for a plateau in a plot of ˜𝜏𝑋,𝑖𝑛𝑡 versus 𝑡, and

the value of ˜𝜏𝑋,𝑖𝑛𝑡 at the plateau is taken to be the estimate 𝜏𝑋,𝑖𝑛𝑡. See Figure 3.1. In practice, it

may be hard to find a plateau. One often has to do a controlled search for the plateau by increasing

𝑇 and the statistics (number of sweeps) used in the simulation. See Fig. (3.2). The integrated

51

Figure 3.1 Plots of the integrated autocorrelation curve of the energy for the two-dimensional Ising
model at four different lattice sizes. Each simulation is performed at the critical point 𝛽𝑐 using
a heatbath algorithm. The curves are computed using Eq. (3.31), and the plateau value gives the
estimate of the integrated autocorrelation time 𝜏𝑖𝑛𝑡. Notice that 𝜏𝑖𝑛𝑡 grows with lattice volume, and
to obtain the plateau value, one must go to much larger values of 𝑡 as the lattice size is increased.

autocorrelation time is observable dependent and there can be orders of magnitude difference

between the integrated autocorrelation times of different observables 𝑋. In general, it also depends

on the lattice size and the updating algorithm.

In plots of ˜𝜏𝑋,𝑖𝑛𝑡 versus 𝑡 as in Figure 3.2, the error bars at nearby values of 𝑡 are correlated and

roughly the same size. To estimate 𝜏𝑋,𝑖𝑛𝑡, it usually suffices to take the maximum value and its error

bar. One should always check such plots by eye to make sure that one is getting the maximum. For

example, in the left panel of Figure 3.2, one is clearly not obtaining the maximum since the curve

is increasing beyond the extent of the plot.

In practice, analyzing the autocorrelations in a time series can be a subtle business. For

example, in systems with explicitly broken symmetry as we will see in the next chapter, there can be

very “long wavelength” autocorrelations coming from magnetization flips (when the entire lattice

flips magnetization) as well as ordinary autocorrelations coming from individual spin flips. If the

Markov chain is too short, an autocorrelation analysis may completely miss the long wavelength

autocorrelations. The safest approach is to start with very small lattices and very large statistics,

and slowly work up to larger lattices.

The effect of autocorrelations is to decrease the number of statistically-independent measure-

52

Figure 3.2 Here we show the integrated autocorrelation curve of the energy for the Ising model on
a 16 × 16 lattice near the critical point calculated from a time series of 216 heatbath sweeps with
one measurement per sweep. The integrated autocorrelations are estimated using Eq. (3.31), and
we are trying to find the maximum/plateau value. In the left panel, the time extent 𝑇 = 32 is clearly
insufficient to resolve the plateau. In the middle panel, we increased 𝑇 and located a peak which
gives an estimate of the integrated autocorrelation time. However, in this case we have large error
bars and don’t have a nice plateau, which indicates that our data was generated with insufficient
statistics to accurately resolve the integrated autocorrelation time. In the final panel, we increased
the statistics of the simulation by a factor of 256 to 224 heatbath sweeps, and we increased 𝑇 to 512.
Now we see a nice plateau with small error bars. In many practical examples, it would be difficult
to get sufficient statistics to resolve the plateau this well and one might be forced to estimate the
integrated autocorrelation time from plots more like the middle one.

ments in one’s sample.

If autocorrelation is not properly accounted for, the naively calculated

statistical error bars will be improperly small. For a sample {𝑋1, 𝑋2, . . . , 𝑋𝑁 } of 𝑁 autocorrelated

measurements, the number of effectively independent measurements is

𝑁𝑖𝑛𝑑𝑒 𝑝 =

𝑁
𝜏𝑋,𝑖𝑛𝑡

.

(3.32)

For uncorrelated measurements, the variance of the mean is 𝜎2
𝑋

= 𝜎2

𝑋/𝑁. The variance of the

mean, corrected for autocorrelations, is

𝜎2
𝑋,𝑐𝑜𝑟𝑟 = 𝜏𝑋,𝑖𝑛𝑡𝜎2

𝑋 = 𝜏𝑋,𝑖𝑛𝑡

𝜎2
𝑋
𝑁

.

(3.33)

Thus, the estimate that one would quote for the sample of correlated measurements {𝑋1, 𝑋2, . . . , 𝑋𝑁 }

is

where ˆ𝑋 is an estimator of the mean.

ˆ𝑋 ± 𝜎𝑋,𝑐𝑜𝑟𝑟,

(3.34)

53

 0 5 10 15 20 25 30 35 0 5 10 15 20 25 30 35τintt 0 50 100 150 200 250 300t 0 100 200 300 400 500 600tFigure 3.3 The integrated autocorrelation time (of the energy here) blows up at a phase transition
as shown here for the 𝑋𝑌 model using a Biased Metropolis Heatbath Algorithm on lattices of
various size. This is the cause of the critical slowing down that plagues MCMC simulations
near critical points. The high autocorrelation time means many configurations must be discarded
between measurements in order to get an independent sample, and so the overall computational
time required gets very large at the critical point.

According to [66], for equilibration, one should discard at least 20𝜏𝑋,𝑖𝑛𝑡 configurations. To

end up with a sample of independent measurements, one should discard a similar quantity of

configurations between consecutive measurements. For good statistics, increase this to 1000𝜏𝑋,𝑖𝑛𝑡,

and for statistics at the 1% level, increase it to 10, 000𝜏𝑋,𝑖𝑛𝑡. In practice, however, for computationally

intensive simulations as done in lattice QCD, one has to be satisfied with ∼ 1𝜏𝑋,𝑖𝑛𝑡 configurations

discarded between measurements.

For a system with a phase transition, 𝜏𝑋,𝑖𝑛𝑡 grows large at the critical point. Furthermore, this

growth is volume dependent so 𝜏𝑋,𝑖𝑛𝑡 blows up doubly as the critical point is approached and the

lattice volume is increased. See Figure 3.3, to see how 𝜏𝑋,𝑖𝑛𝑡 varies with 𝛽 and lattice size as one

approaches the critical point in the 𝑋𝑌 model. At the critical point, the integrated autocorrelation

time increases as [5, 66]

𝜏𝑋,𝑖𝑛𝑡 ∝ 𝐿𝑧,

(3.35)

where 𝐿 is the lattice extent in each direction, and 𝑧 is a dynamical critical exponent. This increase

as a power of the lattice volume as the critical point is neared is the cause of the critical slowing

down that plagues MCMC studies. Keep in mind that some number of configurations (> 𝜏𝑋,𝑖𝑛𝑡)

54

0.00.51.01.52.0020406080100int4×48×816×1632×3264×64Figure 3.4 The integrated autocorrelation time of the energy is calculated using the heatbath
algorithm for the two-dimensional Ising model at the critical point on four different lattice sizes.
See Figure 3.1. The results are fitted to Eq. (3.35) to get an estimate of the dynamical critical
exponent 𝑧.

must be discarded between saved configurations to mitigate the autocorrelations. This means as

the critical point is approached, the MCMC simulation must be run for longer and longer to obtain

equivalent statistics. For the two-dimensional Ising model with a heatbath algorithm, we find

𝜏𝐸,𝑖𝑛𝑡 ≈ 0.1733𝐿1.87,

(3.36)

with dynamical critical exponent 𝑧 = 1.873 ± 0.015. See Figure 3.4.

It must be noted that computing the integrated autocorrelation time is expensive. Typically, it

requires orders of magnitude more statistics to precisely estimate 𝜏𝑋,𝑖𝑛𝑡, for some observable 𝑋, than

it takes to get a seemingly precise estimate of 𝑋 itself. Nevertheless, if one wants a reliable estimate

of some observable 𝑋, one must also have some understanding of its integrated autocorrelation

time 𝜏𝑋,𝑖𝑛𝑡.

One obvious way to mitigate autocorrelations is to throw away most of the data. For example,

if 𝜏𝑖𝑛𝑡 = 10 for some time series, then saving only every 10th data point should in principle remove

the autocorrelations. Unfortunately, since the sample size is now 10 times smaller, the error bar of

the sample mean is now

√

10 times larger. Another approach is to do blocking and preaveraging

of the data. Chunking the time series into blocks larger than 𝜏𝑖𝑛𝑡 and taking the average of each

block produces a new (much shorter) time series which in principle is free of autocorrelations. See

55

Figure 3.5 Here we illustrate how blocking and preaveraging can be used to mitigate autocorrela-
tions. We construct a stochastic time series containing autocorrelation. This is done by choosing
100K uniformly random numbers {𝑟} in [−0.5, 0.5). The 𝑛th element 𝑎𝑛 of the autocorrelated
time series (plotted in top left) is 𝑎𝑛 = 𝑟 + 𝑎𝑛−1 where 𝑟 is a random number, however, if 𝑛 is a
multiple of 100, then 𝑎𝑛 = 𝑟. The center left panel shows the autocorrelation function 𝑐(𝑡) of the
series, and the bottom left panel shows that the integrated autocorrelation time is ∼ 65. Next, we
do blocking and preaveraging with block size 100. The resulting shorter time series is shown in
the top right. The associated correlation function (center right) and 𝜏𝑖𝑛𝑡 (bottom right), shows that
the preaveraged time series is not autocorrelated.

Figure 3.5. Although this new sample is smaller, its variance is also smaller (since each number is

already an average), and so the error bar does not increase by as much as if one simply threw away

most of the data. The better approach, discussed in the next section, is to jackknife with blocking.

3.4

Jackknife Error Analysis

In this section we follow the notation and approach of [5]. Suppose we have a sample of 𝑁

estimates of a quantity 𝑥

𝑥1, 𝑥2, 𝑥3, . . . , 𝑥𝑁 .

(3.37)

56

020000400006000080000100000t1050510time series050100150200250300t0.00.20.40.60.81.0c(t)050100150200250300t020406080100int02004006008001000t642024time series050100150200250300t0.00.20.40.60.81.0c(t)050100150200250300t0.00.51.01.52.0intThis sample may or may not be an ordered time series. Assuming the 𝑥𝑖 are uncorrelated, a correct

(i.e. unbiased) estimator of the expectation value of 𝑥 is the mean

𝑥 =

1
𝑁

𝑁
∑︁

𝑖=1

𝑥𝑖,

and a correct estimator of the variance of this sample mean is

𝑠2
𝑥 =

1
𝑁 (𝑁 − 1)

𝑁
∑︁

𝑖=1

(𝑥𝑖 − 𝑥)2.

(3.38)

(3.39)

Then our best estimate of 𝑥 is 𝑥 and our uncertainty is the standard deviation of the mean 𝑠𝑥, and

so we would quote the result

𝑥 ± 𝑠𝑥.

(3.40)

If we are interested in a function 𝑓 (𝑥), as we often are, we find that a correct estimator is 𝑓 = 𝑓 (𝑥),

i.e.

𝑓 calculated at the mean 𝑥. To calculate the uncertainty associated with 𝑓 , the conventional

approach is to use “error propagation” formulae to deduce the uncertainty from the uncertainty of

𝑥. However, this approach becomes difficult and even unreliable when 𝑓 (𝑥) is a nonlinear function

of 𝑥, and so we need a different approach.

The jackknife approach to error analysis is easy to implement and robust. See textbooks like [5]

for a complete introduction. We define jackknife estimators of 𝑓 (𝑥) as 𝑓 𝐽

𝑖 ≡ 𝑓 (𝑥𝐽

𝑖 ) where the

jackknife variable is

𝑥𝐽
𝑖 =

1
𝑁 − 1

∑︁

𝑥𝑘 ,

(3.41)

that is, 𝑥𝐽

𝑘≠𝑖
𝑖 is the mean of the sample when the 𝑖th data point is excluded. Then the jackknife mean is

𝐽

𝑓

=

1
𝑁

𝑁
∑︁

𝑖=1

𝑓 𝐽
𝑖 ,

(3.42)

Note that the superscript 𝐽 is not an index nor a power. Rather, it is just a label to indicate that the

quantity is from jackknife. An estimator of the variance of 𝑓

𝐽

is

𝑠2
𝑓

𝐽 =

𝑁 − 1
𝑁

𝑁
∑︁

(cid:16)

𝑖=1

𝑓 𝐽
𝑖 − 𝑓

𝐽 (cid:17) 2

.

(3.43)

57

Then the estimate that one would quote is

𝐽

𝑓

± 𝑠

𝐽 .

𝑓

An an example, let us consider a jackknife computation of the magnetic susceptibility

𝜒 =

1
𝑉

(cid:16)

⟨𝑀 2⟩ − ⟨𝑀⟩2(cid:17)

,

(3.44)

(3.45)

where we use 𝑉 here to indicate the number of lattice sites since we are using 𝑁 in this section

to denote the length of the time series. We can think of 𝜒(𝑀, 𝑀 2) as a two-variable function of

𝑀 and 𝑀 2. Assume that a MCMC simulation has given us a time series 𝑀1, 𝑀2, . . . , 𝑀𝑁 of the

magnetization measured on 𝑁 uncorrelated lattice configurations. We create a second time series

by squaring each of these to get 𝑀 2

1 , 𝑀 2

2 , . . . , 𝑀 2

𝑁 . We define the jackknife variables

𝑀 𝐽

𝑖 =

1
𝑁 − 1

∑︁

𝑘≠𝑖

𝑀𝑘 ,

(𝑀 2)𝐽

𝑖 =

1
𝑁 − 1

𝑀 2
𝑘

∑︁

𝑘≠𝑖

(3.46)

Then our estimator for the susceptibility is the jackknife mean

𝜒𝐽 =

1
𝑁

𝑁
∑︁

𝑖=1

𝜒𝐽
𝑖 ,

𝜒𝐽
𝑖 ≡

(cid:16)

1
𝑉

and our estimator for the variance of 𝜒𝐽 is

(𝑀 2)𝐽

𝑖 − (𝑀 𝐽

𝑖 )2(cid:17)

,

(3.47)

𝑠2
𝜒𝐽 =

𝑁 − 1
𝑁

𝑁
∑︁

𝑖=1

(cid:16)

𝑖 − 𝜒𝐽 (cid:17) 2
𝜒𝐽

.

(3.48)

So far we’ve assumed that the sample data are uncorrelated. However, we know that time

series data from an MCMC simulation will be autocorrelated. The simplest way to handle this

is to remove the autocorrelations by blocking and pre-averaging the data, and then applying the

jackknife machinery to the (de-correlated) pre-averaged data. Suppose we have a time series of 𝑁

data points 𝑥1, 𝑥2, 𝑥3, . . . , 𝑥𝑁 with an integrated autocorrelation time of 𝜏𝑥,𝑖𝑛𝑡. We chunk the time

series into 𝑁𝑏 blocks, labeled 𝑗 = 1, 2, . . . , 𝑁𝑏, each containing 𝑁/𝑁𝑏 of the original data points.

To mitigate the autocorrelations, the block size should be >> 𝜏𝑥,𝑖𝑛𝑡 (if 𝜏𝑥,𝑖𝑛𝑡 is unknown, we can

vary the block size, study the error versus block size, and extrapolate to infinite block size in the

end). We form a new time series 𝑦1, 𝑦2, . . . , 𝑦𝑁𝑏, where 𝑦 𝑗 is the average computed from block 𝑗.

58

This new time series forms a sample of nearly statistically independent data points and becomes

the input for our jackknife procedure.

The more robust and general approach is to do jackknife with blocking. Instead of blocking

and then doing jackknife on a shorter pre-averaged time series, one integrates blocking into the

jackknife procedure. This approach can be used to handle autocorrelations (by choosing a block

size >> 𝜏𝑥,𝑖𝑛𝑡, but also to handle the cases where the length 𝑁 of the original time series is very

large and the function being computed is too costly to be repeated 𝑁 times on the full time series.

The difference from the jackknife procedure detailed in Eq. (3.42)–(3.44), is simply that one now

excludes entire blocks in Eq. (3.41) rather than single data points. Once again, suppose our original

time series is 𝑥1, 𝑥2, 𝑥3, . . . , 𝑥𝑁 . We choose some number 𝑁𝑏 of blocks to chunk the time series

into. To maintain “Gaussian-ness”, one should use 𝑁𝑏 > 30. To mitigate autocorrelations, one

should choose 𝑁𝑏 such that the block size is much larger than the integrated autocorrelation time,

i.e. 𝑁/𝑁𝑏 >> 𝜏𝑖𝑛𝑡. I assume here that 𝑁 is a multiple of 𝑁𝑏. We label the blocks as ℬ𝑖 with

𝑖 = 1, 2, . . . , 𝑁𝑏. This time, we do not pre-average. The main difference is that we define the 𝑖th

jackknife variable now as

𝑥𝐽
𝑖 =

𝑘 for 𝑥𝑘∉ℬ𝑖
where the sum is over all data points not in block ℬ𝑖, i.e. 𝑥𝐽

1
𝑁 − 𝑁/𝑁𝑏

𝑁−𝑁/𝑁𝑏
∑︁

𝑥𝑘 .

(3.49)

𝑖 is simply the mean of the time series
𝑖 ≡ 𝑓 (𝑥𝐽
𝑖 ), the jackknife

when block ℬ𝑖 is excluded. The jackknife estimators of 𝑓 (𝑥) are now 𝑓 𝐽

mean is

and an estimator of the variance of 𝑓

𝐽

is

𝐽

𝑓

=

1
𝑁𝑏

𝑁𝑏∑︁

𝑖=1

𝑓 𝐽
𝑖 ,

𝑠2
𝑓

𝐽 =

𝑁𝑏 − 1
𝑁𝑏

𝑁𝑏∑︁

(cid:16)

𝑓 𝐽
𝑖 − 𝑓

𝐽 (cid:17) 2

.

𝑖=1
Superficially, this may seem equivalent to pre-averaging followed by the jackknife procedure

Eq. (3.42)–(3.44), and indeed it is when 𝑓 is a simple function of 𝑥. More commonly, however, we

have to deal with some multivariable nonlinear mapping where 𝑓 depends not only on 𝑥, 𝑦, . . ., but

also on the original data points {𝑥𝑖}, {𝑦𝑖}, . . .. We will see an example of this in Section 3.6.

59

(3.50)

(3.51)

3.5 Reweighting

In the canonical ensemble, we commonly describe the statistical properties of a system in

thermodynamic equilibrium by the partition function Eq. (3.2). In this form, the partition function

is a function of the inverse temperature 𝛽 and is a sum over all possible lattice configurations

𝑈. An MCMC simulation, performed at a particular 𝛽 = 𝛽0, is used to generate a chain of

configurations 𝑈1, 𝑈2, . . ., with frequency proportional to the Boltzmann weight 𝑒−𝛽0𝐸 (𝑈). After

accounting for equilibration and autocorrelations, this chain provides a representative sample of

the interesting configurations and is used to estimate various thermodynamic averages at 𝛽0. What

about estimating the thermodynamic averages at a different value 𝛽? Naively, this requires us to

repeat the MCMC simulation at 𝛽 even if 𝛽 is very close to 𝛽0.

Alternatively, we can write the partition function as a sum over energies

𝑍 (𝛽) =

∑︁

𝐸

Ω(𝐸)𝑒−𝛽𝐸 ,

(3.52)

where the density of states Ω(𝐸) gives the number of configurations with energy 𝐸. The probability

of observing the system with energy 𝐸 is

𝑃𝛽 (𝐸) =

Ω(𝐸)
𝑍 (𝛽)

𝑒−𝛽𝐸 .

(3.53)

An MCMC simulation at a fixed 𝛽 = 𝛽0 produces for us a Markov chain of 𝑛 configurations

𝑈1, 𝑈2, . . . , 𝑈𝑛, with some distribution of energies 𝐸. A histogram ℋ𝛽0 (𝐸) of these energies gives

an estimate of the equilibrium probability distribution

ℋ𝛽0 (𝐸)
𝑛

=

˜Ω(𝐸)
𝑍 (𝛽0)

𝑒−𝛽0𝐸 .

(3.54)

where ˜Ω(𝐸) is an estimate of the density of states. We can solve Eq. (3.54) for ˜Ω(𝐸) and use it to

replace the two instances of Ω(𝐸) in Eq. (3.53) to get
𝑍 (𝛽0)𝑒 𝛽0𝐸 (cid:105)
𝑍 (𝛽0)𝑒 𝛽0𝐸 (cid:105)

Ω(𝐸)𝑒−𝛽𝐸
(cid:205)𝐸 Ω(𝐸)𝑒−𝛽𝐸 =

(cid:104) ℋ𝛽0 (𝐸)
𝑛
(cid:104) ℋ𝛽0 (𝐸)
𝑛

𝑃𝛽 (𝐸) =

(cid:205)𝐸

𝑒−𝛽𝐸

𝑒−𝛽𝐸

=

ℋ𝛽0 (𝐸)𝑒−(𝛽−𝛽0)𝐸
(cid:205)𝐸 ℋ𝛽0 (𝐸)𝑒−(𝛽−𝛽0)𝐸

.

(3.55)

Thus, the histogram from a simulation performed at 𝛽0, can be used to estimate the probability

distribution at 𝛽. This is the basis of the “single histogram” reweighting method. See textbooks

60

Figure 3.6 Here we show an illustration of how useful multihistogram reweighting can be to
precisely locate the peak positions and heights for finite size scaling. The five black points are from
canonical MCMC simulations, and the gray points are computed via multihistogram reweighting
of the canonical time series. In this example, we are looking at the specific heat from the Extended-
𝑂 (2) model (discussed in a later chapter) with 𝑞 = 3 and ℎ𝑞 = 1 simulated on a 64 × 64 lattice.

such as [5, 67]. Given a canonical simulation performed at 𝛽0, we can estimate the value of an

observable 𝑂 at a new 𝛽 via the relation

⟨𝑂⟩𝛽 =

∑︁

𝐸

𝑂 (𝐸)𝑃𝛽 (𝐸) =

(cid:205)𝐸 𝑂 (𝐸)ℋ𝛽0 (𝐸)𝑒−(𝛽−𝛽0)𝐸
(cid:205)𝐸 ℋ𝛽0 (𝐸)𝑒−(𝛽−𝛽0)𝐸

.

(3.56)

To get an accurate estimate, there must be sufficient overlap between the histograms associated with

𝛽0 and 𝛽. In practice, the result is only accurate for small Δ𝛽 ≡ 𝛽 − 𝛽0. To maintain accuracy, Δ𝛽

generally must be taken smaller as a critical point is approached or as the lattice size is increased.

To accurately cover a wider range in 𝛽, one can use “multihistogram” reweighting.

In multihistogram reweighting [68, 69], multiple histograms (from multiple canonical simula-

tions) are stitched together to approximate the density of states. This improves the accuracy of the

reweighted observables, and typically allows one to reweight to a much larger range of 𝛽 values.

Reweighting is very useful in finite-size scaling [70] to obtain precise peak positions without

having to do a large number of canonical simulations at different 𝛽. An example is shown in

Figure 3.6 for illustration.

61

0.790.800.810.820.830.840.8501234567CReweightedCanonical Simulation3.6 Curve Fitting

3.6.1 Uncorrelated Fitting

Suppose we have 𝑁 uncorrelated data points 𝑦𝑖 (𝑥𝑖) with 𝑖 = 1, 2, . . . , 𝑁 and their standard

deviations 𝜎𝑖. We want to model the data by fitting it to some function 𝑦(𝑥𝑖; (cid:174)𝑎) with free parameters

(cid:174)𝑎. The goal is to estimate the best fit parameters (cid:174)𝑎 and their error bars. For linear fits, this problem

is easily solved with the linear least squares approach. However, for nonlinear fits, such as power

law fits which occur in finite size scaling, the fitting procedure is slightly more complicated. Here,

we only briefly sketch the general procedure since one can find the details in references like [5, 71].

The foundation of most fitting methods is the chi-squared function

𝜒2( (cid:174)𝑎) =

𝑁
∑︁

𝑖=1

(cid:18) 𝑦𝑖 − 𝑦(𝑥𝑖; (cid:174)𝑎)
𝜎𝑖

(cid:19) 2

.

(3.57)

Here, 𝑦𝑖 are the data points with standard deviations3 𝜎𝑖, and 𝑦(𝑥𝑖; (cid:174)𝑎) is the function we are fitting.

The components of (cid:174)𝑎 = (𝑎1, . . . , 𝑎 𝑀) are the free parameters to be estimated. The best estimates

of the free parameters are those which minimize Eq. (3.57). Thus, the task of a fitting procedure is

reduced to the problem of minimizing Eq. (3.57).

For linear or polynomial fitting, the minimization of Eq. (3.57) is straightforward. For nonlinear

fitting, it is more difficult, but there are standard algorithms such as the Levenberg-Marquardt

algorithm proposed by Levenberg [72] and improved by Marquardt [73]. The Levenberg-Marquardt

algorithm uses a weighted average of the Newton-Raphson and the steepest descent methods for

minimizing Eq. (3.57). Both of these are iterative methods. Initially, the weight is biased toward

the steepest descent method which converges towards a (possibly local) minimum even when the

initial guess (cid:174)𝑎0 for the free parameters is poor. The steepest descent method computes the next

iteration point, (cid:174)𝑎𝑛+1, from

(cid:174)𝑎𝑛+1 = (cid:174)𝑎𝑛 − 𝛾∇𝜒2( (cid:174)𝑎𝑛),

(3.58)

where 𝛾 is a small constant. The disadvantage of the steepest descent method is slow convergence,

so once convergence is detected, the Levenberg-Marquardt algorithm shifts the weight to the

3Including the error bars of the data points is important so that the best fit properly gives more weight to points

with smaller error bars.

62

Newton-Raphson method, which converges rapidly but needs a good initial approximation. In the

Newton-Raphson method, the next iteration point is computed as

(cid:174)𝑎𝑛+1 = (cid:174)𝑎𝑛 − 𝐻−1∇𝜒2( (cid:174)𝑎𝑛),

where 𝐻 is an 𝑀 × 𝑀 Hessian matrix with components

𝐻 𝑗 𝑘 = 2

𝑁
∑︁

𝑖=1

1
𝜎2
𝑖

(cid:20) 𝜕𝑦(𝑥𝑖; (cid:174)𝑎)
𝜕𝑎 𝑗

𝜕𝑦(𝑥𝑖; (cid:174)𝑎)
𝜕𝑎𝑘

− [𝑦𝑖 − 𝑦(𝑥𝑖; (cid:174)𝑎)]

𝜕2𝑦(𝑥𝑖; (cid:174)𝑎)
𝜕𝑎 𝑗 𝜕𝑎𝑘

(cid:21) (cid:12)
(cid:12)
(cid:12) (cid:174)𝑎= (cid:174)𝑎𝑛

.

(3.59)

(3.60)

After finding the parameters (cid:174)𝑎 which minimize Eq. (3.57), one computes the inverse of half of the

Hessian

𝐶 =

(cid:21) −1

.

(cid:20) 1
2

𝐻

(3.61)

This is the covariance matrix of the standard errors in the fitted parameters (cid:174)𝑎.

In practice, there are a variety of software packages available which implement fitting procedures

such as the one described above. In Python, a common choice is to use scipy.optimize.curve_fit,

which defaults to the Levenberg-Marquardt algorithm for unconstrained fitting. Such a fitting

package will return the best fit parameters (cid:174)𝑎 along with a covariance matrix. For uncorrelated data,

the covariance matrix is diagonal, with the diagonal elements being the estimated variances of the

fit parameters.

The purpose of the curve fitting procedure is typically to estimate some figure of interest 𝐴 and

to quantify the uncertainty 𝜎𝐴 of that estimate. If the figure of interest is one of the fit parameters

(e.g. suppose we are fitting to the form 𝐵𝑥 𝐴, and we are interested in the exponent 𝐴), then we can

simply use a standard fitting package, which gives us the best estimates of 𝐴 and 𝐵 as well as the

covariance matrix. The square roots of the diagonal elements of the covariance matrix give us the

uncertainties 𝜎𝐴 and 𝜎𝐵 of the fit parameters. On the other hand, if our figure of interest is some

nonlinear function of the fit parameters, then the fitting package will not calculate the uncertainty

𝜎𝐴 for us. In that case, one can use a form of “bootstrap” to estimate 𝜎𝐴. For example, suppose we

are fitting to a model 𝑦(𝑥𝑖; (cid:174)𝑎) with fit parameters (cid:174)𝑎, and the figure of interest is the maximum value

of the best fit model, i.e. 𝐴 = max(𝑦(𝑥𝑖; (cid:174)𝑎)). The fitting package returns the best fit parameters (cid:174)𝑎

63

along with the uncertainties 𝜎𝑎1, 𝜎𝑎2, . . ., but we want 𝐴 and 𝜎𝐴. Calculating 𝐴 is straightforward

as one can use the usual calculus approach to obtain the maximum of a given function. To estimate

𝜎𝐴 using a bootstrap approach, we treat each original data point 𝑦𝑖 (𝑥𝑖) as coming from a Gaussian

distribution with standard deviation given by the error bar 𝜎𝑖 associated with that point. For each

data point 𝑦𝑖 (𝑥𝑖), we select a Gaussian random from that distribution, fit the curve to that set of 𝑁

random numbers, and calculate the maximum. This is repeated a large number of times so that one

ends up with many estimates of the maximum 𝐴. The result that one quotes is the mean of this

sample, and the error bar is the standard deviation of the sample.

3.6.2

Jackknife Fitting

When fitting correlated data, one can in principle use the approach detailed above, but now the

chi-squared function is the more general

𝜒2( (cid:174)𝑎) =

∑︁

𝑖, 𝑗

(𝑦𝑖 − 𝑦(𝑥𝑖; (cid:174)𝑎)) ˆ𝐶−1
𝑖 𝑗

(cid:0)𝑦 𝑗 − 𝑦(𝑥 𝑗 ; (cid:174)𝑎)(cid:1) .

(3.62)

where the elements ˆ𝐶−1

𝑖 𝑗 of the inverse covariance matrix are themselves estimated from the data.
In practice, this matrix often has very small eigenvalues—resulting in an unstable 𝜒2( (cid:174)𝑎). As a

result, this approach is often unworkable in practice. Besides, this approach wouldn’t help us to get

the error bar associated with a more complicated property (e.g. the maximum) of the best fit. The

solution is to use the jackknife method.

The jackknife approach to fitting is a very general method that works for both uncorrelated

data and correlated data (provided one uses appropriate block sizes). Rather than attempt a formal

description of this procedure, we detail here an example.

In finite-size scaling, one needs to

extract peak locations and heights from various thermodynamic quantities, e.g. the specific heat

𝐶 = 𝛽2(⟨𝐸 2⟩ − ⟨𝐸⟩2)/𝑉, where 𝑉 is the number of lattice sites, and 𝐸 is the internal energy.

Consider Figure 3.6. We want to extract the peak (location and height) along with proper error

bars. Our procedure is as follows:

1. Choose the set of data points 𝐶 (𝛽1), 𝐶 (𝛽2), . . . , 𝐶 (𝛽𝑁 ) to include in the fit. In practice, most

or all of these data points are computed via a reweighting procedure from a few canonical

64

simulations, and so these points are correlated with each other. For simplicity, we assume

here that there is no reweighting procedure. Instead, we will assume that the 𝐶 (𝛽𝑖) data point

will be coming from an energy time series of length 𝑀 generated by a canonical MCMC

simulation performed at 𝛽𝑖. So we end up with 𝑁 time series of the energy 𝐸 each of length

𝑀. Let 𝐸𝑖, 𝑗 be the 𝑗th element of the 𝑖th time series (𝑖 = 1, . . . , 𝑁, 𝑗 = 1, . . . , 𝑀)

2. We choose the number of jackknife blocks 𝑁𝑏 to use when blocking the time series. One

should use 𝑁𝑏 > 30, but also ensure that the number of elements 𝑀/𝑁𝑏 in a block is much

larger than the integrated autocorrelation time 𝜏𝑖𝑛𝑡. Let ℬ𝑖,𝑘 represent the 𝑘th block in the

𝑖th time series

3. Then the 𝑘th jackknife estimator of the specific heat from the 𝑖th time series (𝐶 𝐽

𝑘 (𝛽𝑖)) is
obtained by computing the specific heat on the entire time series 𝐸𝑖 excluding the 𝑘th block.

𝑖𝑘 ≡ 𝐶 𝐽

That is,

where

𝐶 𝐽

𝑖𝑘 =

(cid:18)

𝛽2
𝑖
𝑉

(𝐸𝑖)𝐽

𝑘 =

1
𝑀 − 𝑀/𝑁𝑏

𝑀−𝑀/𝑁𝑏
∑︁

𝐸𝑖,𝑘 ,

𝑘 for 𝐸𝑖,𝑘∉ℬ𝑖,𝑘

(𝐸 2

𝑖 )𝐽

𝑘 −

(cid:17) 2(cid:19)

(cid:16)

(𝐸𝑖)𝐽
𝑘

(3.63)

(𝐸 2

𝑖 )𝐽

𝑘 =

1
𝑀 − 𝑀/𝑁𝑏

𝑀−𝑀/𝑁𝑏
∑︁

𝑘 for 𝐸 2

𝑖,𝑘∉ℬ𝑖,𝑘

𝐸 2
𝑖,𝑘

(3.64)

So for each of the 𝑁 time series, we end up with 𝑁𝑏 estimators of the specific heat

4. Our fitting procedure requires that our data points have proper error bars. At this stage, we

can get those error bars by calculating the jackknife variance of the specific heat estimators

𝑠2
𝐶𝑖

𝐽 =

𝑁𝑏 − 1
𝑁𝑏

𝑁𝑏∑︁

(cid:16)

𝑘=1

𝐶 𝐽
𝑖𝑘 − 𝐶𝑖

𝐽 (cid:17) 2

,

𝐽

𝐶𝑖

=

1
𝑁𝑏

𝑁𝑏∑︁

𝑘=1

𝐶 𝐽
𝑖𝑘 .

(3.65)

5. For 𝑘 = 1, we plot the 𝑁 specific heat estimators 𝐶 𝐽

𝑖1 with the error bars given by 𝑠

𝐽 . We

𝐶𝑖

perform our fitting procedure (e.g. fitting a cubic polynomial using a least squares algorithm)

on this set of points and extract the maximum which we label 𝑀 𝐽

1 . We repeat this for
, (𝑘 = 1, . . . , 𝑁𝑏) of the maximum

𝑘 = 2, . . . , 𝑁𝑏, to end up with 𝑁𝑏 jackknife estimators 𝑀 𝐽
𝑘

65

6. We then calculate the jackknife mean and variance of the maximum as

𝐽

𝑀

=

1
𝑁𝑏

𝑁𝑏∑︁

𝑘=1

𝑀 𝐽
𝑘 ,

𝑠2
𝑀

𝐽 =

𝑁𝑏 − 1
𝑁𝑏

𝑁𝑏∑︁

(cid:16)

𝑘=1

𝑀 𝐽

𝑘 − 𝑀

𝐽 (cid:17) 2

.

The final result we quote for the maximum is then

𝐽

𝑀

± 𝑠

𝐽 .

𝑀

(3.66)

(3.67)

For a recent paper on alternative approach to correlated fitting, see [74] and references therein.

3.6.3 Goodness-of-fit Measures

After obtaining the best fit for a given model, one needs a way to quantify how good the fit is

since a bad best fit suggests that one is using a poor model. This brings us to “goodness-of-fit”

measures. A common goodness-of-fit measure is the reduced chi-squared statistic or “chi-squared

per degrees of freedom” computed as

𝜒2
𝜈 =

𝜒2
𝜈

,

(3.68)

where 𝜒2 is Eq. (3.57) computed at the best fit parameters, and 𝜈 = 𝑁 − 𝑀 is the number of degrees

of freedom calculated as the number of points being fitted minus the number of fitting parameters.

Typically, a good fit will result in 𝜒2

𝜈 ∼ 1, with 𝜒2

𝜈 >> 1 implying a bad fit and 𝜒2

𝜈 << 1 suggesting

an overfit.

An alternative goodness-of-fit measure described in [5, 71] is

𝑄 = 1 − 𝑃

(cid:18) 𝜈
2

,

𝜒2
2

(cid:19)

,

(3.69)

where 𝑃(𝑎, 𝑥) = (1/Γ(𝑎)) ∫ 𝑥

0

𝑒−𝑡𝑡𝑎−1𝑑𝑡 is the lower incomplete gamma function. Again, 𝜒2 is

Eq. (3.57) computed at the best fit parameters, and 𝜈 is the number of degrees of freedom. The

number 𝑄 is the probability that a chi-square as poor as 𝜒2 will occur by chance. In other words,

it gives the likelihood that the difference between the fit and the data is due to chance. According

to [71], if 𝑄 > 0.1 then the fit is believable. If 𝑄 < 0.001, then the model should be called into

question. If 0.001 < 𝑄 < 0.1 then the fit may be acceptable if the errors are nonnormal or have

been moderately underestimated.

66

3.7 Finite Size Scaling

To obtain critical exponents for second-order transitions, we consider the following leading

finite-size-scaling ansätze [75–77]

𝑑𝑈𝑀
𝑑𝛽

(cid:12)
(cid:12)
(cid:12)
(cid:12)𝑚𝑎𝑥

= 𝑈0 + 𝑈1𝐿1/𝜈

𝐶 |𝑚𝑎𝑥 = 𝐶0 + 𝐶1𝐿𝛼/𝜈

⟨𝑀⟩|𝑖𝑛 𝑓 𝑙 = 𝑀0 + 𝑀1𝐿−𝛽/𝜈

𝜒𝑀 |𝑚𝑎𝑥 = 𝜒0 + 𝜒1𝐿𝛾/𝜈

𝐹 ( (cid:174)𝑞)|𝑚𝑎𝑥 = 𝐹0 + 𝐹1𝐿2−𝜂.

(3.70)

(3.71)

(3.72)

(3.73)

(3.74)

where 𝑈

| (cid:174)𝑀 |

is the Binder cumulant (to be defined and discussed in the next chapter) with respect

to the proxy magnetization, 𝐶 is the specific heat, 𝑀 is the magnitude of the proxy magnetization,

𝜒𝑀 is the magnetic susceptibility of the proxy magnetization, and 𝐹 ( (cid:174)𝑞) is the structure factor (to

be defined and discussed in the next chapter). Note, when 𝛼 = 0, as it is for transitions of the Ising

class, the ansatz Eq. (3.71) must be modified to

𝐶 |𝑚𝑎𝑥 = 𝐶0 + 𝐶1𝐿𝛼/𝜈 ln 𝐿.

(3.75)

The maxima can be extracted by performing an MCMC scan in 𝛽 to locate the approximate

position of the peaks followed by a higher resolution scan centered on the peaks. The integrated

autocorrelation time of the energy should be estimated from each canonical time series. The

canonical runs can then be “stitched” together using multihistogram reweighting to interpolate

to 𝛽 values between the canonical MCMC runs. The maxima can be extracted from jackknife

bins by fitting a cubic polynomial to the peak. The jackknife method should be used in order to

get appropriate error bars for the final results. This is repeated at different lattice sizes and can

be fitted to the above finite-size-scaling ansätze in Python using the curve_fit function in the

scipy.optimize package.

For the exponent 1/𝜈, one can fit the derivative of the Binder cumulant to the form given by

Eq. (3.70). Instead of taking a finite difference derivative, one can follow [77] and use the analytical

67

form

𝑑𝑈𝑀
𝑑𝛽

=

2⟨𝑀 4⟩[⟨𝑀 2⟩⟨𝐸⟩ − ⟨𝑀 2𝐸⟩]
3⟨𝑀 2⟩3

−

⟨𝑀 4⟩⟨𝐸⟩ − ⟨𝑀 4𝐸⟩
3⟨𝑀 2⟩2

.

(3.76)

where 𝐸 is the internal energy and 𝑀 is the proxy magnetization.

For the exponent −𝛽/𝜈, one can fit Eq. (3.72), where ⟨𝑀⟩|𝑖𝑛 𝑓 𝑙 is the value of the proxy

magnetization at its inflection point. To determine the location of the inflection point, one can use

the maximum of the derivative, which can be computed without finite difference as

𝑑⟨𝑀⟩
𝑑𝛽

= ⟨𝐸⟩⟨𝑀⟩ − ⟨𝐸 𝑀⟩,

(3.77)

where 𝐸 is the internal energy. As with other maxima, a cubic polynomial can be fitted to the

peak to extract the maximum 𝑑⟨𝑀⟩/𝑑𝛽|𝑚𝑎𝑥 and its location 𝛽𝑝𝑐. We then interpolate the proxy

magnetization to the point 𝛽𝑝𝑐 using a cubic polynomial to obtain ⟨𝑀⟩|𝑖𝑛 𝑓 𝑙 = ⟨𝑀⟩(𝛽𝑝𝑐).

The pseudocritical points 𝛽𝑝𝑐 (i.e.

the peak locations) for second-order transitions should

approach the infinite volume critical point 𝛽𝑐 as

𝛽𝑝𝑐 = 𝛽𝑐 +

𝑎
𝐿1/𝜈

.

(3.78)

To distinguish between second-order and BKT transitions, one can check whether the specific

heat diverges with volume. For a BKT transition, the specific heat does not diverge with volume,

and so Eq. (3.71) is not suitable for BKT transitions. However, this check may not be sufficient. If

the critical exponent 𝛼 is negative, then the specific heat plateaus also for second-order transitions.

See for example, [40]. For more on distinguishing BKT from first- and second-order transitions,

see [40, 41].

There are well-known scaling and hyperscaling relations, which relate the different critical

exponents. Their derivations can be found in textbooks such as Ref. [59]. In particular, we will

consider the relations

𝛾
𝜈
𝛼
𝜈
𝛾
𝜈

𝑑 −

= 2 − 𝜂
2
𝜈

=

− 𝑑

= 2

𝛽
𝜈

,

68

(3.79)

(3.80)

(3.81)

where 𝑑 = 2 in our case.

In a second-order transition, the critical exponent 𝜈 is defined by the scaling of the correlation

length 𝜉 ∝ 𝜏−𝜈, where 𝜏 ≡ (𝑇 −𝑇𝑐)/𝑇𝑐 = 𝛽𝑐/𝛽 − 1. However, for a BKT transition, 𝜉 diverges faster

than any power of 𝜏, and so we cannot define 𝜈 in the conventional way. See for example, [15],

where it is shown that for the 𝑋𝑌 model, 𝜉 ∝ 𝑒−𝑏𝜏−1/2 or [47], where it is stated that 𝜈 should be ∞

for 𝑞 = 5, 6.

One can define the exponent ˜𝜈 for a BKT transition [54] (we use tilde to distinguish from the

ordinary 𝜈) by the scaling

𝜉 ∝ 𝑒𝑏𝜏− ˜𝜈 ,

(3.82)

and use this to label the universality class of the system. For the 𝑋𝑌 model, ˜𝜈 = 1/2. Then the

pseudocritical point for a BKT transition should approach the infinite volume critical point as

𝛽𝑝𝑐 = 𝛽𝑐 +

𝑎
(ln 𝐿 + 𝑏)1/ ˜𝜈

.

(3.83)

This is different from second-order scaling. However, as noted in [54], fits to this form may be

unstable, and so this scaling relation may not be a good way to extract the critical exponent ˜𝜈. To

investigate the 𝑞 = 5 state clock model, they ended up using this equation with ˜𝜈 fixed at 1/2

𝛽𝑝𝑐 = 𝛽𝑐 +

𝑎
(ln 𝐿 + 𝑏)2

.

(3.84)

This form was used to find 𝛽𝑐, but even then it seemed unsuitable as it was only reliable for very

large volumes.

To extract the 𝛽, 𝛾, and 𝜂 critical exponents from BKT transitions, one can use the scaling

ansätze Eq. (3.72), (3.73), and (3.74), however, in principle, one should replace 𝜈 → ˜𝜈. One has

to be careful with the two BKT transitions in the clock models when 𝑞 ≥ 5. One may not be able

to use the proxy magnetization in all cases with Eq. (3.72) or (3.73). For example, for the second

transition in the 𝑞 = 5 clock model, [54] uses a “rotated magnetization” like Eq. (2.59) instead of

the proxy magnetization.

Note, for BKT transitions in the 𝑞 ≥ 5 clock models, the number of peaks in the specific heat

corresponds to the number of phase transitions, however, the location of the peaks do not correspond

69

to the transition temperatures [51]. Thus, the specific heat alone cannot be used to locate the critical

temperatures. For more details on finite-size scaling for BKT transitions, see [78].

3.8 MCMC in Practice

3.8.1 Outline of an MCMC Simulation

An MCMC simulation typically follows an outline like:

# Initialize everything
initialize()

# Equilibrate the lattice
for iequi=1 to nequi:
updating_sweep()

# Measurement sweeps
for imeas=1 to nmeas:

# Discards
for idisc=1 to ndisc:
updating_sweep()

save_or_measure()

# Finish up
finalize()

In the initialization stage, the model parameters (e.g. 𝛽, lattice dimensions, boundary conditions,

etc.) and run parameters (e.g. nequi, nmeas, ndisc, start_type, etc) must be read in, and the

lattice arrays must be initialized. At this stage, one chooses whether to do a hot start (i.e. lattice with

random values), a cold start (i.e. a lattice with uniform values), or to continue from a previously

saved lattice.

It takes some number of equilibrating sweeps for the Markov chain to reach the equilibrium

distribution. There are several ways to estimate the number of equilibrating sweeps that are needed

including:

1. Measure some observable and see how many sweeps it takes to settle down to some value

2. Even better, do a hot start and a cold start and see how many sweeps it takes for the

70

Figure 3.7 In these illustrations, the 2-state Potts model (aka the Ising model) is simulated on a
100x100 lattice using both cold (ordered) and hot (disordered) starts. On the left, the simulation is
performed at 𝛽 = 0.4, and we see that the lattice equilibrates quickly. On the right, the simulation
is performed in the ordered phase at 𝛽 = 0.8, and we see that at least one of the start types is having
trouble reaching equilibration. Since we know the exact energy density for the two-dimensional
Ising model on a finite lattice, we can tell that the ordered start is at the correct value. The random
start has significantly more trouble reaching equilibration. In general, it seems that for a simulation
performed in the ordered phase, the random start may take significantly longer to reach equilibration
than the ordered start [5].

measurements from these two runs to converge. See Figure 3.7

3. Do multiple runs with different RNG seeds and see how many sweeps it takes for the

measurements to converge

Measurements taken during the equilibration stage are generally discarded—they do not form part

of the sample of equilibrium measurements.

After equilibration, one can begin taking measurements. Here one has the choice of in situ or

ex situ measurements. In situ measurements are taken while the MCMC simulation is running. For

example, after an updating sweep, the energy, magnetization, and other measurements are taken

before proceeding to the next updating sweep. Ex situ measurements are taken outside of the MCMC

simulation. This requires the lattice configuration itself be written to file after each measurement

sweep. The choice of measurement type often comes down to the computational cost of the updating

sweep, the cost of each measurement, and I/O and file storage considerations. For classical spin

71

Figure 3.8 A time series study of magnetization for the 2-state Potts model (i.e. Ising model) on a
20x20 lattice. At the top left, we perform 165K heatbath sweeps at 𝛽 = 0.20. We see that the lattice
is not magnetized because 50% of the spins are in the 𝑞 = 0 state and 50% are in the 𝑞 = 1 state.
At the top right, we repeat the simulation but for 𝛽 = 0.44, which is the critical value of 𝛽 in the
infinite volume limit. Here we see a partially magnetized lattice (the spins tend to be aligned 80/20
instead of 50/50). We see that the lattice magnetization flips often between 𝑞 = 0 (red) dominating
80% of the lattice and 𝑞 = 1 (blue) dominating 80% of the lattice. At the bottom left, we repeat
with 𝛽 = 0.48. Now the lattice is magnetized more strongly with spins aligned 95/5. We still see
magnetization flips, but not as frequently. Finally, at the bottom right, we repeat the simulation
with 𝛽 = 0.80. Now the lattice is completely magnetized with ∼ 100% of the spins staying in the
𝑞 = 0 state.

systems in two dimensions, one typically does in situ measurements. The updating sweep and

the measurements are fairly cheap, so a large number of them can be performed quickly. Writing

the lattice to file after each sweep might slow it down significantly, and the storage requirements

would be much larger. On the other hand, in lattice QCD for example, the updating sweeps and

measurements are much more costly, and so it makes sense to save the lattice configuration to file

after an updating sweep, and perform the measurements separately.

Some researchers prefer to immediately histogram in situ measurements. The benefit is that

it can reduce the memory and storage requirements, and most things of interest—e.g. means

and standard deviations—can be calculated from the histograms. Furthermore, the shape of the

histograms can signal the presence of phase transitions as was seen in Figure 2.7. Nevertheless,

72

some useful information is lost, and it may be preferable to store the entire time series observables

rather than histogramming them. To understand how the Markov chain is behaving it is sometimes

useful to examine the time series observable (i.e. the observable plotted versus Monte Carlo sweep

number). An example of this is shown in Figure 3.8 for the 2-state Potts model. Such an examination

can help inform the researcher on the statistical properties of the MCMC simulation. For example,

looking at the time series is a quick way to check that the simulation has reached equilibrium. One

can always histogram the time series later if needed.

When there is a large autocorrelation time, the effective statistics is significantly reduced, and

it may not be worthwhile to measure and save the observable on every sweep. In such cases, one

often discards some number of sweeps for each measured sweep. In practice, the autocorrelation

time may not be known.

If an estimate is known, the discards per measurement sweep can be

set to mitigate the autocorrelations, but one should still perform an autocorrelation analysis of the

saved measurements. If there remains any residual autocorrelation, it can be mitigated by blocking

techniques as discussed in Sections 3.3 and 3.4.

3.8.2 Planning and Organization

The goal of MCMC lattice studies is varied and ranges from studying and developing new

techniques and tools to characterizing some new spin or gauge model. Part of the process of

understanding a new lattice model is to map out the phase diagram. This requires locating the

critical values of the parameters and then doing a finite-size scaling analysis after performing

simulations at those critical points on increasingly large lattices. Characterizing the phase diagram

of even a relatively simple spin system can take hundreds of thousands of CPU hours and generate

terabytes of data to be stored and analyzed. For example, the work described in Chapter 4 required

the following resources:

• 30K+ individual MCMC simulations were required just to perform a basic scan of the

parameter space at a single lattice size

• Up to 800 nodes on a computing cluster were running simultaneously (trivial parallelization)

73

• Large autocorrelations required Markov chains of length billions in some cases

• Several terabytes of hard disk space were used for the saved time series observables (many

more were discarded)

• 500K+ CPU hours were used on the computing cluster

This is only the resources used from the MCMC side, i.e.

it does not include the computing

resources needed for the complimentary approach which involved tensor methods. Such a study

therefore requires some planning and setup. Given the computing time involved, such work typically

takes place on a computing cluster where many nodes can be used in a trivial kind of parallelization

to simultaneously run many MCMC simulations at different parameter values. For runs on large

lattices, which take a long time, real parallelization may be employed to spread the work of a single

simulation over multiple cores and/or nodes. This of course assumes a lattice codebase capable of

this kind of parallelization.

Typically, the main simulation code is written in a fast compiled language like C++. During

or when the simulation completes, the time series measurements are written to a file on disk.

Later, they can be analyzed by reading the time series into another script which computes e.g. the

integrated autocorrelation time and the jackknife mean and error bar of the observable. These

analysis scripts may be a combination of shell scripts, C++ scripts, and Python scripts. During the

simulation, one typically records some fundamental observables which can be used to construct

other observables. For example, for a two-dimensional spin system, one might record the time

series magnetization components 𝑀𝑥 and 𝑀𝑦. During the analysis stage, additional observables

such as the magnetization and the Binder cumulant can be constructed from these components and

analyzed.

File organization becomes important when one has to deal with gigabytes or terabytes of data.

The author found it useful to use the following top-level directories:

• /CODEBASE/: Located in the user’s home directory on the computing cluster, this directory

contains the lattice codebase itself and all analysis scripts

74

• /RUNS/: Located on the cluster’s scratch space, this is the directory where the actual simu-

lations are run. Intermediate files and error outputs are left here to be auto-deleted by the

cluster after some number of days. If a simulation fails, the error files should be examined

before deletion

• /STORAGE/: This is the long-term storage space where the simulation writes the main time

series observables. This is where the vast majority of storage space is needed—space which

may exceed the home directory allotment

• /OUTPUTS/: Located in the user’s home directory, this directory contains summary infor-

mation obtained from the analysis scripts. That is, after analyzing a time series (stored in

/STORAGE/), the results would be written to /OUTPUTS/

It may be useful to use a common directory structure within /RUNS/, /STORAGE/, and /OUTPUTS/.

For example, for the Ising model with two parameters—inverse temperature 𝛽 and external field

ℎ—the following structure could be used: [modelname]_[ndim]_[length]_[id]/[beta]_[h]/.

For example, a run on a 16 × 16 lattice at 𝛽 = 1.3 and ℎ = 0 would take place in the “run_path”

directory /ising_nd2_nl16_1/1p3000_0p0000/. The id tag serves to distinguish runs which are

otherwise identical, e.g. they differ only in the random seed used. This directory structure makes

it easy to analyze and gather results versus 𝛽. For example, to get energy versus 𝛽 for ℎ = 0, one

would simply cd into /STORAGE/ising_nd2_nl16_1/, loop over all directories /*_0p0000/ therein,

and extract the energy from each time series file. The run itself takes place at /RUNS/run_path with

the time series observables written to /STORAGE/run_path. During analysis, summary results are

written to /OUTPUTS/run_path.

75

CHAPTER 4

THE EXTENDED-O(2) MODEL

4.1

Introduction

In recent years, the idea of using quantum computers or quantum simulation experiments to

approach the real-time evolution or the finite-density behavior of lattice models of interest for

high-energy physics has gained considerable interest [79–88]. As the current noisy intermediate

scale quantum (NISQ) devices that are available to implement this research program have a very

limited number of quantum computing units, such as qubits, trapped ions or Rydberg atoms,

it is essential to optimize the discretization procedure. Starting from the standard Lagrangian

formulation of lattice field theory models with continuous field variables, one can either discretize

the field variables [89–91] used in the path integral, expand the Boltzmann weights using character

expansions [81, 92, 93], or use the quantum link method [94, 95].

Models with continuous Abelian symmetries are of great physical interest. Besides the elec-

tromagnetic interactions of charged particles in 3+1 dimensions, this also includes models where

a mass gap is dynamically generated [96, 97] or a Berezinskii-Kosterlitz-Thouless (BKT) transi-

tion [15, 36, 37] occurs. For models with a 𝑈 (1) symmetry, the character expansion mentioned

above is simply the Fourier series. It has been shown [3, 98] that the truncation of these series

preserves the original symmetry. On the other hand, the Z𝑞 clock approximation of the integration

over the circle only preserves the Z𝑞 discrete subgroup. A recent proposal applies the Z𝑞 clock

approximation to the simulation of the Abelian gauge theory in 2 + 1 dimensions, where transforma-

tions between the electric representation and the magnetic representation can significantly reduce

the required computational resources [99]. In order to decide how good the Z𝑞 approximation is in

a variety of situations, it is useful to build a continuous family of models interpolating among the

various possibilities.

Here we focus on the case of the O(2) nonlinear sigma model in 1+1 dimensions. This model

was key to understanding the BKT transition [15, 36, 37, 100] and the corresponding Z𝑞 clock

model has been studied extensively [25, 43–57, 78, 101–104]. We propose to interpolate among

76

these models by starting with the standard O(2) Hamiltonian and introducing a symmetry-breaking

term,

Δ𝐻 (ℎ𝑞, 𝑞) = −ℎ𝑞

cos(𝑞𝜑𝑥).

∑︁

𝑥

(4.1)

When 𝑞 is an integer, if we take the limit ℎ𝑞 → ∞, we recover the Z𝑞 clock model. For the rest

of the discussion, it is important to realize that the O(2)-symmetric Hamiltonian is 2𝜋-periodic for

all the 𝜑𝑥 variables. In contrast, Δ𝐻 has a 2𝜋/𝑞 periodicity. When 𝑞 is an integer, if we apply the

shift 𝑞 times we obtain the periodicity of the O(2) action. In order to interpolate among the clock

models, we will consider noninteger values of 𝑞 while keeping a fixed 𝜑 interval of length 2𝜋. The

model and the effect of the symmetry breaking are discussed later both in the standard Lagrangian

and tensor formulations.

The idea of having a doubly continuous set of models is interesting from a theoretical point of

view but also from a quantum simulation point of view. If we attempt to quantum simulate these

models using Rydberg atoms as in Refs. [105–107], it is possible to tune the ratio 𝑅𝑏/𝑎 of the radius

for the Rydberg blockade and the lattice spacing, as well as local chemical potentials continuously.

This allowed interpolations among Z𝑞 phases for different integer values of 𝑞 [107]. Sequences of

clock models also appear in models for nuclear matter when the number of colors is varied [108].

It is often a difficult task to detect BKT transitions in the quantum Hamiltonian approach, as

it is hard to find a good indicator of BKT transitions that has a clear discontinuity, peak or dip.

The equivalence of the path integral formulation and statistical mechanics can be used to access

universal features and detect phase transitions in statistical mechanics based on Monte Carlo (MC)

simulations and tensor renormalization group (TRG) calculations. The Markov chain MC (MCMC)

simulations efficiently explore the typical set of the physical configurations. MC calculations

use the universal jump in the helicity modulus [109] as an indicator for BKT transitions. But

ambiguities in the definition of the helicity modulus in the Z5 clock model can result in controversial

conclusions [44, 110]. The TRG [111, 112] calculations provide a coarse-grained theory where the

size of the lattice spacing doubles at each step. If the truncations performed are under control,

one can go to the thermodynamic limit quickly. Calculation of the magnetic susceptibility in the

77

presence of a weak external field is a universal method to detect critical points that can be easily

implemented in the TRG. However, it does not show a peak to indicate the large-𝛽 BKT transition

in the five-state clock model [50, 113]. The study of the ℎ𝑞 → ∞ limit with fractional 𝑞 not only

provides us a clear picture of what phases the symmetry-breaking term will drive the 𝑋𝑌 model

to, paving the way to discussions for the full phase diagram at finite ℎ𝑞, but also brings us a new

tool to detect BKT transitions in Z𝑛 models.

In contrast, the calculation of the specific heat at

increasing volume allows us to discriminate between a second-order phase transition—where it

diverges logarithmically with the volume in the Ising case—and a BKT transition or a crossover.

To define the models that we consider in this chapter we start with the two-dimensional classical

𝑂 (2) nonlinear sigma model, or 𝑋𝑌 model, where the spin degrees of freedom (cid:174)𝑆 are unit vectors

whose possible directions are confined to a plane. They reside on the sites of a two-dimensional

lattice of 𝑁 sites. The Hamiltonian is

𝐻𝑂 (2) = −

𝑁
∑︁

𝑥=1

2
∑︁

𝜇=1

(cid:169)
(cid:173)
(cid:171)

(cid:174)𝑆𝑥 · (cid:174)𝑆𝑥+ ˆ𝜇 + (cid:174)ℎ · (cid:174)𝑆𝑥(cid:170)
(cid:174)
(cid:172)

,

(4.2)

where the sum on 𝑥 is over the sites of the two-dimensional lattice, and on 𝜇 = 1, 2 over the directions.

The field, (cid:174)ℎ, is a uniform constant external magnetic field. It is convenient to parameterize the

spins (cid:174)𝑆 with a single angle 𝜑 ∈ [𝜑0, 𝜑0 + 2𝜋). The Hamiltonian then takes the form

𝐻𝑂 (2) = −

∑︁

cos(𝜑𝑥+ ˆ𝜇 − 𝜑𝑥) − ℎ ∑︁

cos(𝜑𝑥 − 𝜑ℎ),

𝑥,𝜇

𝑥

(4.3)

where ℎ = | (cid:174)ℎ| and 𝜑ℎ is the direction of the external field that in the absence of other symmetry-

breaking terms can be set to zero for convenience.

Next, let us extend the model by introducing a term that can favor certain values of the angle:

𝐻ext-𝑂 (2) = −

∑︁

𝑥,𝜇

cos(𝜑𝑥+ ˆ𝜇 − 𝜑𝑥) − ℎ𝑞

∑︁

cos(𝑞𝜑𝑥) − ℎ ∑︁

cos(𝜑𝑥 − 𝜑ℎ).

(4.4)

𝑥

𝑥

We call the model with the Hamiltonian (4.4) the “extended-O(2)” model.

The canonical partition function is

∫

𝑍 =

𝑑𝑈 𝑒−𝛽𝐻 (𝑈),

78

(4.5)

where the integration is over lattice configurations 𝑈. After parameterizing the spins (cid:174)𝑆 by an angle

𝜑, the partition function is

𝑍 =

∫ 𝜑0+2𝜋

(cid:214)

𝜑0

𝑑𝜑𝑥
2𝜋

𝑒−𝛽𝐻.

(4.6)

𝑥
Specific cases of the Extended-O(2) model have been studied fairly extensively. For 𝑞 = 2, 3,

an early normalization group (RG) analysis [114] showed that the model at finite ℎ𝑞 should be in

the Ising and 3-state Potts universality class respectively. However, in [115] they show that the

critical exponents vary with ℎ𝑞, and conclude that for 𝑞 = 3, there is a BKT phase for sufficiently

small ℎ𝑞. The 𝑞 = 4 with finite ℎ𝑞 case, sometimes referred to as the 𝑋𝑌 ℎ4 model, has been studied

extensively due to its relevance to physical systems [116, 117]. The RG analysis [114] showed

there to be a second-order phase transition with nonuniversal critical exponents which vary with

ℎ𝑞. The BKT phase of the O(2) was recovered only for ℎ𝑞 = 0. In [118], numerical MC studies

of this case showed that the critical exponent 𝜈 does vary with ℎ𝑞. This was supported further

by [40, 119], and more discussion can be found in [116, 120]. However, in [41], it was found that

for sufficiently small ℎ𝑞, the second-order phase transition for 𝑞 = 4 is replaced by BKT transitions.

This is supported by [115]. However, recent work [117, 121] suggests that the BKT phase at finite

ℎ𝑞 is essentially a finite-size effect and might not persist to infinite volumes. For 𝑞 ≥ 5, the RG

analysis [114], showed there should be a BKT phase between a low-temperature ferromagnetic

phase and a high-temperature paramagnetic phase. For 𝑞 = 6, this is supported by numerical MC

studies on triangular lattices [40]. The 𝑞 = 5 case was shown [122] to have two transitions for any

finite ℎ𝑞.

In the case ℎ𝑞 = 0, the Extended-O(2) model reduces to the ordinary 𝑂 (2) model, which has

been studied extensively [18, 21, 24, 39, 123–126]. In the limit ℎ𝑞 → ∞, the Extended-O(2) model

reduces to the Extended 𝑞-state clock model. With ℎ𝑞 → ∞ and integer 𝑞, it reduces to the ordinary

𝑞-state clock model, which has been studied extensively [25, 43–57, 78, 101–104].

Previous work (cited above) was restricted to integer values of 𝑞, and typically focused only on

one or two values. Furthermore, many focused only on large or small values of ℎ𝑞. In this thesis,

we perform a comprehensive study of the Extended-O(2) model over a large parameter space with

79

𝛽, ℎ𝑞 ∈ R ≥ 0 and with 𝑞 ∈ R. Much of the material in this chapter has been published [127–129].

4.2 The ℎ𝑞 → ∞ Phase Diagram

4.2.1

Introduction

For integer 𝑞 the limit ℎ𝑞 → ∞ forces the spin angles to take the values 𝜑(𝑘)

𝑥 = 2𝜋𝑘/𝑞 with

𝑘 ∈ Z. Thus, while ℎ𝑞 = 0 corresponds to the O(2) model, ℎ𝑞 → ∞ corresponds to the 𝑞-state

clock model. The action defined in Eq. (4.4) is also valid for noninteger 𝑞 and we therefore consider

Eq. (4.4) as our definition of the extension of the 𝑞-state clock model to noninteger 𝑞 in the ℎ𝑞 → ∞

limit. In this case the angle 𝜑(𝑘)

𝑥

takes the values

𝜑0 ≤ 𝜑(𝑘)

𝑥 =

2𝜋𝑘
𝑞

< 𝜑0 + 2𝜋,

(4.7)

with 𝑘 ∈ Z and some choice of domain [𝜑0, 𝜑0 + 2𝜋). By varying 𝜑0, we can obtain different sets of

angles that are equivalent to either 𝑘 = 0, 1, . . . , ⌊𝑞⌋ (case 1) or 𝑘 = 0, 1, . . . , ⌊𝑞⌋ − 1 (case 2), since

in the ℎ𝑞 → ∞ limit—in the absence of an external field—the action only depends on the relative
angle between nearest-neighbor sites Δ𝜑(𝑘)

(see Appendix B.1). Case 2 just has one

𝑥,𝜇 = 𝜑(𝑘)

𝑥+ ˆ𝜇 − 𝜑(𝑘)

𝑥

fewer angle than case 1. As shown in Fig. 4.1, the angular distance between two adjacent values of

𝜑𝑥 on a circle takes two values: 2𝜋(𝑞 − ⌊𝑞⌋)/𝑞 < 2𝜋/𝑞 for case 1, and 2𝜋/𝑞 < 2𝜋(1 + 𝑞 − ⌊𝑞⌋)/𝑞

for case 2. These values including 0 have the largest Boltzmann weights in the partition function.

The “leftover angle” is

˜𝜙 ≡ 2𝜋

(cid:18)

1 −

(cid:19)

,

⌊𝑞⌋
𝑞

(4.8)

in case 1 and 2𝜋/𝑞 + ˜𝜙 in case 2. With the choice 𝜑0 = 0, we have case 1, while choosing 𝜑0 = −𝜋

is equivalent to case 2 (1) for odd (even) ⌊𝑞⌋. At noninteger 𝑞 the Z𝑞 symmetry is explicitly broken

since the action is not invariant under the operation 𝑘 → mod (𝑘 + 1, ⌊𝑞⌋). But there is still a Z2

symmetry because the action is invariant under the operation 𝑘 → ⌊𝑞⌋ − 𝑘.

We also consider the limit ℎ𝑞 → ∞ directly by simply restricting the values of the originally

continuous angle 𝜑 to the values given in Eq. (4.7):

𝐻ext-𝑞 = −𝛽 ∑︁

𝑥,𝜇

cos(𝜑(𝑘)

𝑥+ ˆ𝜇 − 𝜑(𝑘)

𝑥

) − ℎ ∑︁

cos(𝜑(𝑘)

𝑥 − 𝜑ℎ).

𝑥

(4.9)

80

Figure 4.1 Arrows indicate the allowed spin orientations for the extended 𝑞-state clock model with
the choice 𝜑0 = 0 (left) and 𝜑0 = −𝜋 (right). In this example, 𝑞 = 5.5. Blue and red denote the
leftover angles in the two cases. See Eq. (4.8).

We call the model (4.9) the “extended 𝑞-state” clock model for all values of 𝑞 and the “fractional-

𝑞-state” clock model for fractional values of 𝑞. For integer 𝑞 the extended 𝑞-state clock model

reduces to the ordinary 𝑞-state clock model. Numerical results presented in later sections are from

the extended 𝑞-state clock model.

For the extended 𝑞-state clock model, the degrees of freedom are discrete, so we write the

partition function now as

𝑍 =

∑︁

(cid:214)

𝜑 (𝑘 )

𝑥

𝑑𝜑𝑥
2𝜋

𝑒−𝛽𝐻.

(4.10)

With the models defined we turn to observables. The main observables that we compute to study

the critical behavior are the internal energy, magnetization, and their corresponding susceptibilities.

These quantities are defined in the same way for both the continuous and discrete angle cases. The

internal energy is defined as

(cid:42)

𝐸 = ⟨𝐻ext-𝑞⟩ =

−

∑︁

𝑥,𝜇

(cid:43)

cos(𝜑𝑥+ ˆ𝜇 − 𝜑𝑥)

= −

𝜕
𝜕 𝛽

ln 𝑍,

where ⟨· · · ⟩ denotes the ensemble average. The specific heat is

𝐶 =

−𝛽2
𝑁

𝜕𝐸
𝜕 𝛽

𝛽2
𝑁

=

(⟨𝐸 2⟩ − ⟨𝐸⟩2).

In addition we consider the magnetization

(cid:174)𝑀 =

𝜕
𝜕 (cid:174)ℎ

ln 𝑍 =

(cid:42)

∑︁

(cid:43)

(cid:174)𝑆𝑥

.

𝑥

81

(4.11)

(4.12)

(4.13)

2π/q2π/q2π/q2π/q2π/q2π(cid:16)1−bqcq(cid:17)≡˜φϕ0=0(case1)2π/q2π/q2π/q2π/q2π/q+˜φϕ0=−π(case2)The magnetic susceptibility defined in a manifestly O(2)-invariant way is

𝜒 (cid:174)𝑀 =

1
𝑁

𝜕 (cid:174)𝑀
𝜕 (cid:174)ℎ

(cid:16)

1
𝑁

=

⟨ (cid:174)𝑀 · (cid:174)𝑀⟩ − ⟨ (cid:174)𝑀⟩ · ⟨ (cid:174)𝑀⟩

(cid:17)

.

(4.14)

We note that in Monte Carlo simulations at zero external field in a finite system in the absence

of explicit symmetry breaking terms the definition of the spontaneous magnetization (4.13) gives

⟨ (cid:174)𝑀⟩ = 0. In such situations one often resorts to using a proxy observable [45–47, 50, 52, 78]

| (cid:174)𝑀 | =

(cid:42)(cid:12)
(cid:12)
∑︁
(cid:12)
(cid:12)
(cid:12)

𝑥

(cid:174)𝑆𝑥

(cid:43)

(cid:12)
(cid:12)
(cid:12)
(cid:12)
(cid:12)

in place of (cid:174)𝑀. The corresponding susceptibility is

𝜒

| (cid:174)𝑀 | =

1
𝑁

(cid:16)

⟨| (cid:174)𝑀 |2⟩ − ⟨| (cid:174)𝑀 |⟩2(cid:17)

.

(4.15)

(4.16)

While one expects that | (cid:174)𝑀 | indicates the same critical behavior, in general, | (cid:174)𝑀 | is numerically

different from (cid:174)𝑀 except deep in the ordered phase. Nevertheless, we expect both definitions of the

magnetic susceptibility—Eqs. (4.14) and (4.16)—possess the same critical behavior, and can be

relied upon to extract universal features. In the next section we detail the methods used to study

the observables defined above.

4.2.2 Methods

4.2.2.1 Monte Carlo

The allowed spin orientations in the extended 𝑞-state clock model, given by Eq. (4.7), are

discrete, and the model can be studied using a heatbath algorithm. The heatbath algorithm is a

MCMC algorithm that drives the lattice toward equilibrium configurations by choosing the new

spin at each update according to the probability distribution defined by its neighboring spins. We

adapted Fortran code developed by Bernd Berg for the standard Potts model [5].

Initial exploration of the extended 𝑞-state clock model was performed via MC on a 4 × 4 lattice

with zero external magnetic field.

We used a heatbath algorithm to study the extended 𝑞-state clock model on a 4 × 4 lattice. The

general structure of the Monte Carlo algorithm is as follows:

82

# Equilibration:
for iequi = 1 to nequi do

heatbath sweep

end

# Measurements:
for irpt = 1 to nrpt do

for imeas = 1 to nmeas do

for idisc = 1 to ndisc do

heatbath sweep

end
measure observables

end
save measurements to file

end

Parameters used in the primary data production for the extended 𝑞-state clock model are given in

Table 4.1. The total number of heatbath sweeps performed (not including equilibration) is nrpt

× nmeas × ndisc. The number of measurements taken is nrpt × nmeas. Ensemble averages and

error bars for the energy and magnetization were calculated after binning with nrpt bins each of

size 𝑛𝑚𝑒𝑎𝑠. For specific heat and magnetic susceptibility, the measurements were binned and

jackknifed.

We studied the extended 𝑞-state clock model on a 4 × 4 lattice with zero external magnetic field.

For each 𝛽, we initialized to a random lattice (hot start) then we performed 215 equilibrating sweeps

followed by 222 measurement sweeps. Each measurement sweep was followed by 28 discarded

sweeps. We calculated the energy density and specific heat as defined in Eqs. (4.11) and (4.12). We

calculated the proxy magnetization and susceptibility as defined in Eqs. (4.15) and (4.16). Results

for 1 < 𝑞 < 6 are shown in Figs. 4.2–4.6.

In the large-𝛽 (low temperature) regime, the heatbath algorithm has difficulty appropriately

sampling the configuration space. At large-𝛽, the lattice freezes along a particular magnetization

direction. When 𝑞 ∈ Z all magnetization directions are equivalent, however, when 𝑞 ∉ Z, the

discrete rotational symmetry is broken and the direction of magnetization matters. The configu-

ration space is split into two sectors with different thermodynamic properties. In one sector, the

magnetization is in the direction 0 or − ˜𝜙, defined in Eq. (4.8), where relatively large fluctuations

83

Table 4.1 The Monte Carlo parameters used in the primary data production for the extended 𝑞-
state clock model. See Figs. 4.2–4.6. Here, 𝑞 refers to the 𝑞-state clock model, ℎ is the external
magnetic field, nequi is the number of equilibrating heatbath sweeps used, nrpt is the repetitions
of measurement sweeps, nmeas is the number of measurement sweeps, and ndisc is the number of
sweeps discarded between each measurement sweep.

𝑞

𝛽

ℎ

[1.1, 6.0] with Δ𝑞 = 0.1
[0.0, 2.0] with Δ𝛽 = 0.01
0.0

Lattice

Start type

𝑛𝑒𝑞𝑢𝑖

𝑛𝑟 𝑝𝑡

𝑛𝑚𝑒𝑎𝑠

𝑛𝑑𝑖𝑠𝑐

4 × 4
Random (hot)
215
26
216
28

Figure 4.2 Monte Carlo results for the extended 𝑞-state clock model on a 4 × 4 lattice for 1.1 ≤
𝑞 ≤ 2.0. The top panel shows energy density and specific heat, and the bottom panel shows proxy
magnetization and magnetic susceptibility. Statistical error bars are omitted since they are smaller
than the line thickness.

84

−2.0−1.5−1.0−0.50.0E/Nq=1.1q=1.2q=1.3q=1.4q=1.5q=1.6q=1.7q=1.8q=1.9q=2.00.00.51.01.5C0.00.51.01.52.0β0.000.250.500.751.00|~M|/N0.00.51.01.52.0β0.00.51.01.5χ|~M|Figure 4.3 Monte Carlo results for the extended 𝑞-state clock model on a 4 × 4 lattice for 2.1 ≤
𝑞 ≤ 3.0. The top panel shows energy density and specific heat, and the bottom panel shows proxy
magnetization and magnetic susceptibility. Statistical error bars are omitted since they are smaller
than the line thickness. Dashed lines indicate regions where we have data but we do not have the
uncertainty fully under control.

Figure 4.4 Monte Carlo results for the extended 𝑞-state clock model on a 4 × 4 lattice for 3.1 ≤
𝑞 ≤ 4.0. The top panel shows energy density and specific heat, and the bottom panel shows proxy
magnetization and magnetic susceptibility. Statistical error bars are omitted since they are smaller
than the line thickness. Dashed lines indicate regions where we have data but we do not have the
uncertainty fully under control.

85

−2.0−1.5−1.0−0.50.0E/Nq=2.1q=2.2q=2.3q=2.4q=2.5q=2.6q=2.7q=2.8q=2.9q=3.00.00.51.01.5C0.00.51.01.52.0β0.000.250.500.751.00|~M|/N0.00.51.01.52.0β0.000.250.500.751.00χ|~M|−2.0−1.5−1.0−0.50.0E/Nq=3.1q=3.2q=3.3q=3.4q=3.5q=3.6q=3.7q=3.8q=3.9q=4.00.00.51.01.5C0.00.51.01.52.0β0.000.250.500.751.00|~M|/N0.00.51.01.52.0β0.00.20.40.6χ|~M|Figure 4.5 Monte Carlo results for the extended 𝑞-state clock model on a 4 × 4 lattice for 4.1 ≤
𝑞 ≤ 5.0. The top panel shows energy density and specific heat, and the bottom panel shows proxy
magnetization and magnetic susceptibility. Statistical error bars are omitted since they are smaller
than the line thickness. Dashed lines indicate regions where we have data but we do not have the
uncertainty fully under control.

Figure 4.6 Monte Carlo results for the extended 𝑞-state clock model on a 4 × 4 lattice for 5.1 ≤
𝑞 ≤ 6.0. The top panel shows energy density and specific heat, and the bottom panel shows proxy
magnetization and magnetic susceptibility. Statistical error bars are omitted since they are smaller
than the line thickness.

86

−2.0−1.5−1.0−0.50.0E/Nq=4.1q=4.2q=4.3q=4.5q=4.5q=4.6q=4.7q=4.8q=4.9q=5.00.00.51.01.5C0.00.51.01.52.0β0.000.250.500.751.00|~M|/N0.00.51.01.52.0β0.00.20.40.6χ|~M|−2.0−1.5−1.0−0.50.0E/Nq=5.1q=5.2q=5.3q=5.4q=5.5q=5.6q=5.7q=5.8q=5.9q=6.00.00.51.01.5C0.00.51.01.52.0β0.000.250.500.751.00|~M|/N0.00.51.01.52.0β0.00.20.40.6χ|~M|Figure 4.7 The measured energy density for 𝑞 = 4.5 at different number of heatbath sweeps for
different random number generator (RNG) seeds. The vertical axis is the energy density. The
horizontal axis gives the number of heatbath sweeps used (not including equilibration sweeps).
The different colors indicate runs with different seeds for the RNG. All runs were performed at
𝛽 = 2.5 with hot-started 4 × 4 lattices. This shows that the configuration space consists of two
distinct sectors, and the heatbath algorithm has difficulty appropriately sampling both sectors unless
the number of heatbath sweeps is taken very large.

may still be possible due to the spins being able to flip over the relatively small “leftover angle”.

In the other sector, the magnetization is in one of the other directions where fluctuations are less

likely. To appropriately sample the configuration space one has to use very large statistics or run

multiple heatbath streams at the same parameters but with different seeds for the random number

generator. In Fig. 4.7, we show an example of this phenomenon for 𝑞 = 4.5 and 𝛽 = 2.5 on a 4 × 4

lattice. In this example, we need ≳ 232 heatbath sweeps to adequately sample both sectors. This

MC slowdown makes it difficult to study larger lattices, and is a strong motivation for using an

alternative method such as TRG.

The Monte Carlo slowdown is illustrated by an explosion of the integrated autocorrelation time

in the intermediate-𝛽 regime. For an observable 𝑂, an estimator of the integrated autocorrelation

time is given by

˜𝜏𝑂,𝑖𝑛𝑡 = 1 + 2

𝑇
∑︁

𝑡=1

𝐶 (𝑡)
𝐶 (0)

(4.17)

where 𝐶 (𝑡) = ⟨𝑂𝑖𝑂𝑖+𝑡⟩ − ⟨𝑂𝑖⟩⟨𝑂𝑖+𝑡⟩ is the correlation function between the observable 𝑂 measured

at Markov times 𝑖 and 𝑖 + 𝑡. The integrated autocorrelation time 𝜏𝑂,𝑖𝑛𝑡 is estimated by finding a

window in 𝑡 for which ˜𝜏𝑂,𝑖𝑛𝑡 is nearly independent of 𝑡. The integrated autocorrelation time for

87

-2-1.98-1.96-1.94-1.92-1.9-1.88-1.86220222224226228230232〈E〉/ VHeatbath SweepsEﬀect of Diﬀerent Random Seedsiseed1=1iseed1=2iseed1=3iseed1=4iseed1=5iseed1=6iseed1=7Figure 4.8 The integrated autocorrelation time of the energy density for several 𝑞 on a 4 × 4 lattice
using a heatbath algorithm. At large 𝛽, the integrated autocorrelation time 𝜏𝑖𝑛𝑡 grows abruptly when
𝑞 ∉ Z. Note the log scale on the vertical axis. Connecting lines are included to guide the eyes. The
rapid increase of the integrated autocorrelation time for noninteger 𝑞 occurs because the MCMC
approach is having difficulty sampling the full configuration space. Details are given in the main
text.

4 < 𝑞 ≤ 5 is shown in Fig. 4.8. The values of 𝑇 needed to extract these points are given in

Fig. 4.9. The rapid increase of the integrated autocorrelation times for noninteger 𝑞 we believe

occurs because the MCMC approach is having difficulty sampling the full configuration space as

described above. This effect is discussed in much more detail in Section 4.3.7.

To mitigate the effect of autocorrelation in our results, we discarded 28 heatbath sweeps between

each saved measurement. The saved measurements were then binned (i.e. preaveraged) with bin

size 216 before calculating the means and variances.

4.2.2.2 Tensor Renormalization Group (TRG)

For this model, the heatbath approach suffers from a slowdown that makes it difficult to study the

large-𝛽 regime already on very small lattices. An alternative approach, the tensor renormalization

group (TRG), which does not suffer from this slowdown, was used to study the model on much

larger lattices and in the thermodynamic limit. This allows us to perform finite-size scaling and

characterize the phase transitions in the extended 𝑞-state clock models. The TRG results are

validated by comparison with exact and Monte Carlo results on small lattices (see Appendix B.2).

88

0.00.51.01.52.0β100101102103τintq=4.1q=4.2q=4.3q=4.4q=4.5q=4.6q=4.7q=4.8q=4.9q=5.0Figure 4.9 The values of 𝑇 defined in Eq. (4.17) needed to extract the integrated autocorrelation
times shown in Fig. 4.8. The vertical axis gives the values of 𝑞 and the horizontal axis gives the
values of 𝛽. Blank/white regions in this heatmap indicate cases where 𝑇 > 216 is needed to get
a reliable estimate of the integrated autocorrelation time. These were not attempted due to the
computational cost.

The TRG methods are not exact. Because the bond dimension, 𝐷bond, of the coarse-grained tensor

increases exponentially with the renormalization group (RG) steps, a truncation for 𝐷bond must be

applied to avoid uncontrolled growth of memory needs on classical computers. For noncritical

phases, the fixed-point tensor of the RG flow has small 𝐷bond, but a larger bond dimension is

needed to have the correct RG flow near the critical point. It has been reported that TRG methods

using 𝐷bond = 40 can locate the phase transition point with an error of order 10−4 for the Ising

model [113], and of order 10−3 for the O(2) model [24, 113] and the clock models [50, 113].

To perform TRG calculations, we need to express the partition function as a contraction of a

tensor network. We rewrite the weight of each link by a singular value decomposition (SVD):

𝑒 𝛽 cos(cid:104) 2 𝜋

𝑞 (𝑘 𝑥+ ˆ𝜇−𝑘 𝑥)(cid:105)

=

⌊𝑞⌋
∑︁

𝑛𝑥,𝜇=0

𝑈𝑘 𝑥+ ˆ𝜇𝑛𝑥,𝜇𝐺𝑛𝑥,𝜇𝑉𝑘 𝑥𝑛𝑥,𝜇 .

(4.18)

Then we sum over the original 𝑘 indices and the partition function can be expressed in the dual

space from the expansion in terms of 𝑛 indices

𝑍 =

∑︁

(cid:214)

𝑇𝑙𝑟𝑑𝑢,

{𝑛}

𝑥

89

(4.19)

0.00.20.40.60.81.01.21.41.61.82.0β4.14.24.34.44.54.64.74.84.95.0q29210211212213214215216Twhere 𝑙 = 𝑛𝑥− ˆ𝑠,𝑠, 𝑟 = 𝑛𝑥,𝑠, 𝑑 = 𝑛𝑥− ˆ𝜏,𝜏, 𝑢 = 𝑛𝑥,𝜏 for each site 𝑥, and the local rank-four tensor is

defined as

𝑇𝑙𝑟𝑑𝑢 = √︁𝐺𝑙𝐺𝑟𝐺 𝑑𝐺𝑢𝐶𝑙𝑟𝑑𝑢,

𝐶𝑙𝑟𝑑𝑢 =

⌊𝑞⌋
∑︁

𝑘 𝑥=0

𝑒ℎ cos(cid:16) 2 𝜋

𝑞 𝑘 𝑥−𝜓ℎ

(cid:17)

𝑈𝑘 𝑥𝑙𝑉𝑘 𝑥𝑟𝑈𝑘 𝑥 𝑑𝑉𝑘 𝑥𝑢.

(4.20)

(4.21)

Notice that for integer 𝑞, the matrix 𝑈 = 𝑉 −1 can be chosen as 𝑈𝑘𝑛 = exp(𝑖2𝜋𝑘𝑛/𝑞), then if ℎ = 0,

the tensor 𝐶𝑙𝑟𝑑𝑢 becomes a 𝛿-function that gives a Z𝑞 selection rule for values of 𝑛:

mod (cid:0)𝑛𝑥− ˆ𝑠,𝑠 + 𝑛𝑥− ˆ𝜏,𝜏 − 𝑛𝑥,𝑠 − 𝑛𝑥,𝜏, 𝑞(cid:1) = 0.

(4.22)

The tensor reformulation of the expectation value of a local observable can be obtained in the

same way. For example, the first component of the magnetization is equal to the expectation value

of cos (𝜑) at an arbitrary site 𝑥0, 𝑚1 = ⟨cos(𝜑(𝑘)

𝑥0 )⟩, which can be expressed as

(cid:205)

𝑚1 =

{𝑛} 𝑇 i
(cid:205)

(cid:206)𝑥≠𝑥0
(cid:206)𝑥 𝑇𝑙𝑟𝑑𝑢

𝑥0,𝑙𝑟𝑑𝑢

{𝑛}

𝑇𝑙𝑟𝑑𝑢

,

(4.23)

where 𝑇 i

𝑥0,𝑙𝑟𝑑𝑢 =

√

𝐺𝑙𝐺𝑟𝐺 𝑑𝐺𝑢𝐶i

𝑥0,𝑙𝑟𝑑𝑢 is an impure tensor residing at site 𝑥0, and

𝐶i

𝑥0,𝑙𝑟𝑑𝑢 =

⌊𝑞⌋
∑︁

𝑘 𝑥0 =0

cos

(cid:18) 2𝜋
𝑞

𝑘𝑥0

(cid:19)

𝑒ℎ cos(cid:16) 2 𝜋

𝑞 𝑘 𝑥0 −𝜓ℎ

(cid:17)

𝑈𝑘 𝑥0

𝑙𝑉𝑘 𝑥0

𝑟𝑈𝑘 𝑥0

𝑑𝑉𝑘 𝑥0

𝑢.

(4.24)

To compute the internal energy, we need to calculate the expectation values of link interactions

𝜖𝜇 = ⟨cos(𝜑(𝑘)

𝑥0+ ˆ𝜇 − 𝜑(𝑘)

𝑥0 )⟩. Taking 𝜇 = 𝑠 as an example, we perform another SVD for the target link

cos

(cid:20) 2𝜋
𝑞

(cid:0)𝑘𝑥0+ ˆ𝑠 − 𝑘𝑥0

(cid:1)

(cid:21)

𝑒 𝛽 cos(cid:104) 2 𝜋

𝑞 (𝑘 𝑥0+ ˆ𝑠−𝑘 𝑥0)(cid:105)

=

⌊𝑞⌋
∑︁

𝑛𝑥0,𝑠=0

𝑈i

𝑘 𝑥0+ ˆ𝑠𝑛𝑥0,𝑠

𝐺i

𝑛𝑥0,𝑠

𝑉 i

𝑘 𝑥0

,

𝑛𝑥0,𝑠

(4.25)

and introduce two impure tensors ˜𝑇 i

𝑥0+ ˆ𝑠,𝑙𝑟𝑑𝑢, ˜𝑇 i

by replacing 𝑈𝑘 𝑥0+ ˆ𝑠,𝑙, 𝐺𝑙 with 𝑈i
respectively. Thus the tensor reformulation of the expectation value of the link interaction is

𝑙 and replacing 𝑉𝑘 𝑥0

,𝑟, 𝐺𝑟 with 𝑉 i

𝑘 𝑥0+ ˆ𝑠,𝑙, 𝐺i

,𝑟, 𝐺i

𝑘 𝑥0

𝑥0,𝑙𝑟𝑑𝑢 residing at nearest neighbor sites 𝑥0 + ˆ𝑠, 𝑥0,
𝑟 in Eq. (4.20),

written as

(cid:205)

{𝑛}

𝜖𝜇 =

˜𝑇 i
𝑥0+ ˆ𝜇,𝑙𝑟𝑑𝑢
(cid:205)

˜𝑇 i
𝑥0,𝑙𝑟𝑑𝑢
(cid:206)𝑥 𝑇𝑙𝑟𝑑𝑢

{𝑛}

(cid:206)𝑥≠𝑥0+ ˆ𝜇,𝑥0

𝑇𝑙𝑟𝑑𝑢

.

(4.26)

90

In the following, we use TRG and higher-order TRG (HOTRG) to contract tensor networks with

impure tensors [130, 131] up to a volume 𝑁 = 𝐿2 = 224 × 224, calculate the first component of

the magnetization (cid:174)𝑚 = (𝑚1, 𝑚2) and internal energy 𝐸 = 𝜖𝑠 + 𝜖𝜏, and take derivatives of (cid:174)𝑚 and

𝐸 with respect to (cid:174)ℎ and 𝛽, respectively, to find the magnetic susceptibility and specific heat.1,2

The locations and heights of the peaks of 𝜒𝑀 and 𝐶𝑣 are obtained via a spline interpolation on

datasets with Δ𝛽 = 10−3. The tensor contraction in HOTRG is performed with ITensors Julia

Library [132].

4.2.3 Thermodynamics

In the extended 𝑞-state clock model, there is a Z𝑞 symmetry when 𝑞 ∈ Z. When 𝑞 ∉ Z, this

symmetry is explicitly broken. We choose 𝜑0 = 0, so the allowed spin orientations divide the unit
circle into ⌈𝑞⌉ arcs of which ⌈𝑞⌉ − 1 have measure 2𝜋/𝑞. The remainder has measure ˜𝜙 given

in Eq. (4.8) and illustrated in Fig. 4.1. There remains a Z2 symmetry and an approximate Z⌈𝑞⌉

symmetry.

Monte Carlo results obtained with a heatbath algorithm on a 4 × 4 lattice with zero external

field are shown for 1.1 ≤ 𝑞 ≤ 6.0 in Fig. 4.2–4.6. The four panels show the energy density and the

specific heat defined in Eqs. (4.11) and (4.12) as well as the proxy magnetization and susceptibility

defined in Eqs. (4.15) and (4.16). For example, for 𝑞 = 5, the energy density is zero at 𝛽 = 0 because

there is no linear term in the series expansion of the partition function due to the Z5 symmetry. The

nonzero energy density at 𝛽 = 0 for 4 < 𝑞 < 5 is consistent with the explicit Z5 symmetry breaking.

As 𝑞 → 4+, the stronger symmetry breaking results in a more negative energy density. There is

a double-peak structure in the specific heat, where the large-𝛽 peak moves toward 𝛽 = ∞ as 𝑞 is

decreased. The proxy magnetization also increases for smaller values of 𝑞 with stronger symmetry

breaking, and the peak of the magnetic susceptibility moves toward smaller 𝛽 values. Note that

the double-peak structure of this proxy magnetic susceptibility will appear at larger system sizes

𝐿 ≥ 12 [46]. The true magnetic susceptibility with an external field at large volumes will show the

1One can assume that the TRG and the HOTRG return the same results if the bond dimension is sufficiently large.
2If it is not declared in the main text or in the captions, the bond dimension is set to be sufficiently large so that the

outputs converges.

91

Figure 4.10 The energy density for the fractional-𝑞-state clock model from TRG for 𝑞 = 4.1, 4.3,
4.5, 4.7, and 4.9. These results were obtained with a volume 1024 × 1024. The energy is shifted
vertically by 2 (to make it positive) and plotted on a log scale to better illustrate the difference
between the curves.

double-peak structure (see below).

In Fig. 4.10, we show the logarithm of the energy density of this model from TRG for 𝑞 = 4.1,

4.3, 4.5, 4.7, and 4.9 with 0 ≤ 𝛽 ≤ 16 and 𝑁 = 1024 × 1024. For large enough 𝛽, we have

𝑍 = 𝑒2𝑁 𝛽 (cid:104)

⌈𝑞⌉ + 2𝑁𝑒−4𝛽(1−cos ˜𝜙) + · · ·

(cid:105)

,

(4.27)

so that the energy density converges exponentially with 𝛽. The results in Fig. 4.10 confirm this

behavior. We also notice that for smaller 𝑞, there is a larger range of 𝛽 where the energy density

does not change much. In this range, 1 − cos ˜𝜙 is close to zero for 𝑞 close to 4 from above, and the

inverse temperature 𝛽 is large enough that terms containing larger angular distances are negligible,

but 𝛽 is still not large enough to change the values of exp[−𝛽(1 − cos ˜𝜙)] significantly from 1 and

ignore higher orders. The result for 𝑞 = 4.1 shown in Fig. 4.10 indicates that the specific heat is

almost zero for 𝛽 > 2.5, which is confirmed in Fig. 4.11.

In Fig. 4.11, we show the specific heat for 𝑞 = 4.1, 4.5, 4.9, and 5.0 at volumes ranging from

4 × 4 to 128 × 128. For generic 𝑞, there are two peaks in the specific heat.3 In Fig. 4.11 we see only

3There is only a single peak for 𝑞 = 2, 3, 4 and for fractional 𝑞 just below these integers.

92

0246810121416β10−710−510−310−1101E/N+2q=4.1q=4.3q=4.5q=4.7q=4.9Figure 4.11 Specific heat of the extended 𝑞-state clock model from TRG for 𝑞 = 4.1, 4.5, 4.9, and
5.0 at volumes from 4 × 4 up to 128 × 128. All vertical axes use a shared scale, and all horizontal
axes use a shared scale. In general, there is a double-peak structure in the specific heat (for 𝑞 = 4.1,
the second peak is at 𝛽 ∼ 75). Insets show the height of the second peak versus the linear system
size 𝐿 =

𝑁 plotted on a log scale, where the solid line is a linear fit.

√

a single peak for 𝑞 = 4.1 since the second peak is at much larger 𝛽. For 𝑞 ∉ Z and not too close to

5, the first peak shows little or no dependence on volume. The second peak grows logarithmically

with volume, as shown in the insets for 𝑞 = 4.5, 4.9. This is in contrast to the integer case 𝑞 = 5

where there are two BKT transitions and both peaks show little dependence on volume for lattice

sizes larger than 32 × 32. Because the specific heat is the second-order derivative of free energy,

the results in Fig. 4.11 indicate that the first peak is associated with either a crossover or a phase

transition with an order larger than 2, and the second peak is associated with a second-order phase

transition. To conclusively characterize the phase transitions, if any, associated with these two

peaks in the fractional-𝑞-state clock model, we study the magnetic susceptibility in the next two

subsections.

We find that the thermodynamic curves vary smoothly for 𝑛 < 𝑞 ≤ 𝑛 + 1 where 𝑛 is an integer.

When 𝑞 is taken slightly larger than 𝑛 from below, these curves change abruptly since an additional

degree of freedom is introduced. The specific heat exhibits a double-peak structure with the second

93

0.00.51.01.52.02.53.0Cq=4.14×48×816×1632×3264×64128×128q=4.50.00.51.01.52.02.53.03.54.0β0.00.51.01.52.02.53.0Cq=4.90.00.51.01.52.02.53.03.54.0βq=5.0232527L024peakheight232527L0123peakheightFigure 4.12 The magnetic susceptibility as a function of 𝛽 for finite volumes. 𝐷bond = 40.

peak at very large 𝛽. As 𝑞 is increased further, this second peak moves toward small 𝛽, until at

𝑞 = 𝑛 + 1, the thermodynamic curves of the integer-(𝑛 + 1)-state clock model are recovered.

In the small-𝛽 (high temperature) regime, all allowed angles are nearly equally accessible, and

the model behaves approximately like a ⌈𝑞⌉-state clock model. The model is dominated by the

approximate Z⌈𝑞⌉ symmetry, and there is a peak in the specific heat. In Fig. 4.8, we show that

at intermediate 𝛽, an explosion of the integrated autocorrelation time of the energy is observed

in the MC simulation as the model quickly reduces to a rescaled Ising model. At large beta, the

configuration space separates into thermodynamically distinct sectors, and the Markov chain has

trouble adequately sampling both sectors. This is discussed further in Section 4.2.2.1. At large-𝛽,

spin flips across the “leftover angle” ˜𝜙 are strongly favored relative to spin flips across the other

angles. Thus, in the large-𝛽 regime, the model behaves as a rescaled Ising model. The existence of

an Ising critical point is conclusively established via TRG in Sec. 4.2.5.

We next present our TRG results and discuss the phase transitions in the fractional-𝑞-state clock

model in the rest of this section. We first present the results for the magnetic susceptibility without

an external field at small volumes in Fig. 4.12. For 𝑞 < 5, there is a small-𝛽 peak converging

94

10−410−2100χMq=4.14×48×816×1632×3264×64128×12810−1100101102q=4.50.00.51.01.52.02.53.03.54.0β100101102103104χMq=4.90.00.51.01.52.02.53.03.54.0β100101102103104q=5.0Figure 4.13 The magnetic susceptibility as a function of 𝛽 at 𝑁 = 224 × 224. For 𝑞 = 4.995, the
external field is ℎ = 4 × 10−5. For 𝑞 = 4.999, 4.9999, the external field is ℎ = 4 × 10−5 for the lower
curve and ℎ = 2 × 10−5 for the upper curve.

quickly with volume, which means the peak is associated with a crossover. As 𝑞 is increased,

there is a high plateau moving toward small 𝛽. The height of the plateau increases with volume

as a power law (notice the logarithmic scale in the 𝑦-axis). The divergent plateau signals a phase

transition. As there is no spontaneous symmetry breaking in small volumes, the response of the

system to an external field remains high at low temperatures, so we cannot see the transition from a

critical phase to the symmetry-breaking phase where there is a small magnetic susceptibility based

on these results for ℎ = 0 and small volumes. But notice that for fractional 𝑞 like 𝑞 = 4.9, there is

a higher plateau after the first one for each volume. This is due to an “approximate Z⌈𝑞⌉ symmetry

breaking” after a Z2 symmetry breaking. This “approximate Z⌈𝑞⌉ symmetry breaking” is not a

true phase transition so it moves to 𝛽 = ∞ in the thermodynamic limit as shown in the results for

𝑞 = 4.9. There is only one Z5 symmetry breaking for 𝑞 = 5 so there is a single plateau for each

volume. We will use the magnetic susceptibility with a weak external field in the thermodynamic

limit to detect the phase transitions.

In Fig. 4.13 we present the magnetic susceptibility in the thermodynamic limit as a function of

𝛽 for 𝑞 = 4.995, 4.999, 4.9999 with a small external field ℎ = 4 × 10−5, 2 × 10−5. The height of the

large-𝛽 peak is large for all three values of 𝑞, indicating a phase transition near this peak. The small-

95

𝛽 peak for 𝑞 = 4.995, ℎ = 4 × 10−5 is invisible because it is very small. For 𝑞 = 4.999, ℎ = 4 × 10−5,

closer to 5, the small-𝛽 peak is higher, and the large-𝛽 peak moves toward the large-𝛽 BKT transition

point 𝛽BKT

𝑞=5,𝑐2 for 𝑞 = 5. When the external field is decreased to ℎ = 2 × 10−5, the large-𝛽 peak
height is almost doubled, while the small-𝛽 peak height does not change, which means there is no

phase transition near the small-𝛽 peak. For 𝑞 = 4.9999 (closer to 5) with ℎ = 4 × 10−5, one can

see that the small-𝛽 peak becomes higher than the large-𝛽 one, and the large-𝛽 peak is fading away,

which is consistent with the results in Refs. [50, 113] for the five-state clock model. When the

external field is decreased to ℎ = 2 × 10−5, the large-𝛽 peak grows a much larger amount of height

than the small-𝛽 one does and becomes higher than the small-𝛽 one. We have confirmed that the

small-𝛽 peak will eventually converge for small enough ℎ. All these behaviors are evidence that

for all fractional 𝑞 > 4, the small-𝛽 peak in 𝜒𝑀 does not diverge, and the large-𝛽 peak diverges at

ℎ = 0 indicating a phase transition.

In the following, we discuss the behavior of the two peaks in the thermodynamic limit in detail.

The main observations are that for 𝜑0 = 0 and 𝑞 > 4, there are two peaks in the specific heat and the

magnetic susceptibility, the small-𝛽 one is finite and is associated with a crossover, and the large-𝛽

one diverges which is characteristic of an Ising critical point. When 𝑞 is approaching an integer

from below, the height of the crossover peak of the magnetic susceptibility diverges as ⌈𝑞⌉ − 𝑞 goes

to zero with a power law:

𝑀 ∼ ( ⌈𝑞⌉ − 𝑞)−𝑦 .
𝜒∗

We can formulate the scaling hypothesis with Δ𝑞 = ⌈𝑞⌉ − 𝑞 and ℎ,

𝑓 (𝜆 𝑝Δ𝑞, 𝜆𝑟 ℎ) = 𝜆𝑑 𝑓 (Δ𝑞, ℎ) ,

(4.28)

(4.29)

where 𝜆 parameterizes a scale transformation, 𝑝 and 𝑟 are the scaling dimensions, and 𝑑 = 2 in two-

dimensional space. Notice that the reduced temperature should not enter the homogeneous function

independently because there is an essential singularity in the correlation length as a function of

temperature for BKT transitions. We assume Eq.

(4.29) holds for any critical temperature, in

particular, the BKT crossover peak position 𝛽BKT(Δ𝑞, ℎ), which is a power law function of Δ𝑞 and

96

ℎ, considered in the following calculations. Then the magnetization and the magnetic susceptibility

satisfy the following relations:

𝜆𝑟 𝑀 (𝜆 𝑝Δ𝑞, 𝜆𝑟 ℎ) = 𝜆𝑑 𝑀 (Δ𝑞, ℎ) ,

𝜆2𝑟 𝜒𝑀 (𝜆 𝑝Δ𝑞, 𝜆𝑟 ℎ) = 𝜆𝑑 𝜒𝑀 (Δ𝑞, ℎ) ,

from which one can obtain

𝑀 ∼ ℎ 1

𝛿 ∼ (Δ𝑞)

1
𝛿′ ,

𝜒𝑀 ∼ ℎ−1+ 1

𝛿 ∼ (Δ𝑞)− 𝛿−1

𝛿′

,

(4.30)

(4.31)

(4.32)

(4.33)

where 1/𝛿 = (𝑑 − 𝑟)/𝑟, 1/𝛿′ = (𝑑 − 𝑟)/𝑝. We then have 𝛿′ = (𝛿 − 1)/𝑦 = 14/𝑦, where we have

used the fact that 𝛿 = 15 for BKT transitions. Note that the expansion of the action to the first order

in Δ𝑞 is

𝑆ext-𝑞 = 𝑆Z⌈𝑞 ⌉ + 𝛽

Δ𝑞
𝑞

Δ𝜑𝑥,𝜇 sin(Δ𝜑𝑥,𝜇),

(4.34)

is the action for the integer-𝑞 clock model. The perturbation term that breaks the Z⌈𝑞⌉

where 𝑆Z⌈𝑞⌉
symmetry has a very different form from the one for the external field ℎ cos(𝜑𝑥). We numerically

show in the following that 𝛿′ is equal to the magnetic critical exponent 𝛿 = 15 for the BKT

transitions and Ising critical points in two dimensions. Thus if we define a new susceptibility as

𝜕 𝑀/𝜕Δ𝑞, which should scale as (Δ𝑞)−1+1/𝛿′, the exponent 𝑦′ = 1 − 1/𝛿′ = 14/15, still the same

as 𝑦. However, calculating 𝛿′ from 𝑦′ requires higher accuracy since 𝛿′ = 1/(1 − 𝑦′) where one

significant digit is subtracted in the denominator. Both the peak position of the crossover and the

Ising critical point go to the two BKT transition points at integer 𝑞 with the same power law, which

provides us a new way to locate the phase transitions in models with Z𝑛 symmetry. However, for

𝜑0 = −𝜋 and odd ⌊𝑞⌋, both peaks of 𝜒𝑀 are finite for fractional 𝑞 so there are no critical points.

For 𝑞 → 5+, only the small-𝛽 peak can be used to extract the BKT transition of 𝑞 = 5 since the

large-𝛽 peak fades away.

4.2.4 Small-𝛽 peak: Crossover

We have shown in Fig. 4.13 that for 𝑞 = 4.999, the height of the small-𝛽 peak converges to

about 400 for small enough external field. The dependence of the peak height on the external field

97

Figure 4.14 The small-𝛽 peak of magnetic susceptibility as a function of 𝛽 for 𝑞 = 4.3. The peak
height converges when the external field is taken to zero.

is larger for values of 𝑞 closer to an integer from below. In Fig. 4.14, we show another example for

𝑞 = 4.3. One can see that the ℎ dependence of the peak height is much smaller. The peak height

at ℎ = 10−2 differs from the value in the ℎ → 0 limit by only about 0.04. As the external field

is decreased, the peak height converges to a constant 𝜒𝑀 ≈ 1.2, implying that there is no phase

transition around this peak. This is true for all fractional 𝑞. Thus, for fractional 𝑞, the first peak in

the specific heat is associated with a crossover rather than a true phase transition. As 𝑞 approaches

5 from below, we expect the small-𝛽 peak height to diverge because there is a BKT transition for

𝑞 = 5, and we expect the location of the peak to go to the small-𝛽 BKT transition point.

To check this, we calculate the converged peak height 𝜒∗

𝑀 of the magnetic susceptibility for
values of 5 − 𝑞 ≤ 10−4 with 𝐷bond = 40 and 𝐷bond = 50, where we use 𝑑ℎ = 10−10 in the numerical

differentiation. We plot 𝜒∗

height on 𝐷bond is very small for 𝐷bond ≥ 40. Applying a linear fit to ln( 𝜒∗

𝑀 versus 5 − 𝑞 in Fig. 4.15, where we see that the dependence of the peak
𝑀) versus ln(5 − 𝑞)

shows that the peak height diverges as 𝑞 → 5− with a power law




(5 − 𝑞)−0.9389(6)

(5 − 𝑞)−0.948(6)

𝜒∗
𝑀 ∼

if 𝐷bond = 40

if 𝐷bond = 50,

from which we obtain 𝛿′ = 14.77(9) for 𝐷bond = 40 and 𝛿′ = 14.911(10) for 𝐷bond = 50. The

(4.35)

98

0.10.20.30.40.50.60.50.60.70.80.91.01.11.2Mh=102h=103h=104Figure 4.15 The log-log plot of the maximal value of the magnetic susceptibility 𝜒𝑀 for the small-𝛽
peak as a function of 5 − 𝑞. The peak height diverges with a power law when 𝑞 → 5−.

Figure 4.16 Same as Fig. 4.15, but for 𝑞 → 6−.

value of 𝛿′ is close to the magnetic critical exponent 𝛿 = 15 for BKT transitions and Ising critical

points in two dimensions, but now 5 − 𝑞 (rather than the external field ℎ) is playing the role of the

symmetry-breaking parameter. Since both ℎ and 5 − 𝑞 break the Z5 symmetry to a Z2 symmetry,

the agreement on the critical exponents is reasonable. The value of 𝛿′ is also checked for 𝑞 → 6−

in Fig. 4.16, where a larger 𝐷bond is applied in HOTRG. In this case, the linear fit gives

𝜒∗
𝑀 ∼





(6 − 𝑞)−0.937(2)

if 𝐷bond = 50

(6 − 𝑞)−0.9329(16)

if 𝐷bond = 60,

99

(4.36)

1514131211109ln(5q)891011121314ln(*M)Dbond=400.948(6)ln(5q)0.49(7)Dbond=500.9389(6)ln(5q)0.382(8)161514131211109ln(6q)89101112131415ln(*M)Dbond=500.937(2)ln(6q)0.36(3)Dbond=600.9329(16)ln(6q)0.31(2)from which we obtain 𝛿′ = 14.94(3) with 𝐷bond = 50 and 𝛿′ = 15.007(26) with 𝐷bond = 60. Within

uncertainty, the results of 𝑞 → 6−, 𝐷bond = 60 agree perfectly with the value of the magnetic critical

exponent 𝛿 = 15.

The location of the converged peak 𝛽𝑝 as a function of 5 − 𝑞 is depicted in Fig. 4.17. The

discrepancy of the peak positions between 𝐷bond = 40 and 𝐷bond = 50 is invisible at 5 − 𝑞 = 10−4

and increases as 5 − 𝑞 is decreased, which is reasonable because a larger bond dimension is needed

for systems closer to a critical point. The overall discrepancy is small. We extrapolate the peak

position to 𝑞 = 5 with a power law and obtain the small-𝛽 BKT transition point 𝛽BKT

𝑞=5,𝑐1 = 1.0494(6)
𝑞=5,𝑐1 = 1.0506(4) with 𝐷bond = 50 for the five-state clock model. The
results are consistent with 1.0503(2) in Ref. [51]. The same procedure is performed for 𝑞 → 6− in

with 𝐷bond = 40 and 𝛽BKT

Fig. 4.18. The peak positions 𝛽𝑝 for 𝐷bond = 50 and those for 𝐷bond = 60 have little discrepancy for

6 − 𝑞 ≥ 7 × 10−7. For 6 − 𝑞 < 7 × 10−7, 𝛽𝑝 for 𝐷bond = 60 becomes larger than that for 𝐷bond = 50.

The power law fit gives us 𝛽BKT

𝑞=6,𝑐1 = 1.0983(4) with 𝐷bond = 50 and 𝛽BKT

𝑞=6,𝑐1 = 1.1019(5) for

𝐷bond = 60. The results are consistent with 1.101(4) in Ref. [133].

We have shown that ⌈𝑞⌉ − 𝑞 plays the same role as an external magnetic field for the small-𝛽

BKT transition in integer-𝑞-state clock models. The magnetic susceptibility is always finite for

fractional 𝑞 < ⌈𝑞⌉, and diverges as 𝑞 → ⌈𝑞⌉− with a critical exponent 𝑦 = 14/15. This provides

us an alternative way to extract the locations of BKT transitions in clock models. However, the

situation is very different for the large-𝛽 peak of the magnetic susceptibility.

4.2.5 Large-𝛽 peak: Ising criticality

To understand the large-𝛽 peak in the specific heat, we again study the magnetic susceptibility

𝜒𝑀. For a fixed 𝑞, the critical point, if any, is given by the location of the peak of 𝜒𝑀 in the limit

ℎ → 0 where the height 𝜒∗

the susceptibility as a function of the external field ℎ. The linear fit of ln( 𝜒∗

on peak positions of 𝜒𝑀 for small values of ℎ. In Fig. 4.19, we present the peak height 𝜒∗

𝑀 of the peak is infinite. A power law extrapolation to ℎ = 0 is performed
𝑀 of
𝑀) versus ln(ℎ) gives
𝑀 ∼ ℎ−0.93318(16), from which we obtain the magnetic critical exponent 𝛿 = 14.97(4). This value
𝜒∗
is consistent with the value 𝛿 = 15 of the BKT transitions and Ising critical points. We have shown

100

Figure 4.17 The power law extrapolation of the small-𝛽 peak position to 𝑞 = 5 from below. Here
the extrapolation gives 𝛽𝑐 = 1.0506(4).

Figure 4.18 Same as Fig. 4.17, but for 𝑞 → 6−.

before that there is no phase transition around the small-𝛽 peak of 𝜒𝑀. A BKT transition should

be accompanied with a continuous critical region, so the divergent large-𝛽 peak of 𝜒𝑀 must be an

Ising critical point. In the Ising universality class,

𝜉 ∼ |𝛽 − 𝛽𝑐 |−𝜈 ∼ 𝐿,

𝑀 ∼ (𝛽 − 𝛽𝑐) 𝛽 ∼ 𝐿−1/8 ∼ ℎ1/15,

𝜒𝑀 ∼ |𝛽 − 𝛽𝑐 |−𝛾 ∼ 𝐿7/4,

(4.37)

101

1061051045q1.0101.0151.0201.0251.0301.0351.040p0.41(2)(5q)0.256(7)+1.0494(6)Dbond=400.287(8)(5q)0.214(4)+1.0506(4)Dbond=501071061051046q1.041.051.061.071.08p0.384(6)(6q)0.201(2)+1.0983(4)Dbond=500.355(5)(6q)0.186(2)+1.1019(5)Dbond=60Figure 4.19 The maximal magnetic susceptibility in Fig. 4.22 as a function of the external field for
𝑞 = 4.9. A linear fit of the log-log plot gives the exponent 𝛿 = 14.97(4). This is consistent with
the value 𝛿 = 15 of the Ising universality class. 𝐷bond = 40.

where 𝜈 = 1, 𝛽 = 1/8, 𝛾 = 7/4 are the universal critical exponents. There is a universal function

relating 𝜒𝑀/𝐿1.75 and 𝐿 (𝛽 − 𝛽𝑐) with fixed ℎ𝐿15/8. In Fig. 4.21, we plot 𝜒𝑀/𝐿1.75 versus 𝐿 (𝛽 − 𝛽𝑐)

for various lattice sizes around the large-𝛽 peak of 𝜒𝑀, where the value of 𝛽𝑐 is obtained from the

Ising approximation in Eq. (4.40) described below. One can see that all the data collapse onto a

single curve, which gives strong evidence that this is a critical point of the Ising universality class.

In Fig. 4.20, we show the logarithm of 𝜒𝑀 as a function of log2(𝐿) for 𝛽 = 1.1. For 𝑞 = 5,

𝛽 = 1.1 is between two BKT points so is a critical point, and 𝜒𝑀 keeps increasing with a positive

power of 𝐿 as expected. When 𝑞 < 5, even for a very small 5 − 𝑞 = 10−6, 𝜒𝑀 always saturates at a

large enough volume. These results again prove that there are no BKT transitions in fractional 𝑞,

no matter how close to ⌈𝑞⌉ the 𝑞 is. Because the maximal value of 𝜒𝑀 should diverge as a negative

power of 5 − 𝑞 when 𝑞 → 5−, the increment of the height of the plateaus between 5 − 𝑞 = 10−𝑛 and

5 − 𝑞 = 10−𝑛−1 should be a constant for any integer 𝑛, which is also confirmed in Fig. 4.20.

We next obtain the Ising critical point by extrapolating the peak position of 𝜒𝑀 to ℎ = 0. An

example for 𝑞 = 4.9 is shown in Fig. 4.22. The power law extrapolation gives 𝛽𝑐 = 1.44614(2).

We can repeat the same procedure for other values of 𝑞. But notice that we need a larger bond

dimension in TRG when 𝑞 is very close to an integer from below, because more degrees of freedom

become important, and the critical point is close to a BKT transition point. The phase transition in

102

15.515.014.514.013.513.012.512.011.5ln(h)8.59.09.510.010.511.011.512.0ln(*M)0.93318(16)ln(h)2.317(2)Figure 4.20 The dependence of magnetic susceptibility on the size of the system 𝐿 for 𝛽 = 1.1 with
𝐷bond = 40. The line on the circles is a linear fit for the first ten points: 1.275(3) log2(𝐿) − 0.74(2).

Figure 4.21 Data collapse of the rescaled magnetic susceptibility versus the rescaled inverse
temperature for 𝑞 = 4.3. The reduced external field is ℎ𝐿15/8 = 40, and 𝛽𝑐 ≈ 9.3216, which is the
approximate critical point of the model with 𝑞 = 4.3, is obtained from Eq. (4.40).

103

246810121416log2(L)0.02.55.07.510.012.515.017.520.0ln(M)q=55q=1035q=1045q=1055q=106302520151050510L(c)0.000150.000200.000250.000300.000350.000400.00045M/L1.75L=32L=64L=128L=256L=512Figure 4.22 The power law extrapolation of the peak position of 𝜒𝑀 to zero external field for
𝑞 = 4.9. Here the extrapolation gives 𝛽𝑐 = 1.44614(2). 𝐷bond = 40.

the Ising universality class is a transition from a disordered phase to a symmetry-breaking phase.

The structure of the fixed-point tensor in TRG can easily characterize this phase transition. As

proposed in Ref. [134], the symmetry-breaking indicator

𝑋 =

((cid:205)𝑟𝑢 𝑇𝑟𝑟𝑢𝑢)2
(cid:205)𝑙𝑟𝑑𝑢 𝑇𝑙𝑟𝑢𝑢𝑇𝑟𝑙𝑑𝑑

(4.38)

should be 1 in the disordered phase and 2 in the Z2 symmetry breaking phase. Thus the discontinuity

in 𝑋 for the fixed point tensor as a function of 𝛽 can be used to locate the phase transition.

An example for 𝑞 = 4.9 is shown in Fig. 4.23, where the value of 𝑋 changes from 1 to 2 at

1.4461 < 𝛽 < 1.4462, consistent with the result from the extrapolation of the peak position of 𝜒𝑀

in Fig. 4.22. The advantage of this method is that we only need to contract a single tensor network

for each value of 𝛽 and scan a 𝛽 range once to locate the phase transition point. This saves us a lot

of computational effort when we extrapolate the Ising critical point to 𝑞 = ⌈𝑞⌉ from below.

Now we use the value of 𝑋 to locate the Ising critical point and find the large-𝛽 BKT transitions

for 𝑞 = 5, 6. Notice that for 𝑞 = 5, although both a small external field and a small deviation of

𝑞 from an integer break the Z⌈𝑞⌉ symmetry to a Z2 symmetry, the magnetic susceptibility with a

weak external field does not have a peak around the large-𝛽 BKT transition, so it fails to predict

the location of the phase transition [50, 113], but here we always have an Ising critical point for

fractional 𝑞. In Fig. 4.24(a), we calculate the Ising critical point for 𝑞 → 5− with 𝐷bond = 40 and

104

107106105h1.444001.444251.444501.444751.445001.445251.445501.445751.44600p0.94(12)h0.53(1)+1.44614(2)Figure 4.23 The 𝛽 dependence of 𝑋, defined in Eq. (4.38), from the fixed point tensor for
𝑞 = 4.9. The discontinuity is located between 1.4461 and 1.4462, consistent with the result from
the extrapolation of the peak position of 𝜒𝑀 to zero external field in Fig. 4.22. The tensor bond
dimension is 40.

extrapolate the result to 𝑞 = 5 with a power law. The value of 5 − 𝑞 is between 6 × 10−4 and 10−3,

where the dependence of 𝛽𝑐 on 𝐷bond is small. The extrapolated BKT transition point for 𝑞 = 5 is

𝛽BKT
𝑞=5,𝑐2 = 1.1027(14), consistent with the result 1.1039(2) obtained in Ref. [51]. As a comparison,
we also present the extrapolation of the crossover peak with the same 𝐷bond in the same figure. The

exponents of the two power law scalings are the same within uncertainties, and the values of the

exponents are consistent with 0.2677(84) obtained in Ref. [50] for magnetic susceptibility with an

external field ℎ ≤ 10−3.

The results for 𝑞 → 6− are shown in Fig. 4.24(b), where we use a larger bond dimension

𝐷bond = 60 and data with 6−𝑞 ≤ 10−4. The extrapolated BKT transition point is 𝛽BKT

𝑞=6,𝑐2 = 1.435(3),
consistent with 1.441(6) in Ref. [133]. Comparing the Ising critical points and the crossover peak

positions, we find that the power law scaling parts are exactly the same except for a minus sign

within uncertainties, which means they approach the two BKT transition points in the same manner.

We believe this behavior can be seen for all 𝑞 → 𝑛− for integer 𝑛 ≥ 5. To determine the exponent

accurately, we need to use a larger 𝐷bond and data closer to an integer, which is beyond the scope

of this work. However, this exponent should be the same as the power law scaling of 𝛽𝑝 of

𝜒𝑀 in a weak external field for all clock models with integer 𝑞 ≥ 5, where there is always an

105

1.44561.44581.44601.44621.44641.44661.01.21.41.61.82.0XFigure 4.24 Power law extrapolation of the small-𝛽 peak position (triangles) of 𝜒𝑀 with zero
external field to 𝑞 = 5 [lower curve in (a)] and to 𝑞 = 6 [lower curve in (b)] . The same procedure
is performed for the Ising critical points (circles) at higher 𝛽 [upper curves in (a) and (b)]. The
horizontal dashed lines in (a) are the locations of two BKT transitions for 𝑞 = 5 obtained from
Ref. [51]. The horizontal dash-dotted lines in (b) are the locations of two BKT transitions for 𝑞 = 6
obtained from Ref. [133]. 𝐷bond = 40 in (a) and 𝐷bond = 60 in (b).

emergent O(2) symmetry [25, 135]. In the limit of O(2) model, this exponent is found to be around

0.162 [24, 39, 136].

At large 𝛽, the fractional-𝑞-state clock model is a rescaled Ising (𝑞 = 2) model because the link

interactions for the two angles 0 and ˜𝜙 dominate the weights in the partition function. There are

two peaks in the specific heat, and if these peaks are sufficiently separated, then the second peak is

that of an Ising model where 𝛽 is rescaled as 𝛽 → 𝛼𝛽, with a rescaling factor

𝛼 =

1 − cos ˜𝜙
2

,

(4.39)

where the small angular distance ˜𝜙 in the model depends on 𝑞 and is defined in Eq. (4.8). Thus

the critical point 𝛽𝑐 of the fractional-𝑞-state clock model can be approximated by the critical point

𝛽rIsing of the two-dimensional rescaled Ising model,

𝛽𝑐 ≃ 𝛽rIsing ≡

√

ln (cid:16)1 +
2(cid:17)
1 − cos ˜𝜙

.

(4.40)

In Table 4.2 we list some of these critical points for different values of 𝑞. These points give

approximately the location of the large-𝛽 peak in the specific heat for the extended 𝑞 clock model.

The critical point 𝛽rIsing of the rescaled Ising model (see Table 4.2) is a good approximation for

the true critical point 𝛽𝑐 for values of 𝑞 that are not too close to an integer from below. For example,

106

0.00000.00020.00040.00060.00080.00105q0.9751.0001.0251.0501.0751.1001.1251.150c(a)0.287(17)(5q)0.272(13)+1.1027(14)0.428(5)(5q)0.264(2)+1.0484(3)0.000000.000020.000040.000060.000080.000106q1.11.21.31.41.5c(b)0.35(2)(6q)0.186(12)+1.435(3)0.355(5)(6q)0.186(9)+1.1019(5)Table 4.2 The critical points of the rescaled Ising model for different values of 𝑞. They are
calculated by Eq. (4.40) and give the approximate location of the large-𝛽 peak in the specific heat
of the fractional 𝑞-state clock model in the infinite-volume limit.

𝑞

3.6

3.7

3.8

3.9

4.0

4.1

4.2

4.3

4.4

4.5

4.6

4.7

4.8

4.9

5.0

5.1

5.2

5.3

5.4

5.5

5.6

5.7

5.8

5.9

6.0

𝛽rIsing
1.7627

1.4054

1.1681

1.0022

∞
75.2052

19.8386

9.3216

5.5521

3.7673

2.7764

2.1665

1.7627

1.4808

∞
116.284

30.3313

14.0839

8.2861

5.5521

4.0399

3.1120

2.4995

2.0728

∞

𝑞

1.1

1.2

1.3

1.4

1.5

1.6

1.7

1.8

1.9

2.0

2.1

2.2

2.3

2.4

2.5

2.6

2.7

2.8

2.9

3.0

3.1

3.2

3.3

3.4

3.5

𝛽rIsing
5.5521

1.7627

1.0022

0.7209

0.5876

0.5163

0.4764

0.4544

0.4437

∞
19.8386

5.5521

2.7764

1.7627

1.2755

1.0022

0.8329

0.7209

0.6433

∞
43.0567

11.5787

5.5521

3.3770

2.3409

107

Figure 4.25 The specific heat for 𝑞 = 4.5 for several different lattice sizes. The dotted curves are
from TRG and the solid curves are from the rescaled Ising model. For 16 × 16 and larger lattices,
the rescaled Ising model accurately captures the second peak in the specific heat.

in Fig. 4.25 we compare the specific heat from TRG with the specific heat of the rescaled Ising

model for 𝑞 = 4.5 and several different lattice sizes. For lattices 16 × 16 and larger, the specific

heat for the rescaled Ising model accurately captures the large-𝛽 peak of the fractional-𝑞-state clock

model. As 𝑞 approaches an integer from below, the approximation begins to fail. In Fig. 4.26, we

compare the rescaled Ising critical points with the true critical points as 𝑞 → 5 from below. Most

of the true critical points are obtained from 𝑋 defined in Eq. (4.38), and five points are from 𝜒𝑀

and agree perfectly with those from 𝑋. For 4.0 < 𝑞 ≲ 4.7, the difference between the true critical

point and the Ising approximation is less than 0.01. As 𝑞 → 5−, the difference becomes larger and

is around 0.175 for 𝑞 = 5.

4.2.6

Integration interval

For the extended 𝑞-state clock model, the allowed angles are restricted to the integration interval

𝜑 ∈ [𝜑0, 𝜑0 + 2𝜋). All results presented so far used 𝜑0 = 0, which is in the so-called case 1. We

also considered the possibility 𝜑0 = −𝜋. As shown in Fig. 4.27, for 𝑞 = 4.5, the number of allowed

spin orientations and their relative sizes are the same. In fact, the models with 𝜑0 = 0 and 𝜑0 = −𝜋

are equivalent and in case 1 for all even ⌊𝑞⌋. For odd ⌊𝑞⌋, the choice of 𝜑0 = 0 or 𝜑0 = −𝜋 results

108

0.00.51.01.52.02.53.03.54.0β0.00.51.01.52.02.5C4×48×816×1632×3264×64Figure 4.26 The difference between the critical point of the rescaled Ising model 𝛽rIsing and the
true critical point 𝛽𝑐 for fractional-𝑞-state clock models as a function of 𝑞. The circles are from the
discontinuity of values of 𝑋, 𝛽𝑐𝑋, the stars are from the extrapolations of the peak position of 𝜒𝑀
to zero external field, 𝛽𝑐 𝜒. The inset shows the values of the true critical point as a function of 𝑞
and the solid line is Eq. (4.40).

Figure 4.27 The allowed spin orientations for the extended 𝑞-state clock model when 𝜑0 = 0 (left
column) and 𝜑0 = −𝜋 (right column) for two different values of 𝑞. When 𝑞 = 4.5 (top row),
the number of allowed spin orientations and their relative sizes are the same, and the models are
equivalent. When 𝑞 = 5.5 (bottom row), there are six allowed spin orientations when 𝜑0 = 0, but
only five allowed orientations when 𝜑0 = −𝜋, so the two models are different. The dashed gray
lines indicate the Z2 symmetry axes.

109

4.54.64.74.84.95.0q0.0000.0250.0500.0750.1000.1250.1500.175rIsingccXc4.64.85.0q123crIsingFigure 4.28 The magnetic susceptibility at 𝑁 = 224 × 224, ℎ = 0, and 𝐷bond = 40, as a function
of 𝛽 for 𝜑0 = −𝜋. The susceptibility does not diverge with the volume, which implies there is no
phase transition here. This is different from the model with 𝜑0 = 0, which has a divergent peak in
the susceptibility corresponding to a second-order phase transition.

Figure 4.29 Same as Fig. 4.28, but for 𝑞 = 5.001, 5.00001. The large-𝛽 peak fades away and moves
across the BKT phase transition point at 𝑞 = 5.

in a different number of allowed spin orientations and different thermodynamic behaviors. For

example, when 5 < 𝑞 < 6, there are six allowed orientations when 𝜑0 = 0, but only five allowed

orientations when 𝜑0 = −𝜋 (see Fig. 4.27 for 𝑞 = 5.5). One can show that the model with 𝜑0 = −𝜋

is in case 2 for all odd ⌊𝑞⌋. For integer values of 𝑞, the extended 𝑞-state clock model reduces to the

ordinary 𝑞-state clock model for both 𝜑0 = 0 and 𝜑0 = −𝜋.

We consider 5 < 𝑞 < 6 and 𝜑0 = −𝜋. In Fig. 4.28, we show the magnetic susceptibility at ℎ = 0

110

0.00.51.01.52.02.50246810Mq=5.100q=5.300q=5.500q=5.900q=5.990q=5.9990.850.900.951.001.051.101.1550100150200250300350Mq=5.001000500010000150002000025000Mq=5.00001Figure 4.30 Same as Fig. 4.15, but for 𝜑0 = −𝜋, 𝑞 → 5+. 𝐷bond = 50.

as a function of 𝛽. First of all, one can see that the magnetic susceptibility is finite for all values of

𝑞 presented here, which means there are no phase transitions for any 5 < 𝑞 < 6 and 𝜑0 = −𝜋. The

maximal values of 𝜒𝑀 are larger for 𝑞 closer to 5. For 𝑞 = 5.1, we see a double-peak structure in

𝜒𝑀, with the large-𝛽 peak higher than the small-𝛽 peak. As 𝑞 increases toward 6, the first crossover

peak fades away and disappears around 𝑞 = 5.3. The magnetic susceptibility eventually converges

to a single-peak structure with the peak position around 𝛽 = 1.5 and the peak height around 4. In

order to see the behavior of 𝜒𝑀 for 𝑞 → 5+, we plot 𝜒𝑀 versus 𝛽 for 𝑞 = 5.001 and 𝑞 = 5.00001

in Fig. 4.29. We see that the maximal value of 𝜒𝑀 is much larger than those in Fig. 4.28, because

we are approaching a Z5 clock model from above. Unlike 𝑞 = 5.1, the small-𝛽 peak becomes

higher than the large-𝛽 peak for 𝑞 = 5.001, and the large-𝛽 peak fades away as we move closer to

𝑞 = 5. The small-𝛽 peak moves towards the small-𝛽 BKT transition for 𝑞 = 5 so it can be used to

extrapolate the value of 𝛽BKT

for 𝑞 = 5 from right to left so it fails to predict the value of 𝛽BKT

𝑞=5,𝑐1, while the large-𝛽 peak moves across the large-𝛽 BKT transition
𝑞=5,𝑐2. This means that the magnetic
susceptibility cannot capture the crossover going to the large-𝛽 BKT transition point for 𝑞 → 5+.

But the crossover should go to the BKT transition point as the Z⌊𝑞⌋ symmetry is recovered. The

cross derivative of the free energy should be able to capture this [113], but we will just focus on the

magnetic susceptibility.

We then check the divergent behavior of the small-𝛽 peak of 𝜒𝑀 as 𝑞 → 5+ using 𝐷bond = 50.

111

1514131211109ln(q5)891011121314ln(*M)0.9275(15)ln(q5)0.45(2)Figure 4.31 Same as Fig. 4.17, but for 𝜑0 = −𝜋, 𝑞 → 5+. 𝐷bond = 50.

diverges as a power law 𝜒∗

Figure 4.30 shows the linear fit for ln( 𝜒∗

𝑀) versus ln(𝑞 − 5). We see that the peak height of 𝜒𝑀
𝑀 ∼ 1/(𝑞 − 5)0.9275(15), which again gives a value of the critical exponent
𝛿′ = 15.09(2), close to the expected value 15. We extrapolate the peak position to 𝑞 = 5 from above

in Fig. 4.31. One can see that the power law scaling is the same as case 1 where 𝑞 → 5−, 𝜑0 = 0,

and gives 𝛽BKT

𝑞=5,𝑐1 = 1.0514(5), consistent with the result in Fig. 4.17.

4.2.7 Phase diagram

The clock model with integer 𝑞 has been studied extensively [25, 43–57, 78, 101–104]. For

𝑞 = 2, 3, 4, there is a disordered phase and a Z𝑞 symmetry-breaking phase separated by a second-

order phase transition. For 𝑞 ≥ 5, there is a disordered phase at small-𝛽 and a Z𝑞 symmetry-breaking

phase at large-𝛽 with a critical phase for intermediate 𝛽 between them. The boundaries of the critical

phase are two BKT transition points of infinite order [25]. In our extended 𝑞-state clock model,

one must make a choice of the integration interval 𝜑 ∈ [𝜑0, 𝜑0 + 2𝜋). For the choice 𝜑0 = 0 and

fractional 𝑞, both the specific heat and the magnetic susceptibility have a double-peak structure. We

have shown that the small-𝛽 peak is associated with a crossover, and the large-𝛽 peak is associated

with a phase transition of the Ising universality class. For the choice 𝜑0 = −𝜋 and fractional 𝑞, the

phase structure is a little more complicated. For even ⌊𝑞⌋, we get the same behavior as with 𝜑0 = 0,

but for odd ⌊𝑞⌋, we get a trivial case with no critical point.

112

106105104q51.0101.0151.0201.0251.0301.0351.040p0.284(7)(q5)0.202(4)+1.0514(5)𝛽

𝛽

disordered
𝑍2 ordered
𝑍𝑞 ordered
2nd order trans.
BKT trans.
Critical phase
Crossover

disordered
𝑍2 ordered
𝑍𝑞 ordered
2nd order trans.
BKT trans.
Critical phase
Crossover

2

3

4

5

6

𝑞

2

3

4

5

6

𝑞

Figure 4.32 The phase diagram of the extended 𝑞-state clock model [i.e. the ℎ𝑞 = ∞ plane of the
extended-O(2) model] for 𝜑0 = 0 (left) and 𝜑0 = −𝜋 (right). For 𝑞 = 2, 3, 4, there is a second-order
phase transition with a Z𝑞 ordered phase at large 𝛽. For finite integer 𝑞 ≥ 5, there is a critical phase
between a pair of BKT transitions and a Z𝑞 ordered phase at large 𝛽. For fractional 𝑞 > 2 with
𝜑0 = 0, there is a crossover line, a second-order transition line, and a Z2 ordered region between
every consecutive pair of integers. For 𝜑0 = −𝜋, the same is true for every other consecutive pair
of integers.

The phase diagrams for both 𝜑0 = 0 and 𝜑0 = −𝜋 are shown in Fig. 4.32. For 𝜑0 = 0, as

𝑞 → ⌈𝑞⌉−, the Z⌈𝑞⌉ symmetry is recovered. For 𝑞 > 4 and 𝜑0 = 0, both the small-𝛽 crossover

line and the large-𝛽 Ising critical point are smoothly connected to the small-𝛽 BKT transition point

and the large-𝛽 BKT transition point for integer 𝑞 from below, respectively. Notice that for 𝑞 < 4

and 𝜑0 = 0, only the large-𝛽 Ising critical point is smoothly connected to the second-order phase

transition for integer 𝑞 from below, while the crossover peak fades away for 𝑞 close enough (around

3.9 for 3 < 𝑞 < 4) to an integer from below. When 𝑞 → ⌊𝑞⌋+ and 𝜑0 = 0, the Ising critical point

goes to infinity, while the crossover line goes to a smaller value than the phase transition point for

the ⌊𝑞⌋-state clock model because there is 1 more degree of freedom than the Z⌊𝑞⌋ clock model.

For even ⌊𝑞⌋ and 𝜑0 = −𝜋, the phase diagram is the same as that for even ⌊𝑞⌋ and 𝜑0 = 0. For odd

⌊𝑞⌋ and 𝜑0 = −𝜋, the Z⌊𝑞⌋ symmetry is recovered when 𝑞 → ⌊𝑞⌋+. For 5 < 𝑞 < 6, both crossover

lines are smoothly connected to the two BKT transition points for integer 𝑞 from above. When 𝑞 is

increased toward an even ⌈𝑞⌉ and 𝜑0 = −𝜋, the small-𝛽 crossover line fades away and the large-𝛽

crossover line goes to a larger value than the large-𝛽 BKT transition for integer 𝑞 because there is 1

fewer degree of freedom than the ⌈𝑞⌉-state clock model. For 3 < 𝑞 < 4, there is only one crossover

113

Figure 4.33 For the extended-O(2) model, the phase diagram is three-dimensional. In the ℎ𝑞 = 0
plane, it is the 𝑋𝑌 model for all values of 𝑞. The 𝑋𝑌 model has a disordered phase at small 𝛽, a
single BKT transition at 𝛽𝑐 = 1.11995(6) [39], and a critical phase at large 𝛽. In the ℎ𝑞 = ∞ plane,
it is the extended 𝑞-state clock model, which has the phase diagram shown in Fig. 4.32. In this
example we have 𝜑0 = 0. Establishing the phase diagram at finite-ℎ𝑞 will be addressed in future
work.

line that is smoothly connected to the second-order phase transition point for 𝑞 = 3 and goes to

around 0.77 when 𝑞 → 4−.

4.2.8 Summary and Outlook

Interpolations among Z𝑛 clock models have been realized experimentally using a simple Rydberg

simulator, where Z𝑛 (𝑛 ≥ 2) symmetries emerge by tuning continuous parameters, the detuning and

Rabi frequency of the laser coupling, and the interaction strength between Rydberg atoms [107].

This paves the way to quantum simulation of lattice field theory with discretized field variables.

We are interested in a theory that can interpolate among the O(2) model and Z𝑛 clock models. We

define an extended-O(2) model by adding a symmetry breaking term ℎ𝑞 cos(𝑞𝜑𝑥) to the action of

the two-dimensional O(2) model. For integer 𝑞, Z𝑞 clock models emerge for large enough ℎ𝑞. For

fractional 𝑞, we believe there exists a much more interesting phase structure. The first step to graph

the full phase diagram in the (ℎ𝑞, 𝑞, 𝛽) cube is to consider the limit ℎ𝑞 → ∞. In this section, we

studied the fractional-𝑞-state clock model as the ℎ𝑞 → ∞ limit of the extended-O(2) model with

114

β?23456qhqhq=0hq=∞angular variables in the domain [𝜑0, 𝜑0 + 2𝜋). In this limit, the angular variable takes discrete

values 𝜑𝑥,𝑘 = 2𝜋𝑘/𝑞 with 𝑘 integer. By varying 𝜑0, the set of values of integer 𝑘 take either case 1

(0, 1, . . . , ⌊𝑞⌋) or case 2 (0, 1, . . . , ⌊𝑞⌋ − 1). In case 1, Z⌈𝑞⌉ symmetry is recovered as 𝑞 → ⌈𝑞⌉−,

while Z⌊𝑞⌋ symmetry is recovered as 𝑞 → ⌊𝑞⌋+ in case 2.

For the integer-𝑞-state clock model, there is a single second-order phase transition when 𝑞 =

2, 3, 4. When 𝑞 ≥ 5, there are two BKT transitions with a critical phase between them. We

studied the fractional-𝑞-state clock model using Monte Carlo (MC) and tensor renormalization

group (TRG) methods. We establish the phase diagram of the model for both 𝜑0 = 0 and 𝜑0 = −𝜋.

When 𝜑0 = 0, we are in case 1, and analysis of the finite-size scaling shows a crossover and a phase

transition of the Ising universality class. When 𝜑0 = −𝜋, we are in case 1 for even ⌊𝑞⌋ and in case

2 for odd ⌊𝑞⌋. There are no critical points for case 2.

In case 1, we found that there are two peaks in both the specific heat and the magnetic

susceptibility. The height of the small-𝛽 peak is always finite for fractional 𝑞. The large-𝛽 peak

diverges and characterizes an Ising critical point. When 𝑞 → ⌈𝑞⌉− and 𝑞 < 4, the large-𝛽 Ising

critical point is smoothly connected to the second-order phase transition point for Z⌈𝑞⌉ clock models,

while the small-𝛽 peak fades away. When 𝑞 → ⌈𝑞⌉− and 𝑞 > 4, the large-𝛽 Ising critical point

and the position of the small-𝛽 peak are smoothly connected, with the same power law scaling

∼ (⌈𝑞⌉ − 𝑞)𝑏, to the large and small BKT points respectively for Z⌈𝑞⌉ clock models. We also

found that the height of the small-𝛽 peak of the magnetic susceptibility diverges as a power law

1/( ⌈𝑞⌉ − 𝑞)14/15, from which we obtain a critical exponent 𝛿′ = 15 in the ansatz of the scaling

of the magnetization 𝑀 ∼ (⌈𝑞⌉ − 𝑞)1/𝛿′. This critical exponent is equal to 𝛿 associated with the

magnetization with an external field 𝑀 ∼ ℎ1/𝛿.

In case 2, there are no critical points. When

𝑞 → ⌊𝑞⌋+ and 𝑞 > 5, the small-𝛽 peak also goes to the small BKT point with the same power law

scaling and the same 𝛿′ exponent as case 1, while the large-𝛽 peak fades away and cannot be used

to extrapolate the large BKT point of Z⌊𝑞⌋ clock models.

To use the magnetic susceptibility to locate a critical point, a weak external field must be applied

for the magnetic susceptibility to be finite, and extrapolate the peak position to ℎ = 0. This method

115

works in most cases, but the peak fades away near the large-𝛽 BKT point of integer-𝑞-state clock

models. Our procedure provides an alternative approach to locate the BKT transitions of clock

models, by breaking the Z⌈𝑞⌉ symmetry to a Z2 symmetry in the 𝑞 direction instead of ℎ direction.

This procedure creates an Ising critical point that can be used to extrapolate the large-𝛽 BKT point

for clock models.

Our results clarify what phases the symmetry-breaking term ℎ𝑞 cos(𝑞𝜑𝑥) will drive the system

to. These phases should have boundaries in the finite-ℎ𝑞 direction. For small enough ℎ𝑞, the

extended-O(2) model should go back to the same universality class as the ordinary 𝑋𝑌 model,

which has been studied extensively [18, 21, 24, 39, 123–126]. For the ordinary 𝑋𝑌 model, there

is a single BKT transition from a disordered phase to a quasi-long-range-ordered critical phase at

𝛽𝑐 = 1.11995(6) [39]. Figure 4.33 shows the work that remains to be done to figure out the phase

diagram in the (𝛽, ℎ𝑞, 𝑞) space interpolating between the known phase diagram at ℎ𝑞 = 0 and the

phase diagram at ℎ𝑞 = ∞ discussed here. There should be a rich phase diagram in the finite-ℎ𝑞

region, which will be investigated in the next section.

4.3 The Extended-𝑂 (2) Model at Finite-ℎ𝑞

In the previous section, we studied the ℎ𝑞 → ∞ limit of the Extended-𝑂 (2) model. In that

limit, we could neglect the symmetry-breaking term in the Hamiltonian and directly restrict the

spins to the clock angles. Now we return to the full Hamiltonian

𝐻ext-𝑂 (2) = −

∑︁

𝑥,𝜇

cos(𝜑𝑥+ ˆ𝜇 − 𝜑𝑥) − ℎ𝑞

∑︁

cos(𝑞𝜑𝑥) − ℎ ∑︁

cos(𝜑𝑥 − 𝜑ℎ).

(4.41)

𝑥

𝑥

When ℎ𝑞 = 0, this is the classic 𝑋𝑌 model which is known to have a BKT transition. When ℎ𝑞 > 0,

the second term breaks periodicity and we must choose 𝜑 ∈ [𝜑0, 𝜑0 + 2𝜋) for some choice 𝜑0.

From here on, we use 𝜑0 = 0. When ℎ𝑞 → ∞, the continuous angle 𝜑𝑥 at a site 𝑥 is forced into the

discrete “clock” angles

𝜑0 ≤ 𝜑𝑥,𝑘 =

2𝜋𝑘
𝑞

< 𝜑0 + 2𝜋,

𝑘 = 0, 1, . . . , ⌊𝑞⌋,

(4.42)

as illustrated in Figure 4.34. For ℎ𝑞 → ∞, these angles are effectively the only allowed angles in

the model. This limit of the model was studied extensively in Sec. 4.2. For finite but large ℎ𝑞, these

116

Figure 4.34 Here we illustrate how the spin angle distribution changes with ℎ𝑞 for the example
𝑞 = 4.5. From left to right, we have ℎ𝑞 = 0, 1, 4, 64. For large values of ℎ𝑞, the spins strongly
prefer the “clock” angles defined by 𝜑 = 2𝜋𝑘/𝑞 for 𝑘 = 0, 1, . . . , ⌊𝑞⌋. Note the “leftover” angle,
which in this case is ˜𝜙 = 𝜋/4.5. These results are obtained by setting 𝛽 = 0 in the model where 𝛽
is treated as a coupling instead of as an inverse temperature.

angles are preferred angles.

For integer 𝑞 and ℎ𝑞 > 0, the 𝑂 (2) symmetry is broken down to a Z𝑞 symmetry. For noninteger

𝑞 and ℎ𝑞 > 0, the 𝑂 (2) symmetry is broken down to a Z2 symmetry. This is because for noninteger

𝑞, there is a “leftover” angle

˜𝜙 = 2𝜋

(cid:18)

1 −

(cid:19)

.

⌊𝑞⌋
𝑞

(4.43)

See Fig. 4.34 for an illustration of this leftover angle for the case 𝑞 = 4.5. To “see” the residual

Z2 symmetry, imagine drawing a line that bisects the leftover angle. The angles above this line are

symmetric with the ones below the line. Because of the leftover angle’s relation to the residual Z2

symmetry it plays an important role in this model.

We consider a variety of observables. We define the internal energy as

𝐸 = −

𝜕
𝜕 𝛽

ln 𝑍 = (cid:10)𝐻ext-𝑂 (2)

(cid:11) ,

where ⟨. . . ⟩ denotes the ensemble average. We define the specific heat

𝐶𝑉 =

−𝛽2
𝑉

𝜕𝐸
𝜕 𝛽

𝛽2
𝑉

=

(⟨𝐸 2⟩ − ⟨𝐸⟩2).

(4.44)

(4.45)

We consider also the “interaction energy” as the expectation value of only the interaction term in

the Hamiltonian

˜𝐸 =

(cid:42)

−

∑︁

𝑥,𝜇

(cid:43)

cos(𝜑𝑥+ ˆ𝜇 − 𝜑𝑥)

.

(4.46)

117

02π4.54π4.56π4.58π4.502π4.54π4.56π4.58π4.502π4.54π4.56π4.58π4.502π4.54π4.56π4.58π4.5The corresponding specific heat

˜𝐶𝑉 =

𝛽2
𝑉

(⟨ ˜𝐸 2⟩ − ⟨ ˜𝐸⟩2),

is expected to have the same critical behavior as the full specific heat Eq. (4.45).

With zero external field, ℎ = 0, we measure a proxy magnetization

⟨| (cid:174)𝑀 |⟩ =

(cid:42)(cid:12)
(cid:12)
∑︁
(cid:12)
(cid:12)
(cid:12)

𝑥

(cid:174)𝑆𝑥

(cid:43)

(cid:12)
(cid:12)
(cid:12)
(cid:12)
(cid:12)

.

(4.47)

(4.48)

where (cid:174)𝑆𝑥 is the two-component unit vector sitting at the lattice site 𝑥. The corresponding magnetic

susceptibility is

𝜒

| (cid:174)𝑀 | =

1
𝑉

(cid:16)

⟨| (cid:174)𝑀 |2⟩ − ⟨| (cid:174)𝑀 |⟩2(cid:17)

.

(4.49)

For the proxy order parameter defined by Eq. 4.48, we have the corresponding Binder cumulant

𝑈

| (cid:174)𝑀 | = 1 −

⟨| (cid:174)𝑀 |4⟩
3⟨| (cid:174)𝑀 |2⟩2

.

(4.50)

In the limit 𝛽 → ∞, everything is frozen and so ⟨| (cid:174)𝑀 |4⟩ = ⟨| (cid:174)𝑀 |2⟩2, and this Binder cumulant goes

to the trivial value 2/3. In the limit 𝛽 → 0, the magnetization (cid:174)𝑀 is a 2-dimensional Gaussian

distribution centered at zero. The magnitude | (cid:174)𝑀 | of such a distribution is itself a Rayleigh

distribution, which has a fourth moment ⟨| (cid:174)𝑀 |4⟩ = 2⟨| (cid:174)𝑀 |2⟩2. Hence, in the limit 𝛽 → 0, this Binder

cumulant goes to the trivial value 1/3. The Binder cumulant varies rapidly with 𝛽 and varies with

lattice size. However, it has been shown that at a critical value of 𝛽, the Binder cumulant takes a

universal value independent of lattice size [137], and as such, it is a useful quantity for identifying

phase transitions and locating their critical points.

We also consider the Binder cumulant

𝑈𝜙 = 1 −

⟨𝑀 4
𝜙⟩
3⟨𝑀 2
𝜙⟩2

,

(4.51)

of the “rotated magnetization" 𝑀𝜙 = ⟨cos(𝑞𝜙), where 𝜙 ≡ arctan(𝑀𝑦/𝑀𝑥), and where 𝑀𝑥 and 𝑀𝑦

are the 𝑥 and 𝑦-components of the magnetization (cid:174)𝑀.

We consider also the structure factor

𝐹 ( (cid:174)𝑝) =

1
𝐿2

∑︁

∑︁

𝑖

𝑗

𝑒𝑖((cid:174)𝑥𝑖−(cid:174)𝑥 𝑗 )· (cid:174)𝑝 ⟨ (cid:174)𝑆𝑖 · (cid:174)𝑆 𝑗 ⟩,

(4.52)

118

which is the Fourier transform of the spin-spin correlator ⟨ (cid:174)𝑆𝑖 · (cid:174)𝑆 𝑗 ⟩. For an 𝐿 × 𝐿 lattice, we

define (cid:174)𝑝 = 2𝜋 (cid:174)𝑛/𝐿, with (cid:174)𝑛 ∈ {(0, 0), (0, 1), (1, 0), . . . , (𝐿/2, 𝐿/2)}. The position vector (cid:174)𝑥𝑖 is the 2-

component Cartesian vector corresponding to the 𝑖th lattice site. For periodic boundary conditions,

we can write Eq. 4.52 as

𝐹 ( (cid:174)𝑝) =

𝑖
Now there is only a single sum over the lattice—making this an efficient observable from which to

∑︁

𝑒−𝑖 (cid:174)𝑝·(cid:174)𝑥𝑖 (cid:174)𝑆𝑖

2

.

(4.53)

1
𝐿2

(cid:12)
(cid:12)
(cid:12)
(cid:12)
(cid:12)

(cid:12)
(cid:12)
(cid:12)
(cid:12)
(cid:12)

obtain the critical exponent 𝜂 associated with the correlation function.

4.3.1 An Initial Survey

For the Extended-𝑂 (2) model, there is a three-parameter (𝛽, 𝑞, ℎ𝑞) phase diagram. For ℎ𝑞 → ∞,

the Extended-𝑂 (2) model reduces to the Extended-𝑞-state Clock Model—a phase diagram that we

established in the previous section and in [127]. For integer 𝑞, it is the well-studied ordinary clock

model. For noninteger 𝑞, there is a crossover at small 𝛽 and a second-order phase transition of the

Ising universality class at large 𝛽. On the other hand, when ℎ𝑞 → 0, the Extended-𝑂 (2) model

reduces to the 𝑋𝑌 model which has a BKT transition at 𝛽𝑐 ≈ 1.12. This phase transition occurs for

all values of 𝑞 since 𝑞 is irrelevant when ℎ𝑞 = 0. Thus, we have the picture given by Figure 4.33.

Now, we want to replace the question mark in that figure. What is the phase diagram between

ℎ𝑞 = 0 and ℎ𝑞 = ∞?

We started with an initial survey of the parameter space on small 4 × 4 lattices. Using

Monte Carlo, we did a scan with 𝛽 ∈ [0, 2], Δ𝛽 = 0.02, 𝑞 ∈ [1.1, 6.0], Δ𝑞 = 0.1, and ℎ𝑞 =

0.1, 1, 2, 4, 16, 64. This required a total of 30,300 Monte Carlo simulations, so despite the small

lattice size, this survey required a nontrivial number of CPU hours. In Figure 4.35 we plot heatmaps

of the specific heat. The specific heat often diverges with volume at phase transitions and shows a

peak at fixed volumes. Therefore, a heatmap of the specific heat can serve as a proxy for the phase

diagram of the model, and this one is very suggestive. Each panel in this figure is at a different

value of the symmetry-breaking parameter ℎ𝑞. In the bottom-right panel, ℎ𝑞 = 64. Compare this

with the phase diagram of the Extended-𝑞-state clock model shown in the left panel of Figure 4.32.

The picture is very similar, which leads us to conclude that ℎ𝑞 = 64 is already effectively ℎ𝑞 = ∞.

119

Figure 4.35 Here we show a heatmap of the specific heat obtained via Monte Carlo on a 4 × 4
lattice. Each panel shows the result at a different value of the symmetry-breaking parameter ℎ𝑞.
For general noninteger 𝑞, we see two peaks in the specific heat—one at small-𝛽 near 𝛽 ∈ [0.5, 1]
and one at large-𝛽. Note that each pixel corresponds to a distinct MCMC simulation.

Notice that there are discontinuities as one moves left or right in this picture across integer values

of 𝑞. This should not be surprising. When 𝑞 is dialed across an integer, the number of preferred

angles in the model increases or decreases by one. For example, for 𝑞 = . . . , 4.8, 4.9, 5.0, there

are five preferred angles in the model (these are the allowed angles when ℎ𝑞 = ∞), but when

𝑞 = 5.0 + 𝜖, there are six preferred angles in the model. Thus, it is not surprising that the phase

diagram shows discontinuities at large ℎ𝑞. As ℎ𝑞 is dialed to smaller values (e.g. ℎ𝑞 = 2 or 1), the

discontinuities smooth out and the proxy phase diagram becomes of a set of smooth curves. At

small ℎ𝑞 (e.g ℎ𝑞 = 0.1), most of the curves have faded away and what remains is a horizontal line

that connects to the BKT transition of the 𝑋𝑌 model at ℎ𝑞 = 0.

120

0.00.51.01.52.0hq=0.1hq=1hq=223456q0.00.51.01.52.0hq=423456qhq=1623456qhq=640.00.51.01.52.02.5Specific Heat, 4x4 LatticeFigure 4.36 Here we show a heatmap of the proxy magnetization obtained via Monte Carlo on a
4 × 4 lattice. Each panel shows the result at a different value of the symmetry-breaking parameter
ℎ𝑞. Note that each pixel corresponds to a distinct MCMC simulation.

For a generic noninteger 𝑞, we see two peaks (for sufficiently large lattices) in the specific

heat—one at small-𝛽 near 𝛽 ∈ [0.5, 1] and one at large-𝛽 with its 𝛽-position depending on 𝑞. The

main question for this section of this thesis is whether these peaks correspond to phase transitions,

and if so, what kind of transitions are they?

In Figure 4.36, we show similar heatmaps of the proxy magnetization. Since this quantity

illustrates the ordered and disordered phases, it gives us further information about the phase diagram.

Finally, in Figure 4.37, we show heatmaps of the susceptibility of the proxy magnetization. The

magnetic susceptibility generally diverges at a phase transition, however, to see all of the expected

peaks in this quantity, one has to go to larger lattices.

121

0.00.51.01.52.0hq=0.1hq=1hq=223456q0.00.51.01.52.0hq=423456qhq=1623456qhq=640.00.20.40.60.81.0Proxy Magnetization, 4x4 LatticeFigure 4.37 Here we show a heatmap of the susceptibility of the proxy magnetization obtained via
Monte Carlo on a 4 × 4 lattice. Each panel shows the result at a different value of the symmetry-
breaking parameter ℎ𝑞. Note that each pixel corresponds to a distinct MCMC simulation.

4.3.2 Autocorrelations and the Implementation of BMHA

In the ℎ𝑞 → ∞ limit, discussed in previous sections, the degrees of freedom of the Extended-

𝑂 (2) model can be treated as discrete. From the MCMC side, this allowed us to use an efficient

heatbath algorithm. Furthermore, it allowed us to use a TRG method to study the system at

large volumes. The model is more difficult to study at finite ℎ𝑞. The degrees of freedom are now

continuous, and MCMC heatbath is not an option. We are left with the Metropolis algorithm, which

suffers from low acceptance rates and leads to large autocorrelation times as shown in Fig. 4.38.

The TRG algorithm we had worked only for the ℎ𝑞 → ∞ limit. We needed to make some algorithm

advancements.

122

0.00.51.01.52.0hq=0.1hq=1hq=223456q0.00.51.01.52.0hq=423456qhq=1623456qhq=640.00.20.40.60.81.01.21.4Susceptibility of Proxy Magnetization, 4x4 LatticeFigure 4.38 The integrated autocorrelation time of the internal energy (left) and the proxy magne-
tization (right) for several lattice sizes 𝐿. The three different colors correspond to three different
parameter sets. The solid lines are from the BMHA algorithm, and the dashed lines are from
the Metropolis algorithm. We see that the integrated autocorrelation time is one to two orders of
magnitude smaller for the BMHA algorithm. All results shown are from simulations performed at
𝛽 = 1. In practice, we find that the integrated autocorrelation time increases with larger 𝛽. Note,
connecting lines were added only to help guide the eyes.

From the MCMC side, we can implement a BMHA algorithm which is designed to approach

heatbath acceptance rates. The result is integrated autocorrelation times which are orders of

magnitude better than the Metropolis algorithm as shown in Fig. 4.38. The BMHA implementation

of the 𝑂 (2) model described in Sec. 3.2.3 uses a “1 variable / 1 parameter” BMHA algorithm. For

the Extended-𝑂 (2) model, the PDF can be written in the form

𝑃(𝜑; 𝛼, 𝜑+) = 𝑒𝛼 cos(𝜑−𝜑+)+𝛽ℎ𝑞 cos(𝑞𝜑),

𝛼 ≡ 𝛽

(cid:12)
(cid:12)
(cid:12)

(cid:174)𝑆+

(cid:12)
(cid:12)
(cid:12)

(4.54)

where 𝜑 is the BMHA variable and 𝛼 and 𝜑+ are BMHA parameters. The model parameters 𝛽, ℎ𝑞,
and 𝑞 are fixed for a given simulation. Recall that (cid:174)𝑆+ is the “stencil”, i.e. the sum of the nearest

neighbors at a given site, and 𝜑+ is the angle of the stencil vector. The presence of the second term

in the exponential means that, unlike in the 𝑂 (2) case, we cannot eliminate the angle 𝜑+ by a shift

of the variable 𝜑. Instead, we now have to use a “1 variable / 2 parameter” BMHA algorithm. This

increases the dimensionality of the BMHA lookup table, but it does not significantly complicate

things since each parameter can be uniformly discretized.

123

46810L101102τintEq=4.9,β=1.0,hq=1q=4.9,β=1.0,hq=64q=5.0,β=1.0,hq=6446810L|~M|4.3.3

𝛽 as a Coupling or as Inverse Temperature

We define the Extended-𝑂 (2) model to be the model with Hamiltonian

𝐻ext-𝑂 (2) = −

∑︁

𝑥,𝜇

cos(𝜑𝑥+ ˆ𝜇 − 𝜑𝑥) − ℎ𝑞

∑︁

cos(𝑞𝜑𝑥) − ℎ ∑︁

cos(𝜑𝑥 − 𝜑ℎ),

(4.55)

𝑥

𝑥

and canonical partition function

𝑍 =

∫ 𝜑0+2𝜋

𝜑0

𝑑𝜑𝑥
2𝜋

(cid:214)

𝑥

𝑒−𝛽𝐻.

(4.56)

With this definition, 𝛽, which appears only in the Boltzmann factor and not in the Hamiltonian,

plays the role of the thermodynamic inverse temperature.

An alternative definition of the model is to write the Hamiltonian as

˜𝐻ext-𝑂 (2) = −𝛽 ∑︁

cos(𝜑𝑥+ ˆ𝜇 − 𝜑𝑥) − ˜ℎ𝑞

∑︁

cos(𝑞𝜑𝑥) − ℎ ∑︁

cos(𝜑𝑥 − 𝜑ℎ),

(4.57)

𝑥,𝜇

𝑥

𝑥

and then write the partition function as

˜𝑍 =

∫ 𝜑0+2𝜋

𝜑0

𝑑𝜑𝑥
2𝜋

(cid:214)

𝑥

𝑒− ˜𝐻.

(4.58)

Notice that 𝛽 is now attached to the first term of the Hamiltonian rather than in the Boltzmann factor

of the partition function. In this field-theoretic definition, 𝛽, ˜ℎ𝑞, and ℎ are coupling parameters.

This was the definition that was used in [127].

Not surprisingly, there are slight differences between these two models as shown in Fig. 4.39.

In this figure, the left panel shows the interaction energy of the Extended-𝑂 (2) model with 𝑞 = 4.9

on a 4 × 4 lattice. The right panel shows the corresponding specific heat. The results in blue are

from the model with 𝛽 as the inverse temperature and ℎ𝑞 = 5, and the results in red are from the
model where 𝛽 is treated as a coupling and with ˜ℎ𝑞 = 5.

Although there are slight differences between the two models, the general critical behavior

should be the same since results from one model can be mapped to the other with ˜ℎ𝑞 = 𝛽ℎ𝑞. Note
that the two models coincide when 𝛽 = 1 and ℎ𝑞 = ˜ℎ𝑞. For a given 𝑞, the same kind of phase

transition(s) should occur in both models although the critical temperatures may differ. In fact,

124

Figure 4.39 (Left) The interaction energy of the Extended-O(2) model with 𝑞 = 4.9 and ℎ𝑞 = ˜ℎ𝑞 = 5
on a 4 × 4 lattice. (Right) The corresponding specific heat. The results in blue are from the model
with 𝛽 as the inverse temperature, and the results in red are from the model where 𝛽 is treated as
a coupling. The results coincide for 𝛽 = 1 but differ slightly elsewhere. The connecting lines are
only to guide the eyes.

performing the same kind of survey as in Figure 4.35 but with 𝛽 treated as a coupling, shows the

same kind of pictures but stretched a bit in the 𝛽 dimension.

From a field theory perspective, it is more natural to treat 𝛽 as a coupling parameter attached to

the interaction term of the Hamiltonian. On the other hand, from a thermodynamics perspective,

it is more natural to have 𝛽 factored out of the Hamiltonian.

In the latter approach, one can

use the standard definition Eq. (4.44) of the internal energy as the derivative of the free energy

with respect to 𝛽, and 1/𝛽 is the true thermodynamic temperature. In the end, our need to use

temperature reweighting when doing the finite-size-scaling analysis forced us to use the choice

Eq. (4.55) such that 𝛽 is the true inverse temperature. One could use temperature reweighting also

with the alternative definition, however, it would not be as straightforward since one would have

to factor 𝛽 out of the action. But this would leave a nontrivial interplay between 𝛽 and ˜ℎ𝑞 in the
sense that any change in 𝛽 would effectively also change ˜ℎ𝑞. Similar considerations would apply to

the external field coupling ℎ, however, for the present Monte Carlo studies, we have ℎ = 0. Unless

otherwise noted, the results shown in this thesis are from the model with Hamiltonian Eq. (4.55)

125

0.00.51.01.52.0β−2.00−1.75−1.50−1.25−1.00−0.75−0.50−0.250.00˜E/Nβasinversetemperatureβascoupling0.00.51.01.52.0β0.00.20.40.60.81.0˜CVwhere 𝛽 serves as the inverse temperature.

4.3.4 The Clock Model Limit

In the clock model, we have the energy function

𝐻 = −

∑︁

𝑥,𝜇

cos(𝜑𝑥+ ˆ𝜇 − 𝜑𝑥),

with the angles 𝜑(𝑘)

𝑥

selected discretely as

𝜑0 ≤ 𝜑(𝑘)

𝑥 =

2𝜋𝑘
𝑞

< 𝜑0 + 2𝜋,

(4.59)

(4.60)

with 𝑘 ∈ Z. For example, if 𝜑0 = 0, then 𝑘 = 0, 1, 2, . . . , ⌊𝑞⌋ with 𝑞 ∈ R. In general, 𝜑0 must be

specified because, for example, the number of 𝑘’s in a 2𝜋 domain varies depending on 𝜑0.

In the zero-field Extended-𝑂 (2) model, we have the energy function

𝐻 = −

∑︁

𝑥,𝜇

cos(𝜑𝑥+ ˆ𝜇 − 𝜑𝑥) − ℎ𝑞

cos(𝑞𝜑𝑥).

∑︁

𝑥

with the angles 𝜑𝑥 selected continuously in

𝜑0 ≤ 𝜑 ∈ R < 𝜑0 + 2𝜋.

(4.61)

(4.62)

Now, the allowed angles are the continuous values 𝜑 ∈ [𝜑0, 𝜑0 + 2𝜋), but when ℎ𝑞 is taken large, the

Boltzmann factor 𝑒−𝛽𝐻 ∼ 𝑒 𝛽ℎ𝑞 (cid:205) cos(𝑞𝜑) forces the angles to take the values 𝜑𝑥 = 2𝜋𝑘/𝑞 with 𝑘 ∈ Z.

Hence, we claim the Extended-𝑂 (2) model reduces to the clock model in the limit ℎ𝑞 → ∞. This

is true for integer values of 𝑞, however, for noninteger 𝑞, this claim fails as we can see in Fig. 4.40.

For non-integer 𝑞, the 2𝜋-rotational symmetry is broken. Thus, we have to more carefully define

the domain of the angles 𝜑𝑥. For example, one could choose 𝜑0 = −𝜋 and then 𝜑𝑥 ∈ [−𝜋, 𝜋) or one

could choose 𝜑0 = 0 and then 𝜑𝑥 ∈ [0, 2𝜋). These different choices lead to very different phase

diagrams as we showed for the clock model in Fig. 4.32. We prefer the choice 𝜑𝑥 ∈ [0, 2𝜋), simply

because the phase diagram gives a more consistent periodic picture as 𝑞 is increased4. However,

with the choice 𝜑𝑥 ∈ [0, 2𝜋), there is a hard cutoff at the angle 𝜑 = 0. When ℎ𝑞 = ∞, this is not a

4See the left panel of Fig. 4.32 as opposed to the right panel in the same figure.

126

Figure 4.40 Here we compare the 𝑞-state clock model, which has been extended to noninteger
values of 𝑞 and has energy function Eq. (4.59) (black curves), with the Extended-O(2) model
which has energy function Eq. (4.61) (colored curves) on a 4 × 4 lattice. For the Extended-
O(2) model, we consider increasing values of finite ℎ𝑞 in order to understand the limit ℎ𝑞 → ∞.
Naively, we expect the Extended-O(2) model to limit to the clock model when ℎ𝑞 is taken large. The
Extended-O(2) model results are from lower statistics runs. The top row compares the “interaction”
energy, the second row compares the corresponding specific heat, the third row compares the proxy
magnetization, and the bottom row compares the susceptibility of the proxy magnetization. The
different columns correspond to different values of 𝑞 with 𝑞 = 4.1, 4.7, 5.0 from left to right. In
the naive approach (𝜑 ∈ [0, 2𝜋)) illustrated here, the ℎ𝑞 → ∞ limit of the Extended-𝑂 (2) model
is the clock model only when 𝑞 is integer. For noninteger values of 𝑞, the ℎ𝑞 → ∞ limit of the
Extended-𝑂 (2) model differs from the clock model.

127

−2.0−1.5−1.0−0.50.0˜E/Vq=4.1clockhq=1hq=2hq=4hq=8hq=16hq=32hq=64q=4.7q=5.00.000.250.500.751.001.25˜C0.00.20.40.60.81.0h|~M|i/V0.00.51.01.52.0β0.00.20.40.60.8χ|~M|0.00.51.01.52.0β0.00.51.01.52.0βproblem, but at finite ℎ𝑞, this cutoff skews the distribution of angles severely enough that the model

at finite ℎ𝑞 does not smoothly connect with the model at infinite ℎ𝑞 when ℎ𝑞 → ∞. To fix this, we

need to slightly shift the angle domain such that 𝜑𝑥 ∈ [−𝜀, 2𝜋 − 𝜀). To match the clock model (i.e.

ℎ𝑞 = ∞ case), one needs an 𝜀 that varies with 𝑞 and satisfies the condition 0 < 𝜀 < 2𝜋(1 − ⌊𝑞⌋/𝑞).

In general, we choose

𝜀 = 𝜋

(cid:18)

1 −

(cid:19)

.

⌊𝑞⌋
𝑞

That is, the partition function Eq. (4.6) is modified to be

𝑍 =

∫ 2𝜋−𝜀

(cid:214)

−𝜀

𝑥

𝑑𝜑𝑥
2𝜋

𝑒−𝛽𝐻.

(4.63)

(4.64)

After taking the ℎ𝑞 → ∞ limit, the 𝜀 → 0 limit can be taken to connect with the clock models.

To understand why such a shift is needed, see Appendix B.3. In Fig. 4.41, we see the effect of

implementing such a shift in the angles. Now the Extended-𝑂 (2) model connects to the clock

model in the ℎ𝑞 → ∞ limit for both integer and noninteger 𝑞.

There remains one more subtlety. When 𝑞 is noninteger, one may find that the Extended O(2)

model with ℎ𝑞 → ∞ and the clock model do not agree when 𝛽 → 0. This can be seen in Figure 4.41.

For non-integer 𝑞, the Z𝑞 symmetry is explicitly broken, and for example, the energy density of the

clock model does not go to zero in the limit 𝛽 → 0 as can be seen in Figs. 4.2–4.6. In contrast,

the energy density in the Extended O(2) model does go to zero in the limit 𝛽 → 0 for all values of

ℎ𝑞. The discrepancy can be understood by considering the Boltzmann factor of the Extended O(2)

model, which has the form

𝑒−𝛽𝐻 = 𝑒 𝛽 (cid:205) cos(𝜑𝑥+ ˆ𝜇−𝜑𝑥)+𝛽ℎ𝑞 (cid:205) cos(𝑞𝜑𝑥)

(4.65)

The clock limit occurs when ℎ𝑞 → ∞ and the second term becomes very large—forcing the angles

into the discrete values 𝜑𝑥 = 2𝜋𝑘/𝑞. When 𝛽 → 0, the two limits compete against each other. To

match the clock model in the ℎ𝑞 → ∞ limit for all values of 𝛽, one has to adjust ℎ𝑞 such that 𝛽ℎ𝑞

stays large as one approaches 𝛽 = 0. Incidentally, when 𝛽 is treated as a coupling attached only to

the interaction term instead of as a proper inverse temperature, then this discrepancy at small 𝛽 does

128

Figure 4.41 Here we compare the 𝑞-state clock model (black curves) with the ℎ𝑞 → ∞ limit of the
Extended-O(2) model (colored curves) on a 4 × 4 lattice. The Extended-O(2) model results are
from lower statistics runs. The top row compares the “interaction” energy, the second row compares
the corresponding specific heat, the third row compares the proxy magnetization, and the bottom
row compares the susceptibility of the proxy magnetization. The different columns correspond to
different values of 𝑞 with 𝑞 = 4.1, 4.7, 5.0 from left to right. Here, 𝜑 ∈ [−𝜀, 2𝜋 − 𝜀) with 𝜀 given
by Eq. (4.63), and now the Extended-O(2) model becomes the clock model in the ℎ𝑞 → ∞ limit.
Notice that the matching breaks down at very small 𝛽 in this illustration when 𝑞 is noninteger. As
explained in the text, the proper matching procedure would be to adjust ℎ𝑞 such that 𝛽ℎ𝑞 stays large
as one approaches 𝛽 = 0.

129

−2.0−1.5−1.0−0.50.0˜E/Vq=4.1clockhq=1hq=2hq=4hq=8hq=16hq=32hq=64q=4.7q=5.00.000.250.500.751.001.25˜C0.00.20.40.60.81.0h|~M|i/V0.00.51.01.52.0β0.00.20.40.60.8χ|~M|0.00.51.01.52.0β0.00.51.01.52.0βFigure 4.42 (Left) The phase diagram of a quantum simulator with continuously tunable parameters
composed of Rydberg atoms [106, 107]. Reproduced with permission from Springer Nature. Note
the similarity with a heatmap (right) of the specific heat from the Extended-O(2) model with finite
ℎ𝑞 obtained from TRG on a 𝐿 = 1024 lattice. Note that each pixel corresponds to a distinct TRG
calculation.

not occur because in the Boltzmann factor, 𝑒− ˜𝐻 = 𝑒 𝛽 (cid:205) cos(𝜑𝑥+ ˆ𝜇−𝜑𝑥)+ℎ𝑞 (cid:205) cos(𝑞𝜑𝑥), 𝛽 is not multiplying

the symmetry-breaking term.

4.3.5 The Search for Exotic Phases

As shown in Fig. 4.42, the heatmap of the specific heat for the Extended-𝑂 (2) model at finite

ℎ𝑞 is similar to the phase diagram of a quantum simulator composed of Rydberg atoms [106, 107].

The two models both contain crystalline Z𝑞 ordered phases for 𝑞 = 2, 3, 4, and both have a small

set of continuously tunable parameters. For the Extended-𝑂 (2) model, these parameters are 𝛽, 𝑞,

and ℎ𝑞. For the Rydberg simulator, these parameters include the interaction range 𝑅𝐵/𝑎 and the

detuning Δ/Ω. However, the physics of these two models is quite different. The Rydberg model,

for example, contains long-range interactions whereas the Extended-𝑂 (2) model contains only

nearest-neighbor interactions. As such, the similarity between the Rydberg phase diagram and our

proxy phase diagram may be purely coincidental. However, we considered it worth investigating.

The heatmaps of the specific heat at large ℎ𝑞 show discontinuities similar to the clock model

130

phase diagram shown in Fig. 4.32. However, at small ℎ𝑞, these discontinuities seem to disappear,

and the heatmaps show smooth lobes. This motivated us to explore these regions in parameter space

to look for exotic phases such as the floating phases in Rydberg chains [138] or the commensurate-

incommensurate phase transition in the Pokrovsky-Talapov model [139].

We start by looking at several thermodynamic functions in the vicinity of the “hotspot” near

𝑞 ∼ 3, 𝛽 ∼ 0.8 and ℎ𝑞 ∼ 1 (see the bright yellow point in the right panel of Fig. 4.42). This

point is chosen because it appears as a “triple point” in the proxy phase diagram formed by the

heatmap of the specific heat. That is, 𝑞 = 3 is where three curves converge to a point. Similar

“triple point” features are seen near 𝑞 = 4, 𝑞 = 5, and 𝑞 = 6. If exotic phases exist near any of

these points, we expect them to exist near all of these points. We could just as well have chosen

to study one of those other points. From left to right, the columns in Fig. 4.43 correspond to

𝑞 = 2.8, 2.9, 3.0, 3.1, 3.2. From top to bottom, the rows in this figure illustrate the energy density,

specific heat, proxy magnetization, and susceptibility of the proxy magnetization. For 𝑞 = 3, we see

the classic signals of a second order phase transition. The specific heat and susceptibility diverge

with volume near 𝛽 = 0.8. For 𝑞 = 2.8 and 2.9, there are two features of interest, which manifest

as two peaks in the magnetic susceptibility. For 𝑞 = 2.9, the peaks are near 𝛽 = 0.7 and 𝛽 = 0.85.

For 𝑞 = 2.8, the peaks are near 𝛽 = 0.6 and 𝛽 = 1. For both 𝑞 = 2.8 and 2.9, the specific heat

and magnetic susceptibility appear to plateau with increasing volume near the small-𝛽 peak. This

implies that there is only a crossover here and not a true transition. For the large-𝛽 peak, both

the specific heat and magnetic susceptibility seem to diverge with volume. This suggests a phase

transition here. For 𝑞 = 3.1 and 3.2, we see only a single peak in the specific heat and susceptibility,

and both quantities appear to plateau here, which suggests that this is a crossover rather than a phase

transition. For 𝑞 = 3.1 and 3.2, there seem to be phase transitions at much larger 𝛽—beyond the

range of these plots and far from this “hotspot”.

The heatmaps of the specific heat, magnetization, and magnetic susceptibility show smooth

curves for intermediate values of ℎ𝑞 (e.g. for ℎ𝑞 ∼ 1). This suggests the possibility that the phase

diagram will have smooth phase transition curves even as one crosses integer values of 𝑞. This

131

Figure 4.43 From left to right, the columns correspond to 𝑞 = 2.8, 2.9, 3.0, 3.1, 3.2. From top to
bottom, the rows illustrate the energy density, specific heat, proxy magnetization, and susceptibility
of the proxy magnetization. The different colors correspond to the different lattice sizes 𝐿 =
4, 8, 16, 32. Connecting lines are drawn to help guide the eyes. For 𝑞 = 2.8 and 2.9 we see evidence
of a crossover at smaller 𝛽 and a phase transition at larger 𝛽. For 𝑞 = 3.0, there is only a phase
transition, and for 𝑞 = 3.1 and 3.2 we see only a crossover for the illustrated 𝛽 range.

132

−2.5−2.0−1.5−1.0E/Vq=2.8L=4L=8L=16L=32q=2.9q=3.0q=3.1q=3.2012345C0.00.20.40.60.81.0h|~M|i/V0.60.8β0102030χ|~M|0.60.8β0.60.8β0.60.8β0.60.8βFigure 4.44 The specific heat (left) and the entanglement entropy (right) from TRG for various 𝑞
near 𝑞 = 3 with 𝛽 ∈ [0, 2] and ℎ𝑞 = 1. These results were obtained from lattices of size 𝐿 = 1024.
Note that each pixel corresponds to a distinct TRG calculation. The specific heat suggests a smooth
curve in the phase diagram, however, the entanglement entropy shows a clear discontinuity across
𝑞 = 3.

would be very different from the model in the ℎ𝑞 → ∞ limit where there are clear discontinuities

at integer values of 𝑞 in the phase diagram. Furthermore, comparison of these heatmaps at

intermediate ℎ𝑞 with the phase diagrams of Rydberg atom chains suggests the possibility of exotic

phases. However, a closer look at how the thermodynamic functions behave with lattice size near

𝑞 = 3 (see Fig. 4.43), suggests a more mundane picture. In particular, it suggests that for 𝑞 = 3

and 𝑞 ≲ 3, near ℎ𝑞 ∼ 1, there is a second-order phase transition near 𝛽 = 0.8, however for 𝑞 ≳ 3,

there is only a crossover in this region. This is similar to the model at ℎ𝑞 → ∞—suggesting that

the phase diagram at ℎ𝑞 → ∞ persists to finite ℎ𝑞.

This conclusion is supported by some large-volume results obtained from TRG. See Fig. 4.44.

Here we have the specific heat (left panel) and the entanglement entropy (right panel) for various 𝑞

near 𝑞 = 3 with 𝛽 ∈ [0, 2] and ℎ𝑞 = 1. These results were obtained from lattices of size 𝐿 = 1024.

The specific heat suggests a smooth curve in the phase diagram, however, the entanglement entropy

shows a clear discontinuity across 𝑞 = 3. For 𝑞 = 3, the entanglement entropy for large 𝛽 is ≈ ln 3,

which suggests a Z3 ordered phase. Inside the “lobe,” the entanglement entropy is ≈ ln 2, which

suggests Z2 order.

133

2.62.682.762.842.923.03.083.163.243.323.4q0.00.51.01.52.0βSpeciﬁcHeatforhq=10.00.51.01.52.02.53.02.62.682.762.842.923.03.083.163.243.323.4q0.00.51.01.52.0βEntanglementEntropyforhq=10.00.51.01.52.02.5Figure 4.45 Here we show the distribution of spin angles 𝜑 via histograms. These results were
obtained on 4 × 4 lattices with 𝛽 = ℎ𝑞 = 1 and 𝑞 = 2.8 (left), 𝑞 = 3.0 (center), and 𝑞 = 3.2 (right).

Another investigative technique we can use is to study the distribution of the spin angles 𝜑. In

Fig. 4.45, we show histograms of the spin angles for several different values of 𝑞. From left to

right, the histograms correspond to 𝑞 = 2.8, 3.0, 3.2. All three examples are from 4 × 4 lattices

and with 𝛽 = ℎ𝑞 = 1. Compare with Fiq. 4.34, which uses polar histograms. The peaks in the

distribution correspond to the preferred “clock angles” 2𝜋𝑘/𝑞 with 𝑘 = 0, 1, . . . , ⌊𝑞⌋. For a given

set of parameters (𝑞, 𝛽, ℎ𝑞), a Monte Carlo simulation was run using a BMHA algorithm with

220 equilibrating sweeps followed by 213 measurement sweeps each of which was separated by 28

discarded sweeps. For each measurement sweep, the lattice configuration was saved to file. For

a 4 × 4 system, each configuration contains 16 angles—one for each site. These 213 × 16 angles

were histogrammed to obtain a pictures like those in Fig. 4.45. For 𝑞 = 3, there are three equal

and equidistant peaks as expected for a system with Z3 symmetry. When 𝑞 is noninteger, that

symmetry is broken and the peaks are no longer equal or evenly spaced. Now, the system seems

to be dominated by the two preferred angles which are closest to each other: 𝜑 = 0 (or 2𝜋) and
𝜑 = 2𝜋⌊𝑞⌋/𝑞 = − ˜𝜙. Interestingly, the effect is stronger for 𝑞 just above an integer (e.g. 𝑞 = 3.2)

than for 𝑞 just below an integer (e.g. 𝑞 = 2.8).

To illustrate angle histograms over a much larger parameter space, we use a heatmap, which

converts the histogram height into color as shown in Fig. 4.46. These results are again with ℎ𝑞 = 1

and 𝐿 = 4, but now with 𝛽 ∈ [0, 1.2] and for 𝑞 = 2.5, 2.8, 2.9, 3.0, 3.1, 3.2, and they further

demonstrate that the system is strongly affected by the “leftover” angle when 𝑞 is noninteger.

However, it also shows that this effect becomes weaker as 𝑞 approaches the next integer from below.

134

0π2π3π22πϕ0.00.10.20.30.40π2π3π22πϕ0.00.10.20.30π2π3π22πϕ0.00.20.40.6Figure 4.46 Heatmaps of the angle histograms (see Fig. 4.45) for a 4 × 4 lattice with ℎ𝑞 = 1,
𝛽 ∈ [0, 1.2], and 𝑞 = 2.5, 2.8, 2.9, 3.0, 3.1, 3.2. Note that each pixel corresponds to a distinct
MCMC simulation.

For example, the effect is large for 𝑞 = 3.1 but small for 𝑞 = 2.9. An alternative representation is

shown in Fig. 4.47, where we plot a heatmap for constant 𝛽 = 1 and varying 𝑞. We looked also at

histogram heatmaps for smaller ℎ𝑞 = 0.1 and for even and odd lattice sites separately, but nothing

fundamentally different was seen.

Since the angle histogram for e.g. 𝑞 = 2.9 is very similar to that of 𝑞 = 3.0 (likewise 3.9

is similar to 4.0 and so on), this suggests the possibility that there is a Z3 ordered phase also for

𝑞 = 2.9. In other words, the lines of Z𝑞 order in the phase diagram may have some width at finite

ℎ𝑞. However, we were unable to find any evidence of such “thick” 𝑍𝑞 order lines.

At this point, we have not found evidence for exotic phases or really anything different from

the ℎ𝑞 → ∞ case. However, given the similarity between the phase diagram of a Rydberg atom

system and our heatmap of the specific heat (see Fig. 4.42), we would be remiss if we did not

check for the floating phases that are found in Rydberg chains. Evidence for these phases includes

nontrivial structures (curves) in heatmaps of the structure factor 𝐹 ( 𝑝) when plotted with momenta

135

0.000.250.500.751.00βq=2.5q=2.8q=2.90π2π3π22πϕ0.000.250.500.751.00βq=3.00π2π3π22πϕq=3.10π2π3π22πϕq=3.2Figure 4.47 Heatmaps of the angle histograms for constant 𝛽 = 1 and varying 𝑞 ∈ [1.1, 6.0].
Results are from 4 × 4 lattices with ℎ𝑞 = 1. The red labels on the right edge correspond to the
𝑞-values shown in Fig. 4.45. Note that each pixel corresponds to a distinct MCMC simulation.

Figure 4.48 Heatmaps of the structure factor 𝐹 ( 𝑝) with momenta 𝑝 on the vertical axis and with 𝑞
on the horizontal axis. The different panels correspond to different 𝛽 with 𝛽 = 0.7, 0.9, 1.1 from
left to right. These results are from 32 × 32 lattices with ℎ𝑞 = 1 and are from the model where 𝛽 is
treated as a coupling. Note that each pixel corresponds to a distinct MCMC simulation.

136

0π2π3π22πϕ123456q2.83.03.2123456q0π4π23π4πp=2πn/Lβ=0.7123456qβ=0.9123456qβ=1.1100101102Figure 4.49 Heatmaps of the staggered structure factor 𝐹 ( 𝑝) with momenta 𝑝 on the vertical axis and
with 𝑞 on the horizontal axis. The different panels correspond to different 𝛽 with 𝛽 = 0.7, 0.9, 1.1
from left to right. These results are from 32 × 32 lattices with ℎ𝑞 = 1 and are from the model where
𝛽 is treated as a coupling. Note that each pixel corresponds to a distinct MCMC simulation.

on one axis and the tunable parameter on the other axis [138]. In our case, we studied the structure

factor 𝐹 ( 𝑝) on 𝐿 = 32 lattices with 𝑝 = 2𝜋𝑛/𝐿 and 𝑛 = 0, 1, . . . , 𝐿/2. To keep things simple, we

considered only on-axis modes such as (cid:174)𝑝 = (0, 0), (0, 1), (0, 2), . . . , (0, 𝐿/2). We used ℎ𝑞 = 1 and

looked at several 𝛽-values. See Fig. 4.48. No evidence of floating phases is seen. We consider also

the “staggered” structure factor, which can be computed using the same definition of the structure

factor after rotating all spin vectors on odd-parity sites by the angle 𝜋. The results given in Fig. 4.49

show no evidence of exotic phases.

4.3.6 Binder Cumulants

The Binder cumulant

𝑈𝑀 = 1 −

⟨𝑀 4⟩
3⟨𝑀 2⟩2

.

(4.66)

is the fourth order cumulant of the magnetization. Here, 𝑀 = | (cid:174)𝑀 | is the magnitude of the

magnetization vector. In the limit 𝛽 → ∞, everything is frozen and so ⟨| (cid:174)𝑀 |4⟩ = ⟨| (cid:174)𝑀 |2⟩2, and

this Binder cumulant goes to the trivial value 2/3. In the limit 𝛽 → 0, the magnetization (cid:174)𝑀 is a

2-dimensional Gaussian distribution centered at zero. The magnitude | (cid:174)𝑀 | of such a distribution is

itself a Rayleigh distribution, which has a fourth moment ⟨| (cid:174)𝑀 |4⟩ = 2⟨| (cid:174)𝑀 |2⟩2. Hence, in the limit

𝛽 → 0, this Binder cumulant goes to the trivial value 1/3. The Binder cumulant varies rapidly

with 𝛽 and varies with lattice size. However, it has been shown that at a critical value of 𝛽, the

137

123456q0π4π23π4πp=2πn/Lβ=0.7123456qβ=0.9123456qβ=1.10.10.20.30.40.50.6Binder cumulant takes a universal value independent of lattice size [137], and as such, it is a useful

quantity for identifying phase transitions and locating their critical points.

There are several things that one can do with the Binder cumulant. The first is to identify the

critical points associated with phase transitions. Since the Binder cumulant takes a universal value

at a critical point regardless of lattice size, one can perform simulations on different lattice sizes

and identify the critical temperature by seeing where the Binder cumulant curves cross. See for

example, [140, 141].

The Binder cumulant can also be used to identify the type of phase transition. For second order

phase transitions, the temperature dependency of the Binder cumulants at different lattice sizes

have a distinct intersection point. In other words, the Binder cumulant curves intersect at a single

point. See for example, [140]. In contrast, at a BKT transition the Binder cumulants intersect at

all critical temperatures, that is, the Binder cumulant curves lie on top of each other in the critical

phase. Binder cumulants can be used to distinguish first- and second-order phase transitions by

considering also the cumulant of the energy [142]. See also [143, 144].

Finally, the Binder cumulant can be used to extract the critical exponent 𝜈 via finite-size scaling.

See for example, [46, 140, 145]. One can also show “data collapse” of Binder cumulants to provide

support for a particular value of 𝜈 [54, 146–152]. Other finite-size-scaling of Binder cumulants is

discussed in [153–156].

The Binder cumulant has been used to study the Ising model [157] and the 𝑋𝑌 model [158,159].

It has also been used to study the clock models [46, 52, 54, 78]. For the 𝑞-state clock model with

𝑞 ≥ 5, there are two BKT transitions. The small-𝛽 transition separates the disordered and quasi-

long-range-ordered phases, and the large-𝛽 transition separates the quasi-long-range-ordered and

long-range-ordered phases. The Binder cumulants as defined in Eq. (4.66) for different lattice sizes

merge at the small-𝛽 transition and run together for larger values of 𝛽. Thus, this Binder cumulant

is useful for obtaining the small-𝛽 critical point, but not for obtaining the large-𝛽 critical point. To

138

Figure 4.50 Here, we show the specific heat (top row), magnetic susceptibility (second row), Binder
cumulant (third row), and the Binder cumulant of the rotated magnetization (bottom row). From left
to right, the columns correspond to 𝑞 = 2, 3, 4, 5, 6. In all cases, ℎ𝑞 = 1. The different color shades
correspond to different lattice sizes ranging from 𝐿 = 4 (light) to 𝐿 = 128 (dark). Connecting lines
are added to help guide the eyes. The vertical dashed gray lines show where the Binder cumulants
intersect.

extract the latter, one can define the Binder cumulant [44, 45, 54, 55, 160]

𝑈𝜙 = 1 −

⟨𝑀 4
𝜙⟩
3⟨𝑀 2
𝜙⟩2

,

(4.67)

of the “rotated magnetization" 𝑀𝜙 = ⟨cos(𝑞𝜙), where 𝜙 ≡ arctan(𝑀𝑦/𝑀𝑥), and where 𝑀𝑥 and 𝑀𝑦

are the 𝑥 and 𝑦-components of the magnetization (cid:174)𝑀. This Binder cumulant allows one to extract

the large-𝛽 transition in the clock models but not the small-𝛽 transition.

In Fig. 4.50, we show some Binder cumulants for the Extended-𝑂 (2) model with ℎ𝑞 = 1 and

integer 𝑞. The columns correspond to different values of 𝑞 with 𝑞 = 2, 3, 4, 5, 6 from left to

right. The first two rows show the specific heat and magnetic susceptibility. For 𝑞 = 2, 3, 4 the

specific heat and susceptibility both seem to diverge with volume—suggesting a second-order phase

transition. For 𝑞 = 5, 6 we see two peaks in the specific heat. Both peaks converge to a finite height

even as the volume is increased. This suggests the presence of two BKT transitions as in the clock

model (i.e. ℎ𝑞 = ∞) case. This is supported by the behavior of the susceptibility which seems to

diverge with volume at all 𝛽-values between the two peaks. The small-𝛽 transition is marked by a

139

012345Cq=2L=4L=8L=16L=32L=64L=12802468q=30123q=40.00.51.01.52.0q=50.00.51.01.52.0q=60100200300400χ|~M|01002003004000501001502000501001502000501001502000.580.600.620.640.66U|~M|0.620.630.640.650.660.6400.6450.6500.6550.6600.6400.6450.6500.6550.6600.65000.65250.65500.65750.66000.30.40.50.60.70.8β0.600.620.640.660.68Uφ0.60.70.80.91.0β0.5750.6000.6250.6500.6750.70.80.91.01.1β0.500.550.600.650.60.81.01.21.41.6β0.500.550.600.651.01.52.02.5β0.500.550.600.65Figure 4.51 As in Fig. 4.50 but with ℎ𝑞 = 0.1.

prominent peak in the susceptibility while the critical region up to the large-𝛽 transition is marked

by a “shoulder” in the susceptibility. The third row of the figure shows the Binder cumulant given

by Eq. (4.66). The bottom row of the figure shows the Binder cumulant of the rotated magnetization

given by Eq. (4.67). Notice that for 𝑞 = 5, 6, the Binder cumulants defined by Eq. (4.66) cross at

the small-𝛽 transition and those defined by Eq. (4.67) cross at the large-𝛽 transition. The crossing

or intersection points are highlighted by the vertical dashed gray lines. For 𝑞 = 2, 3, 4 where there

seems to be a single transition, both kinds of Binder cumulants cross at the same 𝛽. In Fig. 4.51,

we show the same set of plots but for a smaller symmetry-breaking parameter ℎ𝑞 = 0.1.

For noninteger values of 𝑞, the Binder cumulant behaves in an unexpected manner. In Fig. 4.52,

we show some Binder cumulants for noninteger 𝑞 with ℎ𝑞 = 1 near the “hotspot”. Note that this

data is from the model where 𝛽 is treated as a coupling, and so the transition points may be slightly

shifted from our usual model where 𝛽 is treated as an inverse temperature. The qualitative picture

should be the same regardless of whether 𝛽 is treated as a coupling or as an inverse temperature.

From left to right, the columns correspond to 𝑞 = 2.8, 2.9, 3.0, 3.1, 3.2. The top row corresponds

to the ordinary Binder cumulant defined in Eq. (4.66). The different shades correspond to different

lattice sizes from 𝐿 = 4 (light) to 𝐿 = 64 (dark). Notice that for 𝑞 = 3, the Binder cumulants

cross near the expected transition point. Compare with Fig. 4.43 where other thermodynamic

140

01234Cq=2L=4L=8L=16L=32L=64L=1280.00.51.01.52.0q=30.00.51.01.5q=40.00.51.01.5q=50.00.51.01.5q=60100200300χ|~M|01002003000501001502000501001502000501001502000.600.620.640.66U|~M|0.6450.6500.6550.6600.65000.65250.65500.65750.66000.65000.65250.65500.65750.66000.65000.65250.65500.65750.66000.60.81.01.2β0.500.550.600.65Uφ0.60.81.01.2β0.500.550.600.650.60.81.01.2β0.500.550.600.651.01.52.02.53.0β0.500.550.600.65123β0.500.550.600.65Figure 4.52 Here we look at some Binder cumulants for the model with noninteger 𝑞 and ℎ𝑞 = 1
near the “hotspot”. From left to right, the columns correspond to 𝑞 = 2.8, 2.9, 3.0, 3.1, 3.2. The
different shades correspond to different lattice sizes from 𝐿 = 4 (light) to 𝐿 = 64 (dark). The
top row corresponds to the ordinary Binder cumulant defined in Eq. (4.66). For noninteger 𝑞, the
Binder cumulants pinch together instead of cross at a second-order phase transition. In the bottom
row, we show the Binder cumulant defined in Eq. (4.69) that has been modified to emphasize the
Ising-ness of the model. This Binder cumulant shows again the classic crossing phenomenon at the
phase transition. This data is from the model where 𝛽 is treated as a coupling, and so the transition
points may be slightly shifted from our usual model where 𝛽 is treated as an inverse temperature.

quantities are plotted for the same values of 𝑞 and ℎ𝑞. For 𝑞 = 3.1 and 3.2 the Binder cumulants

do not cross—further evidence that there is only a crossover in this small-𝛽 regime. The expected

second-order phase transition for 𝑞 = 3.1 and 3.2 is at much larger 𝛽. For 𝑞 = 2.8 and 2.9, the

Binder cumulants seem to intersect near the expected second-order transition point, however, they

“pinch” together instead of crossing each other. We have been unable to find other examples of

such Binder cumulant “pinching” in the literature.

To understand the Binder cumulant “pinch” as seen in Fig. 4.52 for 𝑞 = 2.8 and 2.9, we looked

at the time series proxy magnetization | (cid:174)𝑀 | and the angle 𝜑 that (cid:174)𝑀 makes with the 𝑥-axis for a

simulation run in the pinched region. We found that the magnetization angle tended to point in

the direction − ˜𝜙 < 𝜑 < 0 where ˜𝜙 is the “leftover” angle defined in Eq. (4.43). For example, for

141

0.500.550.600.65U|~M|q=2.8L=4L=8L=16L=32L=640.400.450.500.550.600.65q=2.90.30.40.50.6q=3.00.400.450.500.550.600.65q=3.10.500.550.600.65q=3.20.51.0β0.00.20.40.6U˜M0.51.0β0.00.20.40.60.51.0β0.00.20.40.60.51.0β−1.0−0.50.00.50.51.0β−1.0−0.50.00.5𝑞 = 2.9, the preferred angles of the model are 𝜑 = 0, 2𝜋/2.9, 4𝜋/2.9, and the leftover angle is
˜𝜙 = 2𝜋 − 4𝜋/2.9. In this example, the system is spending its time magnetized mostly in the 𝜑 = 0
direction or in the nearby 𝜑 = 4𝜋/2.9 ≡ − ˜𝜙 direction. This suggests that one should be able to turn

it into an Ising model by projecting the magnetization (cid:174)𝑀 onto a coordinate system that is rotated

by ˜𝜙/2. That is, we define a new magnetization

˜𝑀 = 𝑀𝑥 sin

(cid:19)

(cid:18) ˜𝜙
2

+ 𝑀𝑦 cos

(cid:19)

(cid:18) ˜𝜙
2

,

(4.68)

where 𝑀𝑥 and 𝑀𝑦 are the components of the ordinary magnetization vector (cid:174)𝑀. Then we use a

Binder cumulant

𝑈 ˜𝑀 = 1 −

⟨ ˜𝑀 4⟩
3⟨ ˜𝑀 2⟩2

.

(4.69)

In the bottom row of panels in Fig. 4.52, we show examples of this quantity for 𝑞 = 2.8, 2.9, 3.0, 3.1, 3.2

for the model with ℎ𝑞 = 1. For 𝑞 = 2.8 and 2.9, the pinching phenomenon is now replaced with

the classic crossing behavior. For 𝑞 = 3.0, the expected crossing behavior is also seen at the

second-order transition point. For 𝛽 larger than the critical point, the Binder cumulant Eq. (4.69),

seems to occasionally get stuck at 𝑈 ˜𝑀 = 0 yielding unreliable results. For 𝑞 = 3.1 and 3.2, the

expected Ising transition is at 𝛽 much larger than the values shown in Fig. 4.52, thus no crossing is

seen. Instead, we see 𝑈 ˜𝑀 ≈ 0 for sufficiently large lattices.

4.3.7 Broken Symmetry and Ultra-long Autocorrelations: A Cautionary Tale

When 𝑞 is noninteger, the Z𝑞 symmetry is broken. This results in a practical and pernicious

difficulty for MCMC approaches to studying this model. In practice, we found that a naive MCMC

approach leads to misleading conclusions about the scaling behavior and therefore to an incorrect

determination of critical exponents. This was only discovered after we found apparent scaling law

violations for some critical exponents.

To understand what happens when 𝑞 is noninteger, it will help to consider the integer case first,

for example 𝑞 = 5. Furthermore, for simplicity we will consider the clock limit (i.e. ℎ𝑞 → ∞) where

the so-called “clock angles” are the only allowed angles for the spin variables. When 𝑞 = 5, the

five allowed angles form a Z5 symmetry group. See Fig. 4.53. In the low temperature limit, the Z5

142

Figure 4.53 The clock angles for 𝑞 = 5.0 (gray) and 𝑞 = 4.9 (black). For 𝑞 = 5, there are five
equidistant angles which divide the unit circle into five equal sectors. For 𝑞 = 4.9, there are again
five angles, but the Z5 symmetry is broken. Now there are four equal sectors and one slightly
smaller “leftover” sector.

symmetry is spontaneously broken, and the system chooses one of the five possible magnetization

directions. See Fig. 4.54. For sufficiently low temperature (i.e.

large 𝛽), the lattice is mostly

magnetized in one direction. In a MCMC simulation, an individual spin variable may easily flip to

a neighboring direction, however, the probability for the whole lattice to flip magnetization direction

is very small. The MCMC chain is effectively stuck or frozen in a particular magnetization direction.

Fortunately, this is not a problem because the five magnetization directions are thermodynamically

equivalent because, for example:

• In a fully magnetized lattice, the probability of any given spin flipping to either neighboring

direction is the same regardless of the overall magnetization direction

• The much smaller probability of the entire lattice flipping to either neighboring direction is

the same regardless of the overall magnetization direction

If the five directions were not equivalent, then the ensemble average of a thermodynamic observable

from a Markov chain stuck in one magnetization direction would be different from the ensemble

average from one stuck in a different direction. In that case, to get an accurate estimate of the true

143

2q02q12q22q32q4Figure 4.54 At very low temperatures, the spin system tends to be fully or nearly fully magne-
tized. For the 𝑞 = 5.0 or 𝑞 = 4.9 state clock models, there are five possible directions for this
magnetization—corresponding to the five “clock angles” (See Fig. 4.53). When 𝑞 = 5, there is Z5
symmetry, and these directions are equivalent. When 𝑞 = 4.9, this is not true.

value of the observable, one would need a Markov chain long enough to sample all magnetization

directions many times. In the limit where temperature goes to zero, this would require an infinitely

long Markov chain. Note, we refer here to a fully magnetized lattice only because that is easier to

visualize and describe. However, this symmetry effect occurs also for nonmagnetized lattices. This

may all be obvious and seemingly trivial, but it’s worth pointing out because this equivalence of the

magnetization directions does not hold when the Z𝑞 symmetry is broken, which is what happens

when 𝑞 is noninteger.

Consider what happens with 𝑞 = 4.9. Now the clock angles 𝜑 = 0, 2𝜋/4.9, 4𝜋/4.9, 6𝜋/4.9, 8𝜋/4.9

are no longer equivalent. The clock angles divide the unit circle into four equal sectors and one

slightly smaller leftover sector. See Fig. 4.53. As usual, at low temperature the lattice will tend to

be magnetized in one of the five angles. However, these angles are no longer equivalent. Suppose

Case 1 is when the lattice is magnetized in the 𝜑 = 0 or 𝜑 = 8𝜋/4.9 and Case 2 is when it is

magnetized in any of the other three directions. In Case 1, the lattice is magnetized in one of the

directions adjacent to the slightly smaller “leftover” sector. As such, it is easier for a given spin to

flip to a neighboring direction in Case 1 than it is in Case 2. Similarly, it is easier for the entire

lattice to flip magnetization directions in Case 1 than it is in Case 2. Since Case 1 and Case 2

result in different spin-flipping probabilities, they define two thermodynamically distinct sectors.

At large 𝛽, a 𝑞 = 4.9 lattice will tend to freeze in one of the two Case 1 directions or in one of

the three Case 2 directions. Two different MCMC simulations at the same parameters could give

completely different ensemble averages if the Markov chain is too short and each simulation gets

144

Figure 4.55 Here we show some thermodynamic results for the model with 𝑞 = 4.9, ℎ𝑞 = 4
on a 16 × 16 lattice. Clockwise from the top-left panel we show the energy density, magnetic
susceptibility, structure factor, and specific heat. Data points are connected with lines to help guide
the eyes. Near 𝛽 = 1.5 we see that all four thermodynamic quantities become unstable in a manner
that suggests the Markov chain became stuck in one of two thermodynamically distinct sectors.
Clearly, the naive error bars (here the statistical variance only) are not enough to capture the true
uncertainty.

stuck in a different sector. The problem gets more severe with increasing ℎ𝑞, increasing 𝛽, or

increasing lattice size. The end result is that for noninteger 𝑞 in the low temperature limit or the

large lattice size limit, with the MCMC approach one needs infinite statistics to get an accurate

estimate of thermodynamic observables. This freezing effect was briefly discussed in Sec. 4.2.2.1

in the context of the ℎ𝑞 → ∞ limit of the model, but the effect occurs also at finite and even at

small ℎ𝑞.

This effect can be seen in thermodynamic curves as in Fig. 4.55.

In this figure, we show

the energy density, magnetic susceptibility, structure factor, and specific heat for the model with

𝑞 = 4.9 and ℎ𝑞 = 4 on a 16 × 16 lattice. Near 𝛽 = 1.5 we see that all four thermodynamic

quantities become unstable in a manner that suggests the Markov chain became stuck in one of two

thermodynamically distinct sectors. Clearly, the naive error bars (here the statistical variance only)

are not enough to capture the true uncertainty. Even a standard autocorrelation analysis does not

capture this effect since the Markov chain may be stuck in one sector for the entire simulation. The

145

−6.0−5.5−5.0−4.5−4.0−3.5−3.0E/V01234χ|~M|0.51.01.52.0β0.00.51.01.52.02.5C0.51.01.52.0β0123456F(~p)only way out seems to be to increase the statistics until the Markov chain repeatedly samples both

thermodynamic sectors. However, the statistics needed makes the MCMC approach infeasible for

even moderate lattices of size 𝐿 > 32.

One way to clearly show the impact of the freezing phenomenon on observables is to run parallel

simulations which differ only in the random number generator (RNG) seed. By chance, some of

the runs may end up stuck in one sector while other runs may end up stuck in the other sector. See

Fig. 4.56. Here we show observables estimated on Markov chains of different lengths. The data is

from the model with 𝑞 = 4.9, ℎ𝑞 = 1, and 𝛽 = 1.66. The left column comes from an 8 × 8 lattice,

and the right column comes from a 16×16 lattice. The rows, from top to bottom, are energy density,

specific heat, proxy magnetization, susceptibility of the proxy magnetization, and structure factor.

The horizontal axis is the log base-2 of the number of updating (BMHA) sweeps performed. That

is, ‘12’ on the horizontal axis corresponds to 212 updating sweeps. The different color points (with

connecting lines to guide the eyes) correspond to different seeds for the random number generator.

The error bars on the data points come only from the statistical variance of the data. These results

show that the system tends to freeze in one of two thermodynamically distinct sectors, and a reliable

estimate of the true value of an observable can only be obtained with very large statistics such that

both sectors are sampled. For example, for the 8 × 8 results shown in the left column, we see that

one needs a Markov chain of 218 to 220 BMHA sweeps to properly sample both sectors. Even there

the error bars need to be much larger to account for the ultra-long autocorrelations caused by the

freezing phenomenon. The problem worsens dramatically with increasing lattice size as we see for

the 16 × 16 results in the right column where one needs statistics of 227 to get reliable averages.

For lattices larger than 32 × 32, it becomes completely infeasible to reach the necessary statistics.

Keep in mind that this phenomenon is a result of the broken Z𝑞 symmetry and only occurs when 𝑞

is not integer.

A practical question is, can one still do finite size scaling with Monte Carlo to extract critical

exponents despite this freezing phenomenon? For noninteger 𝑞, the model exhibits two potential

phase transitions. These manifest as two peaks in the specific heat, for example. Since this freezing

146

Figure 4.56 Here we show observables estimated on Markov chains of different lengths. The data
is from the model with 𝑞 = 4.9, ℎ𝑞 = 1, and 𝛽 = 1.66. The left column comes from an 8 × 8 lattice,
and the right column comes from a 16 × 16 lattice. The rows, from top to bottom, are energy
density, specific heat, proxy magnetization, susceptibility of the proxy magnetization, and structure
factor. The horizontal axis is the log base-2 of the number of updating (BMHA) sweeps performed.
That is, ‘12’ on the horizontal axis corresponds to 212 updating sweeps. The different color points
(with connecting lines to guide the eyes) correspond to different seeds for the random number
generator. These results indicate that the system tends to freeze in one of two thermodynamically
distinct sectors, and a reliable estimate of the true value of an observable can only be obtained
with very large statistics such that both sectors are sampled. This problem worsens dramatically
with increasing lattice size. It is a result of the broken Z𝑞 symmetry and only occurs when 𝑞 is not
integer.

147

−2.50−2.48−2.46−2.44E/Vseed=50seed=51seed=52seed=53seed=54seed=55seed=56seed=57seed=58seed=59seed=601.71.81.9C0.920.930.94|~M|0.050.060.070.080.09χ|~M|121416182022242628log2(BMHAsweeps)0.30.40.5F(~p)−2.50−2.48−2.46−2.44E/Vseed=50seed=51seed=52seed=53seed=54seed=55seed=56seed=57seed=58seed=59seed=601.61.82.02.22.42.6C0.900.910.920.930.94|~M|0.10.20.3χ|~M|121416182022242628log2(BMHAsweeps)0.51.01.5F(~p)Figure 4.57 Finite size scaling of the magnetic susceptibility (left) and the structure factor (right)
for the model with 𝑞 = 4.9 and ℎ𝑞 = 1 at the large-𝛽 peak where we expect a phase transition.
According to the scaling law Eq. (3.79), these two quantities must scale with the same exponent.
That scaling law is clearly violated here since 𝛾/𝜈 = 1.566(12) and 2 − 𝜂 = 1.729(10). We
believe this apparent violation is due to the freezing effect which occurs with MCMC at large lattice
sizes. The result is that with MCMC alone, we cannot perform reliable finite size scaling for the
Extended-𝑂 (2) model with noninteger 𝑞.

phenomenon seems to occur just after the second peak (see Fig. 4.55), one might optimistically

think there is sufficient reliable data to perform finite size scaling. That is what we attempted to

do. See Fig. 4.57. In practice, the finite-size scaling seemed to work very well with nice peaks and

power law scaling. It wasn’t until we had enough data to check the scaling laws such as Eq. (3.79)

that we found them to be severely violated for noninteger 𝑞. Such a scaling law violation implies

that there is no phase transition here or there is something wrong with the MCMC results, and

we believe it is the latter. To reliably study the model at noninteger 𝑞, we are forced to use an

alternative approach such as tensor renormalization group (TRG) methods which do not suffer from

this freezing effect.

4.3.8 Tensor Renormalization Group (TRG)

Using the character expansion

𝑒 𝛽 cos(𝜃1−𝜃2) =

∞
∑︁

𝑚=−∞

𝐼𝑚 (𝛽) 𝑒𝑖𝑚(𝜃1−𝜃2),

(4.70)

148

where 𝐼 is the modified Bessel function of the first kind, the partition function 𝑍 can be expressed

as

where

𝑍 =

∑︁

(cid:214)

{𝑥,𝑡}

𝑛

𝐴𝑥𝑛𝑡𝑛𝑥𝑛− ˆ1𝑡𝑛− ˆ2

(4.71)

√︃

𝐴𝑖 𝑗 𝑘𝑙 =

𝐼𝑖 (𝛽) 𝐼 𝑗 (𝛽) 𝐼𝑘 (𝛽) 𝐼𝑙 (𝛽)

∫ 2𝜋

0

d𝜃
2𝜋

𝑒𝑖𝜃 (𝑖+ 𝑗−𝑘−𝑙)𝑒 𝛽ℎ cos 𝜃.

(4.72)

If we do not have the symmetry-breaking term ℎ𝑞 cos 𝜃, there is the “selection rule”, so that we can

analytically evaluate the elements of the tensor. In the presence of the symmetry-breaking term,

one may evaluate the integral using the Gaussian quadrature rule. There are some possible choices

for polynomial functions to be used. For the noncompact scalar theories, Hermite polynomials

are used. In our case the integral for each angle has a finite range, so Legendre polynomials are

suitable. Then the elements of the tensor are defined by

√︃

𝐴𝑖 𝑗 𝑘𝑙 ≈

𝐼𝑖 (𝛽) 𝐼 𝑗 (𝛽) 𝐼𝑘 (𝛽) 𝐼𝑙 (𝛽)

𝐾
∑︁ d𝜃
2𝜋

𝑒𝑖𝜃 (𝑖+ 𝑗−𝑘−𝑙)𝑒 𝛽ℎ cos 𝜃.

(4.73)

Note that the elements of 𝐴 can be complex.

There is another (rather simpler) way to discretize the angles. By simply replacing the integral

by summations, we obtain

(cid:32)

(cid:214)

(cid:33)

𝐾
∑︁

(cid:214)

𝑍 ≈

𝑛

𝛼𝑛=1

𝑛

𝑤𝛼𝑛
2

2
(cid:214)

𝜈=1

ℎ (cid:0)𝜋𝑥𝛼𝑛 + 𝜋, 𝜋𝑥𝛼𝑛+ ˆ𝜈 + 𝜋(cid:1) .

(4.74)

The local Boltzmann factor ℎ can be regarded as a 𝐾 × 𝐾 matrix now and can be decomposed

numerically by singular values:

ℎ𝑖 𝑗 =

𝐾
∑︁

𝑘=1

𝑈𝑖𝑘 𝜎𝑘𝑉 †
𝑘 𝑗 ,

(4.75)

where {𝜎} is the singular values and 𝑈 and 𝑉 are unitary matrices. Using them, we can approxi-

mately make a tensor network representation of 𝑍:

(cid:32)

(cid:214)

(cid:33)

𝐾
∑︁

(cid:214)

𝑍 ≈

𝑛

𝛼𝑛=1

𝑛

𝑇𝑥𝑛𝑡𝑛𝑥𝑛− ˆ1𝑡𝑛− ˆ2

,

149

(4.76)

where the tensor is defined by

𝑇𝑖 𝑗 𝑘𝑙 =

𝐾
∑︁

𝛼=1

𝑤𝛼
2

√𝜎𝑖𝜎𝑗 𝜎𝑘 𝜎𝑙𝑈𝛼𝑖𝑈𝛼 𝑗𝑉 †

𝑘𝛼𝑉 †
𝑙𝛼.

(4.77)

To balance between accuracy and efficiency, one may take a large 𝐾 and initially truncate the

bond dimension of the tensor. For the present work, 𝐾 is set to 1024, and the accuracy is examined

through a comparison with MC for some parameters. Note that in this construction the elements

of 𝑇 are real. In our work we employ the simpler discretization method.

Whereas Monte Carlo is a well-established approach with easily quantifiable uncertainties, it

is an approach that sometimes struggles with critical slowing down. In the Extended-O(2) model

studied in this paper, the Monte Carlo approach has no problem when 𝑞 is integer. However,

when 𝑞 is noninteger, the Markov chain suffers from large autocorrelation times which worsen

with increasing lattice size. The result is that for noninteger 𝑞, it becomes infeasible to produce

de-correlated samples for large volumes e.g. for 𝐿 >> 32. In contrast, the TRG approach does not

suffer from autocorrelation or critical slowing down. However, because of the truncations used,

there are systematic uncertainties that are not always well-quantified. To be confident that our TRG

methods are reliable, we compare them against MC in the regimes which are accessible to Monte

Carlo.

In Fig. 4.58, we consider several different values of 𝑞 and ℎ𝑞 at two different volumes. For

noninteger values of 𝑞, we compare small volumes 𝐿 = 16, 32. For integer values of 𝑞, we are

able to compare larger volumes like 𝐿 = 128, 256, since autocorrelations in Monte Carlo are much

less severe for integer 𝑞.

In the top panel of each figure, we compare the specific heat from

Monte Carlo and TRG. With TRG, we compute the free energy and apply smoothing splines before

computing the specific heat as the second derivative of the free energy. These figures show very

good agreement between Monte Carlo and TRG for the specific heat.

With Monte Carlo, a zero external field (ℎ = 0) is used, and so the magnetization (cid:174)𝑀 averaged

over many equilibrium configurations would average to zero for all 𝛽. Instead, we measure a proxy

magnetization | (cid:174)𝑀 | defined in Eq. (4.15). The corresponding susceptibility is defined in Eq. (4.16).

With TRG, we use a different approach. With TRG, external magnetic fields are applied, and the

150

(a) 𝑞 = 3.9, ℎ𝑞 = 0.1

(b) 𝑞 = 3.9, ℎ𝑞 = 1

(c) 𝑞 = 4.1, ℎ𝑞 = 0.1

(d) 𝑞 = 4.1, ℎ𝑞 = 1

(e) 𝑞 = 5, ℎ𝑞 = 0.1

(f) 𝑞 = 5, ℎ𝑞 = 1

Figure 4.58 In each figure, we compare the specific heat from Monte Carlo (error bars and no
connecting lines) with the specific heat from TRG (solid line). For the top two figures, we show the
case 𝑞 = 3.9 with ℎ𝑞 = 0.1 (left) and ℎ𝑞 = 1 (right). The 𝛽 range was chosen to show both peaks
in the specific heat, although at these small volumes, only one peak is visible for ℎ𝑞 = 0.1. In the
middle two figures, we show the case 𝑞 = 4.1 with ℎ𝑞 = 0.1 (left) and ℎ𝑞 = 1 (right). The 𝛽 range
was chosen to show small-𝛽 peak in the specific heat. For the case ℎ𝑞 = 1, the specific heats of
the two different volumes lie directly on top of each other. In the bottom two figures, we show the
case 𝑞 = 5 with ℎ𝑞 = 0.1 (left) and ℎ𝑞 = 1 (right). At integer 𝑞, we are able to reliably go to larger
volumes with Monte Carlo, so we compare 𝐿 = 128 and 𝐿 = 256 here. The 𝛽 range was chosen to
show the small-𝛽 peak in the specific heat.

151

0.60.70.80.91.01.11.21.31.4β0.00.51.01.52.02.5CTRG,L=16TRG,L=32MC,L=16MC,L=320.60.70.80.91.01.11.21.31.4β0.00.51.01.52.02.5CTRG,L=16TRG,L=32MC,L=16MC,L=320.750.800.850.900.951.00β0.00.51.01.52.02.5CTRG,L=16TRG,L=32MC,L=16MC,L=320.650.700.750.800.850.90β0.00.51.01.52.02.5CTRG,L=16TRG,L=32MC,L=16MC,L=320.980.991.001.011.021.031.04β0.00.51.01.52.02.5CTRG,L=128TRG,L=256MC,L=128MC,L=2560.980.991.001.011.021.031.04β0.00.51.01.52.02.5CTRG,L=128TRG,L=256MC,L=128MC,L=256Figure 4.59 We compute the susceptibility for 𝑞 = 3.9 and ℎ𝑞 = 1 in the vicinity of the second-order
transition in two different ways. In the top panel, we see how the susceptibility peaks from Monte
Carlo approach the critical point, and in the bottom panel we look at the susceptibility peaks from
TRG. From the Monte Carlo peak positions, we estimate 𝛽𝑐 = 1.19511(32) (vertical blue line) by
fitting the peak locations to the finite-size-scaling form Eq. (3.78). From the TRG peak positions,
we estimate 𝛽𝑐 = 1.196 (vertical dashed line) via the same finite-size scaling form. As described
in the main text, because of large autocorrelations, the Monte Carlo results at noninteger 𝑞 are not
strictly reliable for 𝐿 > 32.

susceptibility is computed by taking the numerical derivative of the free energy with respect to

the external field. Because of the different approaches used by Monte Carlo and TRG, it is not

meaningful to directly compare the magnetization or the magnetic susceptibility. However, we

expect the critical behavior to be the same in the thermodynamic limit. We show an example of

this in Fig. 4.59 for the case 𝑞 = 3.9 and ℎ𝑞 = 1. In the top panel, we show how the susceptibility

(calculated from Monte Carlo) peaks approach the critical point with increasing volume. The

critical point 𝛽𝑐 = 1.19511(32) (vertical blue line) was estimated by fitting the Monte Carlo peak

locations to the finite-size scaling form Eq. (3.78). In the bottom panel of Fig. 4.59, we do the

same but with the susceptibility calculated using TRG. Here, we estimate 𝛽𝑐 = 1.196 (vertical

dashed line) using the same finite-size scaling form. Note, that the Monte Carlo results are not

strictly reliable for 𝐿 > 32 in this figure since for 𝑞 = 3.9 there are very large and unmitigated

autocorrelations. Nevertheless, it seems the finite-size-scaling shift of the peaks are in general

152

0204060χβc=1.19511(32)MC,L=32MC,L=64MC,L=128MC,L=2561.141.161.181.201.22β050001000015000χβc=1.196TRG,L=64TRG,L=128TRG,L=256TRG,L=512TRG,L=1024TRG,L=2048agreement with the finite-size-scaling shift of the TRG peaks.

4.3.9 Mapping the Phase Diagram

4.3.9.1

Integer 𝑞

For integer values of 𝑞, the Extended-O(2) model retains a rotational symmetry. When ℎ𝑞 = 0,

this is the full 𝑂 (2) symmetry. When ℎ𝑞 is finite, this symmetry reduces to a Z𝑞 symmetry. The

model is easier to study at integer 𝑞 because the autocorrelations remain manageable even at large

𝛽.

To get a good survey of the model, we study 𝑞 = 2, 3, . . . , 6 at ℎ𝑞 = 0.1, 1.0. At infinite ℎ𝑞,

the Extended-O(2) model reduces to the ordinary clock model which for 𝑞 = 2, 3, 4 has a single

second-order transition, and for 𝑞 ≥ 5, has a pair of infinite-order BKT transitions. See Table 2.2.

We start with the assumption that the same kind of transitions persist to finite ℎ𝑞. We show then

that this assumption fails to hold at small ℎ𝑞.

We performed MCMC simulations at 𝐿 = 16, 32, 64, 96, 128, 160, 192, 224, 256. A

BMHA updating algorithm was used with an acceptance rate around 90%. Each run started with

215 equilibration sweeps followed by 216 measurement sweeps. Each measurement sweep was

separated by 26 (or more) discarded sweeps to help mitigate autocorrelation. Later during the

analysis stage, the residual integrated autocorrelation time was estimated and the effect removed

by binning the observables. For some of the larger volumes, more runs were performed to increase

the statistics.

Using the FSS fit forms Eq. (3.70)–(3.75), we extracted the exponents 1/𝜈, 𝛼/𝜈, −𝛽/𝜈, 𝛾/𝜈,

and 2 − 𝜂 for integer 𝑞 and ℎ𝑞 = 0.1, 1.0. These are tabulated in Table B.1. The critical exponents

𝜈, 𝛼, 𝛽, 𝛾, and 𝜂 are then extracted and listed in Table 4.3. For example, the exponent 𝛼 is obtained

by taking the ratio of measured exponents 𝛼/𝜈 and 1/𝜈. This is done at the level of jackknife bins

in order to get reliable error bars on the final critical exponents. For 𝑞 = 5, 6, we do not include

results from Eq. (3.70) since the quantity seems unstable and the fits tend to be bad. As a result,

the bare exponents for 𝑞 = 5, 6, which depend on 1/𝜈 are not listed in Table 4.3.

For 𝑞 = 2, 3, 4, we fit the thermodynamic quantities to Eq. (3.78) to estimate the critical exponent

153

Table 4.3 The first two columns refer to the model parameters, and the remaining columns list
the critical exponents obtained for the model. They are obtained from the ratio exponents (e.g.
Table B.1, which are in turn obtained from finite-size-scaling. Some exponents could not be reliably
extracted with our data. For 𝑞 = 5, 6, the results refer to the transition at small inverse temperature.

𝑞

2
2
3
3
4
4
5
5
6
6

ℎ𝑞
0.1
1.0
0.1
1.0
0.1
1.0
0.1
1.0
0.1
1.0

𝜈

𝛼

𝛽

𝛾

𝜂

1.044(53)
1.022(71)
0.658(24)
0.809(26)
2.49(89)
1.20(14)
N/A
N/A
N/A
N/A

0.021(18)
-0.010(16)
0.354(35)
0.311(21)
-0.30(12)
-0.162(19)
N/A
N/A
N/A
N/A

0.319(94)
N/A
N/A
N/A
0.76(87)
0.78(61)
N/A
N/A
N/A
N/A

1.859(96)
1.78(12)
1.291(45)
1.411(36)
4.3(1.5)
2.09(24)
N/A
N/A
N/A
N/A

0.287(13)
0.251(14)
0.3651(93)
0.295(16)
0.2570(85)
0.2532(93)
0.2716(72)
0.2653(94)
0.2880(86)
0.2654(92)

Table 4.4 Here we tabulate the jackknife average of the estimates of 𝜈 and 𝛽𝑐 listed in Tables B.2
and B.3. For 𝑞 = 5, 6, the results refer to the transition at small inverse temperature.

𝑞

2
2
3
3
4
4
5
5
6
6

ℎ𝑞
0.1
1.0
0.1
1.0
0.1
1.0
0.1
1.0
0.1
1.0

𝜈

1.059(50)
0.985(58)
0.878(21)
0.812(22)
2.04(18)
1.361(74)
N/A
N/A
N/A
N/A

𝛽𝑐
0.87756(46)
0.65448(24)
1.00044(56)
0.82584(11)
1.0841(84)
0.9916(18)
1.136(11)
1.128(12)
1.1251(83)
1.1143(92)

154

Figure 4.60 Here we look at data for some volumes (𝐿 = 16, . . . , 256) for the model with 𝑞 = 2
and ℎ𝑞 = 0.1. The top panel shows the magnetic susceptibility, the bottom panel shows the specific
heat, and the inset within the top panel shows the Binder cumulant. The data is from reweighted
Monte Carlo with error bars included. The vertical dashed line is at 𝛽𝑐 ≈ 0.8773 where the
Binder cumulant curves cross. This figure illustrates the behavior typical of a second-order phase
transition—the Binder cumulants cross at the transition point, the magnetic susceptibility diverges,
and the specific heat diverges logarithmically.

𝜈 and critical points 𝛽𝑐. The results are tabulated in Table B.2. Similarly, we use Eq. (3.84), to

estimate the BKT critical points for 𝑞 = 5, 6, and list them in Table B.3. The estimates of 𝜈 and 𝛽𝑐

are averaged (at the level of jackknife bins) to obtain the final estimates listed in Table 4.4. Compare

the estimates of the critical exponent 𝜈 in Table 4.3, which were obtained from Eq. (3.70) with the

estimates in Table 4.4, which were obtained from Eq. (3.78). The results are consistent except for

the model with 𝑞 = 3 and ℎ𝑞 = 0.1. The cause of this discrepancy is not clear.

For 𝑞 = 2 with ℎ𝑞 = ∞ (i.e.

the Ising model), there is a second-order phase transition at

𝛽𝑐 = ln(1 +

√

2)/2 ≈ 0.4407 with critical exponents 𝜈 = 1, 𝛼 = 0, 𝛾 = 7/4, 𝛽 = 1/8, and

𝜂 = 1/4 (see Table 2.2). With ℎ𝑞 = 1.0, the thermodynamic quantities show Ising-like divergences

near a critical point, which we estimate to be 𝛽𝑐 = 0.65448(24) in the infinite-volume limit.

From finite-size-scaling, we find 1/𝜈 = 0.978(68), 𝛼/𝜈 = −0.010(16), 𝛾/𝜈 = 1.745(19), and

2 − 𝜂 = 1.749(14). The bare exponents are 𝜈 = 1.022(71), 𝛼 = −0.010(16), 𝛾 = 1.78(12), and

155

02004006008001000M16x1632x3264x64128x128256x2560.800.850.900.951.000.00.51.01.52.02.53.03.54.0CV0.860.880.900.550.600.65U|M|𝜂 = 0.251(14). These are consistent with the critical exponents of the Ising universality class.

We were unable to extract 𝛽/𝜈 from our data. When we decrease ℎ𝑞 to 0.1, the picture changes

a little. The thermodynamic quantities show similar Ising-like divergences, however, the model

is no longer in the Ising universality class. The specific diverges logarithmically (see Fig. 4.60).

The critical point is now at 𝛽𝑐 = 0.87756(46). From finite-size-scaling, we find 1/𝜈 = 0.958(49),

𝛼/𝜈 = 0.020(17), 𝛾/𝜈 = 1.780(16), 𝛽/𝜈 = 0.306(81), and 2 − 𝜂 = 1.713(13). The bare exponents

𝜈 = 1.044(53), 𝛼 = 0.021(18), 𝛽 = 0.319(94), 𝛾 = 1.859(96), and 𝜂 = 0.287(13), are not

consistent with the Ising universality class, however, the scaling relations (in particular Eq. (3.79)

and (3.81)) are violated, so this case is somewhat inconclusive. As ℎ𝑞 → 0, the critical point

moves to larger 𝛽 and seems to connect with the BKT transition near 𝛽𝑐 = 1.12 of the 𝑋𝑌 model

when ℎ𝑞 = 0. So for 𝑞 = 2 with large ℎ𝑞, there is a second-order phase transition of the Ising

universality class. However, at small ℎ𝑞, the second-order transition is no longer of the Ising

class—contradicting an early renormalization group analysis [114].

For 𝑞 = 3 with ℎ𝑞 = ∞ (i.e. the 3-state Potts model), there is a second-order phase transition

at 𝛽𝑐 = 2 ln(1 +

√

3)/3 ≈ 0.6700, with critical exponents 𝜈 = 5/6, 𝛼 = 1/3, 𝛾 = 13/9, 𝛽 = 1/9,

and 𝜂 = 4/15 (see Table 2.2). At ℎ𝑞 = 1.0, the thermodynamic functions show second-order

divergences near a critical point, which we estimate to be 𝛽𝑐 = 0.82584(11) in the infinite-volume

limit. From finite-size-scaling, we find 1/𝜈 = 1.236(40), 𝛼/𝜈 = 0.385(21), 𝛾/𝜈 = 1.743(16), and

2 − 𝜂 = 1.705(16) which are consistent with the exponents at ℎ𝑞 = ∞. The bare exponents are

𝜈 = 0.809(26), 𝛼 = 0.311(21), 𝛾 = 1.411(36), and 𝜂 = 0.295(16). When we reduce ℎ𝑞 to 0.1, the

critical point moves to 𝛽𝑐 = 1.00044(56). Assuming a second-order transition, we find from finite-

size-scaling 1/𝜈 = 1.521(57), 𝛼/𝜈 = 0.538(48), 𝛾/𝜈 = 1.962(16), and 2 − 𝜂 = 1.6349(93), which

are very different from the 3-state Potts universality class. The bare exponents are 𝜈 = 0.658(24),

𝛼 = 0.354(35), 𝛾 = 1.291(45), and 𝜂 = 0.3651(93). The scaling relations are violated in this case.

As ℎ𝑞 → 0, the critical point moves to larger 𝛽 and seems to connect with the BKT transition of

the 𝑋𝑌 model when ℎ𝑞 = 0.

For 𝑞 = 4 with ℎ𝑞 = ∞, the model is known to be in the Ising universality class with critical point

156

Figure 4.61 Here we look at data for some volumes (𝐿 = 16, . . . , 256) for the model with 𝑞 = 4 and
ℎ𝑞 = 0.1. The top panel shows the magnetic susceptibility, the bottom panel shows the specific heat,
and the inset within the top panel shows the Binder cumulant. The data is from reweighted Monte
Carlo with error bars included. The vertical dashed line is at 𝛽𝑐 ≈ 1.09 where the Binder cumulant
curves merge. This figure illustrates the behavior typical of a BKT phase transition—the Binder
cumulants merge at the transition point, the magnetic susceptibility diverges, and the specific heat
plateaus.

at 𝛽𝑐 = ln(1+

√

2) ≈ 0.8814 and critical exponents given in Table 2.2. At ℎ𝑞 = 1, the thermodynamic

functions seem to diverge near a critical point which we estimate to be 𝛽𝑐 = 0.9916(18) in

the infinite-volume limit. From finite-size-scaling, we find 1/𝜈 = 0.834(96), 𝛾/𝜈 = 1.742(14),

−𝛽/𝜈 = −0.65(51), and 2−𝜂 = 1.7468(93). When extracting the exponent 𝛼/𝜈, attempts to fit to the

form Eq. (3.71) failed. However, a fit to the form Eq. (3.75) yielded 𝛼/𝜈 = −0.1349(90). The bare

exponents are 𝜈 = 1.20(14), 𝛼 = −0.162(19), 𝛽 = 0.78(61), 𝛾 = 2.09(24), and 𝜂 = 0.2532(93).

With the large error bars, these exponents are still consistent with the Ising universality class.

When we reduce ℎ𝑞 to 0.1, the critical point moves to 𝛽𝑐 = 1.0841(84). Assuming a second-order

transition, we find from finite-size-scaling 1/𝜈 = 0.40(14), 𝛼/𝜈 = −0.120(19), 𝛾/𝜈 = 1.728(16),

−𝛽/𝜈 = −0.31(31), and 2 − 𝜂 = 1.7430(85). The bare exponents are 𝜈 = 2.49(89), 𝛼 = −0.30(12),

𝛽 = 0.76(87), 𝛾 = 4.3(1.5), and 𝜂 = 0.2570(85). These are not consistent with the Ising

universality class. Since 𝛼 is negative and 𝜈 is finite, the specific heat decreases with increasing

157

0100200300400500600M16x1632x3264x64128x128256x2560.91.01.11.21.30.00.51.01.52.02.5CV1.061.081.101.120.6500.6550.660U|M|Figure 4.62 Here we look at data for some volumes (𝐿 = 16, . . . , 256) for the model with 𝑞 = 5 and
ℎ𝑞 = 1.0. The top panel shows the magnetic susceptibility, and the bottom panel shows the specific
heat. The inset within the top panel shows the Binder cumulant of the proxy magnetization. The
vertical dashed line is at 𝛽𝑐1 ≈ 1.08, where the Binder cumulant curves cross. The inset within the
bottom panel shows the Binder cumulant of the rotated magnetization defined in Eq. (4.51). The
vertical dotted line is at 𝛽𝑐2 ≈ 1.22, where the Binder cumulant curves cross. The data is from
reweighted Monte Carlo with error bars included. This figure illustrates the behavior typical of the
two BKT transitions that occur in the ordinary clock model for 𝑞 ≥ 5. The magnetic susceptibility
diverges at both transitions, and the specific heat plateaus at both transitions. For 𝐿 = 256, we
show only results near the first transition point. Already here the error bars are rather large.

volume. This can be seen in Fig. 4.61. Other thermodynamic quantities diverge, and the Binder

cumulants cross suggesting a BKT transition instead of a second-order transition. The scaling

relation Eq. (3.80) does not hold in this case, but the other two relations do hold. As ℎ𝑞 → 0, the

critical point moves to slightly larger 𝛽 and seems to connect with the BKT transition of the 𝑋𝑌

model when ℎ𝑞 = 0.

Finally, we consider 𝑞 = 5, 6. At ℎ𝑞 = ∞, these models both show a pair of BKT transitions. At

finite ℎ𝑞 there also seems to be two BKT transitions for 𝑞 = 5, 6. See Fig. 4.62. To extract critical

exponents, we look at maxima in the thermodynamic quantities. This works well for the low-𝛽

transition, but not for the large-𝛽 transition since many of the thermodynamic quantities do not show

a well-defined peak near the second transition. Nevertheless, the transition point can be located

158

0100200300400500600M16x1632x3264x64128x128256x2560.80.91.01.11.21.31.41.5012345CV1.001.051.101.150.6550.660U|M|1.101.151.201.251.300.520.540.56Uby the Binder cumulant crossings. In a BKT transition, the specific heat does not diverge with

volume, and one cannot use Eq. (3.71) to extract a critical exponent 𝛼. Furthermore, the correlation

length diverges faster than any power, and so Eq. (3.70) cannot be used to extract 𝜈. In fact, in

the conventional sense, 𝜈 must be infinite for a BKT transition. Fits to the forms Eq. (3.72)–(3.74)

are performed and exponents recorded in Table B.1. The results seem generally consistent with

BKT transitions. For 𝑞 = 5, 6, estimates of the critical points for the small-𝛽 transition obtained by

fitting to Eq. (3.84) are listed in Table B.3. The average values are listed in Table 4.4. However, it

is not clear that Eq. (3.84) is a reliable way to estimate the critical point for BKT transitions, and

the method of Binder cumulant crossings may be preferred. For 𝑞 = 5, 6, the small-𝛽 transition

connects to the BKT transition of the 𝑋𝑌 model when ℎ𝑞 = 0. The large-𝛽 transition moves to

larger 𝛽 as ℎ𝑞 → 0. Recently, it was estimated that for 𝑞 = 5, the large-𝛽 transition limits to

𝛽𝑐2 ≈ 2.27 as ℎ𝑞 → 0 [122].

We summarize now the results for integer 𝑞. Recall that for ℎ𝑞 = ∞, this model becomes the

ordinary 𝑞-state clock model, which is known to have a second-order transition for 𝑞 = 2, 3, 4, and a

pair of BKT transitions for 𝑞 ≥ 5. In this work, we’ve studied what happens at finite ℎ𝑞. For 𝑞 ≥ 5,

there seems to be a pair of BKT transitions for all values of ℎ𝑞 > 0. For 𝑞 = 2, 3, 4, there appears

to be a second-order transition for large and intermediate values of ℎ𝑞. The critical exponents, and

hence the universality class, seems to vary with ℎ𝑞, and this transition appears to turn into a BKT

transition in the small ℎ𝑞 limit. There are very few results in the literature even for integer 𝑞 (we

found no literature on noninteger 𝑞) that give specific numbers such as critical exponents or critical

points. Of the few results that we found, our findings seem to agree with, for example, [40, 41, 117]

but disagree with [115]. The reason for this is not clear to us.

4.3.9.2 Noninteger 𝑞

At noninteger 𝑞, the explicitly broken Z𝑞 symmetry causes difficulties for the MCMC approach.

In fact, we are unable to get reliable results on lattices larger than 32 × 32 due to the freezing

effect described in Sec. 4.3.7. This makes finite-size scaling infeasible with the MCMC approach.

Instead, we use TRG to investigate the noninteger 𝑞 regime. We validate the TRG approach by

159

Figure 4.63 Heatmaps from TRG of the specific heat (top row) and entanglement entropy (bottom
row) for the Extended-O(2) model with ℎ𝑞 = 1 (left column) and ℎ𝑞 = 0.1 (right column). Each
pixel is a distinct TRG calculation performed with 𝐿 = 1024 and bond dimension 40. There are a
few missing points (white) in the top-left panel.

comparison with MCMC on smaller lattices. See Sec. 4.3.8.

With TRG, we started with a low precision scan of the parameter space for 𝑞 ∈ [1, 6], 𝛽 ∈ [0, 2],

ℎ𝑞 = 0.1, 1.0, and 𝐿 = 4, 8, . . . , 1024. For example, we show in Fig. 4.63 heatmaps of the specific

heat (top row) and entanglement entropy (bottom row) for ℎ𝑞 = 1 (left column) and ℎ𝑞 = 0.1

(right column). Each pixel is a data point obtained from TRG performed with 𝐿 = 1024 and bond

dimension 40. The color in each heatmap ranges from dark to light and this corresponds to a

value ranging from 0 to 2.5. The cutoff choice of 2.5 (which truncates some of the specific heat

values) was made to increase the contrast in the heatmaps. For ℎ𝑞 = 1, the heatmap of the specific

heat shows smooth lobes suggesting smooth lines in the phase diagram even as one crosses integer

values of 𝑞. When ℎ𝑞 is reduced to 0.1, the lines at large 𝛽 fade away leaving a thick horizontal

line that connects to the BKT transition at ℎ𝑞 = 0. However, heatmaps of the entanglement entropy

show that there remain definite discontinuities as one crosses integer values of 𝑞. This suggests a

phase diagram at finite-ℎ𝑞 that is similar to the phase diagram at infinite ℎ𝑞.

160

0.00.51.01.52.0βSpeciﬁcHeathq=1.0hq=0.123456q0.00.51.01.52.0βEnt.Entropy23456qFigure 4.64 Here we look at the behavior of the magnetic susceptibility near the small-𝛽 peak for
noninteger 𝑞. In this example, 𝑞 = 4.1. We obtain the susceptibility from TRG with bond dimension
32 after applying small external magnetic fields ℎ = 10−5, 𝛿ℎ = 10−5. In the left panel, ℎ𝑞 = 0.01
and in the right panel, ℎ𝑞 = 0.001. For sufficiently large volumes, the magnetic susceptibility peaks
plateau—implying a crossover. For a true phase transition, the peaks would diverge with increasing
volume. As the symmetry-breaking parameter ℎ𝑞 is decreased toward zero, one must go to larger
volumes to show that the peaks are not diverging.

At noninteger values of 𝑞, the specific heat generally shows two peaks. In the ℎ𝑞 = ∞ limit, we

found that the first peak is only a crossover, but the second peak is associated with a second-order

phase transition of the Ising universality class. Here, we investigate the finite ℎ𝑞 regime.

First we consider the small-𝛽 peak, which we know is a crossover at ℎ𝑞 = ∞. On the other

hand, this peak connects to the BKT transition of the 𝑋𝑌 model when ℎ𝑞 = 0. The question is;

what happens at finite ℎ𝑞? In Fig. 4.64, we find that the magnetic susceptibility plateaus with

volume—indicating a crossover—since a phase transition here would result in divergence of the

susceptibility with volume. However, as we take ℎ𝑞 → 0, it becomes more and more difficult

to distinguish between crossover and BKT transition as it takes larger and larger lattices for the

susceptibility to plateau. We conclude that most likely the small-𝛽 peak corresponds to a crossover

even as ℎ𝑞 → 0 and only becomes a BKT transition exactly at ℎ𝑞 = 0. However, we suspect that to

prove this for values of ℎ𝑞 very close but not equal to zero one would require very large lattices.

Next, we consider the large-𝛽 peak, which corresponds to a second-order transition of the Ising

universality class when ℎ𝑞 = ∞. As 𝑞 approaches an integer from below, the second-order transition

(i.e.

the second peak) occurs at relatively small values of 𝛽. As 𝑞 is decreased toward the next

161

0.80.91.01.1β050100150200250300350χL=32L=64L=128L=256L=512L=10240.80.91.01.1β05001000150020002500χFigure 4.65 In the top panel, data collapse of the magnetic susceptibility from TRG for the large-𝛽
peak for the model with 𝑞 = 3.9 and ℎ𝑞 = 1. The estimate 𝛽𝑐 = 1.196 was obtained by fitting
the peak positions to the finite-size scaling form Eq. (3.78). To extract the magnetic susceptibility,
an external field ℎ = 40/𝐿15/8 was imposed (𝑑ℎ = 10−5 for the numerical differentiation). In the
bottom panel, data collapse of the specific heat from TRG for the same model. These results are
consistent with there being a second-order phase transition here of the Ising universality class.

integer, the critical point moves to large values of 𝛽. In Figure 4.65, we show TRG results for 𝑞 = 3.9

and ℎ𝑞 = 1 near the large-𝛽 peak. In the top panel, we show the data collapse of the magnetic

susceptibility from TRG. The estimate 𝛽𝑐 = 1.196 was obtained by fitting the peak positions to

the finite-size scaling form Eq. (3.78). To extract the magnetic susceptibility, an external field

ℎ = 40/𝐿15/8 was imposed (𝑑ℎ = 10−5 for the numerical differentiation). In the bottom panel, we

show the data collapse of the specific heat from TRG for the same model. The specific heat was

computed from the second derivative of the free energy after applying smoothing splines to the free

energy. The data collapse curves are consistent with a second-order phase transition of the Ising

universality class.

4.3.9.3 The Phase Diagram at Finite ℎ𝑞

In Fig. 4.66 we present four two-dimensional conjectures for the full three-parameter phase

diagram. Each slice is at a different value of the symmetry-breaking parameter ℎ𝑞. In the top-left

panel, the phase diagram at ℎ𝑞 = 0. This is the well-studied 𝑋𝑌 model at all values of 𝑞. There is a

162

L(β−βc)0.0000.0010.0020.0030.004(χ−χ0)/L1.75L=64L=128L=256L=512L=1024−30−20−1001020L(β−βc)0.10.20.3(C−C0)/lnLL=32L=64L=128L=256L=512(a) ℎ𝑞 = 0

(b) Small ℎ𝑞

(c) Intermediate ℎ𝑞

(d) ℎ𝑞 → ∞

Figure 4.66 Here we present four two-dimensional conjectures for the full three-parameter phase
diagram. Each slice is at a different value of the symmetry-breaking parameter ℎ𝑞. In the top-left
panel, the phase diagram at ℎ𝑞 = 0. This is the well-studied 𝑋𝑌 model at all values of 𝑞. There is a
single BKT transition near 𝛽 = 1.12 and a BKT critical phase at larger 𝛽. The top-right panel gives
the conjectured phase diagram at small ℎ𝑞 > 0. For noninteger 𝑞, there is generally a crossover at
small 𝛽 and an Ising phase transition at larger 𝛽 with an Ising ordered phase at large 𝛽. For integer
𝑞, our data is consistent with BKT transitions for all integer 𝑞 given sufficiently small (but positive)
ℎ𝑞. For 𝑞 = 2, 3, 4 our conjecture that these are BKT transitions is rather tenuous as it is possible
that the apparent BKT transition is really the result of studying the model on too small lattices.
For integer 𝑞 ≥ 5, there are two BKT transitions with the second transition appearing at larger 𝛽.
This transition does not seem to connect to the BKT line at ℎ𝑞 = 0 in the limit ℎ𝑞 → 0 [122]. In
the bottom-left panel, ℎ𝑞 is increased to intermediate values. The most significant change is that
for 𝑞 = 2, 3, 4 the transitions are now second-order, however, they are not in the same universality
class as the clock model (i.e. ℎ𝑞 = ∞) transitions. Finally, in the bottom-right panel, we show the
phase diagram for the ℎ𝑞 → ∞ case. For integer 𝑞, this is the ordinary 𝑞-state clock model, and for
noninteger 𝑞, this is what we call the “Extended 𝑞-state clock model”.

163

q23456βdisorderedBKTBKTtrans.q23456βdisorderedZ2orderedZqorderedIsingtrans.2ndordertrans.BKTtrans.CriticalphaseCrossoverq23456βdisorderedZ2orderedZqorderedIsingtrans.2ndordertrans.BKTtrans.CriticalphaseCrossoverq23456βdisorderedZ2orderedZqorderedIsingtrans.2ndordertrans.BKTtrans.CriticalphaseCrossoversingle BKT transition near 𝛽 = 1.12 and a BKT critical phase at larger 𝛽. The top-right panel gives

the conjectured phase diagram at small ℎ𝑞 > 0. For noninteger 𝑞, there is generally a crossover at

small 𝛽 and an Ising phase transition at larger 𝛽 with an Ising ordered phase at large 𝛽. For integer

𝑞, our data is consistent with BKT transitions for all integer 𝑞 given sufficiently small (but positive)

ℎ𝑞. For 𝑞 = 2, 3, 4 our conjecture that these are BKT transitions is rather tenuous as it is possible

that the apparent BKT transition is really the result of studying the model on too small lattices.

For integer 𝑞 ≥ 5, there are two BKT transitions with the second transition appearing at larger 𝛽.

This transition does not seem to connect to the BKT line at ℎ𝑞 = 0 in the limit ℎ𝑞 → 0 [122]. In

the bottom-left panel, ℎ𝑞 is increased to intermediate values. The most significant change is that

for 𝑞 = 2, 3, 4 the transitions are now second-order, however, they are not in the same universality

class as the clock model (i.e. ℎ𝑞 = ∞) transitions. Finally, in the bottom-right panel, we show the

phase diagram for the ℎ𝑞 → ∞ case. For integer 𝑞, this is the ordinary 𝑞-state clock model, and for

noninteger 𝑞, this is what we call the “Extended 𝑞-state clock model”.

4.4 Future Questions

There are many unanswered questions about the models studied in this thesis. Some of the ones

that the author finds most interesting include the following list of questions which are asked in no

particular order:

• What is really happening in the ℎ𝑞 → 0 limit for 𝑞 = 2, 3, 4? Is it a second-order phase

transition for all ℎ𝑞 > 0 or does it turn into a BKT transition at finite ℎ𝑞? This should be a

straightforward question to answer, but it will likely require very large lattices.

• In the 𝑂 (2) model and in the 𝑞-state clock models, vortices are often claimed to play a

significant role in the BKT phase transitions. What role do vortices play in the Extended-

𝑂 (2) model? This model at noninteger 𝑞 might help us to clarify the role of vortices in phase

transitions in two-dimensional systems.

• The Extended-𝑂 (2) model offers multiple ways in which one can continuously vary a param-

eter to go from a BKT transition to a second-order transition and vice versa. For example,

164

by varying ℎ𝑞 (for many values of 𝑞), one can go from the 𝑋𝑌 model with a BKT transition

at ℎ𝑞 = 0 to a model with a second-order transition at ℎ𝑞 > 0. Alternatively, one can go

from a second-order transition to a BKT transition for example by dialing from 𝑞 = 4.999999

to 𝑞 = 5. There are many other examples in the Extended-𝑂 (2) model in which one can

vary ℎ𝑞 or 𝑞 to switch between a model with a BKT transition and one with a second-order

transition. This makes the model a nice sandbox within which to study these two kinds of

phase transitions and their relationship.

• It is interesting to note that the BKT critical point found in the O(2) model—and here

at the limit of the extended phase diagram—can also be reached through a completely

different interpolation. By considering the O(3) nonlinear sigma model with an additional

symmetry breaking term which breaks the O(3) symmetry down to an O(2) symmetry, one

can interpolate between Z2 to O(3), and from O(3) to O(2) by tuning the sign, and magnitude,

of the additional symmetry-breaking term [161]. Further additional symmetry breaking

terms could be interesting. Positive-definite worm algorithms have been constructed for the

O(3) nonlinear sigma model, and could be used to simulate the model efficiently [162, 163].

• One of the original motivations for studying these models was related to making Z𝑞 approx-

imations of 𝑈 (1) gauge theories. It would be interesting to revisit that motivation especially

to see how 𝑞-state clock models with noninteger values of 𝑞 might be useful when making

discrete approximations of 𝑈 (1).

• In this thesis, we have focused on the Extended 𝑞-state clock model and the Extended-O(2)

model in two dimensions. It might be interesting to study these models in other dimensions

as well.

• In this thesis, we interpolate between the 𝑂 (2) model and its discrete Z𝑞 approximations by

adding a term to the O(2) model which contains a symmetry-breaking parameter ℎ𝑞 and a

discretization parameter 𝑞. We go further and consider 𝑞-state clock models with noninteger

𝑞. It might be interesting to consider a similar approach to interpolate between 𝑂 (3) and

165

various discrete approximations of 𝑂 (3) and further to consider the possible generalizations

of such models to noninteger values of the discretization parameter.

166

CHAPTER 5

QUANTUM FIELD THEORIES ON THE LATTICE

5.1

Introduction

The previous chapters of this thesis focused on the study of spin systems using primarily MCMC

methods. Although these general methods were developed for and work well for classical systems,

one can also apply them to the study of quantum field theories. In fact, the field of lattice QCD uses

many of the same numerical methods that were described in this thesis. As such, in the concluding

chapter of this thesis we give a brief sketch of QCD and how it is studied on the lattice.

The author’s study of the Extended-𝑂 (2) model, which was detailed extensively in the previous

chapter, has several connections to the quantum simulation of quantum field theories. In fact, the

original motivation for studying the model stemmed from recent attempts to quantum simulate

lattice models with continuous Abelian symmetries using discrete approximations. Thus, as a

natural step beyond conventional lattice field theory we conclude with a sketch of the next generation

approach—quantum simulation. But first, we introduce QCD in the continuum.

5.2 QCD in the Continuum

Quantum chromodynamics (QCD), a key component of the Standard Model of particle physics,

is the theory of the strong interaction which occurs between color-charged particles like quarks and

gluons.

The QCD Lagrangian is easy to write down, and it looks like the Lagrangian of simpler theories

such as quantum electrodynamics. However, it is a non-Abelian theory and the gauge field interacts

with itself, making the theory a lot more difficult to deal with. The QCD Lagrangian density is

L =

∑︁

𝜓 𝑓
𝑖

(cid:16)
𝑖𝛾𝜇𝐷 𝜇

𝑖 𝑗 − 𝑚 𝑓 𝛿𝑖 𝑗

(cid:17)

𝜓 𝑓

𝑗 −

𝑓

1
4

𝜇𝜈𝐹 𝜇𝜈
𝐹 𝑎
𝑎 ,

(5.1)

where 𝐷 𝜇

𝑖 𝑗 = 𝜕𝜇𝛿𝑖 𝑗 + 𝑖𝑔(𝑇 𝑎)𝑖 𝑗 𝐴𝜇
𝜈 is the gluon field strength tensor. The quark fields 𝜓 𝑓
𝜇 𝐴𝑐

𝑎 is the gauge covariant derivative, and 𝐹 𝑎

𝑔 𝑓𝑎𝑏𝑐 𝐴𝑏
down, charm, strange, top, bottom} which vary in mass 𝑚 𝑓 . The gluon fields 𝐴𝜇

𝑖 (𝑥) come in six flavors 𝑓 = {up,
𝑎 (𝑥) are coupled

𝜇𝜈 = 𝜕𝜇 𝐴𝑎

𝜈 − 𝜕𝜈 𝐴𝑎

𝜇 −

to the quark fields with coupling strength 𝑔 via the the 𝑆𝑈 (3) generators 𝑇 𝑎 and to each other via

167

𝛼𝑠 (𝑄2)

t
n
e
m
e
n
fi
n
o
c

asymptotic freedom

𝑄2

Figure 5.1 In QCD, the coupling constant 𝛼𝑠 = 𝑔2/4𝜋 varies with the energy scale—a phenomenon
referred to as the “running of the coupling”. As the energy and momentum transfer 𝑄2 increases,
the coupling decreases logarithmically—a phenomenon known as asymptotic freedom. On the
other hand, the coupling increases with decreasing energy, resulting in confinement.

the 𝑆𝑈 (3) structure constants 𝑓𝑎𝑏𝑐. Repeated indices indicate summation with 𝑎, 𝑏, 𝑐 = 1, . . . , 8

with 𝑖, 𝑗 = 1, 2, 3 (color indices) and with 𝜇, 𝜈 = 0, 1, 2, 3 (Dirac indices). The QCD action is the

Lagrangian density integrated over all spacetime

∫

𝑆 =

𝑑4𝑥 L.

(5.2)

At high energies and small length scales, the coupling between color-charged particles becomes

weak. This is the phenomenon of asymptotic freedom [164,165]. See Figure 5.1. In this regime, the

coupling is small, and perturbation theory is applicable. At low energies and larger length scales,

the coupling between particles becomes strong, resulting in the phenomenon of confinement. To

study processes in this regime, one must use nonperturbative methods such as lattice QCD [166].

The Green’s functions, i.e. the time-ordered vacuum expectation values

𝐺 (𝑥1, . . . , 𝑦1, . . . , 𝑧1, . . .) =

(cid:68)0

(cid:12)
(cid:12)
(cid:12)

T

(cid:110) ˆ𝐴(𝑥1) · · · ˆ𝜓(𝑦1) · · ·

ˆ𝜓(𝑧1) · · ·

0(cid:69)

,

(cid:111)(cid:12)
(cid:12)
(cid:12)

(5.3)

encode everything in the theory. The 2-point Green’s function, for example, is the propagator. The

Green’s functions can be written as path integrals

𝐺 (𝑥1, . . . , 𝑦1, . . . , 𝑧1, . . .) =

∫ D 𝐴 D𝜓 D𝜓 𝐴(𝑥1) · · · 𝜓(𝑦1) · · · 𝜓(𝑧1) · · · 𝑒𝑖𝑆[ 𝐴,𝜓,𝜓]
∫ D 𝐴 D𝜓 D𝜓 𝑒𝑖𝑆[ 𝐴,𝜓,𝜓]

(5.4)

168

where the integration goes over all field configurations.

The QCD generating functional takes the form1

𝑍 [𝐽, 𝐽𝜓, 𝐽𝜓] =

∫ D 𝐴 D𝜓 D𝜓 𝑒𝑖 ∫ 𝑑4𝑥

(cid:16)

L [ 𝐴,𝜓,𝜓]+𝐽𝜇 𝐴𝜇+𝐽𝜓𝜓+𝜓𝐽𝜓

∫ D 𝐴 D𝜓 D𝜓 𝑒𝑖𝑆[ 𝐴,𝜓,𝜓]

(cid:17)

.

(5.5)

Notice the similarity between the QCD generating functional and the partition function of a classical

field theory. The generating functional is so-called because one can generate the Green’s functions

by taking functional derivatives as in

𝐺 (𝑥1, . . . , 𝑦1, . . . , 𝑧1, . . .) =

𝛿 · · ·
· · · 𝑖𝛿𝐽 (𝑥1) · · · 𝑖𝛿𝐽𝜓 (𝑦1) · · · 𝑖𝛿𝐽𝜓 (𝑧1)

𝑍 [𝐽, 𝐽𝜓, 𝐽𝜓]

(cid:12)
(cid:12)
(cid:12)
(cid:12)
(cid:12)𝐽=𝐽𝜓=𝐽𝜓=0

.

(5.6)

5.2.1 Perturbative QCD

Here, we give only the barest sketch of the established field of perturbative QCD, mainly to

contrast it later with the ab initio approach of lattice QCD.

The non-interacting Lagrangian density L0 (e.g. L with 𝑔 = 0) is quadratic in each of the fields.

This allows one to “complete the square” and evaluate the field integrals as Gaussian integrals, to

obtain2

𝑍0 [𝐽, 𝐽𝜓, 𝐽𝜓] = N 𝑒𝑖 ∫ 𝑑4𝑥

(cid:16) 1
2 𝐽 𝜇 (𝑥)𝐷 𝜇𝜈 (𝑥−𝑦)𝐽 𝜈 (𝑦)+𝐽𝜓 (𝑥)𝑆𝐹 (𝑥−𝑦)𝐽𝜓 (𝑦)

(cid:17)

,

(5.7)

where 𝐷 𝜇𝜈 (𝑥 − 𝑦) is the free gauge field propagator, 𝑆𝐹 (𝑥 − 𝑦) is the free fermion propagator, and

N is an undetermined constant factor. Then the generating functional of the full theory can be

obtained perturbatively by expanding in 𝑔

𝑍 [𝐽, 𝐽𝜓, 𝐽𝜓] = N exp

∫

(cid:40)

𝑖

𝑑4𝑧 L𝑖𝑛𝑡𝑒𝑟𝑎𝑐𝑡𝑖𝑛𝑔

(cid:32)

−𝑖

𝛿
𝛿𝐽

, 𝑖

𝛿
𝛿𝐽𝜓

, −𝑖

𝛿
𝛿𝐽𝜓

(cid:33)(cid:41)

𝑍0 [𝐽, 𝐽𝜓, 𝐽𝜓]

(5.8)

See, for example, [167]. Notice that the infinite-dimensional integrals over the field configurations

(with measures D 𝐴 D𝜓 D𝜓) are gone now, so there is a possibility of actually evaluating some of

these integrals.

1In practice, there may also be “ghost” sources, and the Lagrangian may contain ghost fields and a gauge fixing

term.

2This is simply a schematic picture as we are neglecting things like gauge fixing and Fadeev-Popov terms.

169

To connect to collider experiments, one may be interested in computing the elements of the

scattering matrix, as these can be related to collider observables such as cross-sections and decay

widths. For scattering, one assumes an “incoming” state |{𝑝𝑖}⟩ prepared in the distant past with

a definite number of particles with definite momenta {𝑝𝑖}. The particles interact in a localized

volume and a localized time and scatter from each other. In the far future, the “outgoing” state

|{𝑘 𝑗 }⟩ particles are assumed to no longer be interacting. Then the elements of the scattering

matrix are denoted 𝑆 𝑗𝑖 = ⟨{𝑘 𝑗 }|{𝑝𝑖}⟩, and these are related to the momentum space Green’s

functions. One approach of “perturbative QCD” is to expand such momentum space path integrals

in powers of the coupling constant 𝑔. It turns out that there is an extremely useful diagrammatic

representation—called “Feynman diagrams”—of the resulting integrals.

Unfortunately, many of the integrals in the perturbative expansion are divergent. Some diverge

in the IR (energy → 0) limit and others diverge in the UV (momentum → ∞) limit. To resolve the

divergences, the integrals must be regularized in some manner. There are several regularization

approaches including Pauli-Villars which regulates the UV divergences, adding small masses which

regulates the IR divergences, and dimensional regularization which cures both types of divergences.

In dimensional regularization, one exploits the fact that the integrals converge in higher dimensions

to solve the integrals in arbitrary 𝑑 dimensions and then take the limit 𝑑 → 4 in the end. This

approach is often preferred because it preserves both Lorentz invariance and gauge invariance.

Dimensional regularization allows one to evaluate the integrals, which are divergent in 𝑑 = 4

dimensions, by going to arbitrary dimensions 𝑑. However, the results remain divergent when

one takes the limit 𝑑 → 4. To connect the theory with the physical world, one can directly or

indirectly redefine the fields or parameters of the theory via physical observables. In effect, one

can absorb the divergent terms into a redefinition of the fields or parameters in a process called

“renormalization”. At the lowest order in perturbation theory, the parameters of the theory (e.g.

quark masses) are physically meaningful. However, at higher orders where the divergences occur,

these “bare” parameters are no longer the physical parameters. Rather, it is the renormalized

parameters that are physically meaningful.

170

Perturbative QCD at small 𝛼𝑠 has proven very useful, however, at some point, the coupling is

no longer a small parameter and perturbation theory can no longer be used. Even in the regimes

where it does work, convergence is slow—even at large momentum—and this makes perturbative

QCD calculations very difficult.

5.3 Lattice Regularization

Here, we follow closely Ref. [66]. Recall that the vacuum expectation value of an observable

𝑂 can be written as a path integral

⟨𝑂⟩ =

∫ D 𝐴 D𝜓 D𝜓 𝑂 [ 𝐴, 𝜓, 𝜓] 𝑒𝑖𝑆[ 𝐴,𝜓,𝜓]
∫ D 𝐴 D𝜓 D𝜓 𝑒𝑖𝑆[ 𝐴,𝜓,𝜓]

.

(5.9)

See for example, Eq. (5.3) and (5.4). The question is how to evaluate such integrals. These

are infinite-dimensional integrals over the field configurations. Even worse, the integrands are

strongly oscillating (note the imaginary unit in the exponentials), and so they behave very poorly

under numerical integration.

In the approach of perturbative QCD, such path integrals can be

expanded in powers of the coupling 𝛼𝑠 = 𝑔2/4𝜋. However, this only works in the very high energy

perturbative regime and even there, the convergence is slow.

Recall that in the canonical ensemble of classical statistical mechanics, the equilibrium expec-

tation value of an observable 𝑂 is

⟨𝑂⟩ =

∫ D𝑈 𝑂 (𝑈) 𝑒−𝛽𝐸 (𝑈)
∫ D𝑈 𝑒−𝛽𝐸 (𝑈)

,

(5.10)

where the integral is over all possible microstates 𝑈 of the system, 𝐸 (𝑈) is the energy of the

microstate, 𝑂 (𝑈) is the value of the observable on that microstate, and 𝛽 is the inverse temperature.

See Eq. (3.3). In lattice QCD, we exploit the similarity between Eq. (5.9) and (5.10) to reinterpret

the QCD expectation value as an expectation value in classical statistical physics. That is, we plan

to evaluate objects like Eq. (5.9) with some kind of Monte Carlo integration. However, first we

need to put QCD on a lattice.

Defining QCD on a finite lattice has several important consequences:

1. The infinite-dimensional integrals become finite-dimensional integrals

171

2. The lattice serves as a regularizer—removing the divergences from the theory. The finite

lattice spacing removes the UV divergences by limiting the momenta, and the finite lattice

volume removes the IR divergences

3. To connect to the physical world, one must carefully take the continuum (lattice spacing → 0)

and thermodynamic (volume → ∞) limits of any observable calculated on the lattice

4. Much like the continuum theory, the lattice theory renormalizes the parameters. To connect

lattice results with the physical world, one must find a way to relate a lattice observable to a

real world quantity—a process called “setting the scale”

For later convenience, we rescale the gluon fields as

𝐴𝜇 (𝑥) →

1
𝑔

𝐴𝜇 (𝑥).

Then the QCD action becomes

∫

𝑆 =

𝑑4𝑥 ∑︁
𝑓

𝜓 𝑓 (cid:0)𝑖𝛾𝜇 (𝜕 𝜇 + 𝑖 𝐴𝜇) − 𝑚 𝑓 (cid:1) 𝜓 𝑓 −

∫

𝑑4𝑥

1
4𝑔2

𝜇𝜈𝐹 𝜇𝜈
𝐹 𝑎
𝑎 ,

(5.11)

(5.12)

with 𝐹 𝑎

𝜇𝜈 = 𝜕𝜇 𝐴𝑎

𝜈 − 𝜕𝜈 𝐴𝑎

𝜇 − 𝑓𝑎𝑏𝑐 𝐴𝑏

𝜇 𝐴𝑐

𝜈. Now the coupling 𝑔 appears only as a factor in front of the

gauge term. From here on, we neglect the color indices 𝑖, 𝑗 that appear in Eq. (5.1).

One key difference between Eq. (5.9) and (5.10) is the complex exponential in the first. To

reinterpret the QCD expectation value as a classical expectation value, we need to perform a Wick

rotation to imaginary time (𝑡 → 𝑖𝑡). This yields the Euclidean QCD action

𝑆𝐸 =

𝑑4𝑥 ∑︁
𝑓
(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)

𝜓 𝑓

∫

(cid:124)

(cid:0)𝛾𝜇 (𝜕𝜇 + 𝑖 𝐴𝜇) + 𝑚 𝑓 (cid:1) 𝜓 𝑓

+

(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)

(cid:125)

(cid:123)(cid:122)
𝑆𝐹

∫

(cid:124)

𝑑4𝑥
(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)

1
4𝑔2
(cid:123)(cid:122)
𝑆𝐺

𝜇𝜈𝐹 𝑎
𝐹 𝑎
𝜇𝜈
(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)(cid:32)

(cid:125)

,

(5.13)

where 𝑆𝐹 is the fermion part of the action and 𝑆𝐺 is the gauge part of the action. Note that

the lattice works as a regulator whether it’s Euclidean or Minkowski. The reason to go to a

Euclidean lattice is so that 𝑒𝑖𝑆 → 𝑒−𝑆𝐸 becomes real and can be interpreted as a probability

weight. Perhaps confusingly, the gamma matrices 𝛾𝜇 in this expression are now the Euclidean

172

Figure 5.2 The variable 𝑈𝜇 (𝑛) is associated with the link pointing in the direction ˆ𝜇 from the lattice
site 𝑛 to the site 𝑛 + ˆ𝜇.

gamma matrices which obey the anti-commutation relations {𝛾𝑢, 𝛾𝜈} = 2𝛿𝜇𝜈I instead of the usual

Minkowski gamma matrices which obey the anti-commutation relations {𝛾𝑢, 𝛾𝜈} = 2𝑔𝜇𝜈I where

𝑔𝜇𝜈 is the Minkowski metric tensor 𝑔𝜇𝜈 = diag(1, −1, −1, −1).

We want to discretize Eq. (5.13) for a lattice. We construct a 4-dimensional Euclidean lattice

with sites labeled by 𝑛 = (𝑛1, 𝑛2, 𝑛3, 𝑛4) and separated by lattice spacing 𝑎. We define the quark

fields on the lattice sites

𝜓(𝑥) −→ 𝜓(𝑛).

We can use a symmetric finite-difference approximation of the derivative

𝜕𝜇𝜓(𝑛) −→

𝜓(𝑛 + ˆ𝜇) − 𝜓(𝑛 − ˆ𝜇)
2𝑎

.

(5.14)

(5.15)

Thus for a single free fermion (i.e. no gauge fields), we obtain the discretized fermion action

𝐹 = 𝑎4 ∑︁
𝑆0

𝑛

4
∑︁

𝜇=1

𝜓(𝑛) (cid:169)
(cid:173)
(cid:171)

𝛾𝜇

𝜓(𝑛 + ˆ𝜇) − 𝜓(𝑛 − ˆ𝜇)
2𝑎

.

+ 𝑚𝜓(𝑛)(cid:170)
(cid:174)
(cid:172)

(5.16)

In the continuum, gauge invariance of the free fermion action requires the introduction of a

direction-dependent gauge field

𝐴𝜇 (𝑥) =

8
∑︁

𝑖=1

𝐴(𝑖)

𝜇 (𝑥)𝑇𝑖 ∈ 𝑠𝑢(3),

(5.17)

where the 𝑇𝑖 are generators of the Lie group 𝑆𝑈 (3), and the 𝐴𝜇 (𝑥) are elements of the Lie algebra

𝑠𝑢(3). On the lattice, one plays a similar game to enforce gauge invariance, but now it turns out

that one needs a field of direction-dependent objects which are elements 𝑈𝜇 (𝑛) of the 𝑆𝑈 (3) Lie

group instead of the algebra. They are related to the original fields 𝐴𝜇 (𝑥) as

𝑈𝜇 (𝑛) = 𝑒𝑖𝑎 𝐴𝜇 (𝑛) ∈ 𝑆𝑈 (3).

(5.18)

173

Figure 5.3 In lattice QCD, the quarks fields are defined on the lattice sites and the gauge fields are
defined on the links between the sites. Gauge invariant objects can be constructed from closed
loops and from paths going from one quark field to another. The ordered product of link variables
along an elementary square is called a plaquette. The image on the right is originally from JICFus.

By convention, we associate these direction-dependent objects with the “links” between the lattice

sites as illustrated in Figure 5.2.

This leads us to the so-called “naive” fermion action for a single flavor of fermion

𝑆𝐹 = 𝑎4 ∑︁

𝜓(𝑛)

𝑛

4
∑︁

𝜇=1

𝛾𝜇

𝑈𝜇 (𝑛)𝜓(𝑛 + ˆ𝜇) − 𝑈−𝜇 (𝑛)𝜓(𝑛 − ˆ𝜇)
2𝑎

+ 𝑚𝜓(𝑛)

.

(5.19)















Although this lattice fermion action has the right continuum limit, it generates unwanted additional

fermions. This “doubling problem” can be resolved by adding an additional term to obtain the

“Wilson” fermion action. This is beyond the scope of this thesis, but detailed discussions can be

found in any textbook on LQCD.

For the lattice gauge action, our first priority is to construct something that reduces to the

continuum gauge action in the continuum limit 𝑎 → 0. To do this, we need gauge-invariant objects.

It turns out that taking the ordered product of link variables along a closed loop and then tracing

over the final matrix results in a gauge-invariant object. The ordered product of link variables

around the simplest closed loop, called the plaquette (see Figure 5.3), is

□𝜇𝜈 (𝑛) = 𝑈𝜇 (𝑛)𝑈𝜈 (𝑛 + ˆ𝜇)𝑈†

𝜇 (𝑛 + ˆ𝜈)𝑈†

𝜈 (𝑛).

(5.20)

174

The trace of plaquettes is one of the simplest gauge-invariant objects on the lattice, and one can use

them to construct the Wilson gauge action

𝑆𝐺 =

2
𝑔2

∑︁

∑︁

Re Tr (cid:16)1 − □𝜇𝜈 (𝑛)

(cid:17)

.

𝑛

𝜇<𝜈

(5.21)

It can be shown that in the continuum limit, this recovers the continuum gauge action, e.g. Ref. [66].

With Eq. (5.19) and (5.21) we now have a full albeit rudimentary lattice definition of QCD.

Next we shall see what we can do with this.

5.4 MCMC

As before, the main objects are correlators i.e. Green’s functions as in Eq. (5.9). But now,

instead of doing a perturbative expansion, Feynman diagrams, and loop integrals, we reinterpret an

equilibrium QFT as a classical statistical physics problem. That is, we estimate the Euclidean path

integrals (obtained by Wick rotation 𝑖𝑆 → −𝑆𝐸 )

⟨𝑂⟩ =

∫ D𝑈 D𝜓 D𝜓 𝑂 [𝑈, 𝜓, 𝜓] 𝑒−𝑆𝐸 [𝑈,𝜓,𝜓]
∫ D𝑈 D𝜓 D𝜓 𝑒−𝑆𝐸 [𝑈,𝜓,𝜓]

.

(5.22)

via Markov chain Monte Carlo (MCMC). At this stage, all quantities are defined in terms of a

discrete spacetime lattice.

5.4.1 MCMC with Pure Gauge

We start with the simpler example of “pure gauge”, in which the theory consists only of

gluons—no quarks. The vacuum expectation value of observables is

⟨𝑂⟩ =

∫ D𝑈 𝑂 [𝑈]𝑒−𝑆𝐺 [𝑈]
∫ D𝑈 𝑒−𝑆𝐺 [𝑈]

.

(5.23)

Compare this with the classical path integral Eq. (5.10). We can interpret Eq. (5.23) as the

classical expectation value of 𝑂 [𝑈] where the 𝑈 are importance-sampled with Boltzmann factor

𝑒−𝑆𝐺 [𝑈]. That is, the action 𝑆𝐺 [𝑈] plays the role of the classical energy quantity 𝛽𝐸 (𝑈). This

reinterpretation is only possible because the Wick rotation converted the complex exponential into

a real exponential which can be interpreted as a probability density. Using the approach detailed in

Section 3.1 for classical spin systems, we can estimate Eq. (5.23) with a Monte Carlo simulation

𝑂 [𝑈𝑛],

⟨𝑂⟩ ≈

1
𝑁

∑︁

𝑈𝑛

175

(5.24)

provided a sample of 𝑁 gauge configurations 𝑈𝑛, distributed with probability ∝ 𝑒−𝑆𝐺 [𝑈]. The

statistical error of the mean is

𝑁
As we did for classical spin systems in Section 3.1, we sample lattice configurations 𝑈 by using

(5.25)

1
√

∝

.

a Markov chain. Now instead of configurations having e.g. a 2-component spin at each lattice site,

we have a complex 3 × 3 𝑆𝑈 (3) matrix at each lattice link.

To get {𝑈𝑛} distributed with probability ∝ 𝑒−𝑆𝐺 [𝑈], we want a Markov process (aka updating

algorithm) that starts with an arbitrary configuration 𝑈0 and stochastically generates more

𝑈0

𝑢 𝑝𝑑𝑎𝑡𝑒
−−−−−→ 𝑈1

𝑢 𝑝𝑑𝑎𝑡𝑒
−−−−−→ 𝑈2

𝑢 𝑝𝑑𝑎𝑡𝑒
−−−−−→ · · ·

(5.26)

To sample the desired distribution, it is sufficient to show that for every pair of configurations 𝑈

and 𝑈′, the Markov process satisfies the detailed balance condition [60, pp 289]

𝑒−𝑆𝐺 [𝑈] 𝑃(𝑈 → 𝑈′) = 𝑒−𝑆𝐺 [𝑈′] 𝑃(𝑈′ → 𝑈),

(5.27)

where 𝑃(𝑈 → 𝑈′) is the probability that the process generates 𝑈′ given 𝑈. The process must also

be ergodic i.e. it must be able to access the whole space of configurations.

The prototypical Markov process is the Metropolis algorithm:

1. Given current state 𝑈, generate a new candidate 𝑈′ via some random process that selects

uniformly on the group

2. Accept 𝑈′ as the new configuration with probability

𝑃𝐴 (𝑈 → 𝑈′) = min (cid:16)1, 𝑒𝑆𝐺 [𝑈′]−𝑆𝐺 [𝑈] (cid:17)

.

(5.28)

3. Repeat

Alternatives include the heatbath algorithm [168–171] in which Steps 1 and 2 are combined in an

optimized way. The implementation depends on the details of the gauge group. Another approach

is that of hybrid Monte Carlo (HMC) [172], which uses molecular dynamics evolution, and ensures

detailed balance by a Metropolis accept/reject step at the end. The HMC algorithm is the standard

gauge configuration updating algorithm for fermionic lattice QFTs.

176

5.4.2 MCMC with Fermions

To include fermions in our lattice MCMC, we return to the full (Euclidean) path integral

⟨𝑂⟩ =

∫ D𝑈 D𝜓 D𝜓 𝑂 [𝑈, 𝜓, 𝜓] 𝑒−𝑆𝐸 [𝑈,𝜓,𝜓]
∫ D𝑈 D𝜓 D𝜓 𝑒−𝑆𝐸 [𝑈,𝜓,𝜓]

.

(5.29)

The Euclidean action is composed of the gauge action and fermionic action, and it can be written

in the form

𝑆𝐸 [𝑈, 𝜓, 𝜓] = 𝑆𝐺 [𝑈] + 𝑆𝐹 [𝑈, 𝜓, 𝜓] = 𝑆𝐺 [𝑈] + 𝜓𝑀 [𝑈]𝜓

(5.30)

where the “Dirac operator” 𝑀 [𝑈] depends on the gauge fields but not the quark fields 𝜓 and 𝜓.

The fermionic variables 𝜓 and 𝜓 are not ordinary numbers or matrices like we are used to dealing

with in classical MCMC. Rather, they are non-commuting Grassman variables. These must be

integrated out in some way before we can proceed with MCMC.

We start with the Matthews-Salam formula [173, 174]. Given a Grassman algebra with 2𝑁

generators, 𝜃𝑖, 𝜃𝑖 for 𝑖 = 1, 2, . . . , 𝑁 and some complex 𝑁 × 𝑁 matrix 𝑄, it can be shown that

∫

DΘDΘ 𝑒Θ𝑄Θ ≡

(cid:32)

∫

(cid:214)

(cid:33)

𝑑𝜃𝑖𝑑𝜃𝑖

𝑖

𝑒(cid:205)𝑖, 𝑗 𝜃𝑖𝑄𝑖 𝑗 𝜃 𝑗 = det[𝑄].

(5.31)

We can use this to rewrite the denominator of Eq. (5.29) as

∫

𝑍 =

D𝑈 D𝜓 D𝜓 𝑒−𝑆𝐸 [𝑈,𝜓,𝜓] =

∫

D𝑈 𝑒−𝑆𝐺 [𝑈] det 𝑀 [𝑈].

(5.32)

Unless the observable 𝑂 is independent of the fermion fields 𝜓 and 𝜓, the same result cannot be

used to simplify the numerator of Eq. (5.29). However, if 𝑂 is bilinear in the fermion fields, then

one can use a generalization of Eq. (5.31). The end result (see LQCD textbooks such as [66] for

the details) is that Eq. (5.29) can be written as

⟨𝑂⟩ =

∫

1
𝑍

D𝑈 𝑂 [𝑈, 𝑀 −1 [𝑈]] 𝑒−𝑆𝐺 [𝑈] det 𝑀 [𝑈],

(5.33)

where 𝑍 is given by Eq. (5.32).

The Grassman variables are now gone from Eq. (5.33), so provided that det 𝑀 [𝑈] is real and

nonnegative, then we can once again interpret ⟨𝑂⟩ as a classical path integral and use MCMC to

177

estimate the observable 𝑂. Now, we sample the configurations 𝑈 from the probability distribution

𝑃(𝑈) =

1
𝑍

𝑒−𝑆𝐺 [𝑈] det 𝑀 [𝑈].

Then as before,

∫

⟨𝑂⟩ =

D𝑈 𝑃(𝑈)𝑂 [𝑈, 𝑀 −1 [𝑈]] ≈

1
𝑁

∑︁

𝑈𝑛

𝑂 [𝑈𝑛].

(5.34)

(5.35)

We cannot generally ensure that det 𝑀 [𝑈] is real and nonnegative. However, we can for the

case of two mass-degenerate quark flavors. Fortunately, QCD with two mass-degenerate dynamical

light quarks is a pretty good approximation of full QCD, so we will use that example here. For two

degenerate flavors, Eq. (5.34) becomes

𝑃(𝑈) =

1
𝑍

𝑒−𝑆𝐺 [𝑈] det 𝑀𝑢 [𝑈] det 𝑀𝑑 [𝑈] =

1
𝑍

𝑒−𝑆𝐺 [𝑈] det (cid:2)𝑀 𝑀 †(cid:3) ,

(5.36)

where 𝑀𝑢 refers to the up quark and 𝑀𝑑 refers to the down quark. Here, we combined the two quark

determinants since for degenerate flavors, they are the same. Thus, we are assured that det[𝑀 𝑀 †]

is real and nonnegative and that 𝑃(𝑈) can be interpreted as a probability distribution function.

Note that 𝑀 [𝑈] is a huge matrix of size proportional to 𝑉 × 𝑉 where 𝑉 is the volume of the

lattice. So for even a moderately sized lattice with 𝑉 = 323 × 64, 𝑀 [𝑈] has millions of rows

and millions of columns. This makes the direct calculation of det 𝑀 infeasible. Also, note that

𝑀 [𝑈] is a functional of the lattice configuration 𝑈, and so it has to be recomputed for each lattice

configuration. It turns out that one can express the fermion determinant as a Gaussian integral of a

bosonic “pseudofermion” field (see LQCD textbooks e.g. [66] for details)

det (cid:2)𝑀 𝑀 †(cid:3) ∝

∫

D𝜙†D𝜙 𝑒−𝜙† (𝑀 𝑀 †) −1𝜙,

(5.37)

where 𝜙† and 𝜙 are bosonic fields. We have effectively traded the determinant det[𝑀 𝑀 †] for the

inverse (𝑀 𝑀 †)−1. The inverse (𝑀 𝑀 †)−1 is a very large matrix of size proportional to 𝑉 × 𝑉 and

highly nonlocal, however, now there is the possibility of using iterative methods to approximate

𝑉-length vectors of the form (𝑀 𝑀 †)−1𝜙. With this, our path integral becomes

∫

⟨𝑂⟩ =

D𝑈D𝜙†D𝜙 𝑃(𝑈)𝑂 [𝑈, 𝑀 −1] with 𝑃(𝑈) =

1
𝑍

𝑒−𝑆𝐺 [𝑈]−𝜙† (𝑀 𝑀 †) −1𝜙.

(5.38)

178

In principle, we could use the Metropolis algorithm as before. However, Metropolis is a local

algorithm and applying it to a problem like this would be abysmally inefficient. The solution is to

use a global updating algorithm like Hybrid Monte Carlo (HMC) or Rational Hybrid Monte Carlo

(RHMC) [175, 176].

5.4.3 The Hybrid Monte Carlo (HMC) Algorithm

For simplicity, we consider a real scalar field 𝑄. The vacuum expectation value of an observable

𝑂 is given by

⟨𝑂⟩ =

∫ D𝑄 𝑂 [𝑄]𝑒−𝑆[𝑄]
∫ D𝑄 𝑒−𝑆[𝑄]

.

(5.39)

We introduce a field 𝑃 of momenta conjugate to the field variables 𝑄. Since the observable 𝑂 does

not depend on 𝑃, we can just as well compute ⟨𝑂⟩ as

⟨𝑂⟩ =

∫ D𝑄D𝑃 𝑂 [𝑄]𝑒− 1
∫ D𝑄D𝑃 𝑒− 1

2 𝑃2−𝑆[𝑄]

2 𝑃2−𝑆[𝑄]

.

(5.40)

Note that Eq. (5.40) is equivalent to Eq. (5.39) since the Gaussian integrals over 𝑃 cancel out in the

numerator and denominator. However, Eq. (5.40) can be interpreted as the vacuum expectation value

of an observable 𝑂 [𝑄] in a classical and nonrelativistic microcanonical system with Hamiltonian

𝐻 [𝑄, 𝑃] =

1
2

𝑃2 + 𝑆[𝑄].

(5.41)

For such a system, the classical equations of motion (with respect to a fictitious “computer” time)

are

(cid:164)𝑃 = −

𝜕𝐻
𝜕𝑄

= −

𝜕𝑆
𝜕𝑄

,

(cid:164)𝑄 =

𝜕𝐻
𝜕𝑃

= 𝑃.

(5.42)

Thus, given a configuration 𝑄, we can invent some conjugate field 𝑃 and then evolve the two

forward in fictitious time by integrating the “molecular dynamics equations” Eq. (5.42) to get a new

configuration 𝑄′. Doing this repeatedly results in an ensemble of configurations, upon which one

can estimate ⟨𝑂⟩.

The evolution of Eq. (5.42) is along a surface of constant energy. To obtain configurations at

different energy, the conjugate field 𝑃 is periodically refreshed with random values. In practice,

the integration of Eq. (5.42) cannot be done exactly due to discretization errors. To correct for this,

179

a Metropolis accept-reject step is performed at the end of each evolution “trajectory”—accepting

the new configuration with a probability that depends on the change of the total Boltzman factor.

5.4.4 Example: Hadron Spectroscopy

Before we conclude this brief overview of lattice QCD, it will be helpful to consider an example

of the kind of ab initio quantities that can be computed in LQCD. We sketch a simple example

here—measuring the mass of a pion. Specifically, we will focus on the 𝜋+ meson. We follow

closely Ref. [66].

We start by identifying lattice interpolating operators 𝑂 and 𝑂 for the pions such that the

corresponding Hilbert space operators ˆ𝑂 and ˆ𝑂† annihilate and create the pions. These operators

are functionals of the lattice fields, and have the quantum numbers of the pion. We can use

𝑂𝜋+ (𝑛) = 𝑑 (𝑛)𝛾5𝑢(𝑛),

𝑂𝜋+ (𝑚) = 𝑢(𝑚)𝛾5𝑑 (𝑚),

(5.43)

where 𝑢, 𝑢, 𝑑, and 𝑑 are the up and down quark spinors.

Next, we construct 2-point correlators from these interpolating operators. For example, the 𝜋+

correlator

(cid:68)

𝑂𝜋+ (𝑛)𝑂𝜋+ (𝑚)

(cid:68)

(cid:69)

=

𝑑 (𝑛)𝛾5𝑢(𝑛)𝑢(𝑚)𝛾5𝑑 (𝑚)

(cid:69)

.

(5.44)

The fermionic part of the expectation value can be computed in closed form. After simplifying and

making use of Wick’s theorem, it can be shown that (see any LQCD textbook for details)

(cid:68)

𝑂𝜋+ (𝑛)𝑂𝜋+ (𝑚)

(cid:69)

𝐹

= −tr (cid:2)𝛾5𝑀 −1

𝑢 (𝑛, 𝑚)𝛾5𝑀 −1

𝑑 (𝑚, 𝑛)(cid:3) ,

(5.45)

where 𝑀 −1 is the inverse of the Dirac operator, and can be understood as a quark propagator. The

object on the right-hand side is something that can be measured on the lattice. However, given that

𝑀 −1 is a highly nonlocal matrix of size proportional to 𝑉 × 𝑉 where 𝑉 is the volume of the lattice,

measuring this quantity is a highly nontrivial endeavor. It is done by inserting “quark sources” and

then applying iterative methods such as the conjugate gradient algorithm. The subscript 𝐹 on the

left indicates that this is only the fermionic part of the expectation value. The gauge part remains.

180

(5.46)

(5.47)

(5.48)

(5.49)

The full expectation value is the path integral

(cid:68)

𝑂𝜋+ (𝑛)𝑂𝜋+ (𝑚)

(cid:69)

= −

∫

1
𝑍

where

D𝑈 𝑒−𝑆𝐺 [𝑈] det[𝑀𝑢] det[𝑀𝑑] tr (cid:2)𝛾5𝑀 −1

𝑢 (𝑛, 𝑚)𝛾5𝑀 −1

𝑑 (𝑚, 𝑛)(cid:3) ,

∫

𝑍 =

D𝑈 𝑒−𝑆𝐺 [𝑈] det[𝑀𝑢] det[𝑀𝑑].

As before, we assume mass-degenerate quarks, and we express the quark determinants in terms of

D𝑈D𝜙†D𝜙 𝑒−𝑆𝐺 [𝑈]−𝜙† (𝑀 𝑀 †) −1𝜙 tr (cid:2)𝛾5𝑀 −1(𝑛, 𝑚)𝛾5𝑀 −1(𝑚, 𝑛)(cid:3) ,

pseudofermion fields. Then,

(cid:68)

𝑂𝜋+ (𝑛)𝑂𝜋+ (𝑚)

(cid:69)

= −

∫

1
𝑍

where

∫

𝑍 =

D𝑈D𝜙†D𝜙 𝑒−𝑆𝐺 [𝑈]−𝜙† (𝑀 𝑀 †) −1𝜙.

On the other hand, after appropriate momentum projection, one can write such correlators as a

spectral decomposition

(cid:68)

𝑂𝜋+ ((cid:174)0, 𝑛𝑡)𝑂𝜋+ ((cid:174)0, 0)

(cid:69)

=

∑︁

𝑘

⟨0| ˆ𝑂|𝑘⟩⟨𝑘 | ˆ𝑂†|0⟩𝑒−𝑛𝑡 𝐸𝑘 = 𝐴0𝑒−𝑛𝑡 𝐸0 + 𝐴1𝑒−𝑛𝑡 𝐸1 + · · · .

(5.50)

Since the lowest energy 𝐸0 corresponds to the rest mass of the particle, this gives us a way to

connect the pion correlator to the mass of the pion.

So to compute the 𝜋+ pion mass using LQCD, one would proceed as follows:

1. Generate, using e.g. the HMC algorithm, an ensemble of 𝑁 de-correlated gauge configura-

tions 𝑈 using the distribution weight

𝑃(𝑈) =

1
𝑍

𝑒−𝑆𝐺 [𝑈]−𝜙† (𝑀 𝑀 †) −1𝜙.

(5.51)

2. On each of these configurations, measure the pion correlator

𝐶 (𝑛𝑡) = − tr (cid:104)

𝛾5𝑀 −1((cid:174)0, 𝑛𝑡, (cid:174)0, 0)𝛾5𝑀 −1((cid:174)0, 0, (cid:174)0, 𝑛𝑡)

(cid:105)

(5.52)

for different 𝑛𝑡

181

3. Extract the mass 𝑚 from fits to exponential decays

𝐶 (𝑛𝑡) ≈ 𝐴0𝑒−𝑛𝑡 𝑚

(5.53)

4. Average the measurements over the ensemble of configurations (Eq. (5.24)) to obtain the best

estimate

5. Repeat for different lattice sizes and extrapolate to zero lattice spacing and infinite lattice

volume

5.4.5 Challenges

As we’ve seen in the preceding section, for lattice QCD with dynamical fermions, the distri-

bution weight for the gauge fields is proportional to determinants of huge matrices. Similarly,

the calculation of correlators requires the inversion of these huge matrices. Because of their size,

neither the determinants nor the inverses of these matrices can be calculated directly, and so the

state-of-the-art in lattice QCD is algorithms which allow us to estimate these quantities indirectly.

Even the most basic calculations are difficult and necessitate the use of the latest technologies

(GPUs, TPUs, etc.) and high performance algorithms such as multigrid methods.

Conventional lattice QCD works because by Wick-rotating the path integrals to imaginary

time, we are able to reinterpret those path integrals as classical partition functions of a system

in a heat bath at some particular temperature and thereby employ our extensive classical MCMC

methods. However, this means we are always studying a system in equilibrium. The study of

real-time phenomena such as evolution and decay dynamics is completely out of reach with this

approach. To study real-time lattice QCD, we will need completely new approaches such as

quantum simulation.

A similar problem occurs with nonzero chemical potential.

In the vacuum, the chemical

potential (i.e. the net number of baryons or quarks) is zero. However, this is not true in matter, and

especially not in extreme situations such as in heavy-ion collisions or in ultra-dense matter such

as neutron stars. In conventional lattice QCD, calculations are done at zero (or very nearly zero)

182

chemical potential and for good reason. It turns out that introducing a nonzero chemical potential

results in a complex fermion determinant, which means we can no longer interpret the path integral

as a classical partition function.

5.5 Quantum Simulation of Lattice Quantum Field Theories

5.5.1

Introduction

In a quantum many-body system, the Hilbert space grows exponentially with the system size.

The simulation of such systems in the strongly-coupled regime is an extremely hard problem for

classical computers. Even when we simplify to equilibrium systems in Euclidean time, as we

do in lattice QCD, our conventional methods suffer from sign problems. Real-time dynamics are

laughably out of reach. But what if we could construct a simple quantum system that emulates a

quantum field theory in some way? By using quantum objects instead of classical bits, the expo-

nential growth of the Hilbert space with the system size would match that of the QFT. Furthermore,

nothing would prevent us from evolving the simple system forward in time to gain insight into the

dynamics of the QFT.

The idea of quantum simulation seems to have originated with Richard Feynman [177]. He

gave a talk in 1982 in which he noted that phenomena in quantum field theory, assuming a theory

in which spacetime is discrete, can be imitated by phenomena in solid state physics. In a way, a

lattice of atoms can imitate quantum field theory. He postulated, therefore, that one could build

quantum simulators in the lab which could tell us things about quantum field theory:

“I [. . .] believe it’s true that with a suitable class of quantum machines you could imitate

any quantum system, including the physical world.”

In this section, we consider the quantum simulation of quantum field theories. That is, we

consider quantum systems that can be made to emulate a quantum field theory in some manner

such that a measurement of the quantum system gives us information about the quantum field

theory. There are really two distinct classes of quantum simulators. An analog quantum simulator

is a special-purpose device—built and tuned specifically to emulate the Hamiltonian of a specific

183

quantum field theory.

In an analog simulation, time is not discretized—the simulator evolves

continuously in real time. This may be accomplished by setting up and tuning the device, and then

letting it evolve naturally on its own. An example would be cold atoms hopping around on an optical

lattice. In a digital quantum simulator, the time evolution proceeds stroboscopically typically via

the successive application of quantum logic gates. The simulator could be a special-purpose device

or it could be a universal quantum computer. The drawback of digital simulations is a need for

error correction which means that useful results may be decades in the future.

In lattice QCD, spacetime is discretized and one typically works in the Lagrangian formalism.

In both digital and analog quantum simulations, only space is discretized at the beginning, and one

works in the Hamiltonian formalism. In a digital simulation, time is eventually discretized and time

evolution proceeds stroboscopically. In an analog simulation, time never gets discretized. Rather,

the system is prepared in some initial state and then allowed to evolve in real time. General reviews

of both analog and digital quantum simulation include [79, 80, 178, 179], and there is even a recent

textbook on the subject [180].

5.5.2 Analog Simulation

In lieu of the preferred analytical solution, the modern scientist often resorts to computational

simulation. But even that can be so difficult that one must occasionally resort to physical models.

In general, the simulation of natural phenomena, such as fluid dynamics, is computationally

difficult. So difficult that scale models and wind tunnels are still used to study aerodynamic forces

on vehicles. The validity of this kind of analog simulation rests on there being some kind of

isomorphism between the problem of interest (e.g. vehicle aerodynamics) and the physical model

(e.g. scale model in a wind tunnel). In the case of a scale model, the similarity is obvious. But in

analog simulation, the similarity is not always obvious. Consider for example that electric circuits

have been used to model neural systems [181] and that the flow of water in a set of tanks and pipes

has been used to model the economy of the United Kingdom [182]. The general idea of analog

simulation is to map a complex system of interest to a simpler physical system which can be built

and studied in the lab. For example, the analog simulation of black holes using Bose-Einstein

184

condensates has been proposed [183, 184]. In our case, we want to map complicated systems like

lattice quantum field theories to simpler quantum-mechanical systems that we can build and study

in the lab.

In an analog quantum simulation, a Hamiltonian of interest (e.g. of a gauge theory) is mapped

to a simpler and controllable quantum system (e.g. an optical lattice). After setup and initialization,

the simple system is evolved forward in time. At the end, some quantity is measured on the simple

system and interpreted in terms of the system of interest. Analog simulation of gauge theories is

promising because it would bypass the sign problems inherent in conventional lattice QCD and

allow us to study the real-time dynamics of the theory [86].

Analog quantum simulators are purpose-built devices—they are not universal quantum com-

puters. They are allowed to evolve in real time as opposed to digital simulators which are strobo-

scopically evolved in computer time by the application of quantum gates. By mapping the degrees

of freedom onto continuous interactions, an analog simulator requires fewer qubits than a digital

algorithm. This makes the analog simulator potentially more efficient than digital simulators in

the current noisy intermediate-scale quantum (NISQ) era [178]. Given their purpose-built nature,

analog simulators are limited to simpler interactions, however, they are also more easily scalable

to large system size [185]. Finally, with analog simulators, we may be able to perform qualitative

experiments to determine, e.g. if a given model has a phase transition, even before we are able to

perform precision measurements.

Schematically, the general approach for an analog simulation is as follows:

1. Map the Hamiltonian of the system of interest 𝐻𝑠𝑦𝑠 directly onto the Hamiltonian 𝐻𝑠𝑖𝑚 of a

simpler and controllable simulator. This assumes that you can find a simulator and a mapping

for which this is possible. This may turn out to be too difficult, so instead of studying 𝐻𝑠𝑦𝑠,

you may need to study a simpler effective many-body model of the system of interest. Often

the mapping will require some cleverness [186]

2. Prepare the state to be measured. In the simplest case, this simply means to let the simulator

185

settle into equilibrium

3. Measure the state. In the simplest case, measuring the simulator in its prepared state will

yield information about the system of interest

4. Repeat the previous two steps many times to get a statistical average of the observables of

interest

The most common technologies proposed for the analog quantum simulation of quantum field

theories include optical lattices, trapped ion chains, and superconducting circuits. In an optical

lattice, overlapping laser beams are used to create interference patterns, which can be used to trap

atoms via the optical dipole force in the regions where destructive interference occurs. Atoms

can hop from site to site and atoms that meet at the same site interact with each other. Various

lattice geometries can be realized by superimposing laser fields at different angles. The optical

lattice can be tuned by adjusting intensity, frequency, and phase of the lasers. The atoms used in

an optical lattice would typically be neutral alkaline atoms since they have only a single valence

electron—simplifying things. An individual atom in the lattice can be addressed and manipulated,

for example, flipping its spin via a focused laser beam. In the end, the lattice configuration can

be measured, for example, by fluorescence. For excellent review articles that include information

on analog simulations with optical lattices, see [80, 86, 87, 185–190]. In a trapped ion chain, cold

ions are contained in electromagnetic traps. A qubit can be encoded in two internal states of the

ion and manipulated by laser pulses and measured via fluorescence. For excellent review articles

that include information on using trapped ions to perform analog quantum simulations of LGTs,

see [80, 84, 186, 187, 189, 191]. In superconducting circuits, qubits are constructed from circuit

elements and Josephson junctions. Qubits can be coupled via resonators and manipulated using

microwave radiation. Superconducting circuits are very tunable and seem promising due to their

apparent scalability—they can be fabricated using the existing technology of integrated circuits.

For excellent review articles that include information on using superconducting circuits to perform

analog quantum simulations of LGTs, see [80, 186, 187, 189, 192].

186

Analog quantum simulation has been around for a while in the condensed matter field. In [193],

it was shown that an ultracold dilute gas of bosonic atoms in an optical lattice could be described by

a Bose-Hubbard model. This model describes “the hopping of bosonic atoms between the lowest

vibrational states of the optical lattice sites”. The Bose-Hubbard Hamiltonian is

𝐻 = −𝐽 ∑︁
⟨𝑖, 𝑗⟩

𝑏†
𝑖 𝑏 𝑗 +

𝜖𝑖 ˆ𝑛𝑖 +

∑︁

𝑖

1
2

𝑈 ∑︁
𝑖

ˆ𝑛𝑖 ( ˆ𝑛𝑖 − 1),

(5.54)

where 𝑏𝑖 and 𝑏†

𝑖 𝑏𝑖
counts the number of bosonic atoms at lattice site 𝑖. The parameter 𝑈 gives the strength of the

𝑖 are the annihilation and creation operators, and the number operator ˆ𝑛𝑖 = 𝑏†

repulsion of two atoms on a lattice site, 𝐽 is the hopping matrix element between adjacent sites, and

𝜖𝑖 describes the energy offset of each lattice site. In [194], this was experimentally implemented by

loading Rb-87 atoms into a three-dimensional optical lattice. The optical lattice potential had the

form

𝑉 (𝑥, 𝑦, 𝑧) = 𝑉0 (cid:2)sin2(𝑘𝑥) + sin2(𝑘 𝑦) + sin2(𝑘 𝑧)(cid:3) .

(5.55)

In each individual well (or trap), i.e. at each lattice site, the confining potential for an atom can be

approximated by a harmonic potential. In their setup, there are ∼ 65 lattice sites in each dimension

with an average atom number of up to 2.5 atoms per lattice site near the center of the 3D lattice.

The researchers were able to observe and study the quantum phase transition from superfluid to

Mott insulator. Simulating the Bose-Hubbard model on an optical lattice is straightforward since

the Bose-Hubbard Hamiltonian can be directly related to the Hamiltonian of interacting atoms on a

lattice. However, for other models, it is generally not that straightforward, and work must be done

to relate a Hamiltonian of interest to a Hamiltonian that is simple enough to simulate on an optical

lattice but not so simple that it becomes a useless analogue.

The quantum simulation of gauge theories on analog simulators is significantly more challenging

than for condensed matter systems such as the Bose-Hubbard model. This is largely because the

gauge theories of high energy physics realize certain symmetries that are usually not naturally

realized in an analog simulator.

In order to simulate a gauge theory, there are two particularly

187

problematic requirements; simulator must be able to include both fermionic and bosonic degrees

of freedom, and it must have local gauge invariance [86].

Because the gauge theories of interest to us contain both fermionic and gauge fields, optical

lattices seem attractive because they can hold both fermionic and bosonic atoms. However, there

remains a complication.

In Wilson’s formulation of lattice gauge theory, the gauge fields are

defined on the links of the lattice. These continuous gauge degrees of freedom give rise to an

infinite-dimensional Hilbert space at each link, and it is not obvious how to represent such objects

on an analog simulator. One solution is to truncate the link Hilbert space in some way such that

the model with truncated gauge fields still approximates the full theory. For example, in a theory

with 𝑈 (1) gauge fields, the fields may be approximated with Z𝑞 [195]. To optimize such a Z𝑞

approximation, it is useful to have a continuous family of models that interpolate among the various

possibilities. Doing this was one of the original motivations for our study of the Extended-𝑂 (2)

model [127–129], which resulted in this thesis. Similarly, 𝑆𝑈 (𝑁) gauge fields may be “digitized”

in some way to represent them with discrete subgroups. This has been studied for both 𝑆𝑈 (2) [196]

and 𝑆𝑈 (3) [197–199].

Several strategies have been proposed to include local gauge invariance in an analog simulation.

One possibility is to add a penalty term to the Hamiltonian that suppresses symmetry-violating

transitions. This is straightforward for Abelian models, but not for non-Abelian models. Another

option is to exploit microscopic intrinsic symmetries of the atomic matter and combine with spatially

arranged potentials to engineer gauge invariance up to high energy. See reviews such as [187, 189].

5.5.3 Digital Simulation

For a quantum simulation to be digital, the only requirement is that the time evolution proceeds

stroboscopically. In this sense, digital quantum simulation can be done on special-purpose machines

such as optical lattices of Rydberg atoms or on trapped ion chains. Typically, however, when

referring to “digital” simulation, a narrower definition is being used in that the simulation is being

performed on a universal gate-based quantum computer. This is a more specific kind of quantum

device in which the time evolution is discretized and implemented by the application of quantum

188

logic gates, for example [200–207]. This is the approach that we focus on here. For review articles

on digital quantum simulation, see [80, 87, 186, 187, 208].

In general, the digital simulation of a quantum field theory involves three steps:

1. Initial state preparation: The first step is to initialize the set of qubits to the state |𝜓(0)⟩.

This may be challenging, and finding an efficient algorithm may not be possible. If |𝜓(0)⟩

is the ground state of the theory, one approach called “adiabatic state preparation” is to start

the system in the ground state of the simpler non-interacting theory and then to adiabatically

turn on the interactions. The author and others recently studied a variety of state preparation

methods for the Schwinger model [209, 210].

2. Unitary evolution: The initial state is time-evolved to the final state

|𝜓(𝑡)⟩ = 𝑈 (𝑡) |𝜓(0)⟩ ,

𝑈 (𝑡) = 𝑒−𝑖ℏ𝐻𝑡,

by applying the unitary operator 𝑈 to the initial state. The challenge here is to write 𝑈 as a

sequence of quantum gate operations. This may be achieved by writing 𝐻 in terms of Pauli

matrices, which we know how to implement via quantum gates. Then by discretizing time 𝑡

in a process known as Trotterization, one may approximate 𝑈 as a product of quantum gate

operations

𝑛
More generally, any unitary operation can be quantum simulated—the challenge is to do it

𝑈 (𝑡) ≈

(cid:214)

𝑒−𝑖ℏ𝐻𝑛𝛿𝑡 .

efficiently.

3. Measurement: Once the system is in the desired final state, it is measured. One possibility

is to use quantum state tomography. However, this approach requires a lot of resources, and

more direct alternatives are preferred.

In practice, the three steps above give a single data point since the act of measuring a quantum state

will also destroy it. Therefore, the three steps are repeated many times to obtain a statistical mean

and error bar.

189

The main challenge in implementing digital quantum simulations is the same as the main

challenge facing universal quantum computation—error correction and its need for an order of

magnitude more qubits. In the NISQ era, there may be little that we can accomplish unless we can

significantly reduce the need for error correction in our digital simulations. Another challenge is

how to represent fermionic degrees of freedom on a gate-based quantum computer. One approach

is the Jordan-Wigner transformation [211] in which fermionic modes are mapped to strings of Pauli

matrices [212,213]. However, the Jordan-Wigner transformation is a non-local mapping and is very

inefficient in more than 1 + 1 dimensions. An alternative is the Bravyi-Kitaev mapping [214, 215],

which reduces the non-locality that is induced. As in analog simulation, there is also the challenge

of discretizing and truncating the infinite-dimensional Hilbert space of the gauge links. Finally,

gauge invariance, which imposes constraints relating the matter and gauge degrees of freedom at

each lattice site, poses a complication. If these constraints are not satisfied, then one is simulating

a theory that contains many undesired non-physical states. One way to solve two challenges at

once is to use these gauge constraints to integrate out the fermionic or gauge degrees of freedom.

This approach may be used to eliminate the fermionic matter in any dimension or it may be used to

eliminate the gauge fields if one is in a single spatial dimension [216, 217].

190

CHAPTER 6

SUMMARY

Lattice QCD, which is a nonperturbative ab initio approach, is our most powerful tool for extracting

predictions from QCD. However, there are significant limitations.

In particular, lattice QCD is

limited to equilibrium physics at zero baryon density. Because of so-called “sign problems”, lattice

QCD is not a viable means for studying QCD dynamics or QCD at finite density. We need a new

approach. Quantum simulation can, in principle, simulate quantum field theories at finite density

and real time with no sign problem. However, the realization of a quantum simulator for QCD will

require siginifant advancements from both the theory and the hardware sides. We need to start with

toy models—spin models, Abelian-Higgs in 1+1 dimensions, the Schwinger model, SU(2) with

fermions, and so on before eventually reaching full QCD.

Theoretical work in the analog simulation of quantum field theories has shown that the Abelian-

Higgs model (i.e. scalar QED) in 1+1 dimensions can be mapped to a Rydberg ladder [1–4].

In this proposed approach, the Abelian-Higgs Hamiltonian is mapped to an angular momentum

Hamiltonian, which in turn is truncated to a finite number (e.g. 5) of angular momentum states,

which in principle can be simulated on a Rydberg ladder.

In the limit of infinite self-coupling and zero gauge coupling, the Abelian-Higgs model reduces

to the classical O(2) spin model. In this dissertation, we study a class of O(2)-like models. We

find interesting behavior already on very small lattices, so we think this model would be a good

candidate for analog simulation before attempting the simulation of full quantum field theories like

the Abelian-Higgs model.

We studied the effects of adding a symmetry breaking term −ℎ𝑞 (cid:205)𝑥 cos(𝑞𝜑𝑥) to the Hamiltonian

of the ordinary classical O(2) model. This adds two continuously tunable parameters which allows

us to explore the effect of symmetry-breaking in O(2)-like models. There is some previous literature

on this model, however, it is generally limited to specific integer values of 𝑞 and specific values of

ℎ𝑞. In this work, we have studied a much larger parameter space, and unlike previous studies, we

considered also noninteger values of 𝑞.

191

When 𝑞 is integer and ℎ𝑞 > 0, the O(2) symmetry is reduced to a Z𝑞 symmetry. In the ℎ𝑞 → ∞

limit, the model reduces to the ordinary 𝑞-state clock model, which for 𝑞 = 2, 3, 4 has a second-

order phase transition and for 𝑞 ≥ 5 has two BKT transitions. In the limit 𝑞 → ∞, the ordinary

O(2) model is recovered. For 𝑞 ≥ 5, the pair of BKT transitions seems to persist to finite ℎ𝑞 > 0.

When ℎ𝑞 → 0, the small-𝛽 transition connects to the BKT transition of the 𝑂 (2) model, whereas

the large-𝛽 transition goes to a different finite value of 𝛽. For 𝑞 = 2, 3, 4, the phase diagram changes

more significantly as the symmetry-breaking parameter ℎ𝑞 is changed. At intermediate ℎ𝑞, there

is still a second-order phase transition in each case, however, the critical exponents vary with ℎ𝑞,

and so the universality class is not the same as in the ℎ𝑞 = ∞ case. For sufficiently small ℎ𝑞 > 0,

there is evidence that the phase transitions become BKT instead of second-order. If true, this is

the only example, that we are aware of, in which one can vary a smooth parameter to go from a

second-order to a BKT transition without breaking a symmetry or introducing a new symmetry to

the model. However, the evidence is not conclusive and future studies on larger lattices should be

done to confirm this.

For noninteger 𝑞, early results suggested smooth “lobes” in the phase diagram reminiscent of

the phase diagram seen in Rydberg atom chains (see Fig. 4.42). However, this initial picture did not

hold up under scrutiny. In the ℎ𝑞 → ∞ limit, we find that for noninteger 𝑞, there is a crossover at

small 𝛽 and an Ising phase transition at larger 𝛽. In the phase diagram, there is a smooth Ising phase

transition line that terminates at the next larger integer, such that the phase diagram looks nearly

periodic from integer to integer, but with an abrupt discontinuity at each integer. For noninteger 𝑞,

this picture seems to persist to all finite ℎ𝑞 > 0. This model vividly illustrates the strong influence

that symmetries exert in these classes of models and even a barely broken symmetry results in

completely different behavior. Thus, in an analog simulation, it may be difficult to keep the system

in the desired universality class unless the symmetries are somehow protected.

In summary, we found that adding symmetry breaking term −ℎ𝑞 (cid:205)𝑥 cos(𝑞𝜑𝑥) to the ordinary

classical O(2) model results in a rich phase diagram, illustrated in Fig. 4.66, containing second-

order phase transitions of various universality classes and BKT transitions. This extended-O(2)

192

model can serve as a playground in which to explore these different kinds of phase transitions

and their relationships. The model interpolates between different Z𝑞 models via the continuously

tunable parameter 𝑞, and so it may be useful for optimizing the various Z𝑞 approximations of 𝑈 (1)

in quantum simulations of Abelian gauge theories.

193

BIBLIOGRAPHY

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

A. Bazavov, Y. Meurice, S.-W. Tsai, J. Unmuth-Yockey, and J. Zhang, “Gauge-invariant
implementation of the Abelian-Higgs model on optical lattices,” Phys. Rev. D 92, 076003
(2015).

J. Zhang, J. Unmuth-Yockey, J. Zeiher, A. Bazavov, S.-W. Tsai, and Y. Meurice, “Quantum
Simulation of the Universal Features of the Polyakov Loop,” Phys. Rev. Lett. 121, 223201
(2018).

Y. Meurice, “Examples of symmetry-preserving truncations in tensor field theory,” Phys.
Rev. D 100, 1, 014506 (2019), arXiv:1903.01918.

Y. Meurice, “Theoretical methods to design and test quantum simulators for the compact
Abelian Higgs model,” Phys. Rev. D 104, 094513 (2021).

B. A. Berg, Markov Chain Monte Carlo Simulations and Their Statistical Analysis: With
Web-Based Fortran Code, World Scientific (2004), ISBN 978-9812389350.

E. W. Montroll, R. B. Potts, and J. C. Ward, “Correlations and Spontaneous Magnetization of
the Two-Dimensional Ising Model,” Journal of Mathematical Physics 4, 2, 308–322 (2004).

C. N. Yang, “The Spontaneous Magnetization of a Two-Dimensional Ising Model,” Phys.
Rev. 85, 808–816 (1952).

R. B. Griffiths, “Peierls Proof of Spontaneous Magnetization in a Two-Dimensional Ising
Ferromagnet,” Phys. Rev. 136, A437–A439 (1964).

K. Huang, Statistical mechanics, Edition 2, John Wiley & Sons (1987), ISBN 978-
0471815181.

[10] R. Peierls, “On Ising’s model of ferromagnetism,” Mathematical Proceedings of the Cam-

bridge Philosophical Society 32, 3, 477–481 (1936).

[11] P. Fendley, Modern Statistical Mechanics (book in progress) URL: https://users.ox.ac.uk/

~phys1116/book.html.

[12] R. J. Baxter, “Potts model at the critical temperature,” Journal of Physics C: Solid State

Physics 6, 23, L445 (1973).

[13] Blatt, M and Wiseman, S. and Domany, E., “Clustering data through an analogy to the Potts
model,” Advances in Neural Information Processing Systems 8 (NIPS 1995) (1995).

[14] S. J. K. Jensen and O. G. Mouritsen, “Is the Phase Transition of the Three-State Potts Model

Continuous in Three Dimensions?” Phys. Rev. Lett. 43, 1736–1739 (1979).

[15]

J. M. Kosterlitz, “The critical properties of the two-dimensional xy model,” Journal of
Physics C: Solid State Physics 7, 6, 1046 (1974).

[16] R. G. Edwards, J. Goodman, and A. D. Sokal, “Multi-grid Monte Carlo (II). Two-dimensional

XY model,” Nuclear Physics B 354, 2, 289–327 (1991).

194

[17]

J. F. Fernández, M. F. Ferreira, and J. Stankiewicz, “Critical behavior of the two-dimensional
XY model: A Monte Carlo simulation,” Phys. Rev. B 34, 292–300 (1986).

[18] R. Gupta and C. F. Baillie, “Critical behavior of the two-dimensional XY model,” Phys. Rev.

B 45, 2883–2898 (1992).

[19] R. Gupta, J. DeLapp, G. G. Batrouni, G. C. Fox, C. F. Baillie, and J. Apostolakis, “Phase

Transition in the 2D XY Model,” Phys. Rev. Lett. 61, 1996–1999 (1988).

[20] M. Hasenbusch, “The two-dimensional XY model at the transition temperature: a high-
precision Monte Carlo study,” Journal of Physics A: Mathematical and General 38, 26, 5869
(2005).

[21] Y. Komura and Y. Okabe, “Large-Scale Monte Carlo Simulation of Two-Dimensional Clas-
sical XY Model Using Multiple GPUs,” Journal of the Physical Society of Japan 81, 11,
113001 (2012).

[22] G. Palma, T. Meyer, and R. Labbé, “Finite size scaling in the two-dimensional 𝑋𝑌 model

and generalized universality,” Phys. Rev. E 66, 026108 (2002).

[23] H. Weber and P. Minnhagen, “Monte Carlo determination of the critical temperature for the

two-dimensional XY model,” Phys. Rev. B 37, 5986–5989 (1988).

[24]

J. F. Yu, Z. Y. Xie, Y. Meurice, Y. Liu, A. Denbleyker, H. Zou, M. P. Qin, J. Chen, and
T. Xiang, “Tensor renormalization group study of classical 𝑋𝑌 model on the square lattice,”
Phys. Rev. E 89, 013308 (2014).

[25] G. Ortiz, E. Cobanera, and Z. Nussinov, “Dualities and the phase diagram of the p-clock

model,” Nuclear Physics B 854, 3, 780–814 (2012).

[26] P. Archambault, S. T. Bramwell, J.-Y. Fortin, P. C. W. Holdsworth, S. Peysson, and J.-F.
Pinton, “Universal magnetic fluctuations in the two-dimensional XY model,” Journal of
Applied Physics 83, 11, 7234–7236 (1998).

[27] P. Archambault, S. T. Bramwell, and P. C. W. Holdsworth, “Magnetic fluctuations in a finite
two-dimensional model,” Journal of Physics A: Mathematical and General 30, 24, 8363
(1997).

[28] S. T. Bramwell and P. C. W. Holdsworth, “Magnetization and universal sub-critical behaviour
in two-dimensional XY magnets,” Journal of Physics: Condensed Matter 5, 4, L53 (1993).

[29] S. T. Bramwell and P. C. W. Holdsworth, “Magnetization: A characteristic of the Kosterlitz-

Thouless-Berezinskii transition,” Phys. Rev. B 49, 8811–8814 (1994).

[30] L. Berthier, P. C. W. Holdsworth, and M. Sellitto, “Nonequilibrium critical dynamics of the
two-dimensional XY model,” Journal of Physics A: Mathematical and General 34, 9, 1805
(2001).

[31] C. Kawabata and K. Binder, “Evidence for vortex formation in Monte Carlo studies of the
two-dimensional XY-model,” Solid State Communications 22, 11, 705–710 (1977).

195

[32] R. Loft and T. A. DeGrand, “Numerical simulation of dynamics in the XY model,” Phys.

Rev. B 35, 8528–8541 (1987).

[33] R. Kenna, “The XY Model and the Berezinskii-Kosterlitz-Thouless Phase Transition,”

(2005), arXiv:cond-mat/0512356.

[34] P. Olsson, “Effective vortex interaction in the two-dimensional XY model,” Phys. Rev. B 46,

14598–14616 (1992).

[35]

J. Tobochnik and G. V. Chester, “Monte Carlo study of the planar spin model,” Phys. Rev. B
20, 3761–3769 (1979).

[36] V. L. Berezinskii, “Destruction of Long-range Order in One-Dimensional and Two-
Dimensional Systems Having a Continuous Symmetry Group I. Classical Sys-
tems,” Sov. Phys.
493 (1971) URL: https://inspirehep.net/files/
3,
f55503250f690969aedfda4ceaf9b4f9.

JETP 32,

[37]

J. M. Kosterlitz and D. J. Thouless, “Ordering, metastability and phase transitions in two-
dimensional systems,” Journal of Physics C: Solid State Physics 6, 7, 1181–1203 (1973).

[38] M. B. Einhorn, R. Savit, and E. Rabinovici, “A physical picture for the phase transitions in

ZN symmetric models,” Nuclear Physics B 170, 1, 16–31 (1980).

[39] R. G. Jha, “Critical analysis of two-dimensional classical XY model,” Journal of Statistical

Mechanics: Theory and Experiment 2020, 8, 083203 (2020).

[40] E. Rastelli, S. Regina, and A. Tassi, “Monte Carlo simulation of a planar rotator model with

symmetry-breaking fields,” Phys. Rev. B 69, 174407 (2004).

[41] E. Rastelli, S. Regina, and A. Tassi, “Monte Carlo simulation for square planar model with

a small fourfold symmetry-breaking field,” Phys. Rev. B 70, 174447 (2004).

[42] D. D. Betts, “The Exact Solution of Some Lattice Statistics Models with Four States Per

Site,” Canadian Journal of Physics 42, 8, 1564–1572 (1964).

[43] S. Elitzur, R. B. Pearson, and J. Shigemitsu, “Phase structure of discrete Abelian spin and

gauge systems,” Phys. Rev. D 19, 3698–3714 (1979).

[44] S. K. Baek and P. Minnhagen, “Non-Kosterlitz-Thouless transitions for the 𝑞-state clock

models,” Phys. Rev. E 82, 031102 (2010).

[45] S. Chatterjee, S. Puri, and R. Paul, “Ordering kinetics in the 𝑞-state clock model: Scaling

properties and growth laws,” Phys. Rev. E 98, 032109 (2018).

[46] M. S. S. Challa and D. P. Landau, “Phase transitions in the six-state vector Potts model in

two dimensions,” Journal of Applied Physics 55, 6, 2429–2431 (1984).

[47]

J. Tobochnik, “Properties of the 𝑞-state clock model for 𝑞 = 4, 5, and 6,” Phys. Rev. B 26,
6201–6207 (1982).

196

[48] Y. Tomita and Y. Okabe, “Probability-changing cluster algorithm for two-dimensional 𝑋𝑌

and clock models,” Phys. Rev. B 65, 184405 (2002).

[49]

J. Chen, H.-J. Liao, H.-D. Xie, X.-J. Han, R.-Z. Huang, S. Cheng, Z.-C. Wei, Z.-Y. Xie, and
T. Xiang, “Phase Transition of the q-State Clock Model: Duality and Tensor Renormaliza-
tion*,” Chinese Physics Letters 34, 5, 050503 (2017).

[50] Y. Chen, Z.-Y. Xie, and J.-F. Yu, “Phase transitions of the five-state clock model on the

square lattice,” Chinese Physics B 27, 8, 080503 (2018).

[51] Z.-Q. Li, L.-P. Yang, Z. Y. Xie, H.-H. Tu, H.-J. Liao, and T. Xiang, “Critical properties of

the two-dimensional 𝑞-state clock model,” Phys. Rev. E 101, 060105 (2020).

[52] C. M. Lapilli, P. Pfeifer, and C. Wexler, “Universality Away from Critical Points in Two-

Dimensional Phase Transitions,” Phys. Rev. Lett. 96, 140603 (2006).

[53] O. Borisenko, V. Chelnokov, G. Cortese, R. Fiore, M. Gravina, and A. Papa, “Phase transi-
tions in two-dimensional 𝑍 (𝑁) vector models for 𝑁 > 4,” Phys. Rev. E 85, 021114 (2012).

[54] O. Borisenko, G. Cortese, R. Fiore, M. Gravina, and A. Papa, “Numerical study of the phase
transitions in the two-dimensional Z(5) vector model,” Phys. Rev. E 83, 041120 (2011).

[55] S. K. Baek, P. Minnhagen, and B. J. Kim, “True and quasi-long-range order in the generalized

𝑞-state clock model,” Phys. Rev. E 80, 060101 (2009).

[56] F. Corberi, E. Lippiello, and M. Zannetti, “Scaling and universality in the aging kinetics of

the two-dimensional clock model,” Phys. Rev. E 74, 041106 (2006).

[57] Y. Leroyer and K. Rouidi, “Dynamical phase diagram of the two-dimensional p-state clock

model,” Journal of Physics A: Mathematical and General 24, 8, 1931 (1991).

[58] M. Suzuki, “Solution of Potts Model for Phase Transition,” Progress of Theoretical Physics

37, 4, 770–772 (1967).

[59] R. Pathria and P. Beale, Statistical Mechanics, Edition 3, Elsevier Science (2011), ISBN

9780081026939.

[60] H. J. Rothe, Lattice gauge theories: an introduction, Edition 3, World Scientific Publishing

Company (2012).

[61] N. Metropolis, A. W. Rosenbluth, M. N. Rosenbluth, A. H. Teller, and E. Teller, “Equation
of State Calculations by Fast Computing Machines,” The Journal of Chemical Physics 21, 6,
1087–1092 (1953).

[62] A. Bazavov, “Biased Metropolis-Heatbath Algorithms for Lattice Gauge Theory,” PoS 110

(2005).

[63] A. Bazavov and B. A. Berg, “Heat bath efficiency with a Metropolis-type updating,” Phys.

Rev. D 71, 114506 (2005).

197

[64] A. Bazavov, B. A. Berg, and U. M. Heller, “Biased Metropolis-heat-bath algorithm for
fundamental-adjoint SU(2) lattice gauge theory,” Phys. Rev. D 72, 117501 (2005).

[65] A. Bazavov, B. A. Berg, and H.-X. Zhou, “Application of Biased Metropolis Algorithms:
From protons to proteins,” Mathematics and Computers in Simulation 80, 6, 1056–1067
(2010), fifth IMACS Seminar on Monte Carlo Methods Applications of Computer Algebra
2007 (ACA 2007) special session on Nonstandard Applications of Computer Algebra Com-
putational Biomechanics and Biology, a collection of papers presented at the 1st IMACS
International Conference on the Computational Biomechanics and Biology ICCBB 2007.

[66] C. Gattringer and C. B. Lang, Quantum Chromodynamics on the Lattice: An Introductory

Presentation, Springer (2010), ISBN 978-3642018497.

[67] D. Landau and K. Binder, A guide to Monte Carlo simulations in statistical physics, Cam-

bridge university press (2021).

[68] A. M. Ferrenberg, D. P. Landau, and R. H. Swendsen, “Statistical errors in histogram

reweighting,” Phys. Rev. E 51, 5092–5100 (1995).

[69] A. M. Ferrenberg and R. H. Swendsen, “Optimized Monte Carlo data analysis,” Computers

in Physics 3, 5, 101–104 (1989).

[70] A. M. Ferrenberg and R. H. Swendsen, “New Monte Carlo technique for studying phase

transitions,” Phys. Rev. Lett. 61, 2635–2638 (1988).

[71] W. H. Press, S. A. Teukolsky, W. T. Vetterling, and B. P. Flannery, Numerical recipes in C,

Edition 2, Cambridge university press Cambridge (1992), ISBN 0-521-43108-5.

[72] K. Levenberg, “A method for the solution of certain non-linear problems in least squares,”

Quarterly of applied mathematics 2, 2, 164–168 (1944).

[73] D. W. Marquardt, “An Algorithm for Least-Squares Estimation of Nonlinear Parameters,”
Journal of the Society for Industrial and Applied Mathematics 11, 2, 431–441 (1963).

[74] M. Bruno and R. Sommer, “On fits to correlated and auto-correlated data,” Computer Physics

Communications 285, 108643 (2023).

[75] N. C. Bartelt and T. L. Einstein, “Finite-size effects on the critical structure factor of the
two-dimensional Ising model,” Journal of Physics A: Mathematical and General 19, 8, 1429
(1986).

[76] M. Droz and A. Malaspinas, “Computation of structure factors by finite-size scaling and
transfer matrix methods,” Journal of Physics C: Solid State Physics 16, 8, L231 (1983).

[77] M. Weigel and W. Janke, “Error estimation and reduction with cross correlations,” Phys.

Rev. E 81, 066701 (2010).

[78] M. S. S. Challa and D. P. Landau, “Critical behavior of the six-state clock model in two

dimensions,” Phys. Rev. B 33, 437–443 (1986).

198

[79]

J. Preskill, “Simulating quantum field theory with a quantum computer,” PoS LAT-
TICE2018, 024 (2018), arXiv:1811.10085.

[80] M. C. Bañuls, R. Blatt, J. Catani, A. Celi, J. I. Cirac, M. Dalmonte, L. Fallani, K. Jansen,
M. Lewenstein, S. Montangero, C. A. Muschik, B. Reznik, E. Rico, L. Tagliacozzo,
K. Van Acoleyen, F. Verstraete, U.-J. Wiese, M. Wingate, J. Zakrzewski, and P. Zoller,
“Simulating lattice gauge theories within quantum technologies,” The European Physical
Journal D 74, 8, 165 (2020).

[81] Y. Meurice, R. Sakai, and J. Unmuth-Yockey, “Tensor lattice field theory for renormalization
and quantum computing,” Rev. Mod. Phys. 94, 025005 (2022), arXiv:2010.06539v2.

[82] A. Ciavarella, N. Klco, and M. J. Savage, “A Trailhead for Quantum Simulation of SU(3)
Yang-Mills Lattice Gauge Theory in the Local Multiplet Basis,” Phys. Rev. D 103, 9, 094501
(2021), arXiv:2101.10227.

[83] Z. Davoudi, I. Raychowdhury, and A. Shaw, “Search for efficient formulations for Hamilto-
nian simulation of non-Abelian lattice gauge theories,” Phys. Rev. D 104, 074505 (2021),
arXiv:2009.11802.

[84] Z. Davoudi, M. Hafezi, C. Monroe, G. Pagano, A. Seif, and A. Shaw, “Towards analog
quantum simulations of lattice gauge theories with trapped ions,” Phys. Rev. Res. 2, 2,
023015 (2020), arXiv:1908.03210.

[85]

J. Bender, E. Zohar, A. Farace, and J. I. Cirac, “Digital quantum simulation of lattice
gauge theories in three spatial dimensions,” New Journal of Physics 20, 9, 093001 (2018),
arXiv:1804.02082.

[86] E. Zohar, J. I. Cirac, and B. Reznik, “Quantum simulations of lattice gauge theories using

ultracold atoms in optical lattices,” Reports on Progress in Physics 79, 1, 014401 (2015).

[87] U.-J. Wiese, “Ultracold quantum gases and lattice systems: quantum simulation of lattice

gauge theories,” Annalen der Physik 525, 10-11, 777–796 (2013).

[88] H. Singh and S. Chandrasekharan, “Qubit regularization of the 𝑂 (3) sigma model,” Phys.

Rev. D 100, 054505 (2019).

[89] H. Lamm and S. Lawrence, “Simulation of Nonequilibrium Dynamics on a Quantum Com-

puter,” Phys. Rev. Lett. 121, 17, 170501 (2018), arXiv:1806.06649.

[90] N. Klco, E. F. Dumitrescu, A. J. McCaskey, T. D. Morris, R. C. Pooser, M. Sanz, E. Solano,
P. Lougovski, and M. J. Savage, “Quantum-classical computation of Schwinger model
dynamics using quantum computers,” Phys. Rev. A 98, 032331 (2018).

[91] D. C. Hackett, K. Howe, C. Hughes, W. Jay, E. T. Neil, and J. N. Simone, “Digitizing Gauge
Fields: Lattice Monte Carlo Results for Future Quantum Computers,” Phys. Rev. A99, 6,
062341 (2019), arXiv:1811.03629.

[92] L. Tagliacozzo, A. Celi, and M. Lewenstein, “Tensor Networks for Lattice Gauge Theories

with Continuous Groups,” Phys. Rev. X 4, 041024 (2014).

199

[93] Y. Liu, Y. Meurice, M. P. Qin, J. Unmuth-Yockey, T. Xiang, Z. Y. Xie, J. F. Yu, and H. Zou,
“Exact Blocking Formulas for Spin and Gauge Models,” Phys. Rev. D88, 056005 (2013),
arXiv:1307.6543.

[94] S. Chandrasekharan and U. J. Wiese, “Quantum link models: A discrete approach to gauge

theories,” Nuclear Physics B 492, 1, 455–471 (1997), arXiv:hep-lat/9609042.

[95] R. Brower, S. Chandrasekharan, and U. J. Wiese, “QCD as a quantum link model,” Phys.

Rev. D60, 094502 (1999), arXiv:hep-th/9704106.

[96]

J. Schwinger, “Gauge Invariance and Mass. II,” Phys. Rev. 128, 2425–2429 (1962).

[97] A. M. Polyakov, “Compact Gauge Fields and the Infrared Catastrophe,” Phys. Lett. B 59,

82–84 (1975).

[98] Y. Meurice, “Discrete aspects of continuous symmetries in the tensorial formulation of

Abelian gauge theories,” Phys. Rev. D 102, 1, 014506 (2020), arXiv:2003.10986.

[99]

J. F. Haase, L. Dellantonio, A. Celi, D. Paulson, A. Kan, K. Jansen, and C. A. Muschik,
“A resource efficient approach for quantum and classical simulations of gauge theories in
particle physics,” Quantum 5, 393 (2021).

[100] J. B. Kogut, “An introduction to lattice gauge theory and spin systems,” Rev. Mod. Phys. 51,

659–713 (1979).

[101] P. Ruján, G. O. Williams, H. L. Frisch, and G. Forgács, “Phase diagrams of two-dimensional

𝑍 (𝑞) models,” Phys. Rev. B 23, 1362–1370 (1981).

[102] K. Kaski and J. D. Gunton, “Universal dynamical scaling in the clock model,” Phys. Rev. B

28, 5371–5373 (1983).

[103] C.-O. Hwang, “Six-state clock model on the square lattice: Fisher zero approach with

Wang-Landau sampling,” Phys. Rev. E 80, 042103 (2009).

[104] T. Surungan, S. Masuda, Y. Komura, and Y. Okabe, “Berezinskii-Kosterlitz-Thouless transi-
tion on regular and Villain types of q-state clock models,” Journal of Physics A: Mathematical
and Theoretical 52, 27, 275002 (2019).

[105] J. Zhang, J. Unmuth-Yockey, J. Zeiher, A. Bazavov, S. W. Tsai, and Y. Meurice, “Quantum
simulation of the universal features of the Polyakov loop,” Phys. Rev. Lett. 121, 22, 223201
(2018), arXiv:1803.11166.

[106] H. Bernien, S. Schwartz, A. Keesling, H. Levine, A. Omran, H. Pichler, S. Choi, A. S.
Zibrov, M. Endres, M. Greiner, V. Vuletić, and M. D. Lukin, “Probing many-body dynamics
on a 51-atom quantum simulator,” Nature 551, 7682, 579–584 (2017), arXiv:1707.04344.

[107] A. Keesling et al., “Quantum Kibble–Zurek mechanism and critical dynamics on a pro-
grammable Rydberg simulator,” Nature 568, 7751, 207–211 (2019), arXiv:1809.05540.

200

[108] R. D. Pisarski, “Remarks on nuclear matter: How an 𝜔0 condensate can spike the speed
of sound, and a model of 𝑍 (3) baryons,” Phys. Rev. D 103, 7, L071504 (2021), arXiv:
2101.05813.

[109] M. E. Fisher, M. N. Barber, and D. Jasnow, “Helicity Modulus, Superfluidity, and Scaling

in Isotropic Systems,” Phys. Rev. A 8, 1111–1124 (1973).

[110] Y. Kumano, K. Hukushima, Y. Tomita, and M. Oshikawa, “Response to a twist in systems
with 𝑍 𝑝 symmetry: The two-dimensional 𝑝-state clock model,” Phys. Rev. B 88, 104427
(2013).

[111] M. Levin and C. P. Nave, “Tensor renormalization group approach to 2D classical lattice

models,” Phys. Rev. Lett. 99, 12, 120601 (2007), arXiv:cond-mat/0611687.

[112] Z. Y. Xie, J. Chen, M. P. Qin, J. W. Zhu, L. P. Yang, and T. Xiang, “Coarse-graining
renormalization by higher-order singular value decomposition,” Phys. Rev. B 86, 4, 045139
(2012), arXiv:1201.1144.

[113] Y. Chen, K. Ji, Z. Y. Xie, and J. F. Yu, “Cross derivative of the Gibbs free energy: A
universal and efficient method for phase transitions in classical spin models,” Phys. Rev. B
101, 165123 (2020).

[114] J. V. José, L. P. Kadanoff, S. Kirkpatrick, and D. R. Nelson, “Renormalization, vortices, and
symmetry-breaking perturbations in the two-dimensional planar model,” Phys. Rev. B 16,
1217–1241 (1977).

[115] T.-L. H. Nguyen and V. T. Ngo, “Study on the critical properties of thin magnetic films using
the clock model,” Advances in Natural Sciences: Nanoscience and Nanotechnology 8, 1,
015013 (2017).

[116] S. T. Bramwell, P. C. W. Holdsworth, and J. Rothman, “Magnetization in Ultrathin Films:.
Critical Exponent 𝛽 for the 2D XY Model with 4-Fold Crystal Fields,” Modern Physics
Letters B 11, 4, 139–148 (1997).

[117] A. Taroni, S. T. Bramwell, and P. C. W. Holdsworth, “Universal window for two-dimensional
critical exponents,” Journal of Physics: Condensed Matter 20, 27, 275233 (2008).

[118] D. Landau, “Non-universal critical behavior in the planar XY-model with fourth order
anisotropy,” Journal of Magnetism and Magnetic Materials 31-34, 1115–1116 (1983).

[119] G. Hu and S. Ying, “Monte Carlo and coarse graining renormalization studies for the XY
model with cubic anisotropy,” Physica A: Statistical Mechanics and its Applications 140, 3,
585–596 (1987).

[120] P. Calabrese and A. Celi, “Critical behavior of the two-dimensional N-component Landau-

Ginzburg Hamiltonian with cubic anisotropy,” Phys. Rev. B 66, 184410 (2002).

[121] A. Chlebicki and P. Jakubczyk, “Criticality of the 𝑂 (2) model with cubic anisotropies from

nonperturbative renormalization,” Phys. Rev. E 100, 052106 (2019).

201

[122] N. Butt, X.-Y. Jin, J. C. Osborn, and Z. H. Saleem, “Moving from continuous to discrete

symmetry in the 2D XY model,” (2022), arXiv:2205.03548.

[123] L. Vanderstraeten, B. Vanhecke, A. M. Läuchli, and F. Verstraete, “Approaching the
Kosterlitz-Thouless transition for the classical 𝑋𝑌 model with tensor networks,” Phys. Rev.
E 100, 062136 (2019).

[124] P. Butera and M. Comi, “Quantitative study of the Kosterlitz-Thouless phase transition in an
XY model of two-dimensional plane rotators: High-temperature expansions to order 𝛽20,”
Phys. Rev. B 47, 11969–11979 (1993).

[125] P. Olsson, “Monte Carlo analysis of the two-dimensional XY model. II. Comparison with
the Kosterlitz renormalization-group equations,” Phys. Rev. B 52, 4526–4535 (1995).

[126] H. Arisue, “High-temperature expansion of the magnetic susceptibility and higher moments
of the correlation function for the two-dimensional 𝑋𝑌 model,” Phys. Rev. E 79, 011107
(2009).

[127] L. Hostetler, J. Zhang, R. Sakai, J. Unmuth-Yockey, A. Bazavov, and Y. Meurice, “Clock
model interpolation and symmetry breaking in O(2) models,” Phys. Rev. D 104, 054505
(2021).

[128] L. Hostetler, J. Zhang, R. Sakai, J. Unmuth-Yockey, A. Bazavov, and Y. Meurice, “Clock
model interpolation and symmetry breaking in O(2) models,” PoS(LATTICE2021)353
(2021), arXiv:2110.05527.

[129] L. Hostetler, R. Sakai, J. Zhang, J. Unmuth-Yockey, A. Bazavov, and Y. Meurice, “Symmetry
Breaking in an Extended-O(2) Model,” PoS(LATTICE2022)014 (2022), arXiv:2212.06893.

[130] S. Morita and N. Kawashima, “Calculation of higher-order moments by higher-order tensor

renormalization group,” Computer Physics Communications 236, 65–71 (2019).

[131] S. Akiyama, Y. Kuramashi, T. Yamashita, and Y. Yoshimura, “Phase transition of four-
dimensional Ising model with higher-order tensor renormalization group,” Phys. Rev. D 100,
054510 (2019).

[132] M. Fishman, S. White, and E. Stoudenmire, “The ITensor Software Library for Tensor

Network Calculations,” SciPost Physics Codebases (2022), arXiv:2007.14822.

[133] H. Ueda, K. Okunishi, K. Harada, R. Krčmár, A. Gendiar, S. Yunoki, and T. Nishino, “Finite-
𝑚 scaling analysis of Berezinskii-Kosterlitz-Thouless phase transitions and entanglement
spectrum for the six-state clock model,” Phys. Rev. E 101, 062111 (2020).

[134] Z.-C. Gu and X.-G. Wen, “Tensor-entanglement-filtering renormalization approach and

symmetry-protected topological order,” Phys. Rev. B 80, 155131 (2009).

[135] G. Sun, T. Vekua, E. Cobanera, and G. Ortiz, “Phase transitions in the Z𝑝 and U(1) clock

models,” Phys. Rev. B 100, 094428 (2019).

202

[136] J. Zhang, Y. Meurice, and S.-W. Tsai, “Truncation effects in the charge representation of the

O(2) model,” Phys. Rev. B 103, 245137 (2021).

[137] K. Binder, “Critical Properties from Monte Carlo Coarse Graining and Renormalization,”
Phys. Rev. Lett. 47, 693–696 (1981) URL: https://link.aps.org/doi/10.1103/PhysRevLett.47.
693.

[138] M. Rader and A. M. Läuchli, “Floating Phases in One-Dimensional Rydberg Ising Chains,”

(2019), arXiv:1908.02068.

[139] A. Lazarides, O. Tieleman, and C. Morais Smith, “Pokrovsky-Talapov model at finite
temperature: A renormalization-group analysis,” Phys. Rev. B 80, 245418 (2009) URL:
https://link.aps.org/doi/10.1103/PhysRevB.80.245418.

[140] M. K. Ramazanov and A. K. Murtazaev, “Phase transitions and critical properties in the
antiferromagnetic Heisenberg model on a layered cubic lattice,” JETP Letters 106, 2, 86–91
(2017).

[141] F. Karsch and S. Stickan, “The three-dimensional, three-state Potts model in an external

field,” Physics Letters B 488, 3, 319–325 (2000).

[142] M. S. S. Challa, D. P. Landau, and K. Binder, “Finite-size effects at temperature-driven
first-order transitions,” Phys. Rev. B 34, 1841–1852 (1986) URL: https://link.aps.org/doi/10.
1103/PhysRevB.34.1841.

[143] A. Mailhot, M. L. Plumer, and A. Caillé, “Finite-size scaling of the frustrated Heisenberg

model on a hexagonal lattice,” Phys. Rev. B 50, 6854–6859 (1994).

[144] J. Lee and J. M. Kosterlitz, “Finite-size scaling and Monte Carlo simulations of first-order

phase transitions,” Phys. Rev. B 43, 3265–3277 (1991).

[145] D. A. Matoz-Fernandez, D. H. Linares, and A. J. Ramirez-Pastor, “Determination of the
critical exponents for the isotropic-nematic phase transition in a system of long rods on
two-dimensional lattices: Universality of the transition,” Europhysics Letters 82, 5, 50007
(2008).

[146] B. Ahrens and A. K. Hartmann, “Critical behavior of the random-field Ising model at and

beyond the upper critical dimension,” Phys. Rev. B 83, 014205 (2011).

[147] S. Biswas, A. Kundu, and A. K. Chandra, “Dynamical percolation transition in the Ising

model studied using a pulsed magnetic field,” Phys. Rev. E 83, 021109 (2011).

[148] B. Ahrens, J. Xiao, A. K. Hartmann, and H. G. Katzgraber, “Diluted antiferromagnets in a
field seem to be in a different universality class than the random-field Ising model,” Phys.
Rev. B 88, 174408 (2013).

[149] Z. Merdan and R. Erdem, “The finite-size scaling study of the specific heat and the Binder
parameter for the six-dimensional Ising model,” Physics Letters A 330, 6, 403–407 (2004).

203

[150] K. S. D. Beach, L. Wang, and A. W. Sandvik, “Data collapse in the critical region using

finite-size scaling with subleading corrections,” (2005), arXiv:cond-mat/0505194.

[151] M. Dolfi, J. Gukelberger, A. Hehn, J. Imriška, K. Pakrouski, T. F. Rønnow, M. Troyer,
I. Zintchenko, F. Chirigati, J. Freire, and D. Shasha, “A model project for reproducible papers:
critical temperature for the Ising model on a square lattice,” (2014), arXiv:1401.2000.

[152] J. Houdayer and A. K. Hartmann, “Low-temperature behavior of two-dimensional Gaussian

Ising spin glasses,” Phys. Rev. B 70, 014418 (2004).

[153] R. A. Dumer and M. Godoy, “Ising model on a 2D additive small-world network,” The

European Physical Journal B 95, 9 (2022).

[154] Mon, K. K., “Finite-size scaling of the 5D Ising model,” Europhys. Lett. 34, 6, 399–404

(1996) URL: https://doi.org/10.1209/epl/i1996-00470-4.

[155] P. O. Mari and I. A. Campbell, “Ising spin glasses: Corrections to finite size scaling, freezing

temperatures, and critical exponents,” Phys. Rev. E 59, 2653–2658 (1999).

[156] G. A. Canova, Y. Levin, and J. J. Arenzon, “Kosterlitz-Thouless and Potts transitions in a

generalized 𝑋𝑌 model,” Phys. Rev. E 89, 012126 (2014).

[157] W. Selke, “Critical Binder cumulant of two-dimensional Ising models,” The European
Physical Journal B - Condensed Matter and Complex Systems 51, 2, 223–228 (2006).

[158] D. Loison, “Binder’s cumulant for the Kosterlitz-Thouless transition,” Journal of Physics:

Condensed Matter 11, 34, L401 (1999).

[159] M. Hasenbusch, “The Binder cumulant at the Kosterlitz–Thouless transition,” Journal of

Statistical Mechanics: Theory and Experiment 2008, 08, P08003 (2008).

[160] G. Cortese, Critical properties of 2𝐷 𝑍 (𝑁) vector models for 𝑁 > 4, Phd thesis, Univer-
sitá della Calabria (2011), available at https://dspace.unical.it/bitstream/handle/10955/5380/
thesis.pdf.

[161] M. Klomfass, U. M. Heller, and H. Flyvbjerg, “Interpolating between Ising, XY-, and

non-linear 𝜎-models,” Nuclear Physics B 360, 2, 264–282 (1991).

[162] U. Wolff, “Simulating the all-order strong coupling expansion III: O(N) sigma/loop models,”

Nuclear Physics B 824, 1, 254 – 272 (2010).

[163] F. Bruckmann, C. Gattringer, T. Kloiber, and T. Sulejmanpasic, “Two-dimensional O(3)
model at nonzero density: From dual lattice simulations to repulsive bosons,” Phys. Rev. D
94, 114503 (2016).

[164] D. J. Gross and F. Wilczek, “Ultraviolet Behavior of Non-Abelian Gauge Theories,” Phys.

Rev. Lett. 30, 1343–1346 (1973).

[165] H. D. Politzer, “Reliable Perturbative Results for Strong Interactions?” Phys. Rev. Lett. 30,

1346–1349 (1973).

204

[166] K. G. Wilson, “Confinement of quarks,” Phys. Rev. D 10, 2445–2459 (1974).

[167] B. Anastasiou, Quantum Field Theory II (2018) URL: https://people.phys.ethz.ch/~babis/

Teaching/QFT1/qft2.pdf.

[168] M. Creutz, “Monte Carlo study of quantized SU(2) gauge theory,” Phys. Rev. D 21, 2308–

2315 (1980).

[169] N. Cabibbo and E. Marinari, “A new method for updating SU(N) matrices in computer

simulations of gauge theories,” Physics Letters B 119, 4, 387–390 (1982).

[170] K. Fabricius and O. Haan, “Heat bath method for the twisted Eguchi-Kawai model,” Physics

Letters B 143, 4, 459–462 (1984).

[171] A. Kennedy and B. Pendleton, “Improved heatbath method for Monte Carlo calculations in

lattice gauge theories,” Physics Letters B 156, 5, 393–399 (1985).

[172] S. Duane, A. Kennedy, B. J. Pendleton, and D. Roweth, “Hybrid Monte Carlo,” Physics

Letters B 195, 2, 216–222 (1987).

[173] T. Matthews and A. Salam, “The Green’s functions of quantized fields,” Il Nuovo Cimento

(1955-1965) 12, 563–565 (1954).

[174] T. Matthews and A. Salam, “Propagators of quantized field,” Il Nuovo Cimento (1955-1965)

2, 120–234 (1955).

[175] I. Horváth, A. Kennedy, and S. Sint, “A new exact method for dynamical fermion computa-
tions with non-local actions,” Nuclear Physics B - Proceedings Supplements 73, 1, 834–836
(1999).

[176] M. Clark and A. Kennedy, “The RHMC algorithm for 2 flavours of dynamical staggered
fermions,” Nuclear Physics B - Proceedings Supplements 129-130, 850–852 (2004), lattice
2003.

[177] R. P. Feynman, “Simulating Physics with Computers,” Int J Theor Phys 21, 467–488 (1982).

[178] I. C. Cloët, M. R. Dietrich, J. Arrington, A. Bazavov, M. Bishof, A. Freese, A. V. Gorshkov,
A. Grassellino, K. Hafidi, Z. Jacob, M. McGuigan, Y. Meurice, Z.-E. Meziani, P. Mueller,
C. Muschik, J. Osborn, M. Otten, P. Petreczky, T. Polakovic, A. Poon, R. Pooser, A. Roggero,
M. Saffman, B. VanDevender, J. Zhang, and E. Zohar, “Opportunities for Nuclear Physics
& Quantum Information Science,” (2019), arXiv:1903.05453.

[179] C. W. Bauer, Z. Davoudi, A. B. Balantekin, T. Bhattacharya, M. Carena, W. A. de Jong,
P. Draper, A. El-Khadra, N. Gemelke, M. Hanada, D. Kharzeev, H. Lamm, Y.-Y. Li, J. Liu,
M. Lukin, Y. Meurice, C. Monroe, B. Nachman, G. Pagano, J. Preskill, E. Rinaldi, A. Rog-
gero, D. I. Santiago, M. J. Savage, I. Siddiqi, G. Siopsis, D. V. Zanten, N. Wiebe, Y. Yamauchi,
K. Yeter-Aydeniz, and S. Zorzetti, “Quantum Simulation for High Energy Physics,” (2022),
arXiv:2204.03381.

205

[180] Y. Meurice, Quantum Field Theory, 2053-2563, IOP Publishing (2021), ISBN 978-0-7503-

2187-7.

[181] E. Lewis, “Using electronic circuits to model simple neuroelectric interactions,” Proceedings

of the IEEE 56, 6, 931–949 (1968).

[182] A. Corkhill, “A Superb Explanatory Device: The MONIAC, an Early Hydraulic Analog

Computer,” University of Melbourne Collections 10, 24–28 (2012).

[183] R. Dardashti, K. P. Y. Thébault, and E. Winsberg, “Confirmation via Analogue Simulation:
What Dumb Holes Could Tell Us about Gravity,” The British Journal for the Philosophy of
Science 68, 1, 55–89 (2017).

[184] R. Dardashti, S. Hartmann, K. Thébault, and E. Winsberg, “Hawking radiation and analogue
experiments: A Bayesian analysis,” Studies in History and Philosophy of Science Part B:
Studies in History and Philosophy of Modern Physics 67, 1–11 (2019).

[185] U.-J. Wiese, “Towards quantum simulating QCD,” Nuclear Physics A 931, 246–256 (2014).

[186] I. Georgescu, S. Ashhab, and F. Nori, “Quantum simulation,” Reviews of Modern Physics

86, 1, 153–185 (2014), arXiv:1308.6253.

[187] M. C. Bañuls and K. Cichy, “Review on novel methods for lattice gauge theories,” Reports

on Progress in Physics 83, 2, 024401 (2020).

[188] I. Bloch, J. Dalibard, and S. Nascimbène, “Quantum simulations with ultracold quantum

gases,” Nature Physics 8, 267–276 (2012).

[189] M. Dalmonte and S. Montangero, “Lattice gauge theory simulations in the quantum infor-

mation era,” Contemporary Physics 57, 3, 388–412 (2016), arXiv:1602.03776.

[190] C. Gross and I. Bloch, “Quantum simulations with ultracold atoms in optical lattices,”

Science 357, 6355, 995–1001 (2017).

[191] R. Blatt and C. F. Roos, “Quantum simulations with trapped ions,” Nature Physics 8, 4,

277–284 (2012).

[192] A. A. Houck, H. E. Türeci, and J. Koch, “On-chip quantum simulation with superconducting

circuits,” Nature Physics 8, 4, 292–299 (2012).

[193] D. Jaksch, C. Bruder, J. I. Cirac, C. W. Gardiner, and P. Zoller, “Cold Bosonic Atoms
in Optical Lattices,” Physical Review Letters 81, 15, 3108–3111 (1998), arXiv:cond-mat/
9805329.

[194] M. Greiner, O. Mandel, T. Esslinger, T. W. Hänsch, and I. Bloch, “Quantum phase transition
from a superfluid to a Mott insulator in a gas of ultracold atoms,” Nature 415, 6867, 39–44
(2002).

[195] S. Notarnicola, E. Ercolessi, P. Facchi, G. Marmo, S. Pascazio, and F. V. Pepe, “Discrete
Abelian gauge theories for quantum simulations of QED,” Journal of Physics A: Mathemat-
ical and Theoretical 48, 30, 30FT01 (2015), arXiv:1503.04340.

206

[196] D. Petcher and D. H. Weingarten, “Monte Ćarlo calculations and a model of the phase
structure for gauge theories on discrete subgroups of SU(2),” Phys. Rev. D 22, 2465–2477
(1980).

[197] G. Bhanot and C. Rebbi, “Monte Carlo simulations of lattice models with finite subgroups

of SU(3) as gauge groups,” Phys. Rev. D 24, 3319–3322 (1981).

[198] H. Grosse and H. Kühnelt, “Phase structure of lattice gauge theories for non-abelian sub-

groups of SU(3),” Physics Letters B 101, 1, 77–81 (1981).

[199] A. Alexandru, P. F. Bedaque, S. Harmalkar, H. Lamm, S. Lawrence, and N. C. Warrington
(NuQS Collaboration), “Gluon field digitization for quantum computers,” Phys. Rev. D 100,
114501 (2019).

[200] E. Gustafson, P. Dreher, Z. Hang, and Y. Meurice, “Benchmarking quantum computers for
real-time evolution of a (1 + 1) field theory with error mitigation,” Quantum Sci. Technol.
6, 045020 (2021), arXiv:1910.09478.

[201] E. Gustafson, Y. Meurice, and J. Unmuth-Yockey, “Quantum simulation of scattering in the

quantum Ising model,” Phys. Rev. D 99, 094503 (2019).

[202] S. P. Jordan, K. S. M. Lee, and J. Preskill, “Quantum Algorithms for Quantum Field

Theories,” Science 336, 6085, 1130–1133 (2012).

[203] S. P. Jordan, K. S. M. Lee, and J. Preskill, “Quantum Computation of Scattering in Scalar
Quantum Field Theories,” Quantum Information and Computation 14, 1014–1080 (2014),
arXiv:1112.4833.

[204] S. P. Jordan, H. Krovi, K. S. M. Lee, and J. Preskill, “BQP-completeness of scattering in

scalar quantum field theory,” Quantum 2, 44 (2018).

[205] N. Klco and M. J. Savage, “Digitization of scalar fields for quantum computing,” Physical

Review A 99, 5 (2019).

[206] N. Klco and M. J. Savage, “Systematically localizable operators for quantum simulations of

quantum field theories,” Physical Review A 102, 1 (2020).

[207] B. Chakraborty, M. Honda, T. Izubuchi, Y. Kikuchi, and A. Tomiya, “Classically Emulated
Digital Quantum Simulation of the Schwinger Model with Topological Term via Adiabatic
State Preparation,” (2022), arXiv:2001.00485.

[208] N. Klco, A. Roggero, and M. J. Savage, “Standard model physics and the digital quantum
thoughts about the interface,” Reports on Progress in Physics 85, 6, 064301

revolution:
(2022).

[209] G. Pederiva, A. Bazavov, B. Henke, L. Hostetler, D. Lee, H.-W. Lin, and A. Shindler,
“Quantum State Preparation for the Schwinger Model,” (2021), arXiv:2109.11859.

[210] G. Pederiva, A. Bazavov, B. Henke, L. Hostetler, D. Lee, H.-W. Lin, and A. Shindler, “Com-

parison of Quantum State Preparation Algorithms for the Schwinger Model,” In progress.

207

[211] P. Jordan and E. Wigner, “Über das Paulische Äquivalenzverbot,” Zeitschrift für Physik 47,

9, 631–651 (1928).

[212] G. Ortiz, J. E. Gubernatis, E. Knill, and R. Laflamme, “Quantum algorithms for fermionic

simulations,” Phys. Rev. A 64, 022319 (2001).

[213] R. Somma, G. Ortiz, J. E. Gubernatis, E. Knill, and R. Laflamme, “Simulating physical

phenomena by quantum networks,” Phys. Rev. A 65, 042323 (2002).

[214] S. B. Bravyi and A. Y. Kitaev, “Fermionic Quantum Computation,” Annals of Physics 298,

1, 210–226 (2002).

[215] J. T. Seeley, M. J. Richard, and P. J. Love, “The Bravyi-Kitaev transformation for quantum
computation of electronic structure,” The Journal of Chemical Physics 137, 22, 224109
(2012).

[216] E. Zohar, “Quantum simulation of lattice gauge theories in more than one space dimen-
sion—requirements, challenges and methods,” Philosophical Transactions of the Royal So-
ciety A: Mathematical, Physical and Engineering Sciences 380, 2216 (2021).

[217] G. Pardo, T. Greenberg, A. Fortinsky, N. Katz, and E. Zohar, “Resource-efficient quantum
simulation of lattice gauge theories in arbitrary dimensions: Solving for Gauss’s law and
fermion elimination,” Phys. Rev. Res. 5, 023077 (2023).

[218] B. Kaufman, “Crystal Statistics. II. Partition Function Evaluated by Spinor Analysis,” Phys.

Rev. 76, 1232–1243 (1949).

[219] P. D. Beale, “Exact Distribution of Energies in the Two-Dimensional Ising Model,” Phys.

Rev. Lett. 76, 78–81 (1996).

[220] R. Häggkvist, A. Rosengren, D. Andrén, P. Kundrotas, P. H. Lundow, and K. Markström,
“Computation of the Ising partition function for two-dimensional square grids,” Phys. Rev.
E 69, 046104 (2004).

[221] H. Au-Yang and J. H. Perk, “Ising correlations at the critical temperature,” Physics Letters

A 104, 3, 131–134 (1984).

[222] B. M. McCoy, “The Romance of the Ising Model,” in K. Iohara, S. Morier-Genoud, and
B. Rémy (Editors) Symmetries, Integrable Systems and Representations, 263–295, Springer
London, London (2013), ISBN 978-1-4471-4863-0.

[223] T. T. Wu, “Theory of Toeplitz Determinants and the Spin Correlations of the Two-

Dimensional Ising Model. I,” Phys. Rev. 149, 380–401 (1966).

208

APPENDIX A

ADDITIONAL DETAILS ON CLASSICAL SPIN SYSTEMS

A.1 Exact Lattice Results for the Ising Model in Two Dimensions

The exact solution of the Ising model is known. Not only do we have exact results for the

Ising model in the infinite volume, but we also have exact results for finite lattices. This makes it

a good test model when developing new Monte Carlo methods for statistical physics and quantum

field theory. For example, in [218], the exact partition function for the Ising model with interaction

strength 𝐽 on an 𝑚 × 𝑛 lattice is given

(cid:16)2 sinh(2𝛽𝐽)

(cid:17) 𝑚𝑛/2 (cid:34) 𝑛
(cid:214)

𝑍 =

1
2

(cid:16)2 cosh (cid:16) 𝑚
2

𝛾2𝑟−1

(cid:17)

+

𝑛
(cid:214)

𝑟=1

𝛾2𝑟

(cid:17)

+

𝑛
(cid:214)

𝑟=1

(cid:16)2 sinh (cid:16) 𝑚
2

𝛾2𝑟

(cid:16)2 sinh (cid:16) 𝑚
2

(cid:17)

𝛾2𝑟−1

(cid:35)

(cid:17)

,

𝑟=1
(cid:16)2 cosh (cid:16) 𝑚
2

+

𝑛
(cid:214)

𝑟=1

where 𝛾𝑟 is defined by the equation

cosh 𝛾𝑟 = coth(2𝛽𝐽) cosh(2𝛽𝐽) − cos (cid:16) 𝑟𝜋
𝑛

(cid:17)

.

(A.1)

(A.2)

This “algorithm” for the exact partition function for the two dimensional Ising model on finite

lattices has been further improved [219, 220] allowing us to calculate the exact lattice partition

function using e.g. Mathematica. From the partition function, one can then calculate things like

the internal energy

and the specific heat

𝐸 = −

𝜕
𝜕 𝛽

ln 𝑍,

𝐶 = −

𝛽2
𝑛𝑚

𝜕𝐸
𝜕 𝛽

.

A.2 Notes on Magnetic Susceptibility

Consider a classical spin system with Hamiltonian

𝐻 = −𝐽 ∑︁
⟨𝑖, 𝑗⟩

(cid:174)𝑆𝑖 · (cid:174)𝑆 𝑗 − (cid:174)ℎ · (cid:174)𝑀,

209

(A.3)

(A.4)

(A.5)

where the magnetization vector is

The canonical partition function is

(cid:174)𝑀 =

𝑁
∑︁

𝑖=1

(cid:174)𝑆𝑖.

𝑍 =

∑︁

𝑒−𝛽𝐻𝑘 ,

(A.6)

(A.7)

𝑘
where the sum is over all possible lattice configurations. The expectation value of an observable 𝑂

is then

The Helmholtz free energy is

⟨𝑂⟩ =

(cid:205)𝑘 𝑂 𝑘 𝑒−𝛽𝐻𝑘
𝑍

.

𝐹 = −

1
𝛽

ln 𝑍.

(A.8)

(A.9)

In formal statistical mechanics, a magnetic field (cid:174)ℎ is applied, breaking the Z𝑞 or 𝑂 (𝑁) sym-

metry and giving the magnetization vector a preferred direction. Then the average value of the

magnetization in a given direction 𝑗 is related to the derivative of the Helmholtz free energy 𝐹 with

respect to the component of the applied magnetic field in the same direction

⟨𝑀 𝑗 ⟩ = −

(cid:19)

(cid:18) 𝜕𝐹
𝜕ℎ 𝑗

.

𝑇

Then the isothermal magnetic susceptibility is

𝜒𝑀, 𝑗 =

(cid:19)

(cid:18) 𝜕⟨𝑀 𝑗 ⟩
𝜕ℎ 𝑗

𝑇

= −

(cid:33)

(cid:32) 𝜕2𝐹
𝜕ℎ2
𝑗

.

𝑇

(A.10)

(A.11)

The susceptibility can be determined by studying the effect on magnetization as the applied magnetic

field is varied. For example, we can approximate the first derivative of the magnetization as

𝜒𝑀, 𝑗 ≈

⟨𝑀 𝑗 (ℎ 𝑗,2)⟩ − ⟨𝑀 𝑗 (ℎ 𝑗,1)⟩
ℎ 𝑗,2 − ℎ 𝑗,1

,

(A.12)

where ℎ 𝑗,2 is a weak magnetic field measured along the 𝑗 direction, ℎ 𝑗,1 is in the same direction

but with a small difference in magnitude, and ⟨𝑀 𝑗 (ℎ 𝑗,𝑖)⟩ is the average magnetization when the

applied magnetic field is ℎ 𝑗,𝑖. This is simply a finite difference approximation of the derivative. In

the limit of small ℎ, one should recover the results of spontaneous magnetization.

210

Alternatively, by using Eq. (A.8), we can write

𝜒𝑀, 𝑗 =

𝜕⟨𝑀 𝑗 ⟩
𝜕ℎ 𝑗

=

𝜕
𝜕ℎ 𝑗

(cid:205)𝑘 (𝑀 𝑗 )𝑘 𝑒−𝛽𝐻𝑘
(cid:205)𝑖 𝑒−𝛽𝐻𝑘

(A.13)

where the subscript 𝑘 refers to the 𝑘th possible configuration. We expand the right-hand side using

the quotient rule of derivatives and then use

𝜕
𝜕ℎ 𝑗

∑︁

𝑘

𝑒−𝛽𝐻𝑘 =

𝜕
𝜕ℎ 𝑗

∑︁

𝑘

𝑒−𝛽𝐽𝑆𝑘+𝛽(cid:174)ℎ· (cid:174)𝑀𝑘 =

∑︁

𝑘

𝛽(𝑀 𝑗 )𝑘 𝑒−𝛽𝐽𝑆𝑘+𝛽(cid:174)ℎ· (cid:174)𝑀𝑘

(A.14)

where 𝑆𝑘 = (cid:205)

⟨𝑖, 𝑗⟩

(cid:174)𝑆𝑖 · (cid:174)𝑆 𝑗 measured on the 𝑘th configuration. The partial derivative brings down a

factor of 𝛽 and the 𝑗-th component of (cid:174)𝑀. After simplifying and applying Eq. (A.8), one ends up

with

𝜒𝑀, 𝑗 = 𝛽

(cid:16)

⟨𝑀 2

𝑗 ⟩ − ⟨𝑀 𝑗 ⟩2(cid:17)

.

(A.15)

If we are measuring things along the direction of the applied magnetic field and if the magnetization

vector (cid:174)𝑀 is parallel to the applied magnetic field (cid:174)ℎ, which in general it should be, then we can

replace 𝑀 𝑗 with 𝑀 = | (cid:174)𝑀 | and then

𝜒𝑀 = 𝛽

(cid:16)

⟨𝑀 2⟩ − ⟨𝑀⟩2(cid:17)

.

One can also write the susceptibility as

𝜒𝑀 = 𝛽⟨(𝛿𝑀)2⟩,

(A.16)

(A.17)

where 𝛿𝑀 = 𝑀 − ⟨𝑀⟩. That is, the susceptibility is related to the expectation value of the

“subtracted” magnetization squared.

In this dissertation, we generally use the definition

𝜒𝑀 =

1
𝛽𝑁

𝜕⟨𝑀⟩
𝜕ℎ

1
𝑁

=

(cid:16)

⟨𝑀 2⟩ − ⟨𝑀⟩2(cid:17)

.

(A.18)

In the lattice Monte Carlo approach, we might never apply and vary an external magnetic

field and instead study the spontaneous magnetization directly. Now there is no a priori pre-

ferred direction. Rather, the discrete symmetry, if any, is spontaneously broken when the system

magnetizes.

211

A.3 Spin-spin Correlation Functions

A.3.1

Ising Model

The correlation of spins (within a single lattice configuration) as a function of distance is a

measure of the amount of clustering in the lattice.

In this section, we consider the 2d Ising model with no external magnetic field. To make this

discussion easier to follow, we switch our notation a bit. We let 𝜎𝑖 𝑗 ∈ {−1, 1} be the spin variable

sitting at the site with Cartesian coordinates (𝑖, 𝑗). This notation makes it easier to identify the

nearest neighbors of a site. For example, the four nearest neighbors of 𝜎𝑖, 𝑗 are 𝜎𝑖, 𝑗−1, 𝜎𝑖, 𝑗+1,𝜎𝑖−1, 𝑗 ,

and 𝜎𝑖+1, 𝑗

We define the spin-spin correlation function schematically as1

𝐺 (𝑟) = (cid:10)𝜎0,0𝜎0,𝑟 (cid:11) ,

where 𝜎0,𝑟 is the spin variable 𝑟 units to the right of 𝜎0,0. By translation invariance, we can take

𝜎0,0 to be any spin variable in the lattice, and by isotropy, we can take the other spin variable to be

𝑟 units in any of the four cardinal directions from 𝜎0,0. Note, that each spin variable has exactly

four spin variables at a distance 𝑟—above, below, left, and right. We are not considering diagonal

correlations—only vertical and horizontal. To maximize our statistics, we include all possibile

spin-variables and all four directions, such that we really calculate

𝐺 (𝑟) =

1
4𝑁

∑︁

∑︁

(cid:104)

𝑖

𝑗

𝜎𝑖, 𝑗 𝜎𝑖−𝑟, 𝑗 + 𝜎𝑖, 𝑗 𝜎𝑖+𝑟, 𝑗 + 𝜎𝑖, 𝑗 𝜎𝑖, 𝑗−𝑟 + 𝜎𝑖, 𝑗 𝜎𝑖, 𝑗+𝑟

(cid:105)

,

where 𝑁 is the number of sites in the lattice.

For a fully ordered lattice (i.e. 𝛽 = ∞), all spin variables have the same value and 𝐺 (𝑟) = 1.

For a fully disordered lattice (i.e. 𝛽 = 0), the spin variables have random values and 𝐺 (𝑟) = 0.

Between these extremes, 𝐺 (𝑟) tends to be an exponential decay. This is illustrated in Figure A.1.

For 𝛽 > 𝛽𝑐, the correlation function approaches a nonzero asymptotic value, which turns out to

be 𝑀 2—the spontaneous magnetization squared. Given the periodic boundary conditions, the

1Some authors subtract the magnetization, defining the correlation function as 𝐺 (𝑟) = (cid:10)𝜎0,0𝜎0,𝑟 (cid:11) − (cid:10)𝜎0,0(cid:11)2.

212

Figure A.1 Here we look at the temperature dependence of the correlation function for a 512 × 512
MCMC simulation of the Ising model. At small 𝛽, the lattice is disordered and 𝐺 (𝑟) ≈ 0. At large
𝛽, the lattice is frozen and 𝐺 (𝑟) ≈ 1. At intermediate 𝛽, 𝐺 (𝑟) exhibits exponential decay. Note,
these are illustrative results without error bars.

Figure A.2 Here we calculate the correlation function 𝐺 (𝑟) for the 2d Ising model on lattices of size
1282, 2562, and 5122. Simulations were performed close to the critical value of 𝛽. For comparison,
we include exact values for 𝑁 = 1, 2, 3, 4, 5 from [221]. The line gives the large asymptotic values
from Eq. (A.19). Note, these are illustrative results without error bars.

correlation function is symmetric in the sense that 𝐶 (𝑟) = 𝐶 (𝐿 − 𝑟). Thus, when plotting the

correlation function, typically, only the first half is plotted. I.e. 0 ≤ 𝑟 ≤ 𝐿/2.

At the critical point 𝛽𝑐, we can calculate 𝐺 (𝑟) exactly via Toeplitz determinants. See [222] and

references therein. However, this method of calculating them is inefficient for large 𝑟. The first

several values are given in Au-Yang and Perks [221]. At 𝛽𝑐, we know that the asymptotic (i.e. large

213

𝑟) behavior is [222, 223]

𝐺 (𝑟) ∼

21/12𝑒3𝜁 ′ (−1) cosh1/4(2𝛽𝑐)
𝑟 1/4

≃

0.70338
𝑟 1/4

,

(A.19)

where 𝜁 ′(−1) is the value of the derivative of the Riemann zeta function at −1. Thus, at least for

𝛽 = 𝛽𝑐, for both small and large 𝑟, we can validate our lattice results. See Figure A.2.

A.3.2 Clock Models

As in [25, 50, 56], we define the correlation function

𝐺 (𝑟𝑖 𝑗 ) =

(cid:68) (cid:174)𝑆𝑖 · (cid:174)𝑆 𝑗

(cid:69)

= ⟨cos(𝜃𝑖 − 𝜃 𝑗 )⟩ = ⟨cos 𝜃𝑖 cos 𝜃 𝑗 ⟩ + ⟨sin 𝜃𝑖 sin 𝜃 𝑗 ⟩,

(A.20)

where 𝑟𝑖 𝑗 is the distance between the sites 𝑖 and 𝑗. In this definition, the brakets indicate ensemble

average. That is, it indicates the average over many equilibrium configurations. The correlation

function is translationally invariant, and so it does not depend separately on 𝑖 and 𝑗, but rather

only on the distance 𝑟 between the sites 𝑖 and 𝑗. Thus, we sum it over all 𝑁 sites in the lattice.

We consider only ‘on-axis’ correlations, i.e., we consider only correlations in the horizontal and

vertical directions and not in any diagonal directions. For a given site in a 2D lattice, there are four

sites which are 𝑟 units away—up 𝑟 units, down 𝑟 units, left 𝑟 units, and right 𝑟 units. So in practice,

we calculate the correlation function as

𝐺 (𝑟) =

1
4𝑁

𝑁
∑︁

∑︁

𝑖=1

𝑗

(cid:10)cos(𝜃𝑖 − 𝜃 𝑗 )(cid:11) ,

(A.21)

where 𝑖 goes over all sites in the lattice, and 𝑗 goes over the four relevant sites (up, down, left, and

right) which are 𝑟 units from site 𝑖.

In [43], the correlation function is defined in exponential form as

𝐹 (𝑟𝑖 𝑗 ) =

(cid:68)

𝑒𝑖(𝜃𝑖−𝜃 𝑗 )(cid:69)

.

(A.22)

In the low temperature phase, there is long-range order, so 𝐺 (𝑟𝑖 𝑗 ) is constant.

In the high

temperature phase, there is no long-range order, and 𝐺 (𝑟) decays exponentially with 𝑟.

In the

intermediate temperature phase (which exists for 𝑞 ≥ 5), there is quasi-long-range order, and the

214

correlation function decays as a power of the distance

𝐺 (𝑟𝑖 𝑗 ) ∼ 𝑟 −𝜂
𝑖 𝑗 ,

(A.23)

where 𝜂 is a temperature-dependent exponent.

A.4 Low Temperature Expansion

We now consider the low temperature (large 𝛽) series expansion of the clock model in two

dimensions. This is an approximation that is helpful in pinning down the asymptotic behavior of

the energy density. The idea is to work backwards from the lowest energy configurations. The

internal energy of a given 𝑞-state clock model configuration can be computed as

𝐸 = −

∑︁

⟨𝑖, 𝑗⟩

cos(𝜃𝑖 − 𝜃 𝑗 ),

(A.24)

where the sum is over nearest-neighbors pairs, and 𝜃𝑖 = 2𝜋𝑘/𝑞 with 𝑘 = 0, 1, . . . , 𝑞 − 1. The

partition function is

𝑘
where the sum is over the possible lattice configurations. Once we have the partition function, the

𝑍 =

∑︁

𝑒−𝛽𝐸𝑘 .

(A.25)

energy density is given by

⟨𝐸⟩
𝑁

= −

𝜕
𝜕 𝛽

ln 𝑍
𝑁

.

(A.26)

Keep in mind that Eq. (A.24) is the energy of a single lattice configuration whereas Eq. (A.26)

should be thought of as a function of 𝛽.

For the low temperature series expansion, we start by considering the lowest energy states, and

we expand the partition function as the power series

𝑍 = 𝑔0𝑒−𝛽𝐸0 +𝑔1𝑒−𝛽𝐸1 +𝑔2𝑒−𝛽𝐸2 +· · · = 𝑔0𝑒−𝛽𝐸0

(cid:20)

1 +

𝑔1
𝑔0

𝑒−𝛽(𝐸1−𝐸0) +

𝑔2
𝑔0

𝑒−𝛽(𝐸2−𝐸0) + · · ·

(cid:21)

, (A.27)

where 𝐸𝑘 gives the 𝑘th lowest possible energy, and 𝑔𝑘 is the number of lattice configurations with

that energy. This is only a valid approximation if 𝑒−𝛽(𝐸𝑖−𝐸0) ≪ 1, for all 𝑖 > 0 such that the

first term is a good approximation and the remaining terms are small perturbations which die off

quickly as the order is increased. For the 𝑞-state clock model, we end up with powers of 𝑒−𝜖 𝛽 where

215

𝜖 = 1 − cos(2𝜋/𝑞). This quantity is small for 𝛽 ∼ 2 when 𝑞 is fairly small. However, when 𝑞 → ∞

for constant 𝛽, we get 𝑒−𝜖 𝛽 → 1, and the approximation fails. Thus, this kind of expansion will not

work for the 𝑋𝑌 model, which is the clock model with 𝑞 → ∞.

For the clock model, the lowest possible energy occurs when all spins are aligned. The energy

of such configurations is 𝐸0 = −2𝑁 cos 0 = −2𝑁, where 2𝑁 is the number of nearest neighbor

pairs in a two-dimensional lattice. There are a total of 𝑔0 = 𝑞, configurations with this energy since

there are 𝑞 different directions in which the aligned spins could be pointing.

The next lowest energy occurs when a single spin in the configuration is turned one unit

away from alignment. All other spins remain aligned. The energy of such a configuration is

𝐸1 = −(2𝑁 − 4) cos 0 − 4 cos(2𝜋/𝑞) = −2𝑁 + 4(1 − cos(2𝜋/𝑞)), where (2𝑁 − 4) is the number of

nearest neighbors which remain aligned and 4 is the number of pairs which are out of alignment.

There are 𝑔1 = 2𝑁𝑞, configurations with this energy since the unaligned spin could be at 𝑁 different

places, it could be pointing one unit out of alignment to either the left or the right, and there are 𝑞

directions for the overall magnetization.

Determining the next lowest energy is a bit harder. Is it when a single spin is misaligned by

two units or is it when a pair of spins are misaligned both by a single unit? If we go through

the calculations we find that, for 𝑞 > 3, then the next lowest energy occurs when there are two

spins misaligned by one unit but those two spins must be neighboring and pointing in the same

misaligned direction. In this case, there are a total of (2𝑁 − 6) aligned pairs and 6 misaligned

pairs, so the energy is 𝐸2 = −(2𝑁 − 6) cos 0 − 6 cos(2𝜋/𝑞) = −2𝑁 + 6(1 − cos(2𝜋/𝑞)). The first

misaligned spin could be put at 𝑁 different sites, the second spin could be put at 4 different places

relative to the first, and there are 𝑞 different directions for the overall orientation. Thus, there are

𝑔2 = 4𝑁𝑞, configurations with this energy.

One could go on, but we’ll stop here. We now write our partition function as

𝑍 = 𝑞𝑒2𝑁 𝛽 (cid:104)1 + 2𝑁𝑒−4𝜖 𝛽 + 4𝑁𝑒−6𝜖 𝛽 + · · ·

(cid:105)

,

(A.28)

216

Figure A.3 The clock model for several 𝑞 along with the low temperature series expansion (solid
lines).

where 𝜖 ≡ 1 − cos(2𝜋/𝑞). Taking the logarithm and then differentiating with respect to 𝛽 gives

⟨𝐸⟩
𝑁

= −2 + 𝜖

8𝑢4 + 24𝑢6 + · · ·
1 + 2𝑁𝑢4 + 4𝑁𝑢6 + · · ·

,

where 𝑢 ≡ 𝑒−𝜖 𝛽. For 𝑞 and 𝛽 such that 𝑢 = 𝑒−𝜖 𝛽 is small, one can make the approximation

⟨𝐸⟩
𝑁

≃ −2 + 𝜖

(cid:104)8𝑢4 + 24𝑢6(cid:105)

.

(A.29)

(A.30)

At 𝛽 = 2 for the 2-, 3-, 4-, and 5-state clock models, this seems to be a valid approximation. See

Fig. (A.3). See also the appendix in [47].

217

APPENDIX B

TECHNICAL DETAILS OF SIMULATING THE EXTENDED-𝑂 (2) MODEL

B.1 Effect of the Angle Cutoff in the Extended 𝑞-state Clock Model

In the extended 𝑞-state clock model, the spins are allowed to take the angles 𝜑(𝑘) = 2𝜋𝑘/𝑞,

where 𝑘 is an integer. By restricting the values of the angle to the domain [𝜑0, 𝜑0 + 2𝜋), the values

of 𝑘 must satisfy 𝜑0𝑞/2𝜋 ≤ 𝑘 < 𝜑0𝑞/2𝜋 + 𝑞. Let 𝑞 = ⌊𝑞⌋ + 𝛿𝑞, 𝜑0𝑞/2𝜋 = 𝑝 + 𝜖, 𝑝 is an integer

and 0 ≤ 𝜖, 𝛿𝑞 < 1, then

𝑘 ∈






( 𝑝, 𝑝 + 1, . . . , 𝑝 + ⌊𝑞⌋)

if 𝜖 = 0,

( 𝑝 + 1, 𝑝 + 2, . . . , 𝑝 + ⌊𝑞⌋)

if 𝜖 > 0, 𝜖 + 𝛿𝑞 < 1 ,

( 𝑝 + 1, 𝑝 + 2, . . . , 𝑝 + ⌊𝑞⌋ + 1)

if 𝜖 > 0, 𝜖 + 𝛿𝑞 ≥ 1 .

(B.1)

Because the action only depends on the angular distance, we actually have two cases: 𝑘 =

0, 1, 2, . . . , ⌊𝑞⌋ (case 1), 𝑘 = 0, 1, 2, . . . , ⌊𝑞⌋ − 1 (case 2). This defines a particular model that

interpolates between the integer 𝑞’s of the ordinary 𝑞-state clock model. If 𝜑0 = 0, we are in case

1. For this model, the interpolation is smooth (in the sense that the thermodynamic curves change

smoothly) as an integer 𝑞 is approached from below. In this case, the allowed angles feature a cutoff

at ⌊𝑞⌋2𝜋/𝑞 that breaks the periodicity. One could choose the allowed angles differently and go to

case 2, where the allowed angles feature a cutoff at 2𝜋( ⌊𝑞⌋ − 1)/𝑞 that breaks the periodicity, and

the interpolation is smooth as an integer 𝑞 is approached from above.

One could remove the cutoff and restore a form of periodicity by allowing angles

𝜑(𝑘) =

2𝜋𝑘
𝑞

,

𝑘 = 0, 1, 2, . . . , ∞.

However, in this case, for a rational 𝑞 = 𝑟/𝑠, where 𝑟/𝑠 is a reduced fraction, the model is the

ordinary 𝑟-state clock model since 2𝜋𝑘/(𝑟/𝑠) = 2𝜋𝑘 𝑠/𝑟 and 𝑘 𝑠 is an integer. Thus, removing the

cutoff would destroy the smooth interpolation in 𝑛 < 𝑞 ≤ 𝑛 + 1 for integer 𝑛. For any irrational 𝑞,

this model without cutoff becomes equivalent to the ∞-state clock model, that is, the 𝑋𝑌 model.

218

Figure B.1 The specific heat for the extended 𝑞-state clock model with 𝑞 = 3.141592654. Results
are from Monte Carlo on 4×4 lattices. The different curves correspond to different allowed domains
for the spin angles 𝜑. The specific heat of the 𝑋𝑌 model is included for reference. As the upper
limit of the domain goes to infinity, we get the 𝑋𝑌 model. Statistical error bars are omitted since
they are smaller than the line thickness. Dashed lines indicate regions where we have data but we
do not have the uncertainty fully under control.

In Fig. B.1 we explore the effect of increasing the cutoff for the model with 𝑞 = 3.141592654 ≈

𝜋. For example, for the case 𝜑 ∈ [0, 2𝜋), the allowed angles are 2𝜋𝑘/𝜋 = 2𝑘 with 𝑘 = 0, 1, 2, 3,

whereas for the case 𝜑 ∈ [0, 4𝜋), we have 𝑘 = 0, 1, . . . , 6.

In all of these cases there remains

a Z2 symmetry, and we expect the model to have an Ising transition at very large 𝛽 for certain

cutoffs. As the upper limit of the domain is moved to infinity (i.e. as the cutoff is removed), this

Ising transition moves to infinity and the model becomes the 𝑋𝑌 model. A minor detail is that

3.141592654 = 3141592654/1000000000 is in fact a rational number, so one would actually get

the 3141592654-state clock model if the cutoff were removed completely. However, it would be

indistinguishable from the 𝑋𝑌 model for practical purposes.

B.2 Validating TRG with MC in the Extended 𝑞-state Clock Model

Whereas Monte Carlo methods are well understood in the context of classical spin models,

TRG is a relatively new approach. We validate TRG results at small 𝛽 and small volume using

exact and Monte Carlo results. Exact results can be computed for 𝑞 = 2 (Ising model) and 𝑞 = 4

(two coupled Ising models). We use Monte Carlo to validate the TRG results for other (including

219

0.00.51.01.52.0β0.00.20.40.60.81.01.21.4Cϕ=[0,2π)ϕ=[0,4π)ϕ=[0,6π)ϕ=[0,10π)ϕ=[0,20π)ϕ=[0,50π)XYModelFigure B.2 An example comparison of the energy density from exact calculation and that from TRG
for 𝑞 = 4.0. The top panel shows the energy density for lattices of size 4 × 4, 8 × 8, and 16 × 16.
The three panels on the bottom show the difference between exact and TRG results. TRG shows
deviations from exact results near the phase transition. Here 𝐷bond = 64.

Figure B.3 Same as Fig. B.2, but for the specific heat. TRG shows deviations from exact results
near the peak, however, this is mostly due to discretization error from the derivative.

fractional) values of 𝑞.

In Fig. B.2, we show that the energy density from TRG agrees very well with the exact

calculation, and that the tiny discrepancy appears only around the critical point.

In TRG, the

specific heat is calculated by taking a finite difference derivative of the energy. In Fig. B.3, we

compare the specific heat from TRG with the exact values for 𝑞 = 4.0. TRG deviates from the exact

220

Figure B.4 An example comparison of TRG and Monte Carlo measurements of the energy density
for 𝑞 = 4.9. The top panel shows the energy density for lattices of size 4 × 4, 8 × 8, and 16 × 16.
The three panels on the bottom show the difference between Monte Carlo and TRG results. Here,
Monte Carlo is taken to be the baseline. Here 𝐷bond = 64.

Figure B.5 Same as Fig. B.4, but for the specific heat. TRG shows deviations from Monte Carlo
results near the peak, however, this is mostly due to discretization error from the derivative.

results near the peak of the specific heat, but this deviation is mostly due to the discretization error

from the derivative.

Exact solutions are not known for fractional-𝑞, so we validate TRG by comparing with results

from Monte Carlo at small 𝛽 and small volume. For example, Fig. B.4 shows that the discrepancy

221

𝑃(𝜑)

0

2𝜋
𝑞

4𝜋
𝑞

6𝜋
𝑞

· · ·

2𝜋⌊𝑞⌋
𝑞

𝜑

Figure B.6 The distribution Eq. (B.2) of angles in the clock model when 𝜑0 = 0 and 𝛽 = 0.

between the energy density from TRG and that from Monte Carlo is only of order 10−4. In Fig. B.5,

we compare the specific heat from TRG with that of Monte Carlo for 𝑞 = 4.9. TRG deviates from

the Monte Carlo results near the peak of the specific heat. However, this deviation is again almost

entirely due to discretization error from the derivative.

B.3 The Need for the 𝜀-shift in the Extended-𝑂 (2) Model

In Section 4.3.4, we noted that to match the clock model in the ℎ𝑞 → ∞ limit of the Extended-

𝑂 (2) model, we need to shift the angles by a small 𝜀, such that 𝜑 ∈ [−𝜀, 2𝜋 − 𝜀).

To understand the reason for this, we must look more closely at the spin distributions. For the

clock model, it is useful to consider the distribution of spins at 𝛽 = 0. At this temperature, one

can think of the spins as being selected uniformly from the discrete distribution Eq. (4.60). We

can write such a distribution as a “Dirac comb”. For example, for the case 𝜑0 = 0, the angles are

distributed as

𝑃𝑐𝑙𝑜𝑐𝑘
𝑞,𝜑0=0(𝜑) ∼

⌊𝑞⌋
∑︁

(cid:18)

𝛿

𝜑 −

2𝜋𝑘
𝑞

(cid:19)

,

(B.2)

𝑘=0
on a finite domain (in this case in a domain from 0 to 2𝜋⌊𝑞⌋/𝑞). See Fig. B.6. When 𝛽 > 0 one

can still think of the angles as being selected from this Dirac comb, but now some peaks are more

likely to be sampled than others.

For the Extended-𝑂 (2) model, when 𝛽 is fixed and ℎ𝑞 is large, the second term in Eq. (4.61)

dominates the energy. The energy wants to be minimized, and this occurs when cos(𝑞𝜑𝑥) = 1,

which occurs when 𝑞𝜑𝑥 = 2𝜋𝑘 or 𝜑𝑥 = 2𝜋𝑘/𝑞. Thus, when ℎ𝑞 is large, the second term in

Eq. (4.61) has the effect of enforcing Eq. (4.60). To make this more quantitative, it is useful to

222

𝑃(𝜑)

ℎ𝑞 = 8
ℎ𝑞 = 1

− 2𝜋
𝑞

0

2𝜋
𝑞

4𝜋
𝑞

· · ·

𝜑

Figure B.7 A generic example plot of the distribution Eq. (B.3) for ℎ𝑞 = 1 and ℎ𝑞 = 8 with 𝛽 = 1.
After choosing appropriate end points and taking the limit ℎ𝑞 → ∞, this becomes the Dirac comb
distribution of the clock model angles.

consider the spin distribution

This is plotted in Fig. B.7.

𝑃𝑒𝑥𝑡𝑂2
𝑞,𝜑0

(𝜑) ∼ 𝑒 𝛽ℎ𝑞 cos(𝑞𝜑).

(B.3)

We claim that in the limit ℎ𝑞 → ∞ the Extended-O(2) model defined by Eq. (4.61) and (4.62) is

equivalent to the clock model defined by Eq. (4.59) and (4.60). However, there is a subtlety. We’ve

shown numerically that for some values of 𝜑0 (an example being 𝜑0 = 0) the Extended-O(2) model

converges to the clock model in the limit ℎ𝑞 → ∞ but only if the spins are shifted by some 𝜀

𝜑0 − 𝜀 ≤ 𝜑 ∈ R < 𝜑0 − 𝜀 + 2𝜋.

(B.4)

Compare Fig. 4.40, which used 𝜀 = 0 and Fig. 4.41, which used 𝜀 = 𝜋(1 − ⌊𝑞⌋/𝑞). We obtain

completely different critical behavior depending on the choice of 𝜀.

To understand this discrepancy, we need to consider what happens when one enforces boundaries

[𝜑0, 𝜑0 + 2𝜋) on the distribution Eq. (B.3) and then takes the limit ℎ𝑞 → ∞. When ℎ𝑞 is increased,

the peaks in the distribution become sharper, but their position does not change. The discrepancy

occurs when the boundary 𝜑0 or 𝜑0 + 2𝜋 coincides with the center of a peak1 2𝜋𝑘/𝑞. This occurs

for example at 𝜑 = 0 when one has the left boundary at 𝜑0 = 0 as illustrated in Fig. B.8. No

matter how large ℎ𝑞 is made in this example, one never recovers the Dirac comb of the clock model

1When 𝑞 ∈ Z, periodicity is recovered, and so if a peak occurs at one boundary then another peak occurs at the
other boundary and the discrepancy does not occur. I.e. the half peak missing from the left boundary is recovered at
the other boundary. In short, one does not have to worry about this discrepancy when 𝑞 ∈ Z.

223

𝑃(𝜑)

0

2𝜋
4.5

4𝜋
4.5

6𝜋
4.5

2𝜋

8𝜋
4.5

𝜑

Figure B.8 A plot of the distribution Eq. (B.3) for 𝑞 = 4.5 and generic ℎ𝑞. If we naively choose the
angles in [0, 2𝜋), we get a model that does not match the clock model in the ℎ𝑞 → ∞ limit. No
matter how large we make ℎ𝑞 (i.e. no matter how sharp we make the peaks), we are always cutting
off half of that first peak, and so we don’t get the Dirac comb of Eq. (B.2) in the ℎ𝑞 → ∞ limit.

𝑃(𝜑)

0

−𝜀

2𝜋
4.5

4𝜋
4.5

6𝜋
4.5

𝜑

8𝜋
4.5
2𝜋 − 𝜀

Figure B.9 Same as the previous picture, but now we shift the angle domain from [0, 2𝜋) to
[−𝜀, 2𝜋 − 𝜀), so that the domain includes all of the peaks. Now it is obvious that we recover the
Dirac comb of Eq. (B.2) in the ℎ𝑞 → ∞ limit. In this example, ℎ𝑞 = 8 and 𝜀 = 0.35. However, one
can use much smaller 𝜀 and still include most of the peaks if one makes ℎ𝑞 sufficiently large.

because one is always cutting off half of the first peak. However, if one shifts the boundaries to

[−𝜀, 2𝜋 − 𝜀), as shown in Fig. B.9 then one recovers the Dirac comb of the clock model when

ℎ𝑞 → ∞.

To match the clock model in the ℎ𝑞 → ∞ limit, it should be sufficient to choose any 𝜀 such that

𝑃𝑒𝑥𝑡𝑂2
𝑞,𝜑0

(𝜑) −−−−−→
ℎ𝑞→∞

𝑃𝑐𝑙𝑜𝑐𝑘
𝑞,𝜑0 (𝜑),

(B.5)

where for the clock model, angles are selected from [𝜑0, 𝜑0 + 2𝜋), but for the Extended-O(2) model,

they are selected from [𝜑0 − 𝜀, 𝜑0 − 𝜀 + 2𝜋). For the rest of this section, we assume 𝜑0 = 0. In that

224

case, the condition is satisfied for

0 < 𝜀 < 2𝜋

(cid:18)

1 −

(cid:19)

.

⌊𝑞⌋
𝑞

(B.6)

This is simply the condition that all of the desired clock angles occur strictly between −𝜀 and

−𝜀 + 2𝜋. Note, that for 𝑞 ∈ Z there is no discrepancy and one can use 𝜀 = 0.

What value of 𝜀 should we use? One could just choose a constant value like 𝜀 = 0.1. However,

any constant value will be unsafe for some values of 𝑞. For example, the condition Eq. (B.6) is not

satisfied for 4 < 𝑞 < 4.06469 when 𝜀 = 0.1, so for these values of 𝑞, the Extended-O(2) model

would fail to reproduce the clock model results in the ℎ𝑞 → ∞ limit. A better approach might be to

choose 𝜀 so that the ⌈𝑞⌉ peaks of the distribution 𝑃𝑒𝑥𝑡𝑂2
𝑞,𝜑0

(𝜑) are centered in the domain [−𝜀, 2𝜋 − 𝜀).

This implies

𝜀 = 𝜋

(cid:18)

1 −

(cid:19)

.

⌊𝑞⌋
𝑞

(B.7)

For example, for 𝑞 = 4.5, we have 5 allowed angles in the clock model and 5 peaks in 𝑃𝑒𝑥𝑡𝑂2
𝑞,𝜑0

(𝜑)

located at 2𝜋𝑘/4.5 with 𝑘 = 0, 1, 2, 3, 4. The first peak is at 𝜑 = 0, and the fifth peak is at

𝜑 = 2𝜋 · 4/4.5. Thus the value of 𝜀 that centers the five peaks within the domain is 𝜀 =

2𝜋(1 − 4/4.5)/2 ≈ 0.35.

B.4 FSS Results at Integer 𝑞

Table B.1 Critical exponents are tabulated for integer 𝑞 = 2, 3, 4, 5, 6 and finite ℎ𝑞 = 0.1, 1.0. The
𝑄-value is a goodness-of-fit measure defined in Eq. (3.69). For 𝑞 = 5, 6, there appear to be two
BKT transitions as is the case for ℎ𝑞 = ∞. Here, we include data only for the transition at small
inverse temperature. Empty cells indicate that it was not possible to obtain a trustworthy fit.

𝑞

2.0
2.0
2.0
2.0
2.0
2.0
2.0

ℎ𝑞
0.1
0.1
0.1
0.1
0.1
1.0
1.0

Exponent
1/𝜈
𝛼/𝜈
𝛾/𝜈
−𝛽/𝜈
2 − 𝜂
1/𝜈
𝛼/𝜈

Fit Form 𝐿𝑚𝑖𝑛
32
Eq. (3.70)
32
Eq. (3.75)
32
Eq. (3.73)
32
Eq. (3.72)
32
Eq. (3.74)
32
Eq. (3.70)
32
Eq. (3.75)

𝐿𝑚𝑎𝑥 Q-value
0.196
256
0.498
256
0.386
256
0.617
256
0.198
256
0.722
256
0.011
256

𝜒2/𝑑𝑜 𝑓
1.483
0.878
1.058
0.711
1.476
0.571
2.991

Exponent

0.958(49)
0.020(17)
1.780(16)
-0.306(81)
1.713(13)
0.978(68)
-0.010(16)

225

2.0
2.0
2.0

3.0
3.0
3.0
3.0
3.0
3.0
3.0
3.0
3.0
3.0

4.0
4.0
4.0
4.0
4.0
4.0
4.0
4.0
4.0
4.0

5.0
5.0
5.0
5.0
5.0
5.0

6.0
6.0
6.0
6.0
6.0
6.0

1.0
1.0
1.0

0.1
0.1
0.1
0.1
0.1
1.0
1.0
1.0
1.0
1.0

0.1
0.1
0.1
0.1
0.1
1.0
1.0
1.0
1.0
1.0

0.1
0.1
0.1
1.0
1.0
1.0

0.1
0.1
0.1
1.0
1.0
1.0

𝛾/𝜈
−𝛽/𝜈
2 − 𝜂
1/𝜈
𝛼/𝜈
𝛾/𝜈
−𝛽/𝜈
2 − 𝜂
1/𝜈
𝛼/𝜈
𝛾/𝜈
−𝛽/𝜈
2 − 𝜂
1/𝜈
𝛼/𝜈
𝛾/𝜈
−𝛽/𝜈
2 − 𝜂
1/𝜈
𝛼/𝜈
𝛾/𝜈
−𝛽/𝜈
2 − 𝜂
𝛾/𝜈
−𝛽/𝜈
2 − 𝜂
𝛾/𝜈
−𝛽/𝜈
2 − 𝜂
𝛾/𝜈
−𝛽/𝜈
2 − 𝜂
𝛾/𝜈
−𝛽/𝜈
2 − 𝜂

Table B.1 (cont’d)

Eq. (3.73)
Eq. (3.72)
Eq. (3.74)

Eq. (3.70)
Eq. (3.71)
Eq. (3.73)
Eq. (3.72)
Eq. (3.74)
Eq. (3.70)
Eq. (3.71)
Eq. (3.73)
Eq. (3.72)
Eq. (3.74)

Eq. (3.70)
Eq. (3.75)
Eq. (3.73)
Eq. (3.72)
Eq. (3.74)
Eq. (3.70)
Eq. (3.75)
Eq. (3.73)
Eq. (3.72)
Eq. (3.74)

Eq. (3.73)
Eq. (3.72)
Eq. (3.74)
Eq. (3.73)
Eq. (3.72)
Eq. (3.74)

Eq. (3.73)
Eq. (3.72)
Eq. (3.74)
Eq. (3.73)
Eq. (3.72)
Eq. (3.74)

32

256

0.532

0.828

1.745(19)

32

32
32
32

32
32
32
32

32

32
32
32
32
32
32
32
32
32
32

32
32
32
32
32
32

32
32
32
32

256

256
256
256

256
256
256
256

256

256
256
256
256
256
256
256
256
256
256

256
256
256
256
256
256

256
256
256
256

0.07

0.858
0.029
0.264

0.852
0.822
0.402
0.873

0.407

0.942
0
0.53
0.09
0.398
0.167
0.53
0.568
0.39
0.073

0.039
0.067
0.028
0.042
0.06
0.354

0.678
0.009
0.214
0.867

2.06

0.384
2.514
1.302

0.392
0.436
1.032
0.361

1.749(14)

1.521(57)
0.538(48)
1.962(16)

1.6349(93)
1.236(40)
0.385(21)
1.743(16)

1.021

1.705(16)

0.241
10.332
0.831
1.922
1.035
1.576
0.83
0.777
1.049
2.032

2.379
2.083
2.541
2.339
2.148
1.117

0.629
3.08
1.432
0.371

0.40(14)
-0.120(19)
1.728(16)
-0.31(31)
1.7430(85)
0.834(96)
-0.1349(90)
1.742(14)
-0.65(51)
1.7468(93)

1.727(15)
-0.35(22)
1.7284(72)
1.722(14)
-0.44(24)
1.7347(94)

1.710(11)
-0.15(16)
1.7120(86)
1.726(15)

32

256

0.096

1.889

1.7346(92)

226

Table B.2 Here we fit the peak (or inflection point) locations to the form Eq. (3.78) to estimate the
infinite-volume critical point 𝛽𝑐 and the critical exponent 𝜈. Empty cells indicate that it was not
possible to obtain a trustworthy fit.

𝑞

2.0
2.0
2.0
2.0
2.0
2.0
2.0
2.0
2.0
2.0

3.0
3.0
3.0
3.0
3.0
3.0
3.0
3.0
3.0
3.0

4.0
4.0
4.0
4.0
4.0
4.0
4.0
4.0
4.0
4.0

ℎ𝑞
0.1
0.1
0.1
0.1
0.1
1.0
1.0
1.0
1.0
1.0

0.1
0.1
0.1
0.1
0.1
1.0
1.0
1.0
1.0
1.0

0.1
0.1
0.1
0.1
0.1
1.0
1.0
1.0
1.0
1.0

From
𝑑𝑈𝑀/𝑑𝛽|𝑚𝑎𝑥
𝐶𝑉 |𝑚𝑎𝑥
𝜒𝑀 |𝑚𝑎𝑥
⟨𝑀⟩|𝑖𝑛 𝑓 𝑙
𝐹 ( (cid:174)𝑞)|𝑚𝑎𝑥
𝑑𝑈𝑀/𝑑𝛽|𝑚𝑎𝑥
𝐶𝑉 |𝑚𝑎𝑥
𝜒𝑀 |𝑚𝑎𝑥
⟨𝑀⟩|𝑖𝑛 𝑓 𝑙
𝐹 ( (cid:174)𝑞)|𝑚𝑎𝑥
𝑑𝑈𝑀/𝑑𝛽|𝑚𝑎𝑥
𝐶𝑉 |𝑚𝑎𝑥
𝜒𝑀 |𝑚𝑎𝑥
⟨𝑀⟩|𝑖𝑛 𝑓 𝑙
𝐹 ( (cid:174)𝑞)|𝑚𝑎𝑥
𝑑𝑈𝑀/𝑑𝛽|𝑚𝑎𝑥
𝐶𝑉 |𝑚𝑎𝑥
𝜒𝑀 |𝑚𝑎𝑥
⟨𝑀⟩|𝑖𝑛 𝑓 𝑙
𝐹 ( (cid:174)𝑞)|𝑚𝑎𝑥
𝑑𝑈𝑀/𝑑𝛽|𝑚𝑎𝑥
𝐶𝑉 |𝑚𝑎𝑥
𝜒𝑀 |𝑚𝑎𝑥
⟨𝑀⟩|𝑖𝑛 𝑓 𝑙
𝐹 ( (cid:174)𝑞)|𝑚𝑎𝑥
𝑑𝑈𝑀/𝑑𝛽|𝑚𝑎𝑥
𝐶𝑉 |𝑚𝑎𝑥
𝜒𝑀 |𝑚𝑎𝑥
⟨𝑀⟩|𝑖𝑛 𝑓 𝑙
𝐹 ( (cid:174)𝑞)|𝑚𝑎𝑥

𝐿𝑚𝑖𝑛
32
32
32
32
32
32
32
32
32
32

𝐿𝑚𝑎𝑥 Q-value
0.119
256
0.016
256
0.256
256
0.172
256
0.179
256
0.123
256
0.164
256
0.842
256
0.963
256
0.778
256

𝜒2/𝑑𝑜 𝑓
1.767
2.816
1.32
1.561
1.534
1.752
1.59
0.407
0.194
0.497

1.706
0.276
2.507
1.545
1.429
1.415
1.558
1.905
1.764
2.855

1.607

1.387
1.972
0.412
0.216

2.21
0.935
0.479

32
32
32
32
32
32
32
32
32
32

32

32
32
32
32

32
32
32

256
256
256
256
256
256
256
256
256
256

256

256
256
256
256

256
256
256

0.133
0.924
0.03
0.176
0.215
0.22
0.173
0.093
0.12
0.015

0.158

0.229
0.082
0.839
0.954

0.053
0.46
0.791

227

𝜈

1.13(10)
1.03(11)
1.054(54)
1.014(80)
1.066(45)
0.979(73)
1.04(16)
0.991(55)
0.908(82)
1.012(42)

0.791(35)
0.863(41)
0.811(19)
0.756(29)
1.168(44)
0.808(31)
0.909(47)
0.760(21)
0.760(27)
0.823(18)

𝛽𝑐
0.8775(12)
0.87726(38)
0.87753(48)
0.87734(49)
0.87816(76)
0.65440(53)
0.65442(27)
0.65453(28)
0.65423(28)
0.65482(40)

0.99873(93)
1.00073(74)
0.99924(45)
0.99878(54)
1.0047(16)
0.82568(18)
0.82603(13)
0.82574(10)
0.82575(11)
0.82603(19)

2.26(59)

1.098(27)

1.96(12)
2.05(26)
1.88(13)
1.34(17)

1.0797(54)
1.082(12)
1.0772(67)
0.9918(47)

1.251(47)
1.52(26)
1.334(67)

0.9883(13)
0.9952(60)
0.9912(20)

Table B.3 Here we fit the peak (or inflection point) locations to the form Eq. (3.84) to estimate the
infinite-volume critical point 𝛽𝑐 of the small-𝛽 phase transition.

𝑞

5.0
5.0
5.0
5.0
5.0
5.0

6.0
6.0
6.0
6.0
6.0
6.0

From
ℎ𝑞
0.1
𝜒𝑀 |𝑚𝑎𝑥
0.1
⟨𝑀⟩|𝑖𝑛 𝑓 𝑙
0.1 𝐹 ( (cid:174)𝑞)|𝑚𝑎𝑥
1.0
𝜒𝑀 |𝑚𝑎𝑥
1.0
⟨𝑀⟩|𝑖𝑛 𝑓 𝑙
1.0 𝐹 ( (cid:174)𝑞)|𝑚𝑎𝑥
0.1
𝜒𝑀 |𝑚𝑎𝑥
0.1
⟨𝑀⟩|𝑖𝑛 𝑓 𝑙
0.1 𝐹 ( (cid:174)𝑞)|𝑚𝑎𝑥
1.0
𝜒𝑀 |𝑚𝑎𝑥
1.0
⟨𝑀⟩|𝑖𝑛 𝑓 𝑙
1.0 𝐹 ( (cid:174)𝑞)|𝑚𝑎𝑥

𝐿𝑚𝑖𝑛
32
32
32
32
32
32

32
32
32
32
32
32

𝐿𝑚𝑎𝑥 Q-value
0.167
256
0.037
256
0.26
256
0.571
256
0.078
256
0.939
256

256
256
256
256
256
256

0.11
0.012
0.348
0.209
0.339
0.206

𝜒2/𝑑𝑜 𝑓
1.579
2.402
1.312
0.774
2.007
0.246

1.819
2.971
1.127
1.445
1.143
1.456

𝛽𝑐
1.1308(77)
1.143(23)
1.133(13)
1.1267(96)
1.133(22)
1.126(15)

1.1237(76)
1.123(13)
1.128(12)
1.125(11)
1.099(16)
1.118(13)

228