\hfil{\bf\LARGE in the weak- and strong-coupling regimes}\par
\vskip100pt
\hfil a PhD thesis by\par
\medskip
\hfil{\bf\Large Ian Jauslin}\par
\medskip
\hfil2015
\vskip100pt
\hfil Supervised by {\bf Alessandro Giuliani}\par
\vfill
\eject
\tableofcontents
\vfill
\eject
\hfil{\bf\LARGE Acknowledgements}\par
\bigskip
\hfil in order of appearance
\hugeskip
\it
\indent I started my PhD in the fall of 2012, at the University of Rome ``Sapienza''. Over the past three years, I have had the good fortune to meet a bunch of wonderful people (throughout this document, the word ``bunch'' will be used to mean ``more than one can count on one's fingers, provided one does not have too many fingers'') who have helped me, each in their own way, take these first few steps in the world of academic research. I would like to take a moment to give thanks to all of these, in my own way.
\bigskip
\indent Let me start with my parents, Hans-Rudolf Jauslin and Lucy Moser-Jauslin (in alphabetical order), who have consistently supported me through all of my endeavors. It may sound easy but it really isn't. They have provided advice at every fork in the road, all the while allowing me to make my own path. Always available for discussion, be it about my latest obsession with Joan of Arc, my repeated failures at flipping an omelette without getting any egg on the ceiling, algebraic field extensions of $\mathbb Q$, or the absence of joint probabilities in quantum mechanics, they have provided me with a sympathetic ear and an enlightened mouth.
\indent My brother and sister, Andreas and Tania (in alphabetical order) have also been of great help. Both have a fantastically open, imaginative and bright mind, and have made every family reunion a wonderful experience (in the literal sense, as in ``full of wonders'').
\indent I'm not going to go through the entire clique of cousins, uncles and grand-parents, because I have to stop at some point. I would however like to mention my cousin Madi, with whom I have had countless late night talks, about everything and anything. Her extremely inquisitive mind, producing a seemingly unending stream of usually unexpected and often wise ideas, has lead me to as much introspection, outrospection, and every other kind of spection I could possibly need.
\indent I would also like to acknowledge the unwavering support I have received from my lifelong friend, Arthur, whom I met in kindergarten, and has been by my side ever since (if you're not impressed, note that that includes adolescence).
\indent I've often been asked why I came to Rome for my PhD. I met Giovanni Gallavotti in Paris, at a series of lectures he was giving on applying renormalization group techniques to classical mechanics problems, based on a reinterpretation of KAM theory. His outlook on science immediately resonated with me: studying real, physical phenomena, while understanding the underlying mathematical structure precisely and thoroughly. It was exactly what I was looking for. I came to Rome for the first six months of 2011, and worked with Alessandro Giuliani on the model of bilayer graphene discussed below. I adored the city, with its many ruins, delicious food and great (though capricious at times) weather. But most of all, I reveled in the methodology of the Roman Renormalization Group Group (I don't think anyone has ever called it that, but I like it!) (at the time, composed of Giuseppe Benfatto, Giovanni Gallavotti, Guido Gentile, Alessandro Giuliani, Vieri Mastropietro). Physics with convergent expansions! Endless, but {\rm explicit} computations that often bleed over from late afternoon into late night. Thorough discussions at the blackboard where the titles of ``student'' and ``professor'' dissolve into a flurry of ideas. And trees, so many trees!
\indent When time came to decide on a PhD advisor, I asked Alessandro, and definitively moved to Rome. I have never looked back.
\bigskip
\indent Alessandro has been an outstanding PhD advisor. He has always been extremely nice and patient, even when I would make the most stupid mistakes. He has introduced me to several very interesting problems, and, by his extensive knowledge of the subject and an explicit and personal understanding of the methods involved, has been a wonderful teacher.
\indent When I first arrived in Rome in 2011, Alessandro had a PhD student, Serena Cenatiempo, and a postdoc, Rafael Greenblatt. Both were extremely helpful in my introduction to mathematical physics, and making me feel welcome in what was, after all, a foreign place. To this day, every one of our meetings turns into an interesting and lively discussion. Around the start of my PhD, Michele Correggi joined Alessandro's group, bringing his good humor and understanding of the inner workings of quantum mechanics to the lunch table. Later on, Niels Benedikter and Martin Lohmann took short term positions in the group. We had many conversations on the meaning of the renormalization group, mathematical physics, and research at large, which have helped me keep my ideas straight. Of particular mention, is Giovanni Antinucci, another PhD student of Alessandro's, who started a year after I did. Always eager to talk about mathematical physics, he often asks for advice on his work and offers advice on mine. Since I am the type of person that needs someone to bounce ideas off of in order to keep my error rate at acceptable levels, I have benefited a lot from having Giovanni around.
\bigskip
\indent About half-way through my PhD, I started to branch out into other areas of mathematical physics. Margherita Disertori visited Alessandro and me in Rome several times, to discuss a model of hard rods in three dimensions. We kicked ideas around for a few months, and had a great time at it.
\indent In June 2014, I started a collaboration with Giuseppe Benfatto and Giovanni Gallavotti, the results of which are discussed in this thesis. Since we started collaborating, I have had the opportunity to work rather intensely with Giovanni Gallavotti, which has been extremely enriching and enjoyable, and am very grateful for it.
\indent In the fall of 2014, I was invited by Giovanni Gallavotti to spend six weeks at Rutgers University. There, I met Joel Lebowitz, and started discussing phase transitions in systems of hard objects with him. My time in Rutgers was idyllic, largely due to Joel's friendliness, as well as that of the wonderful mathematical physicists of Rutgers.
\indent In June 2015, Elliott Lieb visited Alessandro in Rome, and we started to work on a problem involving expressing monomer-dimer partition functions as Pfaffians. In August, Elliott invited me to Princeton and I had a very productive, and enjoyable time. I am very thankful to both Elliott and Joel for being so welcoming, and the many scientific discussions we have had.
\bigskip
\indent When I arrived in Rome, in 2011, I did not know anyone, and did not speak a word of Italian. Since then, I made some great friends, who have helped me keep my sanity for the past three years.
\indent I would first like to mention the PhD students at the mathematics department of La Sapienza, Emanuela, Giulia, Lorenzo and Lucrezia. I often burst into the PhD office, spewing twelve of the worst puns that have ever been uttered in an academic institution, every minute, and they refrain from throwing things at me, which shows admirable restraint. Despite their lack of knowledge about French pastries, we have shared many pleasant moments.
\indent When I first arrived in Rome, my mother told me of family friends that lived there. I wrote them, and was invited to dinner. I was greeted by Wendy and Bruno, and their children Lily and Leo. Throughout my stay in Rome, a deep friendship developed between us. Bruno, very unfortunately, died in February 2014. We spent a lot of quality time together, enjoying extraordinary home cooked meals, discussing art and life, traveling. But most of all, with another close friend, Beno\^it, we played a lot of music. Chamber music. Wendy would play the violin, Beno\^it the cello, and I the guitar (find the odd element in this trio...), Lily would occasionally join on the violin, playing Vivaldi solos faster than I could listen to them. We would then spend long evenings, Beno\^it, Wendy and I, talking about mostly anything. If my stay in Rome was pleasant, they made it wonderful.
\vfill
\eject
\rm
\setcounter{page}{1}
\pagestyle{plain}
\indent The structure of this document is roughly the following. It starts out with an introductory part, which is a fairly informal discussion, meant to convey the basic concepts that the renormalization group technique is based on, and how the specific problems considered in this thesis fit into this big picture. The main body of work of this thesis is contained in the two following parts, in which two systems, specifically a weakly interacting electronic model of bilayer graphene, and a hierarchical $s$-$d$ model, are studied in full detail, using renormalization group techniques. In the introductory part, the discourse has been kept light and accessible, at the occasional expense of precision, and the two main parts of the thesis are much more explicit and precise, although, somewhat technical.
\indent In this introductory part, we will describe the renormalization group technique in general terms, focusing on the ideas it relies on. The range of applications of renormalization group methods is broad, covering quantum field theory~\-[\cite{Wi65}], equilibrium statistical mechanics, classical~\-[\cite{DJ69}] and quantum~\-[\cite{BG90}], classical mechanics~\-[\cite{Ga94b}] and beyond... In order not to get entangled in an overly general discussion, we will mostly focus on applications to quantum statistical mechanics, and, when needed, specialize the discussion to many-Fermion lattice systems, even though most underlying ideas apply to all of the previously mentioned fields.
\bigskip
\indent Before we begin our discussion, a comment on the vocabulary involved in renormalization group methods is in order. The maturation of the renormalization group technique was a long and involved process, and the vernacular that was developed does not always reflect the modern point of view.
\indent The term ``renormalization group'' itself is a good example of this fact: in the following, there will be no talk of ``normalizations'' and little mention of ``groups''. According to K.~\-Wilson~\-[\cite{Wi83}], it was first introduced by E.C.G.~\-Stueckelberg and A.~\-Petermann~\-[\cite{SP53}] in the context of Quantum ElectroDynamics. While it had already been established~\-[\cite{To48}, \cite{Sc49b}, \cite{Fe49b}, \cite{Dy49}] that by correcting the values of the charge $e$ and mass $m$ of the electron, the pesky divergences that appeared in the theory could be removed, E.C.G.~\-Stueckelberg and A.~\-Petermann noticed that there is actually a continuum of possible values for $e$ and $m$ that accomplish this regularization, related to each other by a group of rescaling transformations. They called these groups ``groupes de normalisation''. M.~\-Gell-Mann and F.E.~\-Low~\-[\cite{GL54}] independently understood that these different values are related to the behavior of the system at different energy scales. The point of view that will be adopted in this document is close to that developed by K.~\-Wilson~\-[\cite{Wi65}], in which the ``renormalization'' is understood as a map from the {\it effective Hamiltonian} on a given energy scale to that on another.
\indent A term that will be used extensively below is ``non-perturbative'', which can be found to mean two different (related) things in the literature. In renormalization group methods, one typically makes use of power series, which we will call ``perturbative expansions''. When these series are shown to be convergent, the discussion is said to be ``non-perturbative'', in the sense that it goes beyond a perturbative discussion. This is the sense in which the term is to be understood in this document. However, ``non-perturbative'' is also used to mean ``beyond the radius of convergence'' of these power series. In order to avoid confusion, we will call such a regime of parameters the {\it strong-coupling} regime.
\indent The renormalization group has been extensively studied since the late 1940's, and has spawned several variants, which are not always aptly named. The point of view we have adopted in this document, is sometimes called the ``Wilsonian'' renormalization group. A central goal of the discussion below is to be {\it non-perturbative}, and rigorously prove the convergence of the power series we manipulate. This has led us to use the qualifiers ``rigorous'', ``non-perturbative'' and even ``exact'' to describe our take on the renormalization group. However, there is another renormalization group-based method, called the ``functional renormalization group''~\-[\cite{Ro12}] that also uses the ``non-perturbative'' and ``exact'' adjectives. Their use of ``non-perturbative'' corresponds to our ``strong-coupling''.
\bigskip
\indent We are now ready to delve into the discussion of the core concepts of the renormalization group technique.
\indent The starting point of the discussion is a Hamiltonian, $H$, which can be written as $H=H_0+V$, where $H_0$ is {\it integrable}. In the context we are focusing on, $H$ describes the dynamics of a many-electron system, and is an operator on a Fermionic Fock space, $H_0$ is typically the kinetic term and $V$ an interaction between electrons. The concept of {\it integrability}, in this case, stands for the fact that the moments of the {\it free Gibbs measure}
Using the integrability of $H_0$, one can express the right side of~\-(\ref{eqtrotter}) explicitly, which, {\it provided the formal expansion in~\-{\rm(\ref{eqtrotter})} converges}, yields an adequate expression for the partition function $Z$. This is where things get tricky: in a number of models of physical interest, not only is the formal expansion in~\-(\ref{eqtrotter}) divergent, but every one of its terms is formally infinite.
\bigskip
\indent The fundamental question, which the renormalization group technique was devised to answer, is whether this formal divergence is an artifact of the method, or whether it carries physical importance. This is, by no means, an innocent question. Infinities, or rather, singularities, {\it do} exist and are, in some cases, fundamental to understand the physical phenomena in which they show up. The vortices one observes in turbulent flows of fluids are an obvious example. Phase transitions in statistical mechanics are singular events, whose roughness is directly observable, perhaps most strikingly in the so-called critical opalescence one observes when liquid and gas lose their distinction. The long debate over the stability of the Solar System provides a more subtle example of the importance of taking divergences seriously in physics. By neglecting the divergence of the series they were manipulating, J.L.~\-Lagrange and P.S.~\-Laplace were led to believe that the Solar System is stable, overlooking the singular effects brought forth by near-resonances, which were later shown~\-[\cite{La89}] to destabilize the system.
\indent How the importance of the divergence of these series was realized is an edifying story (see~\-[\cite{Ba94}] for details and references). In 1884, king Oscar~\-II of Sweden appointed G.~\-Mittag-Leffler to organize a prize that would reward an important mathematical discovery. After some debate, Mittag-Leffler invited C.~\-Hermite and K.~\-Weierstrass to propose four problems that would be worth the prize. The first of these was to find a convergent series expansion for the trajectories of the $n$-body problem (i.e. $n$ point masses interacting via Newton's law of gravitation). A young H.~\-Poincar\'e took home the prize in January 1889 for solving this problem in a simplified setting, namely the {\it restricted} 3-body problem. Shortly after the prize was announced, H.~\-Gyld\'en, an established astronomer at the Royal Swedish Academy, claimed that he had proved Poincar\'e's result two years earlier, which set off a bitter dispute between the Academy, which sided with Gyld\'en, and Mittag-Leffler, who persisted in upholding the originality of Poincar\'e's work. In an ironic twist, in June 1889, while the debate over who had priority over this result raged on, L.E.~\-Phragm\'en, the editor in charge of Poincar\'e's memoir, which was to be published in the Acta Mathematica, discovered an error in the proof. Upon re-inspection, Poincar\'e understood that this error was indeed deep, and that the series expansions he had computed for the restricted 3-body problem, for which Gyld\'en was fighting so hard to be credited, are divergent! In the subsequent memoir he published, Poincar\'e described the implications of this fact: of the trajectories he had constructed, some (the perturbations of {\it homoclinic} orbits), which would have been very regular had the series converged, were actually quite erratic. This was the first construction of a chaotic trajectory in a Hamiltonian system, and laid the first stone on top of which Chaos Theory was to be built.
\bigskip
\indent A fundamental idea of the renormalization group is that~\-(\ref{eqtrotter}) is not the right expansion to look at. Essentially, the terms in~\-(\ref{eqtrotter}) diverge because of singularities of $\left<\cdot\right>_0$ in the long- and short-distance regimes, and $V$ describes the interaction between fields on {\it all} length scales at once. Instead, the idea of the renormalization group, at least in K.~\-Wilson's formulation~\-[\cite{Wi65}], is to view the system as a {\it sequence} of {\it effective} Hamiltonians $H^{(h)}=H_0^{(h)}+V^{(h)}$ for $h\in\mathbb Z$, which {\it approximate} the behavior of the system on distances of the order of $2^{-h}$. The effective Hamiltonians $H^{(h)}$ are computed by iterating a map called the {\it beta function}, which maps $H^{(h)}$ to $H^{(h-1)}$.
\indent The beta function is {\it expressed as a power series} in $V^{(h)}$. Since $V^{(h)}$ only describes the interaction on scale $2^{-h}$, the resulting formal expansion is well defined, and, provided $V^{(h)}$ is small enough in an appropriate norm, is either convergent (as in the purely Fermionic cases studied in this thesis), or an asymptotic series. The main issue to address is to ensure that $V^{(h)}$ is indeed {\it small enough}. This task may seem daunting a priori: after all, in most cases, the effective potentials $V^{(h)}$ are parametrized by an infinite number of parameters (called {\it running coupling constants}), so that controlling the iterates of the beta function involves controlling the flow of an infinite-dimensional dynamical system, which, to top it all off, is defined by series. However, by computing the effect of a {\it rescaling} transformation $x\mapsto\lambda x$ on the free Gibbs measure $\left<\cdot\right>_0$, it is possible to bound some of the terms of $V^{(h)}$ by simple dimensional arguments. Such terms have been given the name {\it irrelevant} (which is a rather unfortunate denomination: the fact that these terms are well-behaved a priori by no means affects their physical relevance, and their effects {\it must} be taken into account to get good quantitative predictions). If the number of terms in $V^{(h)}$ that are {\it not} irrelevant is {\it finite}, then the system is said to be {\it renormalizable}. In this case, the size of the effective potential $V^{(h)}$ can be controlled by studying the flow of a {\it finite-dimensional} dynamical system, which can be constructed from the beta function by neglecting the irrelevant terms. It may be worth insisting that this does {\it not} mean that the irrelevant terms are negligible, but merely that their contributions do not need to be checked explicitly in order to control the flow of the running coupling constants.
\indent The point of this technique is that, for systems that are amenable to it, it provides a robust method to compute physical observables as convergent or asymptotic series. The method is {\it robust} in the sense that if small irrelevant terms are added to $V$, the analysis can essentially be carried out in the same way, and only the numerical values one can compute for the physical observables will change.
\indent It, however, has some important limitations, which have, so far, prevented it from being used successfully to study some interesting phenomena, such as high-$T_c$ superconductivity, or quantum gravity. First, for systems that are not renormalizable, renormalization group techniques are extremely hard to make sense of. Second, the approach described above is based on perturbative expansions, which, if $V^{(h)}$ is not small enough, breaks down completely. There are commonplace models in which $V^{(h)}$ is not small, even if the interaction $V$ is. For example, in the case of a 2- or 3-dimensional electron gas, it is expected that the Coulomb interaction plays a substantial role in the large-distance behavior of the system, so much so that the very concept of an individual electron loses its meaning, and should be replaced by {\it Cooper pairs}. Such considerations (which, it may be worth noting, have so far not been proved rigorously) are at the basis of the celebrated BCS theory of superconductivity. In such cases, the system is said to be in the {\it strong-coupling} regime, and it is not yet clear how to replace the perturbative expansions that usually appear in renormalization group techniques.
\bigskip
\indent The system that is studied in detail in part~\-\ref{partbilayer} is in the {\it weak-coupling} regime, and provides an explicit example in which the ideas sketched above are worked out, and yield rigorous results. The discussion in part~\-\ref{partkondo} is a first step towards a possible approach to study systems in the {\it strong-coupling} regime using renormalization group techniques.
\bigskip
\indent In part~\-\ref{partbilayer}, we study a model for the electrons of bilayer graphene at half-filling, which is a 2-dimensional crystal of carbon atoms, in which we consider a short-range (exponentially decaying) interaction between electrons, which is thought to correspond to a screened Coulomb interaction. A more thorough introduction to this system will be given below. Here, we will merely discuss some key properties of the model, and the difficulties and simplicities that result from them.
\indent First, the particles that are modeled are electrons, which satisfy Fermionic statistics. There is a well-established formalism to study interacting Fermionic field theories by renormalization group techniques, which was developed by G.~\-Benfatto and G.~\-Gallavotti~\-[\cite{BG90}] (and is built on a solid base of seminal works in constructive quantum field theory). The Fermionic nature of the model will turn out to be a great advantage, in that the power series that we will introduce below as part of the renormalization group technique, are {\it convergent}. In interacting Bosonic field theories, the terms of the power series produced by renormalization group techniques typically grow as the factorial of the power, and may only, in the best of cases, be shown to be asymptotic series (as in~\-[\cite{EMS75}, \cite{MS77}]). Instead, in interacting Fermionic theories, due to the signs that appear when Fermionic fields are exchanged, the terms of the power series can be bounded more efficiently. Technically, this comes from the fact that while multi-dimensional Gaussian integrals yield permanents, Gaussian Grassmann integrals yield determinants.
\indent The model which we will study is a lattice model. The physical interpretation underlying this assumption is that the electrons are bound to an atom of the crystal, and occasionally tunnel to a neighboring atom. This fact, which is not an artifact of the model, but, rather, is rooted in the physics of the system, substantially simplifies the renormalization group treatment. Indeed, the lattice structure regularizes the singularities due to short-distance effects, since there effectively is {\it no} distance shorter than the inter-atom spacing in the model, so we only need to worry about the long-distance behavior. In the renormalization group vernacular, this means that there is {\it no ultraviolet problem}. This is a common property of problems in condensed matter physics, which have an underlying physical lattice regularization.
\indent After all is said and done, the model is {\it superrenormalizable}. This statement requires some clarification. While there is no ultraviolet problem, there is an {\it infrared problem}: the non-interacting Hamiltonian $H_0$ can be diagonalized in Fourier space, and it turns out that there are 8 momenta at which an eigenvalue vanishes. These momenta, called {\it Fermi points}, generate singularities in the free Gibbs measure $\left<\cdot\right>_0$, which are called {\it infrared} singularities. The scaling properties of $H_0$, which, as was mentioned above, are crucial to the renormalization group analysis, depend on the way that these eigenvalues approach 0 as the momentum is varied. As it turns out, in bilayer graphene, this behavior changes as the momentum gets closer to the Fermi points: there are 3 regimes (actually, 4, but this will not be discussed here) (see figure~\-\ref{bilayerfigbands} for a graphical representation): a first one, far from the Fermi points, in which the eigenvalues decrease linearly, followed by another, closer to the Fermi points, in which they behave quadratically, and, finally, a third regime, that extends all the way down to the Fermi points, and in which, as in the first, they behave linearly. In the first and third regimes, every term of the effective potentials $V^{(h)}$ is {\it irrelevant}. This implies that the size of the running coupling constants can be controlled by purely dimensional arguments. In this case, the system is called {\it superrenormalizable}. The real challenge in this model is to control the running coupling constants in the second regime, in which the scaling behavior of $H_0$ is such that the quartic terms in $V^{(h)}$ are {\it marginal} (not irrelevant). A priori, there could be a risk that the system slip into the strong-coupling regime because of the marginal couplings, which is a concern that can even be backed up by a heuristic computation~\-[\cite{Va10}]. However, and this is our main result, one can show that by optimizing the treatment of the first superrenormalizable regime, the running couplings can be controlled in the marginal regime as well, under a specific set of hypotheses, see theorem~\-\ref{bilayertheomain}. In order to arrive at this conclusion, one needs to maintain control over the crossovers between these different regimes, for which our approach to the renormalization group has proved to be particularly well suited.
\indent The final ingredient of the model is that the Fermi points mentioned above are indeed {\it points}. Generically, in 2-dimensional systems, the infrared singularities would form a Fermi {\it curve}, which adds difficulties to the renormalization group analysis (see e.g.~\-[\cite{BGM06}] or~\-[\cite{FKT04}]). In the case of bilayer graphene (at half-filling, which is the regime which we are studying here), this degeneracy of the Fermi curve is enforced by symmetries. As a consequence, since the symmetries are preserved by the beta function, the infrared singularities of $H_0^{(h)}$ remain point-like for all $h$. However, this is only true because we are neglecting some extra complexity in our model: as was mentioned earlier the electrons are assumed to tunnel from one atom to its neighbors, but if one considers tunnelings to non-nearest neighbor atoms, then one of the symmetries of our model would be violated, and the Fermi curve would no longer be degenerate. If one were to include these extra tunnelings, one would then have a fourth regime, beyond the third, in which the 1-dimensional nature of 6 of the 8 Fermi points would become apparent. However, this effect is thought to only manifest itself at very low temperatures, so our model should adequately represent the system down to these low temperatures, and in particular, our result on the control of the marginal regime is in no way diminished by this fact.
\indent To sum up, the ingredients of our bilayer graphene model that simplify its renormalization group treatment are the Fermionic statistics, the lattice structure, the superrenormalizability of the first and third regimes, and the degeneracy of the Fermi curve. The main difficulty of the analysis lies in the control of the marginal running coupling constants in the second regime, which we are able to control because of optimal control over the superrenormalizable regimes.
\bigskip
\indent In part~\-\ref{partkondo}, we lay the groundwork for a technique that might be of some use to study strongly-coupled systems using rigorous renormalization group techniques.
\indent In the 1990's, a variant of the renormalization group technique , called the {\it functional renormalization group}, and based on a formulation by J.~\-Polchinski~\-[\cite{Po84}] and, independently, G.~\-Gallavotti~\-[\cite{Ga85}], was developed to study systems in the strong-coupling regime, see~\-[\cite{Ro12}] for a review. Essentially, the functional renormalization group technique relies on the same perturbative expansions as in standard implementations of the renormalization group, but these series are truncated in a particular way. It was found that this method yields predictions that are in good agreement with experiments, even when applied to strongly-coupled systems, most notably quantum chromodynamics~\-[\cite{Br09}]. However, as of now, it is not understood why this truncation yields good results: in the strong-coupling regime, the parameters appearing in these series are far beyond their radius of convergence, which makes it difficult to control the remainder of the truncation.
\indent In part~\-\ref{partkondo}, we investigate an alternative approach, based on studying {\it hierarchical} approximations of strongly-coupled Fermionic models. Hierarchical models have been used to prove results on their non-hierarchical counterparts in many other settings, for instance in the Ising model with long-range interactions~\-[\cite{Dy69}], the $\varphi^4$ model in 2 and 3 dimensions~\-[\cite{BCe78}, \cite{BCe80}], and the $\nabla\varphi^4$ model in $\geqslant2$ dimensions~\-[\cite{GK81}, \cite{GK82}]. Their use in Fermionic models is limited to the Gross-Neveu model~\-[\cite{Do91}] in the weak-coupling regime, which had previously been understood using non-perturbative renormalization group methods~\-[\cite{GK85}]. So far, to our knowledge, an investigation of strongly-coupled Fermionic hierarchical models has not been attempted.
\indent Without going into details (for which we refer the reader to part~\-\ref{partkondo}), the {\it hierarchical approximation} of a model is an alternative model that shares the same {\it scaling} properties as the original model. As was already stated above, the scaling properties are essential to renormalization group treatments, and the rough idea of the hierarchical approximation is that since the hierarchical model has the same scaling behavior as its non-hierarchical counterpart, its renormalization group analysis will be closely related to that of the non-hierarchical model. In addition, the ingredients that are neglected in the hierarchical approximation are, at every length scale $2^h$, the correlations over larger distances than $2^h$ and fluctuations over shorter lengths than $2^h$. These are, in some sense, expected to be small.
\indent One of the motivations for considering Fermionic hierarchical models to study strong-coupling effects, is that they are {\it exactly solvable} from a renormalization group point of view: the beta function can be computed {\it exactly} and {\it explicitly}, without any series expansions. This means that Fermionic hierarchical models can be studied rigorously using renormalization group techniques, whether they are in the strong- or weak-coupling regime.
\medskip
\indent In part~\-\ref{partkondo}, we study a hierarchical approximation of the $s-d$ model (also called the Kondo model). This model was introduced to study the effect of magnetic impurities in conductors, namely to understand why, at low temperatures, the resistivity of magnetically polluted metals increases as the temperature decreases. The $s-d$ model describes a collection of non-interacting electrons, living on a 1-dimensional discrete chain of atoms, interacting with a single magnetic impurity localized at one of the sites of the chain. By carrying out computations using a Born approximation (which is not controlled), J.~\-Kondo showed~\-[\cite{Ko64}] that the interaction between the impurity and the electrons could qualitatively change the conducting behavior of the electrons. Our aim is to study this effect using renormalization group methods. The range of temperatures we are interested in is that in which the interaction between the impurity and the electrons qualitatively changes the behavior of the electrons, in other words, we are interested in the strong-coupling regime.
\indent The $s-d$ model was studied using renormalization group methods by P.~\-Anderson~\-[\cite{An70}] and K.~\-Wilson~\-[\cite{Wi75}], though the methods used in these references are perturbative or numerical. The endgame of our analysis is to study the $s-d$ model analytically and non-perturbatively, although, so far, we have only studied a hierarchical version of the model. We have investigated the strong-coupling regime of the hierarchical $s-d$ model by iterating an explicit beta function, expressed in terms of rational functions, and found that the qualitative behavior of the hierarchical model matches up with that of the non-hierarchical $s-d$ model.
\indent The methods we used to perform this analysis are the following. The {\it exact} computation of the beta function was carried out with the help of a computer program, {\it meankondo}~\-[\cite{Ja15}], written for the purpose of this computation. {\it Meankondo} takes the expression of the {\it propagator} of the hierarchical $s-d$ model as input, and outputs the beta function, computed symbolically. It is a rather flexible tool, and can be used to compute the beta function for a wide variety of Fermionic hierarchical model. The beta function was then iterated numerically. Note that there is a fundamental difference between the numerical treatment in~\-[\cite{Wi75}] and the one carried out here: in~\-[\cite{Wi75}], the beta function itself is computed numerically, whereas here, it is merely the iteration of the beta function that is carried out numerically. Studying this iteration means studying a finite-dimensional dynamical system, involving only non-singular rational functions. The results presented in part~\-\ref{partkondo} could probably be proved analytically, or, as an alternative, a computer assisted proof could easily be worked out. By ``computer assisted proof'', we mean that the truncation errors made by the numerical iteration could easily be controlled. We have not carried this out, because it seemed more cumbersome than necessary, given the clarity of the numerical results.
\indent This result on the hierarchical $s-d$ model is quite intriguing: this model can be studied exactly by a renormalization group method in the strong-coupling regime and seems to reproduce the same qualitative behavior of the non-hierarchical $s-d$ model. An enticing question is whether the $s-d$ model can be seen as a perturbation of this, or another, hierarchical approximation. We are currently investigating this direction, as well as considering other strongly-coupled Fermionic models which could be studied via hierarchical approximations.
\bigskip
\indent Before moving on to the main body of this thesis, let me make one last remark on the motivations for turning heuristic arguments into mathematically precise statements.
\indent The argument has already been made that important physical phenomena can be overlooked by neglecting convergence issues. This is a pragmatic argument, that new and interesting physics can come from rigorous mathematical analysis. There also is a deeper, more fundamental motivation: a rigorous mathematical description of natural phenomena is an integral part of the scientific method. The reasoning is simple: if one accepts that the goal of physics is to model natural phenomena as mathematical objects that can be manipulated in order to produce measurable predictions, then mathematical rigor is clearly part of the process. Of course, foregoing rigor can be very useful to save time and to keep focus on the truly physical aspects of a problem, but making results mathematically precise remains an important step that cannot be cast aside.
\vfill
\eject
\references
{\small
\BBlography{bibliography-intro.BBlog}
}
\vfill
\eject
\part{The ground state construction of bilayer graphene}\label{partbilayer}
\indent We will now turn to the first model studied in this thesis, namely a model for the electrons of bilayer-graphene at half filling, with a weak short-range interaction. The discussion in this part is taken from~[\cite{GiJ15}], and is the fruit of a collaboration with Alessandro Giuliani.
\section{Introduction}
\label{bilayerintroduction}
\indent Graphene, a one-atom thick layer of graphite, has captivated a large part of the scientific community for the past decade. With good reason: as was shown by A.~Geim's team, graphene is a stable two-dimensional crystal with very peculiar electronic properties~[\cite{bilayerNGe04}]. The mere fact that a two-dimensional crystal can be synthesized, and manipulated, at room temperature without working inside a vacuum~[\cite{bilayerGe11}] is, in and of itself, quite surprising. But the most interesting features of graphene lay within its electronic properties. Indeed, electrons in graphene were found to have an extremely high mobility~[\cite{bilayerNGe04}], which could make it a good candidate to replace silicon in microelectronics; and they were later found to behave like massless Dirac Fermions~[\cite{bilayerNGe05}, \cite{bilayerZTe05}], which is of great interest for the study of fundamental Quantum Electro-Dynamics. These are but a few of the intriguing features~[\cite{bilayerGN07}] that have prompted a lively response from the scientific community.\par
\indent These peculiar electronic properties stem from the particular energy structure of graphene. It consists of two energy bands, that meet at exactly two points, called the {\it Fermi points}~[\cite{bilayerWa47}]. Graphene is thus classified as a {\it semi-metal}: it is not a {\it semi-conductor} because there is no gap between its energy bands, nor is it a {\it metal} either since the bands do not overlap, so that the density of charge carriers vanishes at the Fermi points. Furthermore, the bands around the Fermi points are approximately conical~[\cite{bilayerWa47}], which explains the masslessness of the electrons in graphene, and in turn their high mobility.\par
\bigskip
\indent Graphene is also interesting for the mathematical physics community: its free energy and correlation functions, in particular its conductivity, can be computed non-perturbatively using constructive Renormalization Group (RG) techniques~[\cite{bilayerGM10}, \cite{bilayerGMP11}, \cite{bilayerGMP12}], at least if it is at {\it half-filling}, the interaction is {\it short-range} and its strength is {\it small enough}. This is made possible, again, by the special energy structure of graphene. Indeed, since the {\it propagator} (in the quantum field theory formalism) diverges at the Fermi points, the fact that there are only two such singularities in graphene instead of a whole line of them (which is what one usually finds in two-dimensional theories), greatly simplifies the RG analysis. Furthermore, the fact that the bands are approximately conical around the Fermi points, implies that a short-range interaction between electrons is {\it irrelevant} in the RG sense, which means that one need only worry about the renormalization of the propagator, which can be controlled.\par
\indent Using these facts, the formalism developed in~[\cite{bilayerBG90}] has been applied in~[\cite{bilayerGM10}, \cite{bilayerGMP12}] to express the free energy and correlation functions as convergent series.\par
\indent Let us mention that the
case of Coulomb interactions is more difficult, in that the effective interaction is marginal in an RG sense. In this case, the theory has been constructed at all orders in renormalized perturbation theory
[\cite{bilayerGMP10}, \cite{bilayerGMP11b}], but a non-perturbative construction is still lacking.
\par
\bigskip
\indent In the present work, we shall extend the results of~[\cite{bilayerGM10}] by performing an RG analysis of half-filled {\it bilayer} graphene with short-range interactions. Bilayer graphene consists of two layers of graphene in so-called {\it Bernal} or {\it AB}
stacking (see below). Before the works of A.~Geim et al.~[\cite{bilayerNGe04}], graphene was mostly studied in order to understand the properties of graphite, so it was natural to investigate the properties of multiple layers of
graphene, starting with the bilayer~[\cite{bilayerWa47}, \cite{bilayerSW58}, \cite{bilayerMc57}]. A common model for hopping electrons on graphene bilayers is the so-called {\it Slonczewski-Weiss-McClure} model,
which is usually studied by retaining only certain hopping terms, depending on the energy regime one is interested in: including more hopping terms corresponds to probing the system at lower energies.
The fine structure of the Fermi surface and the behavior of the
dispersion relation around it depends on which hoppings are considered or, equivalently, on the range of energies under inspection.
\indent In a first approximation, the energy structure of bilayer graphene is similar to that of the monolayer: there are only two Fermi points,
and the dispersion relation is approximately conical around them. This picture is valid for energy scales larger than the transverse hopping between the two layers, referred to in the following as the {\it first regime}.
At lower energies, the effective dispersion relation around the two Fermi points appears to be approximately {\it parabolic}, instead of conical. This implies that the effective mass of the electrons in bilayer graphene does not vanish, unlike those in the monolayer, which has been observed
experimentally~[\cite{bilayerNMe06}].\par
\indent From an RG point of view, the parabolicity implies that the electron interactions are {\it marginal} in bilayer graphene, thus making the RG analysis non-trivial. The flow of the effective couplings
has been studied by O.~Vafek~[\cite{bilayerVa10}, \cite{bilayerVY10}], who has found that it diverges logarithmically, and has identified the most divergent channels,
thus singling out which of the possible quantum instabilities are dominant (see also~[\cite{bilayerTV12}]).
However, as was mentioned earlier, the assumption of parabolic dispersion relation is only an approximation,
valid in a range of energies between the scale of the transverse hopping and a second threshold, proportional to the cube of the transverse hopping (asymptotically, as this hopping goes to zero).
This range will be called the {\it second regime}.\par
\indent By studying the smaller energies in more detail, one finds~[\cite{bilayerMF06}] that around each of the Fermi points, there are three extra Fermi points, forming a tiny equilateral triangle around the original ones. This is referred to in the literature as {\it trigonal warping}. Furthermore, around each of the now eight Fermi points, the energy bands are approximately conical. This means that, from an RG perspective, the
logarithmic divergence studied in [\cite{bilayerVa10}] is cut off at the energy scale where the conical nature of the eight Fermi points becomes observable (i.e. at the end of the second regime). At lower energies
the electron interaction is irrelevant in the RG sense, which implies that the flow of the effective interactions remains bounded at low energies. Therefore, the analysis of [\cite{bilayerVa10}] is meaningful
only if the flow of the effective constants has grown significantly in the second regime.
\par
\indent However, our analysis shows that the flow of the effective couplings in this regime does not grow at all, due to their smallness after integration over the first regime,
which we quantify in terms both of the bare couplings and of the transverse hopping. This puts into question the physical relevance of the ``instabilities'' coming from the logarithmic divergence in the second regime,
at least in the case we are treating, namely small interaction strength and small interlayer hopping.\par
\indent The transition from a normal phase to one with broken symmetry as the interaction strength is increased from small to intermediate values was studied in~[\cite{bilayerCTV12}] at second order in perturbation theory. Therein, it was found that while at small bare couplings the infrared flow is convergent, at larger couplings it tends to increase, indicating a transition towards an {\it electronic nematic state}.\par
\indent Let us also mention that the third regime is not believed to give an adequate description of the system at arbitrarily small energies: at energies smaller than a third threshold (proportional to the fourth power of the transverse hopping) one finds~[\cite{bilayerPP06}] that the six extra Fermi points around the two original ones, are actually microscopic ellipses. The analysis of the thermodynamic properties of the system in this regime (to be called the fourth regime) requires new ideas and techniques, due to the extended nature of the singularity, and goes beyond the scope of this paper. It may be possible to adapt the ideas of~[\cite{bilayerBGM06}] to this regime, and we hope to come back to this issue in a future publication.
\par
\bigskip
\indent To summarize, at weak coupling and small transverse hopping, we can distinguish four energy regimes: in the first, the system behaves like two uncoupled monolayers, in the second, the energy bands are approximately parabolic, in the third, the trigonal warping is taken into account and the bands are approximately conical, and in the fourth, six of the Fermi points become small curves. We shall treat the first, second and third regimes,
which corresponds to retaining only the three dominant Slonczewski-Weiss-McClure hopping parameters.
Informally, we will prove that {\it the interacting half-filled system is analytically close to the non-interacting one} in these regimes, and that the effect of the interaction is merely to renormalize the
hopping parameters. The proof depends on a sharp multiscale control of the crossover regions separating one regime from the next.
\par
\bigskip
\indent We will now give a quick description of the model, and a precise statement of the main result of the present work, followed by a brief outline of its proof.\par
\subsection{Definition of the model}
\label{bilayerintromodelsec}
\indent We shall consider a crystal of bilayer graphene, which is made of two honeycomb lattices in {\it Bernal} or {\it AB} stacking, as shown in figure~\ref{bilayerbaregraph}. We can identify four inequivalent types of sites in the lattice, which we denote by $a$, $\tilde b$, $\tilde a$ and $b$ (we choose this peculiar order for practical reasons which will become apparent in the following).\par
\caption{\ta\ and \tb\ represent atoms of type $a$ and $b$ on the lower layer and \tat\ and \tbt\ represent atoms of type $\tilde a$ and $\tilde b$ on the upper layer. Full lines join nearest neighbors within the lower layer and dashed lines join nearest neighbors within the upper layer.}
where the {\it free Hamiltonian}$\mathcal H_0$ plays the role of a kinetic energy for the electrons, and the {\it interaction Hamiltonian}$\mathcal H_I$ describes the interaction between electrons.\par
\bigskip
\indent$\mathcal H_0$ is given by a {\it tight-binding} approximation, which models the movement of electrons in terms of {\it hoppings} from one atom to the next. There are four inequivalent types of hoppings which we shall consider here, each of which will be associated a different {\it hopping strength}$\gamma_i$. Namely, the hoppings between neighbors of type $a$ and $b$, as well as $\tilde a$ and $\tilde b$ will be associated a hopping strength $\gamma_0$; $a$ and $\tilde b$ a strength $\gamma_1$; $\tilde a$ and $b$ a strength $\gamma_3$; $\tilde a$ and $a$, and $\tilde b$ and $b$ a strength $\gamma_4$ (see figure~\ref{bilayerintergraph}). We can thus express $H_0$ in {\it second quantized} form in {\it momentum space} at {\it zero chemical potential} as~[\cite{bilayerWa47}, \cite{bilayerSW58}, \cite{bilayerMc57}]
in which $\hat a_k$, $\hat{\tilde b}_k$, $\hat{\tilde a}_k$ and $\hat b_k$ are {\it annihilation operators} associated to atoms of type $a$, $\tilde b$, $\tilde a$ and $b$, $k\equiv(k_x,k_y)$, $\hat\Lambda$ is the {\it first Brillouin zone}, and $\Omega(k):=1+2e^{-i\frac32k_x}\cos\left(\frac{\sqrt3}2k_y\right)$.
These objects will be properly defined in section~\ref{bilayermodelsec}. The $\Delta$ parameter in $H_0$ models a shift in the chemical potential around atoms of type $a$ and $\tilde b$~[\cite{bilayerSW58}, \cite{bilayerMc57}]. We choose the energy unit in such a way that $\gamma_0=1$. The hopping strengths have been measured experimentally in graphite~[\cite{bilayerDD02}, \cite{bilayerTDD77}, \cite{bilayerMMD79}, \cite{bilayerDDe79}] and in bilayer graphene samples~[\cite{bilayerZLe08}, \cite{bilayerMNe07}]; their values are given in the following table:
We notice that the relative order of magnitude of $\gamma_3$ and $\gamma_4$ is quite different in graphite and in bilayer graphene. In the latter, $\gamma_1$ is somewhat small, and $\gamma_3$ and $\gamma_4$ are of the same order, whereas $\Delta$ is of the order of $\gamma_1^2$. We will take advantage of the smallness of the hopping strengths and treat $\gamma_1=:\epsilon$ as a small parameter: we fix
{\bf Remark:} The symbols used for the hopping parameters are standard. The reason why $\gamma_2$ was omitted is that it refers to next-to-nearest layer hopping in graphite.
In addition, for simplicity, we have neglected the intra-layer next-to-nearest neighbor hopping $\gamma_0'\approx0.1\gamma_1$, which is known to play an analogous role to $\gamma_4$ and $\Delta$ [\cite{bilayerZLe08}].
\par\bigskip
\begin{figure}
\hfil\includegraphics{Figs/hoppings.pdf}\par
\caption{The different types of hopping. From top-left to bottom-right: $a\leftrightarrow b$, $\tilde a\leftrightarrow\tilde b$, $a\leftrightarrow\tilde b$, $b\leftrightarrow\tilde a$, $a\leftrightarrow\tilde a$, $b\leftrightarrow\tilde b$. Atoms of type $a$ and $\tilde a$ are represented by spheres and those of type $b$ and $\tilde b$ by cubes; the interaction is represented by solid red (color online) cylinders; the interacting atoms are displayed either in purple or in blue.}\label{bilayerintergraph}\end{figure}
\indent The interactions between electrons will be taken to be of {\it extended Hubbard} form, i.e.
where $n_x:=\alpha_x^\dagger\alpha_x$ in which $\alpha_x$ is one of the annihilation operators $a_x$, $\tilde b_x$, $\tilde a_x$ or $b_x$; the sum over $(x,y)$ runs over all pairs of atoms in the lattice; $v$ is a short range interaction potential (exponentially decaying); $U$ is the {\it interaction strength} which we will assume to be small.\par
where $\mathbf x_1:=(t_1,x_1)$ and $\mathbf x_2:=(t_2,x_2)$ includes an extra {\it imaginary time} component, $t_{1,2}\in[0,\beta)$, which is introduced in order to compute $Z$ and Gibbs averages,
We denote the Fourier transform of $\check s_2(\mathbf x)$ (or rather of its anti-periodic extension in imaginary time for $t$'s beyond $[0,\beta)$) by $s_2(\mathbf k)$ where $\mathbf k:=(k_0,k)$, and $k_0\in\frac{2\pi}{\beta}(\mathbb Z+\frac12)$.\par
\subsection{Non-interacting system}
\indent In order to state our main results on the interacting two-point Schwinger function,
it is useful to first review the scaling properties of the non-interacting one,
discussion of the structure of its singularities in momentum space.
\bigskip
\point{\bf Non-interacting Fermi surface.} If $H_0(k)$ is not invertible, then $s_2^{(0)}(0,k)$ is divergent. The set of quasi-momenta $\mathcal F_0:=\{k,\ \det H_0(k)=0\}$ is called the non-interacting {\it Fermi surface}
at zero chemical potential, which has the following structure: it contains two isolated points located at
around each of which there are three very small curves that are approximately elliptic (see figure~\ref{bilayerfigferm}). The whole singular set $\mathcal F_0$ is contained within
two small circles (of radius $O(\epsilon^2)$), so that on scales larger than $\epsilon^2$, $\mathcal F_0$ can be approximated by just two points, $\{p_{F,0}^\pm\}$, see figure~\ref{bilayerfigferm}. As we zoom in, looking at smaller
scales, we realize that each small circle contains four Fermi points: the central one, and three secondary points around it, called $\{p_{F,j}^\pm,\ j\in\{1,2,3\}\}$.
A finer zoom around the secondary points reveals that they are actually curves of size $\epsilon^3$. \par
\bigskip
\begin{figure}
\hfil\includegraphics{Figs/figferm.pdf}\par
\caption{Schematic representation of the Fermi points. Each dotted square represents a zoom into the finer structure of the Fermi points. The secondary Fermi points are labeled as indicated in the figure. In order not to clutter the drawing, only one of the zooms around the secondary Fermi point was represented.
}\label{bilayerfigferm}\end{figure}\par
\point{\bf Non-interacting Schwinger function.} We will now make the statements about approximating the Fermi surface more precise, and discuss the behavior of $s_2^{(0)}$ around its singularities. We will identify four regimes in which the Schwinger function behaves differently.\par
\bigskip
\subpoint{\bf First regime.} One can show that, if $\mathbf p_{F,0}^\pm:=(0,p_{F,0}^\pm)$, and
s_2^{(0)}(\mathbf p_{F,0}^\pm+\mathbf k' )=\left(\mathfrak L_{\mathrm{I}}\hat A(\mathbf p_{F,0}^\pm+\mathbf k' )\right)^{-1}\left(\mathds1+O(\|\mathbf k' \|_{\mathrm{I}},\epsilon\|\mathbf k' \|_{\mathrm{I}}^{-1})\right)
\label{bilayerfreeschwino}\end{equation}
in which
$\mathfrak L_{\mathrm{I}}\hat A$ is a matrix, independent of $\gamma_1$, $\gamma_3$, $\gamma_4$ and $\Delta$, whose eigenvalues vanish {\it linearly} around $\mathbf p_{F,0}^\pm$ (see figure~\ref{bilayerfigbands}b). We thus identify a {\it first regime}:
$$\epsilon\ll\|\mathbf k' \|_{\mathrm{I}}\ll1$$
in which the error term in~(\ref{bilayerfreeschwino}) is {\it small}. In this first regime, $\gamma_1$, $\gamma_3$, $\gamma_4$ and $\Delta$ are negligible, and the Fermi surface is approximated by $\{p_{F,0}^\pm\}$, around which the Schwinger function diverges {\it linearly}.\par
\bigskip
\begin{figure}
\hfil\includegraphics{Figs/bands.pdf}\par
\caption{Eigenvalues of $H_0(k)$. The sub-figures b,c,d are finer and finer zooms around one of the Fermi points.}
s_2^{(0)}(\mathbf p_{F,0}^\pm+\mathbf k' )=\left(\mathfrak L_{\mathrm{II}}\hat A(\mathbf p_{F,0}^\pm+\mathbf k' )\right)^{-1}\left(\mathds1+O(\epsilon^{-1}\|\mathbf k' \|_{\mathrm{II}},\epsilon^{3/2}\|\mathbf k' \|_{\mathrm{II}}^{-1/2})\right)
\label{bilayerfreeschwint}\end{equation}
in which $\mathfrak L_{\mathrm{II}}\hat A$ is a matrix, independent of $\gamma_3$, $\gamma_4$ and $\Delta$. Two of its eigenvalues vanish {\it quadratically} around $\mathbf p_{F,0}^\pm$ (see figure~\ref{bilayerfigbands}c) and two are bounded away from 0. The latter correspond to {\it massive} modes, whereas the former to {\it massless} modes. We thus identify a {\it second regime}:
$$\epsilon^3\ll\|\mathbf k' \|_{\mathrm{II}}\ll\epsilon$$
in which $\gamma_3$, $\gamma_4$ and $\Delta$ are negligible, and the Fermi surface is approximated by $\{p_{F,0}^\pm\}$, around which the Schwinger function diverges {\it quadratically}.\par
\bigskip
\subpoint{\bf Third regime.} If $\mathbf p_{F,j}^\pm:=(0,p_{F,j}^\pm)$, $j=0,1,2,3$, and
in which $\mathfrak L_{\mathrm{III},j}\hat A$ is a matrix, independent of $\gamma_4$ and $\Delta$, two of whose eigenvalues vanish {\it linearly} around $\mathbf p_{F,j}^\pm:=(0,p_{F,j}^\pm)$ (see figure~\ref{bilayerfigbands}d) and two are bounded away from 0. We thus identify a {\it third regime}:
in which $\gamma_4$ and $\Delta$ are negligible, and the Fermi surface is approximated by $\{p_{F,j}^\pm\}_{j\in\{0,1,2,3\}}$, around which the Schwinger function diverges {\it linearly}.
\par
\bigskip
{\bf Remark:} If $\gamma_4=\Delta=0$, then the error term $O(\epsilon^{4}\|\mathbf k'_{j}\|_{\mathrm{III}}^{-1})$ in (\ref{bilayerfreeschwinth}) vanishes identically, which allows us to extend the third regime to all momenta satisfying
\indent We now state the main theorem, whose proof will occupy the rest of the paper. Roughly, our result is that as long as $|U|$ and $\epsilon$ are small enough and $\gamma_4=\Delta=0$ (see the remarks following the statement for an explanation of why this is assumed), the free energy and the two-point Schwinger function are well defined in the thermodynamic and zero-temperature limit $|\Lambda|,\beta\to\infty$, and that the two-point Schwinger function is analytically close to that with $U=0$. The effect of the interaction is shown to merely {\it renormalize} the constants of the non-interacting Schwinger function.\par
If $\gamma_4=\Delta=0$, then there exists $U_0>0$ and $\epsilon_0>0$ such that for all $|U|<U_0$ and $\epsilon<\epsilon_0$, the specific ground state energy
and, $\forall\mathbf k\in\mathcal B_\infty\setminus\{\tilde{\mathbf p}_{F,j}^\omega\}_{\omega=\pm,j=0,1,2,3}$, the thermodynamic and zero-temperature limit of the two-point Schwinger function, $\lim_{\beta\to\infty}\lim_{|\Lambda|\to\infty}s_2(\mathbf k)$, exists and is analytic in $U$.
\endtheo
\bigskip
{\bf Remarks}:
\def\itemizepenalty{10000}
\begin{itemize}
\item The theorem requires $\gamma_4=\Delta=0$. As we saw above, those quantities play a negligible role in the non-interacting theory as long as we do not move beyond the third regime. This suggests that the theorem should hold with $\gamma_4,\Delta\neq0$ under the condition that $\beta$ is not too large,
i.e., smaller than $(\mathrm{const}.)\ \epsilon^{-4}$. However, that case presents a number of extra technical complications, which we will spare the reader.
\item The conditions that $|U|<U_0$ and $\epsilon<\epsilon_0$ are independent, in that we do not require any condition on the relative values of $|U|$ and $\epsilon$. Such a result calls for tight bounds on the integration over the first regime. If we were to assume that $|U|\ll\epsilon$, then the discussion would be greatly simplified, but such a condition would be artificial, and we will not require it be satisfied. L.~Lu~[\cite{bilayerLu13}] sketched
the proof of a result similar to theorem~\ref{bilayertheomain}, without discussing the first two regimes, which requires such an artificial condition on $U/\epsilon$.
The renormalization of the secondary Fermi points is also ignored in that reference.
\end{itemize}
\def\itemizepenalty{0}
\bigskip
\indent In addition to theorem~\ref{bilayertheomain}, we will prove that the dominating part of the two point Schwinger function is qualitatively the same as the non-interacting one, with renormalized constants. This result is detailed in theorems~\ref{bilayertheoo}, \ref{bilayertheot} and~\ref{bilayertheoth} below, each of which refers to one of the three regimes.\par
\bigskip
\point{\bf First regime.} Theorem~\ref{bilayertheoo} states that in the first regime, the two-point Schwinger function behaves at dominant order like the non-interacting one with renormalized factors.\par
\bigskip
\theo{Theorem}\label{bilayertheoo}
Under the assumptions of theorem~\ref{bilayertheomain}, if $C\epsilon\leqslant\|\mathbf k-\mathbf p_{F,0}^\omega\|_{\mathrm I}\leqslant C^{-1}$ for a suitable $C>0$, then, in the thermodynamic and zero-temperature limit,
\begin{equation} r(\mathbf p^\omega_{F,0}+\mathbf k')=O\big(\big(1+|U||\log\|\mathbf k' \|_{\mathrm{I}}|\big)\|\mathbf k' \|_{\mathrm{I}},\epsilon\|\mathbf k' \|_{\mathrm{I}}\big),\end{equation}
and, for $(k_0,k_{x}',k_{y}'):=\mathbf k-\mathbf p^\omega_{F,0}$,
\begin{equation}
\bar k_0:=z_1 k_0,\quad
\tilde k_0:=\tilde z_1 k_0,\quad
\bar\xi:=\frac{3}{2}v_1(ik'_x+\omega k'_y)
\label{bilayerrcco}\end{equation}
in which $(\tilde z_1,z_1,v_1)\in\mathbb{R}^3$ satisfy
\begin{equation}
|1-\tilde z_1|\leqslant C_1|U|,\quad
|1-z_1|\leqslant C_1|U|,\quad
|1-v_1|\leqslant C_1|U|
\label{bilayerineqrcco}\end{equation}
for some constant $C_1>0$ (independent of $U$ and $\epsilon$).
\endtheo
\bigskip
{\bf Remarks}:
\def\itemizepenalty{10000}
\begin{itemize}
\item The singularities of $s_2$ are approached linearly in this regime.
\item By comparing (\ref{bilayerschwino}) with its non-interacting counterpart~(\ref{bilayerfreepropzo}), we see that the effect of the interaction is to {\it renormalize} the constants in front of $k_0$ and $\xi$ in~(\ref{bilayerfreepropzo}).
\item The {\it inter-layer correlations}, that is the $\{a,b\}\times\{\tilde a,\tilde b\}$ components of the dominating part of $s_2(\mathbf k)$ vanish. In this regime, the Schwinger function of bilayer graphene behave like that of two independent graphene layers.
\end{itemize}
\def\itemizepenalty{0}
\bigskip
\point{\bf Second regime.} Theorem~\ref{bilayertheot} states a similar result for the second regime. As was mentioned earlier, two of the components are {\it massive} in the second (and third) regime, and we first perform a change of variables to isolate them, and state the result on the massive and massless components, which are denoted below by $\bar s_2^{(M)}$ and $\bar s_2^{(m)}$ respectively.\par
\bigskip
\theo{Theorem}
\label{bilayertheot}
Under the assumptions of the theorem~\ref{bilayertheomain}, if $C\epsilon^3\leqslant\|\mathbf k-\mathbf p_{F,0}^\omega\|_{\mathrm{II}}\leqslant C^{-1}\epsilon$ for a suitable $C>0$, then, in the thermodynamic and zero-temperature limit,
and, for $(k_0,k_{x}',k_{y}'):=\mathbf k-\mathbf p^\omega_{F,0}$,
\begin{equation}
\bar\gamma_1:=\tilde m_2 \gamma_1,\quad
\bar k_0:=z_2 k_0,\quad
\bar\xi:=\frac{3}{2}v_2(ik'_x+\omega k'_y)
\label{bilayerrcct}\end{equation}
in which $(\tilde m_2,z_2,v_2)\in\mathbb{R}^3$ satisfy
\begin{equation}
|1-\tilde m_2|\leqslant C_2|U|,\quad
|1-z_2|\leqslant C_2|U|,\quad
|1-v_2|\leqslant C_2|U|
\label{bilayerineqrcct}\end{equation}
for some constant $C_2>0$ (independent of $U$ and $\epsilon$).
\endtheo
\bigskip
{\bf Remarks}:
\def\itemizepenalty{10000}
\begin{itemize}
\item The {\it massless} components $\{\tilde a,b\}$ are left invariant under the change of basis that block-diagonalizes $s_2$. Furthermore, $M$ is {\it small} in the second regime, which implies that the {\it massive} components are {\it approximately}$\{a,\tilde b\}$.
\item As can be seen from~(\ref{bilayerschwintcomps}), the {\it massive} part $\bar s_2^{(M)}$ of $s_2$ is not singular in the neighborhood of the Fermi points, whereas the {\it massless} one, i.e. $\bar s_2^{(m)}$, is.
\item The massless components of $s_2$ approach the singularity {\it quadratically} in the spatial components of $\mathbf k$.
\item Similarly to the first regime, by comparing~(\ref{bilayerschwintcomps}) with~(\ref{bilayerfreepropzt}), we find that the effect of the interaction is to {\it renormalize} constant factors.
\end{itemize}
\def\itemizepenalty{0}
\bigskip
\point{\bf Third regime.} Theorem~\ref{bilayertheoth} states a similar result as theorem~\ref{bilayertheot} for the third regime, though the discussion is made more involved by the presence of the extra Fermi points.\par
\bigskip
\theo{Theorem}
\label{bilayertheoth}
For $j=0,1$, under the assumptions of theorem~\ref{bilayertheomain}, if $\|\mathbf k-\tilde{\mathbf p}_{F,j}^\omega\|_{\mathrm{III}}\leqslant C^{-1}\epsilon^3$ for a suitable $C>0$, then
for some constant $C_{3}>0$ (independent of $U$ and $\epsilon$).
\endtheo
\bigskip
Theorem~\ref{bilayertheoth} can be extended to the neighborhoods of $\tilde{\mathbf p}_{F,j}^\omega$ with $j=2,3$,
by taking advantage of the symmetry of the system under rotations of angle $2\pi/3$:\par
\medskip
\delimtitle{\bf Extension to $j=2,3$}
For $j=2,3$, under the assumptions of theorem~\ref{bilayertheomain}, if $\|\mathbf k-\tilde{\mathbf p}_{F,j}^\omega\|_{\mathrm{III}}\leqslant C^{-1}\epsilon^3$ for a suitable $C>0$, then
where $T(k_0,k_x,k_y)$ denotes the rotation of the $k_x$ and $k_y$ components by an angle $2\pi/3$, $\mathcal T_{(k_0,k_x,k_y)}:=e^{-i(\frac32k_x-\frac{\sqrt3}2k_y)\sigma_3}$, and $\tilde{\mathbf p}_{F,4}^{-}\equiv\tilde{\mathbf p}_{F,1}^{-}$.
\endtheo
\bigskip
{\bf Remarks}:
\def\itemizepenalty{10000}
\begin{itemize}
\item The remarks below theorem~\ref{bilayertheot} regarding the massive and massless fields hold here as well.
\item The massless components of $s_2$ approach the singularities {\it linearly}.
\item By comparing~(\ref{bilayerschwinth}) with (\ref{bilayerfreepropzth}) and~(\ref{bilayerfreepropoth}), we find that the effect of the interaction is to {\it renormalize} the constant factors.
\end{itemize}
\def\itemizepenalty{0}
\subsection{Sketch of the proof}
\indent In this section, we give a short account of the main ideas behind the proof of theorem~\ref{bilayertheomain}.\par
\bigskip
\point{\bf Multiscale decomposition.} The proof relies on a {\it multiscale} analysis of the model, in which the free energy and Schwinger function are expressed as successive integrations over individual scales.
Each scale is defined as a set of $\mathbf k$'s contained inside an annulus at a distance of $2^h$ for $h\in\mathbb Z$ around the singularities located at $\mathbf p_{F,j}^\omega$.
The positive scales correspond to the ultraviolet regime, which we analyze in a multiscale fashion because of the (very mild) singularity of the free propagator at equal imaginary times.
It may be possible to avoid the decomposition by employing ideas in the spirit of [\cite{bilayerSS08}]. The negative scales are treated differently, depending on the regimes they belong to (see below),
and they contain the essential difficulties of the problem, whose nature is intrinsically infrared.
\par
\bigskip
\point{\bf First regime.} In the first regime, i.e. for $-1\gg h\gg h_\epsilon:=\log_2\epsilon$, the system behaves like two uncoupled graphene layers, so the analysis carried out in~[\cite{bilayerGM10}] holds. From a renormalization group perspective, this regime is {\it super-renormalizable}: the scaling dimension of diagrams with $2l$ external legs is $3-2l$, so that only the two-legged diagrams are relevant whereas all of the others are irrelevant (see section~\ref{bilayerpowercountingsec} for precise definitions of scaling dimensions, relevance and irrelevance). This allows us to compute a strong bound on four-legged contributions:
whereas a naive power counting argument would give $|U|2^{h}$ (recall that with our conventions $h$ is negative).\par
\bigskip
\indent The super-renormalizability in the first regime stems from the fact that the Fermi surface is 0-dimensional and that $H_0$ is linear around the Fermi points. While performing the multiscale integration, we deal with the two-legged terms by incorporating them into $H_0$, and one must therefore prove that by doing so, the Fermi surface remains 0-dimensional and that the singularity remains linear. This is guaranteed by a symmetry argument, which in particular shows the invariance of the Fermi surface.\par
\bigskip
\point{\bf Second regime.} In the second regime, i.e. for $3h_\epsilon\ll h\ll h_\epsilon$, the singularities of $H_0$ are quadratic around the Fermi points, which changes the {\it power counting} of the renormalization group analysis: the scaling dimension of $2l$-legged diagrams becomes $2-l$ so that the two-legged diagrams are still relevant, but the four-legged ones become marginal. One can then check~[\cite{bilayerVa10}] that they are actually marginally relevant, which means that their contribution increases proportionally to $|h|$. This turns out not to matter: since the second regime is only valid for $h\gg3h_\epsilon$, $|\hat W_4^{(h)}|$ may only increase by $3|h_\epsilon|$, and since the theory is super-renormalizable in the first regime, there is an extra factor $2^{h_\epsilon}$ in $\hat W_4^{(h_\epsilon)}$, so that $\hat W_4^{(h)}$ actually increases from $2^{h_\epsilon}$ to $3|h_\epsilon|2^{h_\epsilon}$, that is to say it barely increases at all if $\epsilon$ is small enough.\par
\bigskip
\indent Once this essential fact has been taken into account, the renormalization group analysis can be carried out without major difficulties. As in the first regime, the invariance of the Fermi surface is guaranteed by a symmetry argument.\par
\bigskip
\point{\bf Third regime.} In the third regime, i.e. for $h\ll3h_\epsilon$, the theory is again super-renormalizable (the scaling dimension is $3-2l$). There is however an extra difficulty with respect to the first regime, in that the Fermi surface is no longer invariant under the renormalization group flow, but one can show that it does remain 0-dimensional, and that the only effect of the multiscale integration is to move $p_{F,j}^\omega$ along the line between itself and $p_{F,0}^\omega$.\par
\subsection{Outline}
\indent The rest of this paper is devoted to the proof of theorem~\ref{bilayertheomain} and of theorems~\ref{bilayertheoo}, \ref{bilayertheot} and~\ref{bilayertheoth}. The sections are organized as follows.
\begin{itemize}
\item In section~\ref{bilayerthemodelsec}, we define the model in a more explicit way than what has been done so far; then we show how to compute the free energy and Schwinger function using a Fermionic path integral formulation and a {\it determinant expansion}, due to Battle, Brydges and Federbush~[\cite{bilayerBF78}, \cite{bilayerBF84}], see also [\cite{bilayerBK87}, \cite{bilayerAR98}]; and finally we discuss the symmetries of the system.
\item In section~\ref{bilayerproppropsec}, we discuss the non-interacting system. In particular, we derive detailed formulae for the Fermi points and for the asymptotic behavior of the propagator around its singularities.
\item In section~\ref{bilayerschemesec}, we describe the multiscale decomposition used to compute the free energy and Schwinger function.
\item In section~\ref{bilayertreeexpsec}, we state and prove a {\it power counting} lemma, which will allow us to compute bounds for the effective potential in each regime. The lemma is based on the Gallavotti-Nicol\`o tree expansion~[\cite{bilayerGN85}, \cite{bilayerGN85b}], and follows~[\cite{bilayerBG90}, \cite{bilayerGM01}, \cite{bilayerGi10}]. We conclude this section by showing how to compute the two-point Schwinger function from the effective potentials.
\item In section~\ref{bilayeruvsec}, we discuss the integration over the {\it ultraviolet regime}, i.e. scales $h>0$.
\item In sections~\ref{bilayerosec}, \ref{bilayertsec} and~\ref{bilayerthsec}, we discuss the multiscale integration in the first, second and third regimes,
and complete the proofs of theorem~\ref{bilayertheomain}, as well as of theorems \ref{bilayertheoo}, \ref{bilayertheot}, \ref{bilayertheoth}.
\end{itemize}
\section{The model}
\label{bilayerthemodelsec}
\hfil\framebox{\bf From this point on, we set $\gamma_4=\Delta=0$.}
\bigskip
\indent In this section, we define the model in precise terms, re-express the free energy and two-point Schwinger function in terms of Grassmann integrals and truncated expectations, which we will subsequently explain how to compute, and discuss the symmetries of the model and their representation in this formalism.\par
\subsection{Precise definition of the model}
\label{bilayermodelsec}
\indent In the following, some of the formulae are repetitions of earlier ones, which are recalled for ease of reference. This section complements section~\ref{bilayerintromodelsec},
where the same definitions were anticipated in a less verbose form. The main novelty lies in the momentum-real space correspondence, which is
made explicit.
\par
\bigskip
\point{\bf Lattice.} As mentioned in section~\ref{bilayerintroduction}, the atomic structure of bilayer graphene consists in two honeycomb lattices in so-called {\it Bernal} or {\it AB} stacking, as was shown in figure~\ref{bilayerbaregraph}. It can be constructed by copying an elementary cell at every integer combination of
where we have chosen the unit length to be equal to the distance between two nearest neighbors in a layer (see figure~\ref{bilayercellgraph}). The elementary cell consists of four atoms at the following coordinates
given relatively to the center of the cell. $c$ is the spacing between layers; it can be measured experimentally, and has a value of approximately 2.4~[\cite{bilayerTMe92}].\par
\caption{decomposition of the crystal into elementary cells, represented by the blue (color online) rhombi. There are four atoms in each elementary cell: \ta of type $a$ at $(0,0,0)$, \tbt of type $\tilde b$ at $(0,0,c)$, \tat of type $\tilde a$ at $(-1,0,c)$ and \tb of type $b$ at $(0,0,c)$.}
where $L$ is a positive integer that determines the size of the crystal, that we will eventually send to infinity, with periodic boundary conditions. We introduce the intra-layer nearest neighbor vectors:
It is defined in such a way that $\forall x\in\Lambda$, $\forall k\in\hat\Lambda$,
$$e^{ikxL}=1.$$
Since the third component of vectors in $\hat\Lambda$ is always 0, we shall drop it and write vectors of $\hat\Lambda$ as elements of $\mathbb{R}^2$. In the limit $L\to\infty$, the
set $\hat\Lambda$ tends to the torus $\hat\Lambda_\infty=\mathbb R^2/(\mathbb Z G_1+\mathbb Z G_2)$, also called the {\it Brillouin zone}.
\par
\point{\bf Hamiltonian.} Given $ x\in\Lambda$, we denote the Fermionic annihilation operators at atoms of type $a$, $\tilde b$, $\tilde a$ and $b$
within the elementary cell centered at $x$ respectively by $a_{ x}$, $\tilde b_{ x}$, $\tilde a_{ x-\delta_1}$ and $ b_{ x+\delta_1}$. The corresponding creation operators are their adjoint operators.\par
\bigskip
\indent We recall the Hamiltonian~(\ref{bilayerfullham})
$$\mathcal H=\mathcal H_0+\mathcal H_I$$
where $\mathcal H_0$ is the {\it free Hamiltonian} and $\mathcal H_I$ is the {\it interaction Hamiltonian}.\par
\bigskip
\subpoint{\bf Free Hamiltonian.} As was mentioned in section~\ref{bilayerintroduction}, the free Hamiltonian describes the {\it hopping} of electrons from one atom to another. Here, we only consider the hoppings $\gamma_0,\gamma_1,\gamma_3$, see figure~\ref{bilayerintergraph}, so that $\mathcal H_0$ has the following expression in $x$ space:
where $|\Lambda|=L^2$, $\hat A_k$ is a column vector, whose transpose is $\hat A_k^T=(\hat a_{ k},\hat{\tilde b}_{ k},\hat{\tilde a}_{ k},\hat{b}_{ k})$,
where the $d_\alpha$ are the vectors that give the position of each atom type with respect to the centers of the lattice $\Lambda$: $d_a:=0,\ d_{\tilde b}:=(0,0,c),\ d_{\tilde a}:=(0,0,c)-\delta_1,\ d_b:=\delta_1$ and $v$ is a bounded, rotationally invariant function, which decays exponentially fast to zero at infinity. In our spin-less case, we can assume without loss of generality that $v(0)=0$.
\subsection{Schwinger function as Grassmann integrals and expectations}
\label{bilayertracespathintssec}
\indent The aim of the present work is to compute the {\it specific free energy} and the {\it two-point Schwinger function}. These quantities are defined for finite lattices by
in which $(\alpha,\alpha')\in\mathcal A^2:=\{a,\tilde b,\tilde a,b\}^2$; $\mathbf x_{1,2}=(t_{1,2},x_{1,2})$ with $t_{1,2}\in[0,\beta)$; $\alpha_{\mathbf x}=e^{\mathcal Ht}\alpha_xe^{-\mathcal Ht}$; and $\mathbf T$ is the {\it Fermionic time ordering operator} defined in~(\ref{bilayertimeorderdef}). Our strategy essentially consists in deriving convergent expansions for $f_\Lambda$ and $\check s$, uniformly in $|\Lambda|$ and $\beta$, and then to take $\beta,|\Lambda|\to\infty$.\par
\bigskip
\indent However, the quantities on the right side of~(\ref{bilayerfreeen}) and~(\ref{bilayerschwindef}) are somewhat difficult to manipulate. In this section, we will re-express $f_\Lambda$ and $\check s$ in terms of {\it Grassmann integrals} and {\it expectations}, and show how such quantities can be computed using a {\it determinant expansion}. This formalism will lay the groundwork for the procedure which will be used in the following to express $f_\Lambda$ and $\check s$ as series, and subsequently prove their convergence.\par
\bigskip
\point{\bf Grassmann integral formulation.} We first describe how to express~(\ref{bilayerfreeen}) and~(\ref{bilayerschwindef}) as Grassmann integrals. The procedure is well known and details can be found in many references, see e.g. [\cite{bilayerGM10}, appendix~B] and [\cite{bilayerGi10}] for a discussion adapted to the case of graphene, or [\cite{bilayerGM01}] for a discussion adapted to general low-dimensional Fermi systems,
or [\cite{bilayerBG95}] and [\cite{bilayerSa13}] and references therein for an even more general picture. \par
\bigskip
\subpoint{\bf Definition.} We first define a Grassmann algebra and an integration procedure on it. We move to Fourier space: for every $\alpha\in\mathcal A:=\{a,\tilde b,\tilde a,b\}$, the operator $\alpha_{(t,x)}$ is associated
with $k_0\in2\pi\beta^{-1}(\mathbb Z+1/2)$ (notice that because of the $1/2$ term, $k_0\neq0$ for finite $\beta$). We notice that $\mathbf k\in\mathcal B_{\beta,L}:=(2\pi\beta^{-1}(\mathbb Z+1/2))\times\hat\Lambda$ varies in an infinite set. Since this will cause trouble when defining Grassmann integrals, we shall impose a cutoff $M\in\mathbb N$: let $\chi_0(\rho)$ be a smooth compact support function that returns $1$ if $\rho\leqslant1/3$ and $0$ if $\rho\geqslant2/3$, and let
To every $(\hat\alpha_{\mathbf k},\hat\alpha_{\mathbf k}^\dagger)$ for $\alpha\in\mathcal A$ and $\mathbf k\in\mathcal B_{\beta,L}^*$, we associate a pair of {\it Grassmann variables}$(\hat\psi_{\mathbf k,\alpha}^-,\hat\psi_{\mathbf k,\alpha}^+)$, and we consider the finite Grassmann algebra (i.e. an algebra in which the $\hat\psi$ anti-commute with each other) generated by the collection $\{\hat\psi_{\mathbf k,\alpha}^\pm\}_{\mathbf k\in\mathcal B_{\beta,L}^*}^{\alpha\in\mathcal A}$. We define the Grassmann integral
as the linear operator on the Grassmann algebra whose action on a monomial in the variables $\hat\psi^\pm_{{\bf k},\alpha}$ is $0$ except if said monomial is $\prod^{\alpha\in{\mathcal A}}_{{\bf k}\in{\mathcal B}_{\beta,L}^*}\hat\psi^-_{{\bf k},\alpha}\hat\psi^+_{{\bf k},\alpha}$ up to a permutation of the variables, in which case the value of the integral is determined using
along with the anti-commutation of the $\hat\psi$.\par
\bigskip
\indent In the following, we will express the free energy and Schwinger function as {\it Grassmann integrals}, specified by a {\it propagator} and a {\it potential}.
The propagator is a $4\times4$ complex matrix $\hat g(\mathbf k)$, supported on some set $\mathcal B\subset\mathcal B_{\beta,L}^*$, and is associated with
Gaussian Grassmann integrals satisfy the following {\it addition principle}: given two propagators $\hat g_1$ and $\hat g_2$, and any polynomial $\mathfrak P(\psi)$ in the Grassmann variables,
where $f_{0,\Lambda}$ is the free energy in the $U=0$ case and, using the symbol $\int d\mathbf x$ as a shorthand for $\int_{0}^\beta dt \sum_{x\in\Lambda}$,
Notice that the expression of $\mathcal V(\psi)$ in~(\ref{bilayer2.3a}) is very similar to that of $\mathcal H_I$, with an added imaginary time $(x_0,y_0)$ and the $\alpha_{\mathbf x}$ replaced by $\psi_{\mathbf x,\alpha}$, except that $(\alpha_\mathbf x^\dagger\alpha_\mathbf x-1/2)$ becomes $\psi_{\mathbf x,\alpha}^+\psi_{\mathbf x,\alpha}^-$. Roughly, the reason why we ``drop the 1/2'' is because of the difference between the anti-commutation rules of $\alpha_\mathbf x$ and $\psi_{\mathbf x,\alpha}$ (i.e., $\{\alpha_\mathbf x,\alpha_\mathbf x^\dagger\}=1$, vs. $\{\psi_{\mathbf x,\alpha}^+,\psi_{\mathbf x,\alpha}^-\}=0$).
More precisely, taking $\mathbf x=(x_0,x)$ with $x_0\in(-\beta,\beta)$,
it is easy to check that the limit as $M\to\infty$ of $g_{\leqslant M}(\mathbf x):=\int P_{\leqslant M}(d\psi)\psi^-_\mathbf x\psi^+_{\bf0}$ is equal to $\check s(\mathbf x)$, if $\mathbf x\neq{\bf0}$,
and equal to $\check s({\bf0})+1/2$, otherwise. This extra $+1/2$ accounts for the ``dropping of the 1/2" mentioned above.\par
\bigskip
\subpoint{\bf Two-point Schwinger function.} The two-point Schwinger function can be expressed as a Grassmann integral as well: under the condition that
is analytic in $U$ uniformly in $M$, a fact we will also check a posteriori, then one can prove (see e.g. [\cite{bilayerGM10}, appendix~B]) that the two-point Schwinger function can be written as
where $\hat J_{\mathbf k,\alpha}^-$ and $\hat J_{\mathbf k,\alpha'}^+$ are extra Grassmann variables introduced for the purpose of the computation (note here that the Grassmann integral over the variables $\hat J_{\mathbf k,\alpha_1}^-,\hat J_{\mathbf k,\alpha_2}^+$ acts as a functional derivative with respect to the same variables, due to the Grassmann integration/derivation rules).
\point{\bf Expectations.} We have seen that the free energy and Schwinger function can be computed as Grassmann integrals, it remains to see how one computes such integrals.
in which $(\mathcal V_1,\cdots,\mathcal V_N)$ is a collection of commuting polynomials and the index $_{\leqslant M}$ refers to the propagator of $P_{\leqslant M}(d\psi)$. A similar formula holds for~(\ref{bilayersc}).\par
\bigskip
\indent The purpose of this rewriting is that we can compute truncated expectations in terms of a {\it determinant expansion}, also known as the Battle-Brydges-Federbush formula~[\cite{bilayerBF78}, \cite{bilayerBF84}], which expresses it as the determinant of a Gram matrix. The advantage of this writing is that, provided we first re-express the propagator $\hat g_{\leqslant M}(\mathbf k)$ in $\mathbf x$-space, the afore-mentioned Gram matrix can be bounded effectively (see section~\ref{bilayerpowercountingsec}). We therefore first define an $\mathbf x$-space representation for $\hat g(\mathbf k)$:
The determinant expansion is given in the following lemma, the proof of which can be found in [\cite{bilayerGM01}, appendix~A.3.2], [\cite{bilayerGi10}, appendix~B].\par
\bigskip
\theo{Lemma}\label{bilayerdetexplemma}
\indent Consider a family of sets $\mathbf P=(P_1,\cdots,P_s)$ where every $P_j$ is an ordered collection of Grassmann variables, we denote the product of the elements in $P_j$ by $\Psi_{P_j}:=\prod_{\psi\in P_j}\psi$.\par
\smallskip
\indent We call a pair $(\psi_{\mathbf x,\alpha}^-,\psi_{\mathbf x',\alpha'}^+)\in\mathbf P^2$ a {\it line}, and define the set of {\it spanning trees}$\mathbf T(\mathbf P)$ as the set of collections $T$ of lines that are such that upon drawing a vertex for each $P_i$ in $\mathbf P$ and a line between the vertices corresponding to $P_i$ and to $P_j$ for each line $(\psi_{\mathbf x,\alpha}^-,\psi_{\mathbf x',\alpha'}^+)\in T$ that is such that $\psi_{\mathbf x,\alpha}^-\in P_i$ and $\psi_{\mathbf x',\alpha'}^+\in P_j$, the resulting graph is a tree that connects all of the vertices.\par
\smallskip
\indent For every spanning tree $T\in\mathbf T(\mathbf P)$, to each line $l=(\psi_{\mathbf x,\alpha}^-,\psi_{\mathbf x',\alpha'}^+)\in T$ we assign a {\it propagator}$g_l:=g_{\alpha,\alpha'}(\mathbf x-\mathbf x')$.\par
\smallskip
\indent If $\mathbf P$ contains $2(n+s-1)$ Grassmann variables, with $n\in\mathbb N$, then there exists a probability measure $dP_T(\mathbf t)$ on the set of $n\times n$ matrices of the form $\mathbf t=M^TM$ with $M$ being a matrix whose columns are unit vectors of $\mathbb R^n$, such that
where $\sigma_T\in\{-1,1\}$ and $G^{(T)}(\mathbf t)$ is an $n\times n$ complex matrix each of whose components is indexed by a line $l\not\in T$ and is given by
$$G^{(T)}_{l}(\mathbf t)=\mathbf t_lg_l$$
(if $s=1$, then $\mathbf T(\mathbf P)$ is empty and both the sum over $T$ and the factor $\sigma_T\prod_{l\in T}g_l$ should be dropped from the right side of~(\ref{bilayerdetexp})).
\endtheo
\bigskip
\indent Lemma~\ref{bilayerdetexplemma}\ gives us a formal way of computing the right side of~(\ref{bilayergrasstruncexp}). However, proving that this formal expression is correct, in the sense that it is not divergent, will require a control over the quantities involved in the right side of~(\ref{bilayerdetexp}), namely the propagator $g_{\leqslant M}$.
Since, as was discussed in the introduction, $g_{\leqslant M}$ is singular, controlling the right side of~(\ref{bilayergrasstruncexp}) is a non-trivial task that will require a multiscale analysis described in section~\ref{bilayerschemesec}.\par
\subsection{Symmetries of the system}
\label{bilayersymsec}
\indent In the following, we will rely heavily on the symmetries of the system, whose representation in terms of Grassmann variables is now discussed.\par
\bigskip
\indent A {\it symmetry} of the system is a map that leaves {\it both}
\point{\bf$2\pi/3$ rotation.} Let $T\mathbf k:=(k_0,e^{-i\frac{2\pi}3\sigma_2}k)$, $l_2:=(3/2,-\sqrt3/2)$ and $\mathcal T_{\mathbf k}:=e^{-i(l_2\cdot k)\sigma_3}$, the mapping
\indent In section~\ref{bilayertracespathintssec}, we showed how to express the free energy and the two-point Schwinger function as a formal series of truncated expectations~(\ref{bilayergrasstruncexp}). Controlling the convergence of this series is made difficult by the fact that the propagator $\hat g_{\leqslant M}$ is singular, and will require a finer analysis. In this section, we discuss which are the singularities of $\hat g_{\leqslant M}$ and how it behaves close to them, and identify three regimes in which the propagator behaves differently.\par
\subsection{Fermi points}
\indent The free propagator is singular if $k_0=0$ and $k$ is such that $H_0(k)$ is not invertible. The set of such $k$'s is called the {\it Fermi surface}.
In this subsection, we study the properties of this set. We recall the definition of $H_0$ in (\ref{bilayerhmat}),
$$ H_0( k):=
-\left(\begin{array}{*{4}{c}}
0&\gamma_1&0&\Omega^*(k)\\
\gamma_1&0&\Omega(k)&0\\
0&\Omega^*(k)&0&\gamma_3\Omega(k)e^{3ik_x}\\
\Omega(k)&0&\gamma_3\Omega^*(k)e^{-3ik_x}&0
\end{array}\right)$$
so that, using corollary~\ref{bilayerAnokzprop} (see appendix~\ref{bilayerinversapp}),
It is then straightforward to compute the solutions of $\det H_0(k)=0$ (see appendix~\ref{bilayerfermiapp} for details): we find that as long as $0<\gamma_1\gamma_3<2$, there are 8 Fermi points:
The points $p_{F,j}^\omega$ for $j=1,2,3$ are labeled as per figure~\ref{bilayerfigferm}.\par
\subsection{Behavior around the Fermi points}
\indent In this section, we compute the dominating behavior of $\hat g(\mathbf k)$ close to its singularities, that is close to $\mathbf p_{F,j}^\omega:=(0,p_{F,j}^\omega)$. We recall that $\hat A(\mathbf k):=(-ik_0\mathds1+H_0(k))$ and $\hat g(\mathbf k)=\chi_0(2^{-M}|k_0|)\hat A^{-1}(\mathbf k)$.
\bigskip
\point{\bf First regime.} We define $k':=k-p_{F,0}^\omega=(k'_x,k'_y)$, $\mathbf k':=(k_0,k')$. We have
so that, by using (\ref{bilayerdetAexpr}) with $(\mathfrak a,\mathfrak b,\mathfrak c,\mathfrak x,\mathfrak z)=-(\gamma_1,\Omega(k),\gamma_3\Omega(k)e^{3ik_x}, k_0,k_0)$,
for suitable constants $\kappa_1, \bar\kappa_0>0$, then the remainder term in (\ref{bilayerdetao}) is smaller than the explicit term, so that~(\ref{bilayerdetao}) is adequate in this regime, which we call the ``first regime''.
\indent We now compute the dominating part of $\hat A^{-1}$ in this regime. The computation is carried out in the following way: we neglect terms of order $\gamma_1$, $\gamma_3$ and $|k'|^2$ in $\hat A$, invert the resulting matrix using~(\ref{bilayerinvAexpr}), prove that this inverse is bounded by $(\mathrm{const}.)\ \|\mathbf k'\|_{\mathrm{I}}^{-1}$, and deduce a bound on the error terms. We thus find
Note that, recalling that the basis in which we wrote $A^{-1}$ is $\{a,\tilde b,\tilde a,b\}$, each graphene layer is decoupled from the other in the dominating part of~(\ref{bilayerfreepropzo}).\par
\bigskip
\point{\bf Ultraviolet regime.} The regime in which $\|\mathbf k'\|_{\mathrm{I}}\geqslant\bar\kappa_0$ for both $\omega=\pm$, and is called the {\it ultraviolet} regime.
For such $\mathbf k'=:\mathbf k-\mathbf p_{F,0}^\omega$, one easily checks that
\point{\bf Second regime.} We now go beyond the first regime: we assume that $\|\mathbf k'\|_{\mathrm I}\leqslant\kappa_1\epsilon$ and, using again (\ref{bilayereq:omega}) and (\ref{bilayerdetAexpr}), we write
for suitable constants $\kappa_2,\bar\kappa_1>0$, then the remainder in~(\ref{bilayerdetat}) is smaller than the explicit term, and we thus define the ``second regime'', for which~(\ref{bilayerdetat}) is appropriate.\par
\bigskip
\indent We now compute the dominating part of $\hat A^{-1}$ in this regime. To that end, we define the dominating part $\mathfrak L_{\mathrm{II}}\hat A$ of $\hat A$ by neglecting the terms of order $\gamma_3$ and $|k'|^2$ in $\hat A$ as well as the elements $\hat A_{aa}$ and $\hat A_{\tilde b\tilde b}$ (which are both equal to $-ik_0$), block-diagonalize it using proposition~\ref{bilayerblockdiagprop} (see appendix~\ref{bilayerdiagapp}) and invert it:
in which the bound should be understood as follows: the upper-left element in~(\ref{bilayerLA-1}) is the bound on the upper-left $2\times2$ block of $(\mathfrak L_{\mathrm{II}}\hat A)^{-1}$, and similarly for the upper-right, lower-left and lower-right. Using this bound in $$\hat A^{-1}(\mathbf k)=
{\bf Remark:} Using the explicit expression for $\hat A^{-1}(\mathbf p_{F,0}^\omega+\mathbf k')$ obtained by applying proposition~\ref{bilayermatinvprop} (see appendix~\ref{bilayerinversapp}), one can show that the error term on the right side
of (\ref{bilayerfreepropzt})
can be improved to $O(\epsilon^{-1}\|\mathbf k'\|_{\mathrm{II}},\epsilon^{3/2}\|\mathbf k'\|_{\mathrm{II}}^{-1/2}))$. Since we will not need this improved bound in the following,
we do not belabor further details.
\par
\bigskip
\point{\bf Intermediate regime.} In order to derive~(\ref{bilayerfreepropzt}), we assumed that $\|\mathbf k'\|_{\mathrm{II}}<\bar\kappa_1\epsilon$ with $\bar\kappa_1$ small enough. In the intermediate regime defined by $\bar\kappa_1\epsilon<\|\mathbf k'\|_{\mathrm{II}}$ and $\|\mathbf k'\|_{\mathrm I}<\kappa_1\epsilon$, we have that $\|\mathbf k'\|_{\mathrm I}\sim\|\mathbf k'\|_{\mathrm{II}}\sim\epsilon$ (given two positive functions $a(\epsilon)$ and $b(\epsilon)$, the symbol $a\sim b$ stands for $c b\leqslant a\leqslant C b$ for some universal constants $C>c>0$). Moreover,
which is identical to the bound at the end of the first regime and at the beginning of the second.\par
\bigskip
\point{\bf Third regime.} We now probe deeper, beyond the second regime, and assume that $\|\mathbf k'\|_{\mathrm{II}}\leqslant\kappa_2\epsilon^3$.
Since we will now investigate the regime in which $|k'|<(\mathrm{const}.)\ \epsilon^2$, we will need to consider all the Fermi points $p_{F,j}^\omega$ with $j\in\{0,1,2,3\}$.\par
\bigskip
\subpoint{\bf Around $\mathbf p_{F,0}^\omega$} We start with the neighborhood of $\mathbf p_{F,0}^\omega$:
The computation of the dominating part of $\hat A^{-1}$ in this regime around $\mathbf p_{F,0}^\omega$ is similar to that in the second regime, but for the fact that we only neglect the terms of order $|k'|^2$ in $\hat A$ as well as the elements $\hat A_{aa}$ and $\hat A_{\tilde b\tilde b}$. In addition, the terms that are of order $\epsilon^{-3}\|\mathbf k'\|_{\mathrm{III}}^2$ that come out of the computation of the dominating part of $\hat A$ in block-diagonal form are also put into the error term. We thus find
\subpoint{\bf Around $\mathbf p_{F,1}^\omega$} We now discuss the neighborhood of $\mathbf p_{F,1}^\omega$. We define $k'_{1}:=k-p_{F,1}^\omega=(k'_{1,x},k'_{1,y})$ and $\mathbf k'_{1}:=(k_0,k'_{1})$. We have
where $\bar\kappa_2$ can be assumed to be the same as in (\ref{bilayerthdef}) without loss of generality.
The dominating part of $\hat A^{-1}$ in this regime around $\mathbf p_{F,1}^\omega$ is similar to that around $\mathbf p_{F,0}^\omega$, except that we neglect the terms of order $\epsilon^2k'_{1}$ in $\hat A$ as well as the elements $\hat A_{aa}$ and $\hat A_{\tilde b\tilde b}$. As around $\mathbf p_{F,0}^\omega$, the terms of order $\epsilon^{-3}\|\mathbf k'_{1}\|_{\mathrm{III}}^2$ are put into the error term. We thus find
\subpoint{\bf Around $\mathbf p_{F,j}^\omega$} The behavior of $\hat g(\mathbf k)$ around $p_{F,j}^\omega$ for $j\in\{2,3\}$ can be deduced from~(\ref{bilayerfreepropoth}) by using the symmetry (\ref{bilayerarotation})
under $2\pi/3$ rotations: if we define $k'_{j}:=k-p_{F,j}^\omega=(k'_{j,x},k'_{j,y})$, $\mathbf k'_{j}:=(k_0,k'_{j})$ then, for $j=2,3$ and $\omega\pm$,
One can prove (see appendix~\ref{bilayerboundproptthapp}) that injecting~(\ref{bilayerinttth}) into~(\ref{bilayerdetatth}) implies that $|\det\hat A|\geqslant(\mathrm{const}.)\ \epsilon^8$, which in turn implies that
and identified an {\it ultraviolet} regime and three {\it infrared} regimes in which the free propagator $\hat g(\mathbf k)$ behaves differently:
\begin{itemize}
\item for $\|\mathbf k'\|_{\mathrm{I}}>\bar\kappa_0$, (\ref{bilayerboundfreepropuv}) holds.
\item for $\kappa_1\epsilon<\|\mathbf k'\|_{\mathrm I}<\bar\kappa_0$, (\ref{bilayerfreepropzo}) holds.
\item for $\kappa_2\epsilon^3<\|\mathbf k'\|_{\mathrm{II}}<\bar\kappa_1\epsilon$, (\ref{bilayerfreepropzt}) holds.
\item for $\|\mathbf k'\|_{\mathrm{III}}<\bar\kappa_2\epsilon^3$, (\ref{bilayerfreepropzth}) holds, for $\|\mathbf k'_{1}\|_{\mathrm{III}}<\bar\kappa_{2}\epsilon^3$, (\ref{bilayerfreepropoth}) holds,
and similarly for the $j=2,3$ cases.
\end{itemize}
\section{Multiscale integration scheme}
\label{bilayerschemesec}
\indent In this section, we describe the scheme that will be followed in order to compute the right side of~(\ref{bilayergrasstruncexp}). We will first define a {\it multiscale decomposition} in each regime which will play an essential role in showing that the formal series in~(\ref{bilayergrasstruncexp}) converges. In doing so, we will define {\it effective} interactions and propagators, which will be defined in $\mathbf k$-space, but since we wish to use the determinant expansion in lemma~\ref{bilayerdetexplemma}\ to compute and bound the effective truncated expectations, we will have to define the effective quantities in $\mathbf x$-space as well. Once this is done, we will write bounds for the propagator in terms of scales.\par
\subsection{Multiscale decomposition}
\label{bilayermultiscalesec}
\indent We will now discuss the scheme we will follow to compute the Gaussian Grassmann integrals in terms of which the free energy and two-point Schwinger function were expressed in~(\ref{bilayerfreeengrass}) and~(\ref{bilayerschwinform}). The main idea is to decompose them into scales, and compute them one scale at a time. The result of the integration over one scale will then be considered as an {\it effective theory} for the remaining ones.\par
\bigskip
\indent Throughout this section, we will use a smooth cutoff function $\chi_0(\rho)$, which returns 1 for $\rho\leqslant1/3$ and 0 for $\rho\geqslant2/3$.\par
\point{\bf Ultraviolet regime.} Let $\bar{\mathfrak h}_0:=\lfloor\log_2(\bar\kappa_0)\rfloor$ (in which $\bar\kappa_0$ is the constant that appeared after~(\ref{bilayernormsdef}) which defines the inferior bound of the ultraviolet regime). For $h\in\{\bar{\mathfrak h}_0,\cdots,M\}$ and $h'\in\{\bar{\mathfrak h}_0+1,\cdots,M\}$, we define
\point{\bf First regime.} We now decompose the first regime into scales. The main difference with the ultraviolet regime is that we incorporate the quadratic part of the effective potential into the propagator at each step of the multiscale integration. This is necessary to get satisfactory bounds later on. The propagator will therefore be changed, or {\it dressed}, inductively at every scale, as discussed below.\par
\bigskip
\indent Let $\mathfrak h_1:=\lceil\log_2(\kappa_1\epsilon)\rceil$ (in which $\kappa_1$ is the constant that appears after~(\ref{bilayernormsdef}) which defines the inferior bound of the first regime), and $\|\cdot\|_{\mathrm I}$ be the norm defined in~(\ref{bilayernormsdef}). We define for $h\in\{\mathfrak h_1,\cdots,\bar{\mathfrak h}_0\}$ and $h'\in\{\mathfrak h_1+1,\cdots,\bar{\mathfrak h}_0\}$,
in which $\mathcal Q^{(h)}$ is quadratic in the $\psi$, $\bar{\mathcal V}^{(h)}$ is at least quartic and $\bar{\mathcal W}^{(h)}$ has no terms that are both quadratic in $\psi$ and constant in $\hat J_{\mathbf k,\underline\alpha}$; and $\bar{\mathcal E}^T_{h+1}$ is the truncated expectation defined from the Gaussian measure $P_{\hat{\bar g}_{h+1,+}}(d\psi_+^{(h+1)})P_{\hat{\bar g}_{h+1,-}}(d\psi_-^{(h+1)})$; in which $\hat{\bar g}_{h+1,\omega}$ is the {\it dressed propagator} and is defined as follows. Let $\hat W_2^{(h)}(\mathbf k)$ denote the {\it kernel} of $\mathcal Q^{(h)}$ i.e.
(remark: the $_\omega$ index in $\hat\psi_{\mathbf k,\omega,\alpha}^\pm$ is redundant since given $\mathbf k$, it is defined as the unique $\omega$ that is such that $\mathbf k\in\mathcal B_{\beta,L}^{(\leqslant h,\omega)}$; it will however be needed when defining the $\mathbf x$-space counterpart of $\hat\psi_{\mathbf k,\omega,\alpha}^\pm$ below). We define $\hat{\bar g}_{h,\omega}$ and $\hat{\bar g}_{\leqslant h,\omega}$ by induction: $\hat{\bar g}_{\leqslant\bar{\mathfrak h}_0,\omega}({\bf k}):=(\hat g^{-1}_{\leqslant\bar{\mathfrak h}_0,\omega}(\mathbf k)+\hat W^{(\bar{\mathfrak h}_0)}_{2}(\mathbf k))^{-1}$ and, for $h\in\{\mathfrak h_1+1,\ldots,\bar{\mathfrak h}_0\}$,
in which $f_{\leqslant h,\omega}^{-1}(\mathbf k)$ is equal to $1/f_{\leqslant h,\omega}(\mathbf k)$ if $f_{\leqslant h,\omega}(\mathbf k)\neq0$ and to $0$ if not.\par
\bigskip
\indent The dressed propagator is thus defined so that
in which $\bar P_{\leqslant h}\equiv P_{\hat{\bar g}_{\leqslant h,+}}(d\psi_+^{(\leqslant h)})P_{\hat{\bar g}_{\leqslant h,-}}(d\psi_-^{(\leqslant h)})$. Equation~(\ref{bilayerinddresspropo}) can be expanded into a more explicit form: for $h'\in\{\mathfrak h_1+1,\ldots,\bar{\mathfrak h}_0\}$ and $h\in\{\mathfrak h_1,\cdots,\bar{\mathfrak h}_0\}$,
where $\bar{\mathfrak h}_1:=\lfloor\log_2(\bar\kappa_1\epsilon)\rfloor$, from which we define $\hat{\bar g}_{\mathfrak h_1,\omega}(\mathbf k)$ and $\hat{\bar g}_{\leqslant\bar{\mathfrak h}_1,\omega}(\mathbf k)$ in the same way as in~(\ref{bilayerinddresspropexpo}) with
The analogue of (\ref{bilayerfreeengrassoh}) holds here as well.\par
\bigskip
\point{\bf Second regime.} We now define a multiscale decomposition for the integration in the second regime. Proceeding as we did in the first regime, we define $\mathfrak h_2:=\lceil\log_2(\kappa_2\epsilon^3)\rceil$, for $h\in\{\mathfrak h_2,\cdots,\bar{\mathfrak h}_1\}$ and $h'\in\{\mathfrak h_2+1,\cdots,\bar{\mathfrak h}_1\}$, we define
where $\bar{\mathfrak h}_2:=\lfloor\log_2(\bar\kappa_2\epsilon^3)\rfloor$ from which we define $\hat{\bar g}_{\mathfrak h_2,\omega}(\mathbf k)$ and $\hat{\bar g}_{\leqslant\bar{\mathfrak h}_2,\omega}(\mathbf k)$ in the same way as in~(\ref{bilayerinddresspropexpo}) with
The analogue of (\ref{bilayerfreeengrassoh}) holds here as well.\par
\bigskip
\point{\bf Third regime.} There is an extra subtlety in the third regime: we will see in section~\ref{bilayerthsec} that the singularities of the dressed propagator are slightly different from those of the bare (i.e. non-interacting) propagator: at scale $h$ the effective Fermi points $\mathbf p_{F,j}^{\omega}$ with $j=1,2,3$ are moved to $\tilde{\mathbf p}_{F,j}^{(\omega,h)}$, with
The central Fermi points, $j=0$, are left invariant by the interaction. For notational uniformity we set $\tilde{\mathbf p}_{F,0}^{(\omega,h)}\equiv\mathbf p_{F,0}^{\omega}$.
Keeping this in mind, we then proceed in a way reminiscent of the first and second regimes: let $\mathfrak h_\beta:=\lfloor\log_2(\pi/\beta)\rfloor$, for $h\in\{\mathfrak h_\beta,\cdots,\bar{\mathfrak h}_2\}$ and $h'\in\{\mathfrak h_\beta+1,\cdots,\bar{\mathfrak h}_2\}$, we define
\point{\bf Last scale.} Recalling that $|k_0|\geqslant\pi/\beta$, the smallest possible scale is $\mathfrak h_\beta:=\lfloor\log_2(\pi/\beta)\rfloor$. The last integration is therefore that on scale $h=\mathfrak h_\beta+1$, after which, we are left with
The subsequent sections are dedicated to the proof of the fact that both $F_{\mathfrak h_\beta}$ and $\mathcal W^{(\mathfrak h_\beta)}$ are analytic in $U$, uniformly in $L$, $\beta$ and $\epsilon$. We will do this by studying each regime, one at a time, performing a {\it tree expansion} in each of them in order to bound the terms of the series (see section~\ref{bilayertreeexpsec} and following).\par
\subsection{$\mathbf x$-space representation of the effective potentials}
\label{bilayerxspacesec}
\indent We will compute the truncated expectations arising in~(\ref{bilayereffpotuv}), (\ref{bilayereffpotWuv}), (\ref{bilayereffpotoh}) and~(\ref{bilayereffpotWoh}) using a determinant expansion (see lemma~\ref{bilayerdetexplemma}) which, as was mentioned above, is only useful if the propagator and effective potential are expressed in $\mathbf x$-space. We will discuss their definition in this section. We restrict our attention to the effective potentials $\mathcal V^{(h)}$ since, in order to compute the two-point Schwinger function in the regimes we are interested in, we will not need to express the kernels of $\mathcal W^{(h)}$ in $\mathbf x$-space.
\par
\bigskip
\point{\bf Ultraviolet regime.} We first discuss the ultraviolet regime, which differs from the others in that the propagator does not depend on the index $\omega$. We write $\mathcal V^{(h)}$ in terms of its {\it kernels} (anti-symmetric in the exchange of their indices), defined as
{\bf Remark}: From (\ref{bilayerhatWdefo}), $\hat W_{2l,\underline\alpha}^{(h)}(\underline{\mathbf k})$ is not defined for $\mathbf k_i\not\in\mathcal B_{\beta,L}^{(\leqslant h)}$, however, one can easily check that~(\ref{bilayerxspaceWuv}) holds for any extension of $\hat W_{2l,\underline\alpha}^{(h)}$ to $\mathcal B_{\beta,L}^{2l-1}$, thanks to the compact support properties of
$\psi^{(\leqslant h)}$ in momentum space. In order to get satisfactory bounds on $W_{2l,\underline\alpha}^{(h)}(\underline{\mathbf x})$, that is in order to avoid Gibbs phenomena, we define the extension of $\hat W_{2l,\underline\alpha}^{(h)}(\underline{\mathbf k})$ similarly to~(\ref{bilayerhatWdefo}) by relaxing the condition that $\psi^{(\leqslant h)}$ is supported on $\mathcal B_{\beta,L}^{(\leqslant h)}$ and iterating~(\ref{bilayereffpotuv}). In other words, we let $\hat W_{2l,\underline\alpha}^{(h)}(\underline{{\bf k}})$ for $\underline{{\bf k}}\in\mathcal B_{\beta,L}^{2l-1}$ be the kernels of $\mathcal V^{*(h)}$ defined inductively by
in which $\{\hat\Psi_{\mathbf k,\alpha}\}_{\mathbf k\in\mathcal B_{\beta,L},\alpha\in\mathcal A}$ is a collection of {\it external fields} (in reference to the fact that, contrary to $\psi^{(\leqslant h)}$, they have a non-compact support in momentum space). The use of this specific extension can be justified {\it ab-initio} by re-defining the cutoff function $\chi$ in such a way that its support is $\mathbb{R}$, e.g. using exponential tails that depend on a parameter $\epsilon_\chi$ in such a way that the support tends to be compact as $\epsilon_\chi$ goes to 0. Following this logic, we could first define $\hat W$ using the non-compactly supported cutoff function and then take the $\epsilon_\chi\to0$ limit, thus recovering (\ref{bilayerextendedVuv}). Such an approach is dicussed in~[\cite{bilayerBM02}]. From now on, with some abuse of notation, we shall identify $\mathcal V^{*(h)}$ with $\mathcal V^{(h)}$ and denote them by the same symbol $\mathcal V^{(h)}$, which is justified by the fact that their kernels are (or can be chosen, from what said above, to be) the same.\par
\bigskip
\point{\bf First and second regimes.} We now discuss the first and second regimes (the third regime is very slightly different in that the index $\omega$ is complemented by an extra index $j$ and the Fermi points are shifted). Similarly to~(\ref{bilayerhatWdefo}), we define the {\it kernels} of $\bar{\mathcal V}$:
where $\mathcal B_{\beta,L}^{(\leqslant h,\underline\omega)}=\mathcal B_{\beta,L}^{(\leqslant h,\omega_1)}\times\cdots\times\mathcal B_{\beta,L}^{(\leqslant h,\omega_{2l})}$. Note that the kernel $\hat W_{2l,\underline\alpha}^{(h)}$ is independent of $\underline\omega$, which can be easily proved using the symmetry $\omega_i\mapsto-\omega_i$. The $\mathbf x$-space expression for $\hat\psi_{\mathbf k,\alpha,\omega}^{(\leqslant h)\pm}$ is
{\bf Remark}: Unlike $\hat\psi_{\mathbf k,\alpha,\omega}$, the $_\omega$ index in $\psi_{\mathbf x,\alpha,\omega}^{(\leqslant h)\pm}$ is {\it not} redundant. Keeping track of this dependence is required to prove properties of $\hat W_{2l}(\mathbf k)$ and $\hat{\bar g}_h(\mathbf k)$ close to $\mathbf p_{F,0}^\omega$ while working in $\mathbf x$-space. Such considerations were first discussed in~[\cite{bilayerBG90}] in which $\psi_{\mathbf x,\alpha,\omega}$ were called {\it quasi-particle} fields.\par
\bigskip
\indent We then define the propagator in $\mathbf x$-space:
As in the ultraviolet, the definition of $\hat W_{2l,\underline\alpha}^{(h)}(\underline{\mathbf k})$ is extended to $\mathcal B_{\beta,L}^{2l-1}$ by defining it as the kernel of $\mathcal V^{*(h)}$:
in which $\{\hat\Psi_{\mathbf k,\alpha}\}_{\mathbf k\in\mathcal B_{\beta,L},\alpha\in\mathcal A}$ is a collection of {\it external fields}. The definition~(\ref{bilayerxspaceWo}) suggests a definition for $\bar A_{h,\omega}$ (see~(\ref{bilayerAdefo}) and~(\ref{bilayerAdeft})):
\point{\bf Third regime.} We now turn our attention to the third regime. As discussed in section~\ref{bilayermultiscalesec}, in addition to there being an extra index $j$, the Fermi points are also shifted in the third regime. The kernels of $\bar{\mathcal V}$ and $\mathcal Q$ are defined as in~(\ref{bilayerhatWdeft}), but with $\omega$ replaced by $(\omega,j)$. The $\mathbf x$-space representation of $\hat\psi_{\mathbf k,\alpha,\omega,j}^{(\leqslant h)\pm}$ is defined as
and the $\mathbf x$-space expression of the propagator and the kernels of $\bar{\mathcal V}$ and $\mathcal Q$ are defined by analogy with the first regime:
\indent Before moving along with the tree expansion, we first compute a bound on $\hat g_{h}$ in the different regimes, which will be used in the following.\par
\bigskip
\point{\bf Ultraviolet regime.} We first study the ultraviolet regime, i.e. $h\in\{1,\cdots,M\}$.\par
in which $\partial_{k_0}$ denotes the discrete derivative with respect to $k_0$ and, with a slightly abusive notation, $\partial_k$ the discrete derivative with respect to either $k_1$ or $k_2$. Indeed the derivatives over $k$ land on $ik_0\hat A^{-1}$, which does not change the previous estimate, and the derivatives over $k_0$ either land on $f_h$, $1/(ik_0)$, or $ik_0\hat A^{-1}$, which yields an extra $2^{-h}$ in the estimate.\par
\medskip
{\bf Remark}: The previous argument implicitly uses the Leibnitz rule, which must be used carefully since the derivatives are discrete. However, since the estimate is purely dimensional,
we can replace the discrete discrete derivative with a continuous one without changing the order of magnitude of the resulting bound.\par
\bigskip
\subpoint{\bf Configuration space bounds.} We now prove that the inverse Fourier transform of $\hat g_{h}$
where $\hat\Lambda_\infty=\{t_1 G_1+t_2 G_2: t_i\in[0,1)\}$ is the limit as $L\to\infty$ of $\hat\Lambda$, see (\ref{bilayerlae}) and following lines. The dimensional estimates one finds using this continuum approximation are the same as those using (\ref{bilayer5.41bis}) therefore, integrating (\ref{bilayer5.42bis}) 7 times by parts and using (\ref{bilayerestgkuv}) we find
in which we again used the slightly abusive notation of writing $\partial_{\mathbf k}$ to mean any derivative with respect to $k_0$, $k_1$ or $k_2$. Equation~(\ref{bilayerestgko}) then follows from similar considerations as those in the ultraviolet regime.\par
\bigskip
\subpoint{\bf Configuration space bounds.} We estimate the real-space counterpart of $\hat g_{h,\omega}$,
which follows from very similar considerations as the ultraviolet estimate.\par
\bigskip
\point{\bf Second regime.} We treat the second regime, i.e. $h\in\{\mathfrak h_2+1,\cdots,\bar{\mathfrak h}_1\}$ in a very similar way (we skip the intermediate regime which can be treated in the same way as either the first or second regimes):
\indent In this section, we shall define the Gallavotti-Nicol\`o tree expansion~[\cite{bilayerGN85}, \cite{bilayerGN85b}], and show how it can be used to compute bounds for the $\mathfrak e_h$, $\mathcal V^{(h)}$,
$\mathcal Q^{(h)}$ and $\bar{\mathcal V}^{(h)}$ defined above in~(\ref{bilayereffpotuv}) and (\ref{bilayereffpotoh}), using the estimates~(\ref{bilayerestguv}), (\ref{bilayerestgo}), (\ref{bilayerestgt}) and~(\ref{bilayerestgth}). We follow~[\cite{bilayerBG90}, \cite{bilayerGM01}, \cite{bilayerGM10}].
We conclude the section by showing how to compute the terms in $\bar{\mathcal W}^{(h)}$ that are quadratic in $\hat J_{\mathbf k,\underline\alpha}$ from $\mathcal V^{(h)}$ and $\hat{\bar g}_h$.\par
\bigskip
\indent The discussion in this section is meant to be somewhat general, in order to be applied to the ultraviolet, first, second and third regimes (except for lemma~\ref{bilayerpowercountinglemma} which does not apply to the ultraviolet regime).\par
\subsection{Gallavotti-Nicol\`o Tree expansion}
\indent In this section, we will define a tree expansion to re-express equations {\it of the type}
for $h\in\{h_2^*,\cdots,h_1^*-1\}$ (in the ultraviolet regime $h_2^*=\bar{\mathfrak h}_0$, $h_1^*=M$; in the first $h_2^*=\mathfrak h_1$, $h_1^*=\bar{\mathfrak h}_0$; in the second $h_2^*=\mathfrak h_2$, $h_1^*=\bar{\mathfrak h}_1$;
and in the third, $h_2^*=\mathfrak h_\beta$, $h_1^*=\bar{\mathfrak h}_2$), with
($q=1$ in the ultraviolet regime and $q=2$ in the first, second and third) in which $\underline\varpi$ and $\underline{\mathbf x}$ are shorthands for $(\varpi_1,\cdots,\varpi_{2l})$ and $(\mathbf x_1,\cdots,\mathbf x_{2l})$; $\varpi$ denotes a collection of indices: $(\alpha,\omega)$ in the first and second regimes, $(\alpha,\omega,j)$ in the third, and $(\alpha)$ in the ultraviolet; and $W_{2l,\underline\varpi}^{(h)}(\underline{\mathbf x})$ is a function that only depends on the differences $\mathbf x_i-\mathbf x_j$.
The propagator associated with $\mathcal E^T_{h+1}$ will be denoted $g_{(h+1),(\varpi,\varpi')}(\mathbf x-\mathbf x')$ and is to be interpreted as the dressed propagator $\bar g_{(h+1,\omega),(\alpha,\alpha')}$ in the first and second regimes,
and as $\bar g_{(h+1,\omega,j),(\alpha,\alpha')}$ in the third. Note in particular that in the first and second regimes the propagator is diagonal in the $\omega$ indices, and is diagonal in $(\omega,j)$ in the third. In all cases, we write
where $\mathbf p^{(h+1)}_{\varpi}$ should be interpreted as ${\bf0}$ in the ultraviolet regime, as $\mathbf p_{F,0}^\omega$ in the first and second, and as $\tilde{\mathbf p}_{F,j}^{(\omega,h+1)}$ in the third, see (\ref{bilayereq4.20bis}).
\par
\bigskip
{\bf Remark}: The usual way of computing expressions of the form~(\ref{bilayertreeindeq}) is to write the right side as a sum over Feynman diagrams. The tree expansion detailed below provides a way of identifying the sub-diagrams that scale in the same way (see the remark at the end of this section). In the proofs below, there will be no mention of Feynman diagrams, since a diagramatic expansion would yield insufficient bounds.\par
\bigskip
\indent We will now be a bit rough for a few sentences, in order to carry the main idea of the tree expansion across: equation~(\ref{bilayertreeindeq}) is an inductive equation for the $\mathcal V^{(h)}$, which we will pictorially think of as the {\it merging} of a selection of $N$ potentials $\mathcal V^{(h+1)}$ via a truncated expectation. If we iterate~(\ref{bilayertreeindeq}) all the way to scale $h^*_2$, then we get a set of {\it merges} that {\it fit} into each other, creating a tree structure. The sum over the choice of $N$'s at every step will be expressed as a sum over Gallavotti-Nicol\`o trees, which we will now define precisely.
Given a scale $h\in\{h^*_2,\cdots,h^*_1-1\}$
and an integer $N\geqslant1$, we define the set $\mathcal T_{N}^{(h)}$ of Gallavotti-Nicol\`o (GN) trees as a set of {\it labeled} rooted trees with $N$ leaves in the following way.
\begin{itemize}
\item We define the set of unlabeled trees inductively: we start with a {\it root}, that is connected to a node $v_0$ that we will call the {\it first node}
of the tree; every node is assigned an ordered set of child nodes. $v_0$ must have at least one child, while the other nodes may be childless.
We denote the parent-child partial ordering by $v'\prec v$ ($v'$ is the parent of $v$). The nodes that have no children are called {\it leaves} or {\it endpoints}. By convention, the root is not considered to be a node, but we will still call it the parent of $v_0$.
\item Each node is assigned a {\it scale label}$h'\in\{h+1,\cdots,h^*_1+1\}$ and the root is assigned the {\it scale label}$h$, in such a way that the children of the root or of a node on scale $h'$ are on scale $h'+1$ (keep in mind that it is possible for a node to have a single child).
\item The leaves whose scale is $\leqslant h_1^*$ are called {\it local}. The leaves on scale $h_1^*+1$ can either be {\it local} or {\it irrelevant} (see figure~\ref{bilayertreefig}).
\item Every local leaf must be preceeded by a {\it branching node}, i.e. a node with at least two children. In other words, every local leaf must have at least one sibling.
\item We denote the set of nodes of a tree $\tau$ by $\bar{\mathfrak V}(\tau)$, the set of nodes that are not leaves by $\mathfrak V(\tau)$ and the set of leaves by $\mathfrak E(\tau)$.
\end{itemize}
\bigskip
{\bf Remark}: Local leaves are called ``local'' because those nodes are usually applied a {\it localization} operation (see e.g. [\cite{bilayerBG95}]). In the present case, such a step is not needed, due to the super-renormalizable nature of the first and third regimes.\par
\caption{example of a tree on scale $h$ up to scale $h^*_1+1=h+5$ with 11 leaves, 5 of which are local and 6 irrelevant. Local leaves are represented as empty circles, whereas irrelevant leaves are represented as full circles.}
\label{bilayertreefig}\end{figure}
\indent Every node of a Gallavotti-Nicol\`o tree $\tau$ corresponds to a truncated expectation of effective potentials of the form~(\ref{bilayertreeindeq}). If one expands the product of factors of the form $(\psi_{\mathbf x,\varpi}^{(\leqslant h)\pm}+\psi_{\mathbf x,\varpi}^{(h+1)\pm})$ in every term in the right side of~(\ref{bilayertreeindeq}), then one finds a sum over {\it choices} between $\psi^{(\leqslant h)}$ and $\psi^{(h)}$ for every $(\mathbf x,\varpi,\pm)$. We will express this sum as a sum over a set of {\it external field labels} (corresponding to the labels of $\psi^{(\leqslant h)}$ which are called external because they can be factored out of the truncated expectation) defined in the following way. Given an integer $\ell_0\geqslant q$, whose purpose will become clear in lemma~\ref{bilayerpowercountinglemma}\ (we will choose $\ell_0$ to be $=1$ in the ultraviolet regime, and $=2,3,2$ in the first, second, third infrared regimes, respectively), a tree $\tau\in\mathcal T_N^{(h)}$ whose endpoints are denoted by $(v_1,\cdots,v_N)$, as well as a collection of integers $\underline l_\tau:=(l_{v_1},\cdots,l_{v_N})\in\mathbb N^N$ such that $l_{v_i}\geqslant q$ and, if $v_i$ is a local leaf, $l_{v_i}<\ell_{0}$ (in particular, if $\ell_0=q$ there are no local leaves), we introduce an ordered collection of {\it fields}, i.e. triplets
where $L:=l_{v_1}+\cdots+l_{v_N}$. We then define the set of {\it external field labels} of each endpoint $v_i$ as the following ordered collections of integers
We define the set $\mathcal P_{\tau,\underline l_\tau,\ell_0}$ of {\it external field labels} compatible with a tree $\tau\in\mathcal T_N^{(h)}$ as the set of all the collections $\mathbf P=\{P_v\}_{v\in\mathfrak V(\tau)}$ where $P_v$ are themselves collections of integers that satisfy the following constraints:
\begin{itemize}
\item For every $v\in\mathfrak V(\tau)$ whose children are $(v_1,\cdots,v_s)$, $P_v\subset P_{v_1}\cup\cdots\cup P_{v_s}$ in which, by convention, if $v_i$ is an endpoint then $P_{v_i}=I_{v_i}$; and the order of the elements of $P_v$ is that of $P_{v_1}$ through $P_{v_s}$ (in particular the integers coming from $P_{v_1}$ precede those from $P_{v_2}$ and so forth).
\item For all $v\in\mathfrak V(\tau)$, $P_v$ must contain as many even integers as odd ones (even integers correspond to fields with a $-$, and odd ones to a $+$).
\item If $v$ has more than one child, then $P_v\neq P_{v'}$ for all $v'\succ v$
\item For all $v\in\bar{\mathfrak V}(\tau)\setminus\{v_0\}$ which is not a local leaf, the cardinality of $P_v$ must satisfy $|P_v|\geqslant2\ell_0$.
\end{itemize}
Furthemore, given a node $v$ whose children are $(v_1,\cdots,v_s)$, we define $R_v:=\bigcup_{i=1}^sP_{v_i}\setminus P_v$.\par
\bigskip
\indent We associate a {\it value} to each node $v$ of such a tree in the following way. If $v$ is a leaf, then its value is
where $|P_v|$ denotes the cardinality of $P_v$, and $\underline\varpi_v$ and $\underline{\mathbf x}_v$ are the {\it field labels} (i.e. elements of $F$) specified by the indices in $P_v$. If $v$ is not a leaf and $R_v\neq\emptyset$, then its value is
where $\mathbf T(\mathbf R_v)$, $g_{(h_v),l}$, $dP_{T_v}(\mathbf t)$ and $G^{(T_v,h_v)}$ are defined as in lemma~\ref{bilayerdetexplemma}\ with $g$ replaced by $g_{h_v}$, and if the children of $v$ are denoted by $(v_1,\cdots,v_s)$, then $\mathbf R_v:=(P_{v_1}\setminus P_v,\ldots, P_{v_s}\setminus P_v)$. If $v$ is not a leaf and $R_v=\emptyset$, then it has exactly one child and we let its value be $\rho_v=1$.
\par
\theo{Lemma}\label{bilayertreexplemma}
Equation~(\ref{bilayertreeindeq}) can be re-written as
where $\underline l_\tau:=(l_{v_1},\cdots,l_{v_N})$ (see above),
$\underline\varpi_\tau$ and $\underline{\mathbf x}_\tau$ are the field labels in $F$,
$s_v$ is the number of children of $v$, $\rho_v$ was defined above in~(\ref{bilayervalend}) and~(\ref{bilayervalmid}), $v_0$ is the first node of $\tau$ and
where $\epsilon_i$ is the third component of the $i$-th triplet in $F$.
\endtheo
{\bf Remark}: The sum over $\mathbf P\in\mathcal P_{\tau,\underline l_\tau,\ell_0}$ is a sum over the assignement of $P_v$ for nodes that are not endpoints. The sets $I_v$ are not summed over, instead they are fixed by $\underline l_\tau$. Furthermore, if $\mathcal P_{\tau,\underline l_\tau,\ell_0}=\emptyset$ (e.g. if $\ell_0=q$ and $\tau$ contains local leaves), then the sum should be interpreted as 0.\par
\bigskip
By injecting~(\ref{bilayervalmid}) into~(\ref{bilayertreeexp}), we can re-write
where $\mathbf T(\tau)$ is the set of collections of $(T_v\in\mathbf T(\mathbf R_v))_{v\in\mathfrak V(\tau)}$. Moreover, while $\rho^{(T_v)}_v$ was defined in~(\ref{bilayervalmid}) if $v\in\mathfrak V(\tau)$, it stands for $\rho_v$ if $v\in\mathfrak E(\tau)$ (note that in this case $T_v=\emptyset$).\par
\bigskip
\indent\underline{Idea of the proof}: The proof of this lemma can easily be reconstructed from the schematic description below. We do not present it in full detail here because its
proof has already been discussed in several references, among which~[\cite{bilayerBG95}, \cite{bilayerGM01}, \cite{bilayerGi10}].\par
\bigskip
\indent The lemma follows from an induction on $h$, in which we write the truncated expectation in the right side of~(\ref{bilayertreeindeq}) as
which yields a sum over the choices between $\psi^{(\leqslant h)}$ and $\psi^{(h+1)}$, with each choice corresponding to an instance of $P_v$: each $\psi^{(\leqslant h)\epsilon}_{\mathbf x,\varpi}$ ``creates'' the element $(\mathbf x,\varpi,\epsilon)$ in $P_v$. The remaining truncated expectation is then computed by applying lemma~\ref{bilayerdetexplemma}. Finally, the $W_{2l_j,\underline\varpi_j}^{(h+1)}$ with $l_j<\ell_0$ are left as such, and yield a {\it local leaf} in the tree expansion, the others are expanded using the inductive hypothesis.\par
\bigskip
{\bf Remark}: For readers who are familiar with Feynman diagram expansions, it may be worth pointing out that a Gallavotti-Nicol\`o tree paired up with a set of external field labels $\mathbf P$
represents a class of labeled
Feynman diagrams (the labels being the scales attached to the lines, or equivalently to the propagators)
with similar scaling properties. In fact, given a labeled Feynman diagram, one defines a tree and a set of external field labels by the following procedure. For every $h$, we define the {\it clusters on scale $h$} as the connected components of the diagram one obtains by removing the lines with a scale label that is $< h$. We assign a node with scale label $h$ to every cluster on scale $h$. The set $P_v$ contains the indices of the legs of the Feynman diagram that exit the corresponding cluster. If a cluster on scale $h$ contains a cluster on scale $h+1$, then we draw a branch between the two corresponding nodes. See figure~\ref{bilayerfeyntreefig} for an example.\par
\indent Local leaves correspond to clusters that have {\it few} external legs. They are considered as ``black boxes'': the clusters on larger scales contained inside them are discarded.\par
\indent A more detailed discussion of this correspondence can be found in~[\cite{bilayerGM01}, section~5.2] among other references.
\caption{Example of a labeled Feynman diagram and its corresponding tree. Three clusters, denoted by $\mathbf v_1$, $\mathbf v_2$ and $\mathbf v_3$, on scale $h_1$, $h_2$ and $h_4$ respectively, are explicitely drawn as dotted ellipses. There are 4 more clusters (2 on scale $h_1$, 1 on scale $h_2$ and 1 on scale $h_3$) which are not represented. The scales are drawn in different colors (color online): {\color{\colorfi}red} for $h_5$, {\color{\colorf}orange} for $h_4$, {\color{\colorth}yellow} for $h_3$, {\color{\colort}green} for $h_2$ and {\color{\coloro}blue} for $h_1$.}
\label{bilayerfeyntreefig}\end{figure}
\subsection{Power counting lemma}
\label{bilayerpowercountingsec}
\indent We will now state and prove the {\it power counting lemma}, which is an important step in bounding the elements in the tree expansion~(\ref{bilayertreeexpT}) in the first, second and third regimes.\par
\bigskip
\indent In the following, we will use a slightly abusive notation: given $\underline{\mathbf x}=(\mathbf x_1,\ldots,\mathbf x_{n})$, we will write $\underline{\mathbf x}^m$ to mean ``any of the products of the following form''
$$x_{j_1,i_1}\cdots x_{j_m,i_m}$$
where $i_\nu\in\{0,1,2\}$ indexes the components of $\mathbf x$ and $j_\nu\in\{1,\cdots,n\}$ indexes the components of $\underline{\mathbf x}$. We will also denote the translate of $\underline{\mathbf x}$ by $\mathbf y$ by $\underline{\mathbf x}-\mathbf y\equiv(\mathbf x_1-\mathbf y,\ldots,\mathbf x_{n}-\mathbf y)$. Furthermore, given $\underline{\mathbf x}^m$, we define the vector $\underline m$ whose $i$-th component is the number of occurrences of $x_{\cdot,i}$ in the product $\underline{\mathbf x}^m$ (note that $m_0+m_1+m_2=m$).
\par
\bigskip
\indent The power counting lemma will be stated as an inequality on the so-called {\it beta function} of the renormalization group flow, defined as
In terms of the tree expansion~(\ref{bilayertreeexp}), $B_{2l}^{(h)}$ is the sum of the contributions to $W_{2l}^{(h)}$ whose field label assignment $\mathbf P$ is such that every node $v\in\mathfrak V(\tau)\setminus\{v_0\}$ that is connected to the root by a chain of nodes with only one child satisfies $|P_v|>2l$. We denote the set of such field label assignments by $\tilde{\mathcal P}_{\tau,\underline l_\tau,\ell_0}$ for any given $\tau$, $\underline l_\tau$ and $\ell_0$. In other words, $B_{2l}^{(h)}$ contains all the contributions that have at least one propagator on scale $h+1$. If $l<q$, then all the contributions have a propagator on scale $h+1$, so $B_{2l}=W_{2l}$.\par
\bigskip
\theo{Lemma}\label{bilayerpowercountinglemma}
Assume that the propagator $g_{(h),(\varpi,\varpi')}(\mathbf x-\mathbf x')$ can be written as in (\ref{bilayer6.3}).
where $C_1$, $C_2$ and $C_3$ are constants, independent of $c_g$, $c_k$, $C_g$, $C_G$ and $h$.
\endtheo
\bigskip
{\bf Remarks}: Here are a few comments about this lemma.
\begin{itemize}
\item Combining this lemma with~(\ref{bilayerbetafundef}) yields a bound on $W_{2l,\varpi}^{(h)}(\underline{\mathbf x})$. In particular, if $l\geqslant\ell_0$ and $h<h^*_1$, then
\item The lemma cannot be used in this form in the ultraviolet regime, since in that case the right side of
(\ref{bilayer6.ell0}) is infinite, because $c_k=c_g=1$.
In the ultraviolet we will need to re-organize the tree expansion, in order to derive convergent bounds on the series, as discussed in section~\ref{bilayeruvsec} below.
\item The lemma gives a bound on the $m$-th derivative of $\hat W^{(h)}_{2l,\underline\varpi}(\underline{\mathbf k})$, which we will need in order to write the dominating behavior of the two-point Schwinger function as stated in theorems~\ref{bilayertheoo}, \ref{bilayertheot}, \ref{bilayertheoth}; however, we will never need to take $m$ larger than $3$, which is important because the bound~(\ref{bilayerpowercountingl}),
if generalized to larger values of $m$, would diverge faster than $m!$ as $m\to\infty$.
\item Recall that the propagator $g_{h}$ appearing in the statement should be interpreted as
the dressed propagator $\bar g_{h}$ in the first, second and third regimes. Since $\bar g_{h}$ depends on $W_{2l,\underline\varpi}^{(h')}$ for $h'\geqslant h$, we will have to apply the lemma inductively, proving at each step that the dressed propagator satisfies the bounds~(\ref{bilayerassumprop}).
\item Similarly, the bounds~(\ref{bilayerassumW}) will have to be proved inductively.
\item In this lemma, the purpose of $\ell_0$, which up until now may have seemed like an arbitrary definition, is made clear. In fact, the condition that $\ell_0>c_k/(c_k-c_g)$ implies that $c_k-(c_k-c_g)|P_v|/2<0$, $\forall v\in\mathfrak V(\tau)\setminus\{v_0\}$. If this were not the case, then the weight of each tree $\tau$ could increase with the size of the tree, making the right side of~(\ref{bilayerpowercountingl}) divergent.
\item The combination $c_k-(c_k-c_g)|P_v|/2$ is called the {\it scaling dimension} of the cluster $v$. Under the assumptions of the lemma,
the scaling dimension is negative, $\forall v\in\mathfrak V(\tau)\setminus\{v_0\}$.
The clusters with non-negative scaling dimensions are necessarily leaves, and condition (\ref{bilayerassumW})
corresponds to the requirement that we can control the size of these dangerous clusters.
Essentially, what this lemma shows is that the only terms that are potentially problematic are those with non-negative scaling dimension.
This prompts the following definitions: a node with negative scaling dimension will be called {\it irrelevant}, one with vanishing scaling dimension {\it marginal} and one with positive
scaling dimension {\it relevant}.
\item We will show that in the first and third regimes $c_k=3$ and $c_g=1$, so that the scaling dimension is $3-|P_v|$. Therefore, the nodes with $|P_v|=2$ are relevant whereas all the others are irrelevant. In the second regime,
$c_k=2$ and $c_g=1$, so that the scaling dimension is $2-|P_v|/2$. Therefore, the nodes with $|P_v|=2$ are relevant, those with $|P_v|=4$ are marginal, and all other nodes are irrelevant.
\item The purpose of the factor $\mathfrak F_h(\underline m)$ is to take into account the dependence of the order of magnitude of the different components $k_0$, $k_1$ and $k_2$ in the different regimes. In other words, as was shown in~(\ref{bilayerestguv}), (\ref{bilayerestgo}), (\ref{bilayerestgt}) and~(\ref{bilayerestgth}), the effect of multiplying $g$ by $x_{j,i}$ depends on $i$, which is a fact the lemma must take into account.
\item The reason why we have stated this bound in $\mathbf x$-space is because of the estimate of $\det(G^{(h_v,T_v)})$ detailed below, which is very inefficient in $\mathbf k$-space.
\end{itemize}
\bigskip
\indent\underline{Proof}: The proof proceeds in five steps: first we estimate the determinant appearing in~(\ref{bilayervalmid}) using the Gram-Hadamard inequality; then we perform a change of variables in the integral over $\underline{\mathbf x}_\tau$ in the right side of~(\ref{bilayertreeexpT}) in order to re-express it as an integration on differences $\mathbf x_i-\mathbf x_j$; we then decompose $(\underline{\mathbf x}-\mathbf x_{2l})^m$; and then compute a bound, which we re-arrange; and finally we use a bound on the number of spanning trees $\mathbf T(\tau)$ to conclude the proof.\par
\bigskip
\point{\bf Gram bound.} We first estimate $|\det G^{(T_v,h_v)}|$.\par
\bigskip
\subpoint{\bf Gram-Hadamard inequality.} We shall make use of the Gram-Hadamard inequality, which states that the determinant of a matrix $M$ whose components are given by $M_{i,j}=\mathbf A_i\odot\mathbf B_j$ where $(\mathbf A_i)$ and $(\mathbf B_i)$ are vectors in some Hilbert space with scalar product $\odot$ (writing $M$ as a scalar product is called writing it in {\it Gram form}) can be bounded by
The proof of this inequality is based on applying a Gram-Schmidt process to turn $(\mathbf A_i)$ and $(\mathbf B_i)$ into orthonormal families, at which point the inequality follows trivially.
We recall that
$G^{(T_v,h_v)}$ is an $(n_v-(s_v-1))\times(n_v-(s_v-1))$ matrix in which $s_v$ denotes the number of children of $v$ and
if we denote the children of $v$ by $(v_1,\cdots,v_{s_v})$, then $n_v=|R_v|/2=(\sum_{i=1}^{s_v}|P_{v_i}|-|P_v|)/2$. Its components are of the form $\mathbf t_\ell g_{(h_v),\ell}$ (see lemma~\ref{bilayerdetexplemma}), with $\mathbf t_{(i,j)}=u_i\cdot u_j$ in which the $u_i$ are unit vectors.\par
\bigskip
\subpoint{\bf Gram form.} We now put $(g_{(h),(\alpha,\alpha')}(\mathbf x-\mathbf x'))_{(\mathbf x,\alpha),(\mathbf x',\alpha')}$ in Gram form by using the $\mathbf k$-space representation of $g_{h}$ in~(\ref{bilayer6.3}). Let $\mathcal H=\ell_2(\mathcal B_{\beta,L}\times\{a,\tilde b,\tilde a,b\})$ denote the Hilbert space of square summable sequences indexed by $(\mathbf k,\alpha)\in\mathcal B_{\beta,L}\times\{a,\tilde b,\tilde a,b\}$. For every $h\in\{h^*_2,\cdots,h^*_1-1\}$ and $(\mathbf x,\alpha)\in([0,\beta)\times\Lambda)\times\{a,\tilde b,\tilde a,b\}$, we define a pair of vectors $(\mathbf A_\alpha^{(h)}(\mathbf x),\mathbf B_\alpha^{(h)}(\mathbf x))\in\mathcal H^2$ by
where $\hat\lambda_{\alpha'}^{(h)}(\mathbf k)$ denotes the $\alpha$-th eigenvalue of $\sqrt{\hat g_h^\dagger(\mathbf k)\hat g_{h}(\mathbf k)}$ (i.e. the {\it singular values} of $\hat g_{h}(\mathbf k)$) and $\hat V^{(h)}(\mathbf k)$ and $\hat U^{(h)}(\mathbf k)$ are unitary matrices that are such that
where $\odot$ denotes the scalar product on $\mathcal H$. Furthermore, recalling that $|\hat g_{h}(\mathbf k)|$ is the operator norm of $\hat g_{h}(\mathbf k)$, so that $|\hat g_{h}(\mathbf k)|=\mathrm{max}\ \mathrm{spec}\sqrt{\hat g_h^\dagger(\mathbf k)\hat g_{h}(\mathbf k)}$, we have
\point{\bf Change of variables.} We change variables in the integration over $\underline{\mathbf x}_\tau$. For every $v\in\bar{\mathfrak V}(\tau)$, let $P_{v}=:(j_1^{(v)},\cdots,j_{2l_{v}}^{(v)})$. We recall that a spanning tree $T\in\mathbf T(\tau)$ is a diagram connecting the fields specified by the $I_v$'s for $v\in\mathfrak E(\tau)$: more precisely, if we draw a vertex for each $v\in\mathfrak E(\tau)$ with $|I_v|$ half-lines attached to it that are labeled by the elements of $I_v$, then $T\in\mathbf T(\tau)$ is a pairing of some of the half-lines that results in a tree called a {\it spanning tree} (not to be confused with a Gallavotti-Nicol\`o tree) (for an example, see figure~\ref{bilayerspanningfig}). The vertex $v_r$ of a spanning tree that contains the {\it last external field}, i.e. that is such that $j_{2l_{v_0}}^{(v_0)}\in I_{v_r}$, is defined as its root, which allows us to unambiguously define a parent-child partial order, so that we can dress each branch with an arrow that is directed away from the root. For every $v\in\mathfrak E(\tau)$ that is not the root of $T$, we define $J^{(v)}\in I_v$ as the index of the field in which $T${\it enters}, i.e. the index of the half-line of $T$ with an arrow pointing towards $v$. We also define $J^{(v_r)}:=j_{2l_{v_0}}^{(v_0)}$.
We have thus defined $(\sum_{v\in\mathfrak E(\tau)}|I_v|)-1$ variables $\mathbf z$, so that we are left with $\mathbf x_{J^{(v_r)}}$, which we call $\mathbf x_0$. It follows directly from the definitions that
the change of variables from $\underline{\mathbf x}_\tau$ to $\{\mathbf x_0,\{\mathbf z_{j}\}_{j\in{\mathcal I}_{\tau}\setminus\{J^{(v_r)}\}}\}$, where ${\mathcal I}_{\tau}=\bigcup_{v\in\mathfrak E(\tau)}I_v$, has Jacobian equal to $1$. \par
\bigskip
\begin{figure}
\hfil\includegraphics{Figs/spanningTree.pdf}\par
\caption{example of a spanning tree with $s_v=5$ and $|P_{v_1}|=|P_{v_2}|=|P_{v_3}|=|P_{v_4}|=4$, $|P_{v_5}|=6$; whose root is $v_2$.}
\label{bilayerspanningfig}\end{figure}
\point{\bf Decomposing $(\underline{\mathbf x}-\mathbf x_{2l})^m$} We now decompose the $(\underline{\mathbf x}-\mathbf x_{2l})^m$ factor in (\ref{bilayerpowercountingl}) in the following way (note that in terms of the indices in $P_{v_0}$, $\mathbf x_{2l}\equiv\mathbf x_{J^{(v_r)}}$):
$(\underline{\mathbf x}-\mathbf x_{2l})^m$ is a product of terms of the form $(x_{j,i}-x_{J^{(v_r)},i})$ which we rewrite as a sum of $z_{j',i}$'s for $v\in\mathfrak E(\tau)${\it on the path from $J^{(v_r)}$ to $j$}, a concept we will now make more precise. $j$ and $J^{(v_r)}$ are in $I_{v(j)}$ and $I_{v_r}$ respectively, where $v(j)$ is the unique node in $\mathfrak E(\tau)$ such that $j\in I_{v(j)}$. There exists a unique sequence of lines of $T$ that links $v_r$ to $v(j)$, which we denote by $((j_1,j_1'),\cdots,(j_\rho,j_\rho'))$, the convention being that the line
$(j,j')$ is oriented from $j$ to $j'$. The {\it path from $J^{(v_r)}$ to $j$} is the sequence $\mathbf z_{j_1}, \mathbf z_{j_1'},\mathbf z_{j_2},\cdots$ and so forth, until $j$ is reached. We can therefore write
\point{\bf Bound in terms of number of spanning trees.} Let us now turn to the object of interest, namely the left side of (\ref{bilayerpowercountingl}). It follows from
(\ref{bilayerdefW}) and (\ref{bilayertreeexpT}) that
Therefore, using the bound~(\ref{bilayerboundGt}), the change of variables defined above and the decomposition of $(\underline{\mathbf x}-\mathbf x_{2l})^m$ described above, we find
(we recall that by definition, if $v\in\mathfrak E(\tau)$, $I_v=P_v$ and $|I_v|=2l_v$) in which we inject~(\ref{bilayerassumprop}) and~(\ref{bilayerassumW}) to find
so that by injecting~(\ref{bilayerboundnrspan}) into~(\ref{bilayerboundt}), we find~(\ref{bilayerpowercountingl}), with $C_2=c_2^2c_3$ and $C_3=c_3^{-1}$.\penalty10000\hfill\penalty10000$\square$\par
\subsection{Schwinger function from the effective potential}
\label{bilayerschwinsec}
\indent In this section we show how to compute $\bar{\mathcal W}^{(h)}$ in a similarly general setting as above: consider
for $h\in\{h_2^*,\cdots,h_1^*-1\}$. This discussion will not be used in the ultraviolet regime, so we can safely discard the cases in which the propagator is not renormalized. Unlike~(\ref{bilayertreeindeq}), it is necessary to separate the $\alpha$ indices from the $(\omega,j)$ indices, so we write the propagator of $\mathcal E_{h+1}^T$ as $g_{(h+1,\varpi),(\alpha,\alpha')}$ where $\varpi$ stands for $\omega$ in the first and second regimes, and $(\omega,j)$ in the third. \par
\bigskip
\indent We now rewrite the terms in the right side of~(\ref{bilayerWindeq}) in terms of the effective potential $\mathcal V^{(h)}$. Let
Note that the terms in $\mathcal X^{(h)}$ are either linear or quadratic in $\hat J_{\mathbf k,\underline\alpha}$, simply because the two $J$ variables we have at our disposal, $\hat J_{{\bf k},\alpha_1}^+,\hat J^-_{{\bf k},\alpha_2}$, are Grassmann variables and square to zero. We define the functional derivative of $\mathcal V^{(h)}$ with respect to $\hat\psi^{\pm}_{\mathbf k,\alpha}$:
for some $s_{\alpha_1,\alpha_2}^{(h^*_1)}(\mathbf k)$, $q_{\alpha,\alpha'}^{\pm(h^*_1)}(\mathbf k)$, $\bar G_{\alpha,\alpha'}^{(h^*_1)}({\bf k})$. Then (\ref{bilayerexprdomorderren})
holds for $h\in\{h^*_2,\ldots,h^*_1-1\}$ as well, with
in which the sums over $\alpha$ are sums over the indices of $g$.
\endtheo
\bigskip
\indent The (inductive) proof of lemma~\ref{bilayerschwinlemma}\ is straightforward, although it requires some bookkeeping, and is left to the reader.\par
\bigskip
{\bf Remark}: It follows from~(\ref{bilayerfreeengrassend}) and~(\ref{bilayerschwinform}) that the two-point Schwinger function $s_2(\mathbf k)$ is given by $s_2(\mathbf k)=s^{(\mathfrak h_\beta)}(\mathbf k)$ (indeed, once all of the fields have been integrated, $\mathcal X^{(\mathfrak h_\beta)}=\hat J_{\mathbf k}^+s^{(h)}(\mathbf k)\hat J_{\mathbf k}^-$). Therefore~(\ref{bilayerindsdefren}) is an inductive formula for the two-point Schwinger function.\par
\section{Ultraviolet integration}
\label{bilayeruvsec}
\indent We now detail the integration over the ultraviolet regime. We start from the tree expansion in the general form discussed in section~\ref{bilayertreeexpsec}, with $q=\ell_0=1$
and $h^*_1=M$; note that by construction these trees have no local leaves.
As mentioned in the first remark after lemma~\ref{bilayerpowercountinglemma}, we cannot apply that lemma to prove convergence of the tree expansion: however,
as we shall see in a moment, a simple re-organization of it will allow to derive uniformly convergent bounds.
We recall the estimates~(\ref{bilayerestguv}) and~(\ref{bilayergrambounduv}) of $\hat g_{h}$ in the ultraviolet regime: for $m_0+m_k\leqslant3$
\indent We now move on to the power counting estimate. The first remark to be made is that the values of the leaves have a much better dimensional estimate than the one assumed in lemma~\ref{bilayerpowercountinglemma}. In fact,
the value of any leaf, called $W^{(M)}_{4,\underline\alpha}
\point{\bf Resumming trivial branches.} Next, we re-sum the branches of Gallavotti-Nicol\`o trees that are only followed by a single endpoint: the naive dimensional bound
on the value of these branches tends to diverge logarithmically as $M\to\infty$, but one can easily exhibit a cancellation that improves their estimate, as explained below. Consider a tree $\tau$ made of a single branch, with a root on scale $h$ and a single leaf on scale $M+1$ with value $W^{(M)}_4$. The 4-field kernel
associated with such a tree is $K_{4,\underline\alpha}^{(h)}(\underline{\mathbf x}):=W^{(M)}_{4,\underline\alpha}(\underline{\mathbf x})$.
The 2-field kernel associated with $\tau$, once summed over the choices of $P_v$ and over the field labels it indexes for $h+1<h_v\leqslant M$, keeping $P_{v_0}$ and its field labels fixed,
If one were to bound the right side of~(\ref{bilayerWtline}) term by term in the sum over $h'$ using the dimensional estimates on the propagator (see ~(\ref{bilayerestguv}) and following),
one would find a {\it logarithmic} divergence for $\int d\mathbf x|K_{2,(\alpha,\alpha')}^{(h)}(\mathbf x)|$, i.e. a bound proportional to $M-h$. However, the right side of~(\ref{bilayerWtline}) depends on propagators evaluated at $x_0=0$ (because $w(\mathbf x)$ is proportional to $\delta(x_0)$), so we can use an improved bound on the propagator $g_{h'}$:
the dominant terms in $\hat g_{h}(\mathbf k)$ are odd in $k_0$, so they cancel when considering
for some constant $\mathfrak C_4$. We then re-organize the right side of~(\ref{bilayertreeexp}) by:
\begin{enumerate}
\item summing over the set of {\it contracted} trees $\tilde{\mathcal T}_N^{(h)}$, which is defined like $\mathcal T_N^{(h)}$ but for the fact that every node $v\succ v_0$ that is not an endpoint must have at least two endpoints following it, and the endpoints can be on any scale in $[h+2, M+1]$;
\item re-defining the value of the endpoints to be $\tilde\rho_v=K^{(h_v-1)}_{2l_v}$, with $l_v=1,2$.
\end{enumerate}
\bigskip
\point{\bf Contracted tree expansion.} We can now estimate the ``contracted tree'' expansion, by
repeating the steps of the proof of lemma~\ref{bilayerpowercountinglemma}, thus finding
for two constants $c_1$ and $c_2$ in which the sum over $\underline l_\tau$ is a sum over the $l_v\in\{1,2\}$. It then follows from the following equation
We proceed by induction: if $v_0$ denotes the first node of $\tau$ (i.e. the node immediately following the root), $(v_1,\cdots,v_s)$ its children, and $(\tau_1,\cdots,\tau_s)$ the sub-trees with first node $(v_1,\cdots,v_s)$, then
We can re-express the sum over $\tau$ as a sum over trees with no scale labels that are such that each node that is not a leaf has at least two children, and a sum over scale labels:
in which $\mathcal T_N^*$ denotes the set of unlabeled rooted trees with $N$ endpoints and $\mathbf H_h(\tau^*)$ denotes the set of scale labels {\it compatible} with $\tau^*$. Therefore
for some constant $C_{T,1}$, in which we used the fact that $|\mathfrak V(\tau^*)|\leqslant N$. Furthermore, it is a well known fact that $\sum_{\tau^*}1\leqslant4^N$ (see e.g. [\cite{bilayerGM01}, lemma~A.1], the proof is based on constructing an injective map to the set of random walks with $2N$ steps: given a tree, consider a walker that starts at the root, and then travels over branches towards the right until it reaches a leaf, and then travels left until it can go right again on a different branch). Therefore
\subpoint{\bf Conclusion of the proof.} Therefore, by combining~(\ref{bilayerboundP}) and~(\ref{bilayerboundT}) with the trivial estimate $\sum_{\underline l_\tau}1\leqslant2^N$, we find
\begin{equation}
\mathfrak T_N\leqslant (\mathrm{const.})^N.
\label{bilayerboundtreeuv}\end{equation}
Equation~(\ref{bilayerboundtreeuv}) trivially holds for $N=1$ as well. If we inject~(\ref{bilayerboundtreeuv}) into~(\ref{bilayerboundprerefuv}) we get:
\indent We now study the first regime. We consider the tree expansion in the general form discussed in section~\ref{bilayertreeexpsec}, with $h^*_1=\bar{\mathfrak h}_0$ and $q=\ell_0=2$, so that there are no local leaves, i.e., all leaves are irrelevant, on scale $\bar{\mathfrak h}_0+1$. Recall that the truncated expectation $\mathcal E^T_{h+1}$ in the right side of (\ref{bilayertreeindeq}) is with respect to the dressed propagator $\bar g_{h+1}$ in (\ref{bilayerinddresspropexpo}), so that (\ref{bilayertreeindeq}) is to be interpreted as (\ref{bilayereffpotoh}). A non trivial aspect of the analysis is that we do not have a priori bounds on the dressed propagator, but just on the ``bare'' one
$g_{h,\omega}$, see (\ref{bilayerestgo}), (\ref{bilayergramboundo}). The goal is to show inductively on $h$ that the same qualitative bounds are valid for $\bar g_{h,\omega}$, namely
\point{\bf Bounding the sum on trees.} First, we notice that the sum over $\underline l_\tau$ can be written as a sum over $l_1,\cdots,l_N$, so that it can be moved before $\sum_\tau$.
and since $\ell_0=2$, $|P_v|\geqslant4$ for every node $v$ that is not the first node or a leaf, so that $3-|P_v|\leqslant-|P_v|/4$. Now, if $N\geqslant2$, then given $\tau$, let $v^*_\tau$ be the node with at least two children that is closest to the root, and $h_\tau^*$ its scale. Using the fact that $|P_v|\geqslant2l+2$ for all $v\prec v_\tau^*$ and the fact that $\tau$ has at least two branches on scales $\geqslant h_\tau^*$, we have
Consider now the case with $l=1$. If $N=1$ then the sum over $\tau$ is trivial, i.e., $\mathcal T_1^{(h)}$ consists of a single element, and the sum over ${\mathbf P}$ can be bounded as
where $v'$ is, if it exists, the leftmost node such that $|P_v|>4$, in which case $4-|P_v|\leqslant-|P_v|/3$; otherwise, we interpret the product over $v$ as 1. Proceeding as in the case $l\geqslant2$, we bound the right side of (\ref{bilayer9.16}) by
\begin{equation} 2^h C^{l_1}\sum_{h_{v'}=h+2}^{0} 2^{2\theta h_{v'}}\leqslant 2^h C' C^{l_1}.\label{bilayer9.r8}\end{equation}
If $N\geqslant2$, then we denote by $\tau^*$ the subtree with $v^*_\tau:v^*$ as first node, and $\tau'_{}$ the linear tree with root on scale $h$ and the endpoint on scale $h^*$, so that $\tau\tau'\cup\tau^*$.
The sum in the last parentheses can be bounded as in the case $l\geqslant2$, yielding $C^{\sum_il_i}2^{2\theta h^*}$. The remaining sum can be bounded as in
(\ref{bilayer9.16})-(\ref{bilayer9.r8}) so that, in conclusion,
\pspoint{\bf$l=1$.} Therefore, if $l=1$, (\ref{bilayerpowercountingol}) becomes (we recall that $q=2>1$ so that $B_2=W_2$, see~(\ref{bilayerbetafundef}))
whose inverse Fourier transform is denoted by $\bar A^{(h,\omega)}$. Note that (\ref{bilayerpco}) on its own does not suffice to prove~(\ref{bilayerassumdpropo}) because the bound on
that it would yield is $(\mathrm{const}.)\ |U|$ whereas on the support of $f_{h,\omega}$, $\hat g^{-1}\sim2^h$, which we cannot
compare with $|U|$ unless we impose an $\epsilon$-dependent smallness condition on $U$, which we do not want.
In addition, even if~(\ref{bilayerWing}) were bounded by $(\mathrm{const}.)\ |U|2^h$, we would have to face an extra difficulty to bound $\bar g$ in $\mathbf x$-space: indeed, the naive approach we have used so far (see e.g.~(\ref{bilayerestguv})) to bound
would require a bound on $\partial^n_{\bf k}\hat{\bar g}_{h,\omega}({\bf k})$ with $n>m+3$ (we recall that the integral over $\mathbf x$ is $3$-dimensional), which would in turn require an estimate on
for $h'>h$, which we do not have (and if we tried to prove it by induction, we would immediately find that the estimate would be required to be uniform in $n$, which we cannot expect to be true).\par
\bigskip
\indent In order to overcome both of the previously mentioned difficulties, we will expand $\hat W_2^{(h')}$ at first order around $\mathbf p_{F,0}^\omega$. The contributions up to first order in ${\bf k}-\mathbf p_{F,0}^\omega$ will be called the {\it local part} of $\hat W_2^{(h')}$. Through symmetry considerations, we will write the local part in terms of constants which we can control, and then use~(\ref{bilayerpco}) to bound the remainder. In particular, we will prove that $\hat W_2^{(h)}(\mathbf p_{F,0}^\omega)=0$ from which we will deduce an improved bound for~(\ref{bilayerWing}). Furthermore, since the $\mathbf k$-dependance of the local part is explicit, we will be able to bound all of its derivatives and bound $\bar g$ in $\mathbf x$-space.\par
\bigskip
\point{\bf Local and irrelevant contributions.} We define a {\it localization} operator:
where $\delta(\mathbf x):=\delta(x_0)\delta_{x_1,0}\delta_{x_2,0}$ and in the second term, as usual, the derivative with respect to $x_1$ and $x_2$ is discrete; as well as
the corresponding {\it irrelevator}:
\begin{equation}
\mathcal R:=\mathds1-\mathcal L.
\label{bilayerRodef}\end{equation}
The action of $\mathcal L$ on functions on $\mathbf k$-space is
(up to finite size corrections coming from the fact that $L<\infty$ that do not change the dimensional estimates computed in this section and that we neglect for the sake simplicity)
{\bf Remark}: The reason why $\mathcal L$ is defined as the first order Taylor expansion, is that its role is to separate the {\it relevant} and {\it marginal} parts of $\hat W_2^{(h')}$ from the {\it irrelevant} ones. Indeed, we recall the definition of the {\it scaling dimension} associated to a kernel $\hat W_2^{(h')}$ (see one of the remarks after lemma~\ref{bilayerpowercountinglemma})
$$c_k-(c_k-c_g)=1$$
which, roughly, means that $\hat W_2^{(h')}$ is bounded by $2^{(c_k-(c_k-c_g))h'}=2^{h'}$. As was remarked above, this bound is insufficient since it does not constrain $\sum_{h'\geqslant h}\hat W_2^{(h')}$ to be smaller than $2^h\sim\hat g^{-1}$. Note that, while $\hat W_2^{(h')}({\bf k})$ is bounded by $2^{h'}$, irrespective of ${\bf k}$,
$(\mathbf k-\mathbf p_{F,0}^\omega)\cdot\partial_{\mathbf k}\hat W_2^{(h')}({\bf k})$ has an improved dimensional bound, proportional to $2^{h-h'}2^{h'}$, where $2^h\sim |{\bf k}-\mathbf p_{F,0}^\omega|$;
in this sense, we can think of the operator $(\mathbf k-\mathbf p_{F,0}^\omega)\cdot\partial_{\mathbf k}$ as scaling like $2^{h-h'}$. Therefore, the remainder of the first order Taylor expansion is bounded by $2^{2(h-h')}2^{h'}=2^{2h-h'}$ and thereby has a scaling dimension of $-1$ (with respect to $h'$). Thus, by defining $\mathcal L$ as the first order Taylor expansion, we take the focus away from the remainder, which can be bounded easily because it is irrelevant (i.e., it has negative scaling dimension), and concentrate our attention on the relevant and marginal contributions of $\hat W_2^{(h')}$. See~[\cite{bilayerBG95}, chapter~8] for details.\par
(we can put in the $(f_{\leqslant h+1,\omega}({\bf k})-f_{\leqslant h-2,\omega}({\bf k}))$ factor for free because of the initial $f_{h,\omega}(\mathbf k)$).\par
\bigskip
\point{\bf Local part.} We first compute $\mathcal L\hat{\bar A}_{h,\omega}(\mathbf k)$.\par
\bigskip
\subpoint{\bf Non-interacting components.} As a first step, we write the local part of the free inverse propagator as
\subpoint{\bf Interacting components.} We now turn to the terms coming from the interaction. We first note that $\mathcal V^{(h')}$ satisfies the same symmetries as the {\it initial} potential $\mathcal V$ (\ref{bilayer2.3a}), listed in section~\ref{bilayersymsec}. Indeed, $\mathcal V^{(h')}$ is a function of $\mathcal V$ and a quantity similar to~(\ref{bilayerhzdef}) but with an extra cutoff function, which satisfies the symmetries~(\ref{bilayeraglobaluo}) through~(\ref{bilayerainversiont}). Therefore
This imposes a number of restrictions on $\mathcal L\hat W_2^{(h')}$: indeed, it follows from propositions~\ref{bilayersymprop} and~\ref{bilayersymfourprop} (see appendix~\ref{bilayerconstWtapp}) that, since
with $(\tilde\zeta_{h'}, \tilde\mu_{h'}, \tilde\nu_{h'},\zeta_{h'},\nu_{h'})\in\mathbb{R}^5$. Furthermore, it follows from (\ref{bilayerpco}) that if $h'\leqslant\bar{\mathfrak h}_0$, then
\subpoint{\bf Dominant part of $\mathcal L\hat{\bar A}_{h,\omega}$} Furthermore, we notice that the terms proportional to $\tilde m_h$ or $\tilde v_h$ are sub-dominant:
Before bounding $\sigma_1$, we compute the inverse of~(\ref{bilayerLAodom}): using proposition~\ref{bilayermatinvprop} (see appendix~\ref{bilayerinversapp}), we find that if we define
\point{\bf Irrelevant part.} We now focus on the remainder term $\mathcal R\hat{\bar A}_{h,\omega}(\mathbf k)\,\mathbb L\hat{\bar{\mathfrak g}}_{[h],\omega}(\mathbf k)$ in (\ref{bilayergseplino}), which we now show to be small. The estimates are carried out in $\mathbf x$ space.
{\bf Remark}: The estimate (\ref{bilayerineqRtg}), as compared to the dimensional estimate without $\mathcal R$, is
better by a factor $2^{2(h-h')}$. This is a fairly general argument, and could be repeated with $\mathbb L\bar{\mathfrak g}_{[h],\omega}$ replaced by the inverse Fourier transform of $f_{h,\omega}$:
\point{\bf Conclusion of the proof.} The proof of the first of~(\ref{bilayerassumdpropo}) is then completed by injecting~(\ref{bilayerLAodom}), (\ref{bilayerboundsigmaoo}), (\ref{bilayersepLAodom}), (\ref{bilayerboundzvtop}) and (\ref{bilayerineqRAo}) into~(\ref{bilayergseplino}) and its corresponding $\mathbf x$-space representation. The second of~(\ref{bilayerassumdpropo}) follows from the first.\par
\subsection{Two-point Schwinger function}
\label{bilayerschwinosec}
\indent We now compute the dominant part of the two-point Schwinger function for $\mathbf k${\it well inside} the first regime, i.e.
so that if $h\not\in\{h_{\bf k},h_{{\bf k}}-1\}$, then $f_{h,\omega}({\bf k})=0$.\par
\bigskip
\point{\bf Schwinger function in terms of dressed propagators.} Since $h_{\mathbf k}\leqslant\bar{\mathfrak h}_0$, the source term $\hat J_{\mathbf k,\alpha_1}^+\hat\psi_{\mathbf k,\alpha_1}^-+\hat\psi_{\mathbf k,\alpha_2}^+\hat J_{\mathbf k,\alpha_2}^-$ is constant with respect to the ultraviolet fields, so that the effective source term $\mathcal X^{(h)}$ defined in (\ref{bilayerQdef}) is given, for $h=\bar{\mathfrak h}_0$, by
Therefore, we can compute $\mathcal X^{(h)}$ for $h\in\{\mathfrak h_1,\cdots,\bar{\mathfrak h}_0-1\}$ inductively using lemma~\ref{bilayerschwinlemma}. By using the fact that the support of $\hat{\bar g}_{h,\omega}$ is compact, we find that $\bar G^{(h)}({\bf k})$ no longer depends on $h$ as soon as $h\leqslant h_{\mathbf k}-2$, i.e.,
where all the functions in the right side are evaluated at ${\bf k}$. Note that in order to get the two-point function defined in section~\ref{bilayerintroduction}, we must integrate down to $h=\mathfrak h_\beta$: $s_2({\bf k})=s^{(\mathfrak h_\beta)}({\bf k})$. This requires an analysis of the second and third regimes (see sections~\ref{bilayerschwintsec} and~\ref{bilayerschwinthsec} below). We thus find
in which $\hat W_2^{(h')}$ with $h'\in\{\bar{\mathfrak h}_2+1,\cdots,\mathfrak h_2-1\}\cup\{\bar{\mathfrak h}_1+1,\cdots,\mathfrak h_1-1\}$ should be interpreted as 0.\par
\bigskip
\point{\bf Bounding the error terms.} We then use~(\ref{bilayerassumdpropo}), (\ref{bilayerpco}) as well as the bound
in which $\tilde z_{h_{\mathbf k}}$, $z_{h_{\mathbf k}}$ and $v_{h_{\mathbf k}}$ were defined in~(\ref{bilayersummedzvmo}) and satisfy (see~(\ref{bilayerboundzvtop}))
where $\tilde C_1^{(z)}$, $C_1^{(z)}$ and $C_1^{(v)}$ are constants (independent of $h_{\mathbf k}$, $U$ and $\epsilon$). Finally the error term $\sigma'$ is bounded using~(\ref{bilayerineqRAo}) and~(\ref{bilayerboundsigmaoo})
\point{\bf Proof of theorem~\ref{bilayertheoo}} We now conclude the proof of theorem~\ref{bilayertheoo}, {\it under the assumption} (\ref{bilayersumWo}): we define
which we inject into~(\ref{bilayerschwinxprgo}), which, in turn, combined with~(\ref{bilayerschwinexpro}), (\ref{bilayerboundsigmao}), (\ref{bilayerboundsigmalk}) and~(\ref{bilayerboundsigmapo}) yields~(\ref{bilayerschwino}).\par
\subsection{Intermediate regime: first to second}
\label{bilayerotsec}
\point{\bf Integration over the intermediate regime.} The integration over the intermediate regime between scales $\mathfrak h_1$ and $\bar{\mathfrak h}_1$ can be performed in a way that is entirely analogous to that in the bulk of the first regime, with the difference that it is performed in a single step. The outcome is that, in particular, the effective potential on scale $\bar{\mathfrak h}_1$ satisfies
an estimate analogous to (\ref{bilayerpco}) (details are left to the reader):
where $(i,j)$ are in {\it different layers}, i.e. $(\alpha,\alpha')\in\{a,b\}\times\{\tilde a,\tilde b\}$ or $(\alpha,\alpha')\in\{\tilde a,\tilde b\}\times\{a,b\}$, $h'\geqslant\bar{\mathfrak h}_1$. Note that since $W_{4,(\alpha_1,\alpha_1',\alpha_2,\alpha_2')}^{(M)}$ is proportional to $\delta_{\alpha_1,\alpha_1'}\delta_{\alpha_2,\alpha_2'}$, any contribution to $W_{2,\omega,(\alpha,\alpha')}^{(h',2)}$ must contain at least one propagator between different layers, i.e. $\bar g_{(h'',\omega),(\bar\alpha,\bar\alpha')}$ with $h'<h''\leqslant\bar{\mathfrak h}_0$ or $g_{(h''),(\bar\alpha,\bar\alpha')}$ with $h''\geqslant0$, and $(\bar\alpha,\bar\alpha')\in\{a,b\}\times\{\tilde a,\tilde b\}\cup\{\tilde a,\tilde b\}\times\{a,b\}$. This can be easily proved using the fact that if the inter-layer hoppings were neglected (i.e. $\gamma_1=\gamma_3=0$), then the system would be symmetric under
which would imply that $W_{2,\omega,(\alpha,\alpha')}^{(h')}=0$. The presence of at least one propagator between different layers allows us to obtain a dimensional gain, induced by an improved estimate on each such propagator. To prove an improved estimate on the inter-layer propagator, let us start by considering the bare one, $g_{(h'',\omega),(\bar\alpha,\bar\alpha')}$ with $(\bar\alpha,\bar\alpha')\in\{a,b\}\times\{\tilde a,\tilde b\}\cup\{\tilde a,\tilde b\}\times\{a,b\}$ and $h'<h''\leqslant\bar{\mathfrak h}_0$ (similar considerations are valid for the ultraviolet counterpart): using the explicit expression (\ref{bilayerfreeprop}) it is straightforward to check that it is bounded as in (\ref{bilayerestgo}), (\ref{bilayerestgko}), times an extra factor $\epsilon2^{-h''}$. We now proceed as in section~\ref{bilayerpowercountingosec} and prove by induction that the same dimensional gain is associated with the {\it dressed} propagator $\bar g_{(h'',\omega),(\bar\alpha,\bar\alpha')}$, with $(\bar\alpha,\bar\alpha')\in\{a,b\}\times\{\tilde a,\tilde b\}\cup\{\tilde a,\tilde b\}\times\{a,b\}$, and, therefore, with (\ref{bilayer9.45}) itself. \par
\bigskip
\subpoint{\bf Trees with a single endpoint.} We first consider the contributions $\mathfrak A_{2,\omega,(\alpha,\alpha')}^{(h')}$ to $W_{2,\omega,(\alpha,\alpha')}^{(h')}$ from trees $\tau\in\mathcal T_1^{(h)}$ with a single endpoint. The $\mathfrak F_h(\underline m)$ factor in the estimate~(\ref{bilayerboundpre}) can be removed for these contributions using the fact that they have an empty spanning tree (i.e. $\mathbf T(\tau)=\emptyset$), which implies that the $\mathbf z^m$'s in the right side of~(\ref{bilayerineqpcexpl}) are all $\underline{\mathbf z}^{(v)}$'s and not $\mathbf z_\ell$'s, and can be estimated dimensionally by a constant instead of $\mathfrak F_h(\underline m)$. Therefore, combining this fact with the gain associated to the propagator, we find that for all $m\leqslant3$,
\subpoint{\bf Trees with at least two endpoints.} We now consider the contributions $\mathfrak B_{2,\omega,(\alpha,\alpha')}^{(h')}$ to $W_{2,\omega,(\alpha,\alpha')}^{(h')}$ from trees with $\geqslant2$ endpoints. Let $v^*_\tau$ be the node that has at least two children that is closest to the root and let $h^*_\tau$ be its scale. Repeating the reasoning leading to~(\ref{bilayerpowercountingnoto}), and using the fact that the $\mathbf x^m$ falls on a node on scale $\geqslant h^*_\tau$, we find
\indent Combining~(\ref{bilayerimprovestoneep}) and~(\ref{bilayerimprovestbetaep}), and repeating the argument in section~\ref{bilayerdressproposec}, we conclude the proof of the desired improvement on the estimate of $\bar g$, and that
\indent We now perform the multiscale integration in the second regime. As in the first regime, we shall inductively prove that $\bar g_{h,\omega}$ satisfies the same estimate as $g_{h,\omega}$ (see~(\ref{bilayerestgt}) and~(\ref{bilayergramboundt})): for all $m\leqslant3$,
$C_g=(\mathrm{const}.)\ $ and $C_G=(\mathrm{const}.)\ 2^{h_\epsilon}$.\par
\bigskip
{\bf Remark}: As can be seen from~(\ref{bilayerboundfreepropzt}), different components of $g_{h,\omega}$ scale in different ways. In order to highlight this fact, we call the $\{a,\tilde b\}$ components {\it massive} and the $\{\tilde a,b\}$ components {\it massless}. It follows from~(\ref{bilayerboundfreepropzt}) that the $L_1$ norm of the massive-massive sub-block of $g_{h,\omega}(\mathbf x)$ is bounded by $(\mathrm{const}.)\ 2^{-h_\epsilon}$ (instead of $2^{-h}$, compare
with (\ref{bilayerassumdpropt})) and that the massive-massless sub-blocks are bounded by $(\mathrm{const}.)\ 2^{-(h+h_\epsilon)/2}$. In the following, in order to simplify the discussion, we will ignore these improvements, even though the bounds we will thus derive for the non-local corrections may not be optimal.\par
\bigskip
\indent In addition, in order to apply lemma~\ref{bilayerpowercountinglemma}, we have to ensure that hypothesis~(\ref{bilayerassumW}) is satisfied, so we will also prove a bound on the 4-field kernels by induction ($\ell_0=3$ in this regime, so~(\ref{bilayerassumW}) must be satisfied by the 4-field kernels): for all $m\leqslant3$,
where $C_\mu'$ is a constant that will be defined below. Note that in this regime,
$$\ell_0=3>\frac{c_k}{c_k-c_g}=2$$
as desired.
\subsection{Power counting in the second regime}
\label{bilayerpcosec}
\point{\bf Power counting estimate.} It follows from lemma~\ref{bilayerpowercountinglemma}\ and~(\ref{bilayerboundot}) that for all $m\leqslant3$ and some $c_1,c_2>0$,
\point{\bf Bounding the sum on trees.} By repeating the computation that leads to~(\ref{bilayerboundtreeuv}), noticing that if $\ell_0=3$, then for $v\in\mathfrak V(\tau)$ we have
for some $c_4>0$. Note that, if $l=2$, the contribution with $N=1$ to the left side admits an improved bound of the form $c_4\mathfrak F_h(\underline m)2^{\theta' h}|U|^2$, which is better
than the corresponding term in the right side of (\ref{bilayer10.12}). This implies
for some $c_6>0$, which, in particular, recalling that in the second regime $h_\epsilon-h\leqslant-2h_\epsilon+C$, for some constant $C$ independent of $\epsilon$,
{\bf Remark}: The estimates (\ref{bilayerpowercountingt}) and (\ref{bilayerineqtreesumt}) imply the convergence of
the tree expansion (\ref{bilayertreeexpT}), thus providing a convergent expansion of $W_{2l,\underline\omega,\underline\alpha}^{(h)}$ in $U$.
\bigskip
{\bf Remark}: The first of~(\ref{bilayerpowercountingtt}) exhibits a tendency to grow {\it logarithmically}
in $2^{-h}$. This is not an artifact of the bounding procedure: indeed the second-order flow, computed in~[\cite{bilayerVa10}], exhibits the same logarithmic growth.
However, the presence of the $\epsilon$ factor in front of $(h_\epsilon-h)\leqslant2|\log\epsilon|$ ensures this growth is benign:
it is cut off before it has a chance to be realized.
\par
\subsection{The dressed propagator}
\label{bilayerdressproptsec}
\indent We now turn to the inductive proof of (\ref{bilayerassumdpropt}). We recall that (see~(\ref{bilayerAdeft}))
We will separate the {\it local} part of $\bar A$ from the remainder by using the localization operator defined in~(\ref{bilayerWreldefo}) (see the remark at the end of this section for an explanation of why we can choose the same localization operator as in the first regime even though the scaling dimension is different) and rewrite~(\ref{bilayerdresspropt}) as
Similarly to the first regime, we now compute $\mathcal L\hat{\bar A}_{h,\omega}(\mathbf k)$ and bound $\mathcal R\hat{\bar A}_{h,\omega}(\mathbf k)\mathbb L\hat{\bar{\mathfrak g}}_{[h],\omega}(\mathbf k)$. We first write the local part of the non-interacting contribution:
\point{\bf Local part.} The symmetries discussed in the first regime (see~(\ref{bilayerinvsymsWo}) and~(\ref{bilayerpfzsyms})) still hold in this regime, so that~(\ref{bilayerWrelo}) still holds:
with $(\tilde\zeta_{h'}, \tilde\mu_{h'}, \tilde\nu_{h'},\zeta_{h'},\nu_{h'})\in\mathbb{R}^5$. Furthermore, it follows from~(\ref{bilayerpct}) that if $h'\leqslant\bar{\mathfrak h}_1$, then
in which $\tilde\zeta_{h'}$, $\tilde\mu_{h'}$, $\tilde\nu_{h'}$, $\zeta_{h'}$ and $\nu_{h'}$ with $h'\in\{\bar{\mathfrak h}_1+1,\cdots,\mathfrak h_1-1\}$ are to be interpreted as 0.
By injecting~(\ref{bilayerboundzvtt}) through~(\ref{bilayerboundvot}) into~(\ref{bilayersummedzvmt}), we find
\point{\bf Dominant part of $\mathcal L\hat{\bar A}_{h,\omega}$} Furthermore, we notice that the terms proportional to $\tilde z_h$ or $\tilde v_h$ are sub-dominant:
Before bounding $\sigma_3$, we compute the inverse of~(\ref{bilayerLAtdom}), which is elementary once it is put in block-diagonal form: using proposition~\ref{bilayerblockdiagprop} (see appendix~\ref{bilayerdiagapp}), we find that if we define
in which the bound should be understood as follows: the upper-left element in~(\ref{bilayerboundinvLAtdom}) is the bound on the upper-left $2\times2$ block of $\mathfrak L\hat{\bar A}_{h,\omega,0}^{-1}$, and similarly for the upper-right, lower-left and lower-right. In turn, (\ref{bilayerboundinvLAtdom}) implies
\point{\bf Irrelevant part.} The irrelevant part is bounded in the same way as in the first regime (see~(\ref{bilayerineqRtg})): using~(\ref{bilayerLAt}) and the bounds~(\ref{bilayerboundzvtt}) through~(\ref{bilayerboundvot}), we find that for $m\leqslant3$ and ${\mathfrak h}_2\leqslant h\leqslant h'\leqslant\bar{\mathfrak h}_1$,
\point{\bf Conclusion of the proof.} The proof of~(\ref{bilayerassumdpropt}) is then concluded by injecting~(\ref{bilayerLAtdom}), (\ref{bilayerboundsigmatht}), (\ref{bilayersepLAtdom}) and~(\ref{bilayerineqRAt}) into~(\ref{bilayergseplint}).\par
\bigskip
{\bf Remark}: By following the rationale explained in the remark following~(\ref{bilayerWrelk}), one may notice that the ``correct'' localization operator in the second regime is different from that in the first. Indeed, in the second regime, $(k-p_{F,0}^\omega)\partial_k$ scales like $2^{\frac12(h-h')}$ instead of $2^{h-h'}$ in the first. This implies that the remainder of the first order Taylor expansion of $\hat W_2^{(h')}$ is bounded by $2^{h}$ instead of $2^{2h-h'}$ in the first regime, and is therefore {\it marginal}. However, this is not a problem in this case since the effect of the ``marginality'' of the remainder is to produce the $|h|$ factor in~(\ref{bilayerineqRAt}), which, since the second regime is cut off at scale $3h_\epsilon$ and the integration over the super-renormalizable first regime produced an extra $2^{h_\epsilon}$ (see~(\ref{bilayerineqRAt})), is of little consequence. If one were to do things ``right'', one would define the localization operator for the {\it massless} fields as the Taylor expansion to {\it second} order in $k$ and first order in $k_0$, and find that the $|h|$ factor in (\ref{bilayerineqRAt}) can be dropped. We have not taken this approach here, since it complicates the definition of $\mathcal L$ (which would differ between massive and massless blocks) as well as the symmetry discussion that we used in~(\ref{bilayerLAt}).\par
\subsection{Two-point Schwinger function}
\label{bilayerschwintsec}
\indent We now compute the dominant part of the two-point Schwinger function for $\mathbf k${\it well inside} the second regime, i.e.
so that if $h\not\in\{h_{\bf k},h_{{\bf k}}-1\}$, then $f_{h,\omega}({\bf k})=0$.\par
\bigskip
\point{\bf Schwinger function in terms of dressed propagators.} Recall that the two-point Schwinger function can be computed in terms of the effective source term $\mathcal X^{(h)}$ defined in (\ref{bilayerQdef}), see the comment after lemma~\ref{bilayerschwinlemma}.
Since $h_{\mathbf k}\leqslant\bar{\mathfrak h}_1$, $\mathcal X^{(h)}$ is left invariant by the integration over the ultraviolet and the first regime, in the sense that $\mathcal X^{(\bar{\mathfrak h}_1)}=\mathcal X^{(\bar{\mathfrak h}_0)}$, with $\mathcal X^{(\bar{\mathfrak h}_0)}$ given by (\ref{bilayerinitXo}). We can therefore compute $\mathcal X^{(h)}$ for $h\in\{\mathfrak h_2,\cdots,\bar{\mathfrak h}_1-1\}$ inductively using lemma~\ref{bilayerschwinlemma}, and find, similarly to~(\ref{bilayerschwinexpro}), that
which can be proved as follows: using (\ref{bilayerdresspropt}), we write $\hat{\bar g}_{h_{\mathbf k},\omega}=f_{h_{\bf k}}\hat{\bar A}_{h_{\bf k},\omega}^{-1}$ and
Now, by~(\ref{bilayerpct}), we see that $|\hat W_{2}^{(h_{\bf k}-1)}({\bf k})\hat{\bar A}_{h_{{\bf k}},\omega}^{-1}({\bf k})|\leqslant(\mathrm{const}.)\ 2^{h_\epsilon}$, which implies (\ref{bilayerineqginvgt}). By inserting
(\ref{bilayerineqginvgt}), (\ref{bilayerpct}) and~(\ref{bilayerassumdpropt}) into~(\ref{bilayersigmatdef}), we obtain
\point{\bf Dominant part of the dressed propagators.} We now compute $\hat{\bar g}_{h_{\mathbf k},\omega}+\hat{\bar g}_{h_{\mathbf k}-1,\omega}$: it follows from~(\ref{bilayergseplint}), (\ref{bilayersepLAtdom}) and~(\ref{bilayerinvLAtdom}) that
where $\bar M_{{h_{\mathbf k}}}$, $\bar a_{{h_{\mathbf k}}}^{(M)}$ and $\bar a_{{h_{\mathbf k}}}^{(m)}$ were defined in~(\ref{bilayerMt}) and~(\ref{bilayerbat}), and the error term $\sigma'$ can be bounded using~(\ref{bilayerineqRAt}) and~(\ref{bilayerboundsigmatht}):
\point{\bf Proof of theorem~\ref{bilayertheot}} We now conclude the proof of theorem~\ref{bilayertheot}, {\it under the assumption} (\ref{bilayersumWt}). We define
(i.e. the inverse of the matrix on the right side of~(\ref{bilayerschwinxprgtp}), whose explicit expression is similar to the right side of~(\ref{bilayerLAtdom})), and
We inject~(\ref{bilayerapproxBht}) into~(\ref{bilayerschwinxprgtp}), which we then combine with~(\ref{bilayerschwinexprt}), (\ref{bilayerboundsigmat}), (\ref{bilayerboundsigmalkt}) and~(\ref{bilayerboundsigmapt}), and find an expression for $s_2$ which is similar to the right side of~(\ref{bilayerschwinxprgtp}) but with $h_{\mathbf k}$ replaced by $\mathfrak h_2$. This concludes the proof of~(\ref{bilayerschwint}). Furthermore, the estimate~(\ref{bilayerineqrcct}) follows from~(\ref{bilayerboundzvttp}), which concludes the proof of theorem~\ref{bilayertheot}.\par
\bigskip
\point{\bf Partial proof of (\ref{bilayersumWo})} Before moving on to the third regime, we bound part of the sum on the left side of~(\ref{bilayersumWo}), which we recall was assumed to be true to prove~(\ref{bilayerschwino}) (see section~\ref{bilayerschwinosec}). It follows from~(\ref{bilayerpct}) that
\indent In the intermediate regime, we integrate over the first scales for which the effect of the extra Fermi points $\mathbf p_{F,j}^\omega$ cannot be neglected. As a consequence, the local part of $\hat{\bar A}_{\mathfrak h_2,\omega}(\mathbf k)$ is not dominant, so that the proof of the inductive assumption~(\ref{bilayerassumdpropt}) for $h=\mathfrak h_2$ must be discussed anew. In addition, we will see that dressing the propagator throughout the integrations over the first and second regimes will have shifted the Fermi points away from $\mathbf p_{F,j}^\omega$ by a small amount. Such an effect has not been seen so far because the position of $\mathbf p_{F,0}^\omega$ is fixed by symmetry.\par
\bigskip
\point{\bf Power counting estimate.} We first prove that
The proof is slightly different from the proof in section~\ref{bilayerdressproptsec}: instead of splitting $\hat{\bar g}_{\mathfrak h_2,\omega}$ according to~(\ref{bilayergseplint}), we rewrite it as
(this decomposition suggests that the dominant part of $\hat{\bar A}_{\mathfrak h_2,\omega}$ is $\hat A+\mathcal L\hat{\bar{\mathfrak W}}_{\mathfrak h_2,\omega}$ instead of $\mathcal L\hat{\bar A}_{\mathfrak h_2,\omega}$) in which we recall that $\hat A\equiv\hat{\bar A}_{\mathfrak h_2,\omega}\big|_{U=0}$,
We want to estimate the behavior of (\ref{bilayergseplintth}) in $\mathcal B_{\beta,L}^{(\mathfrak h_2,\omega)}$, which we recall is a ball with four holes around each $\mathbf p_{F,j}^{\omega}$, $j=0,1,2,3$. The splitting in (\ref{bilayergseplintth}) is convenient in that it is easy to see that $\hat A(\mathbf k)+\mathcal L\hat{\bar{\mathfrak W}}_{\mathfrak h_2,\omega}(\mathbf k)$ satisfies the
same estimates as $\hat A({\bf k})$; in particular, via proposition~\ref{bilayermatinvprop} (see appendix~\ref{bilayerinversapp}), we see that $\det\big(\hat A(\mathbf k)+\mathcal L\hat{\bar{\mathfrak W}}_{\mathfrak h_2,\omega}(\mathbf k)\big)\geqslant\det\hat A({\bf k})\cdot(1+O(U))$ on $\mathcal B_{\beta,L}^{(\mathfrak h_2,\omega)}$, so that for all $n\leqslant7$ and $\mathbf k\in\mathcal B_{\beta,L}^{(\mathfrak h_2,\omega)}$,
The proof of~(\ref{bilayerassumdproptth}) is then concluded by injecting~(\ref{bilayerineqLAtth}) and~(\ref{bilayerineqRAtth}) into~(\ref{bilayergseplintth}). We can then use the discussion in section~\ref{bilayerpcosec} to bound
\point{\bf Shift in the Fermi points.} We now discuss the shift of the Fermi points, and show that $\hat{\bar g}_{\leqslant\mathfrak h_2,\omega}$ has {\it at least} 8 singularities: $\mathbf p_{F,0}^\omega$ and $\tilde{\mathbf p}_{F,j}^{(\omega,\mathfrak h_2)}$ for $j\in\{1,2,3\}$ where
(note that~(\ref{bilayershiftttht}) follows immediately from the rotation symmetry~(\ref{bilayerarotation}), so we can restrict our discussion to $j=1$).
\bigskip
{\bf Remark}: Actually, we could prove in this section that $\hat{\bar g}_{\leqslant\mathfrak h_2,\omega}$ has {\it exactly} 8 singularities, but this fact follows automatically from the discussion in section~\ref{bilayerthsec}, for the same reason that the proof that the splittings~(\ref{bilayergseplino}) and~(\ref{bilayergseplint}) are well defined in the first and second regimes implies that no additional singularity can appear in those regimes. Since the third regime extends to $h\to-\infty$, proving that the splitting~(\ref{bilayergseplint}) is well defined in the third regime will imply that there are 8 Fermi points.\par
\bigskip
\indent We will be looking for $\tilde{\mathbf p}_{F,1}^{(\omega,\mathfrak h_2)}$ in the form~(\ref{bilayershiftttho}). In particular, its $k_0$ component vanishes, so that, by corollary~\ref{bilayerAnokzprop} (see appendix~\ref{bilayerinversapp}), $\Delta_{\mathfrak h_2}$ solves
In order to solve~(\ref{bilayerDdeftthN}), we can use a Newton iteration, so we expand $\hat{\bar D}_{\mathfrak h_2,\omega}$ around $0$: it follows from the symmetries~(\ref{bilayeraverticalreflection}) and~(\ref{bilayerahorizontalreflection}) that
with $(M_{\mathfrak h_2},Y_{\mathfrak h_2})\in\mathbb{R}^2$, independent of $\omega$. Furthermore by injecting~(\ref{bilayerpco}) and~(\ref{bilayerpct}) into~(\ref{bilayerDdeftthN}), we find that
Therefore, by using a Newton scheme, one finds a root $\Delta_{\mathfrak h_2}$ of (\ref{bilayerDdeftthN}) and, by~(\ref{bilayerestYMtth}) and~(\ref{bilayerboundRttth}),
This concludes the proof of~(\ref{bilayershiftttho}) and~(\ref{bilayerineqshifttth}).\par
\section{Third regime}
\label{bilayerthsec}
\indent Finally, we perform the multiscale integration in the third regime. Similarly to the first and second regimes, we prove by induction that $\bar g_{h,\omega,j}$ satisfies the same estimate as $g_{h,\omega,j}$ (see~(\ref{bilayerestgth}) and~(\ref{bilayergramboundth})): for all $m\leqslant3$,
$C_g=(\mathrm{const}.)\ $ and $C_G=(\mathrm{const}.)\ 2^{-2h_\epsilon}$.\par\bigskip
{\bf Remark}: As in the second regime, the estimates~(\ref{bilayerassumdpropth}) are not optimal because the massive components scale differently from the massless ones.\par
\bigskip
\indent Like in the first regime,
$$\ell_0=2>\frac{c_k}{c_k-c_g}=\frac{3}{2}.$$
\subsection{Power counting in the third regime}
\point{\bf Power counting estimate.} By lemma~\ref{bilayerpowercountinglemma} and (\ref{bilayerboundtth}), we find that for all $m\leqslant3$
\point{\bf Bounding the sum of trees.} We then bound the sum over trees as in the first regime (see~(\ref{bilayerineqtreesumf}) and~(\ref{bilayerineqtreesumo})): if $l\geqslant2$ then for $\theta\in(0,1)$ and recalling that $\bar{\mathfrak h}_2=3h_\epsilon+{\rm const}$,
\point{\bf$j=0$ case.} We first study the $j=0$ case, which is similar to the discussion in the second regime. We use the localization operator defined in~(\ref{bilayerWreldefo}) and split $\hat{\bar g}_{h,\omega,0}$ in the same way as in~(\ref{bilayergseplint}). We then compute $\mathcal L\hat W_2^{(h')}$ and bound $\mathcal R\hat{\bar A}_{h,\omega,0}\mathbb L\hat{\bar{\mathfrak g}}_{[h],\omega,0}$.
\bigskip
\subpoint{\bf Local part.} The symmetry considerations of the first and second regime still hold (see~(\ref{bilayerinvsymsWo}) and~(\ref{bilayerpfzsyms})) so that~(\ref{bilayerWrelo}) still holds:
with $(\tilde\zeta_{h'}, \tilde\mu_{h'}, \tilde\nu_{h'},\zeta_{h'},\nu_{h'})\in\mathbb{R}^5$. The estimates~(\ref{bilayerboundzvtt}) through~(\ref{bilayerboundvot}) hold, and it follows from~(\ref{bilayerpcth}) that if $h'\leqslant\bar{\mathfrak h}_2$, then
\subpoint{\bf Dominant part of $\mathcal L\hat{\bar A}_{h,\omega,0}$} Furthermore, we notice that the terms proportional to $\tilde z_h$ are sub-dominant:
Before bounding $\sigma_4$, we compute the inverse of~(\ref{bilayerLAthdom}) by block-diagonalizing it using proposition~\ref{bilayerblockdiagprop} (see appendix~\ref{bilayerdiagapp}): if we define
\begin{equation}
\bar k_0:=z_hk_0,\quad
\bar\gamma_1:=\tilde m_h\gamma_1,\quad
\tilde\xi:=\tilde v_h\xi,\quad
\bar\xi:=v_h\xi
\label{bilayerbardefsth}\end{equation}
then for $\mathbf k\in\mathcal B_{\beta,L}^{(h,\omega,0)}$,
in which the bound should be understood as follows: the upper-left element in~(\ref{bilayerboundinvLAthdom}) is the bound on the upper-left $2\times2$ block of $\mathfrak L\hat{\bar A}_{h,\omega,0}^{-1}$, and similarly for the upper-right, lower-left and lower-right. In turn, (\ref{bilayerboundinvLAthdom}) implies
\subpoint{\bf Irrelevant part.} We now bound $\mathcal RW_{2,\omega,0}^{(h')}\ast\mathbb L\bar{\mathfrak g}_{[h,\omega,0]}$ in the same way as in the second regime, and find that for $m\leqslant3$, if $h\leqslant h'\leqslant\bar{\mathfrak h}_0$, then
This concludes the proof of~(\ref{bilayerassumdpropth}) for $j=0$.\par
\bigskip
\point{\bf$j=1$ case.} We now turn to the case $j=1$ ($j=2,3$ will then follow by using the $2\pi/3$-rotation symmetry). Again, we split $\hat{\bar g}_{h,\omega,1}$ in the same way as in~(\ref{bilayergseplint}), then we compute $\mathcal L\hat W_2^{(h')}$ and bound $\mathcal R\hat{\bar A}_{h,\omega,1}\mathbb L\hat{\bar{\mathfrak g}}_{[h],\omega,1}$.
Before computing $\mathcal L\hat{\bar A}_{h,\omega,1}$ and bounding $\mathcal R\hat{\bar A}_{h,\omega,1}\mathbb L\hat{\bar{\mathfrak g}}_{[h],\omega,1}$, we first discuss the shift in the Fermi points $\tilde{\mathbf p}_{F,1}^{(\omega,h)}$ (i.e., the singularities of
$\hat{\bar A}_{h,\omega,1}^{-1}({\bf k})$ in the vicinity of ${\mathbf p}_{F,1}^{(\omega,h)}$),
due to the renormalization group flow.\par
\bigskip
\subpoint{\bf Shift in the Fermi points.} We compute the position of the shifted Fermi points in the form
We expand $\hat{\bar D}_{h,\omega,1}$ around $\Delta_{h+1}$: it follows from the symmetries~(\ref{bilayeraverticalreflection}) and~(\ref{bilayerahorizontalreflection}) that
so that, by injecting~(\ref{bilayerpcth}), (\ref{bilayerpco}) and~(\ref{bilayerpct}) into~(\ref{bilayerDdefthN}) and using the symmetry structure of $\hat{\bar A}_{h,\omega,1}({\bf k})$ (which imply, in particular, that
$|\hat{\bar A}_{h,\omega,1}({\bf k})|\leqslant(\mathrm{const}.)\ \epsilon$ in $\mathcal B_{\beta,L}^{(\leqslant h,\omega,1)}$), we find
Therefore, by using a Newton scheme, we compute $\Delta_{h}$ satisfying~(\ref{bilayerDdefthN}) and, by~(\ref{bilayerestMth}), (\ref{bilayerestYth}) and~(\ref{bilayerboundRtth}),
This concludes the proof of~(\ref{bilayershifttho}) and~(\ref{bilayerineqshiftth}).\par
\bigskip
\subpoint{\bf Local part.} We now compute $\mathcal L\hat{\bar A}_{h,\omega,1}$. The computation is similar to the $j=0$ case, though it is complicated slightly by the presence of constant terms in $\hat{\bar A}_{h,\omega,1}$. Recall the $\mathbf x$-space representation of $\bar A_{h,\omega,1}$~(\ref{bilayerAhojx}). The localization operator has the same definition as~(\ref{bilayerWreldefo}), but because of the shift by $\tilde{\mathbf p}_{F,1}^{(\omega,h)}$ in the Fourier transform, its action in $\mathbf k$-space becomes
In order to avoid confusion, we will denote the localization operator in $\mathbf k$ space around $\tilde{\mathbf p}_{F,1}^{(\omega,h)}$ by $\hat{\mathcal L}_h$.\par
\bigskip
\subsubpoint{\bf Non-interacting local part.} As a preliminary step, we discuss the action of $\hat{\mathcal L}$ on the {\it undressed} inverse propagator $\hat A({\bf k})$. Let us first
\subsubpoint{\bf Local part of $\hat W_2$.} We now turn our attention to $\hat{\mathcal L}_h\hat W_2^{(h')}$. In order to reduce the size of the coming equations, we split $\hat W_2^{(h')}$ into $2\times2$ blocks:
$$\hat W_2^{(h')}=:\left(\begin{array}{*{2}{c}}
\hat W_2^{(h')\xi\xi}&\hat W_2^{(h')\xi\phi}\\
\hat W_2^{(h')\phi\xi}&\hat W_2^{(h')\phi\phi}\\
\end{array}\right).$$
The symmetry structure around $\tilde{\mathbf p}_{F,1}^{(\omega,h)}$ is slightly different from that around $\mathbf p_{F,0}^\omega$. Indeed~(\ref{bilayerinvsymsWo}) still holds, but $\tilde{\mathbf p}_{F,1}^{(\omega,h)}$ is not invariant under rotations, so that~(\ref{bilayerpfzsyms}) becomes
with $(\mu_{h',1}^{ff'},\zeta_{h',1}^{ff'},\nu_{h',1}^{ff'},\varpi_{h',1}^{ff'})\in\mathbb{R}^4$.
In addition, by using the parity symmetry, it follows from~(\ref{bilayereqrotsym}) (see appendix~\ref{bilayerconstWtapp}) that the $\xi\phi$ block is equal to the $\phi\xi$ block.
Furthermore, it follows from~(\ref{bilayerpcth}) that for $h'\leqslant\bar{\mathfrak h}_2$,
and because $\hat W_{2}^{(h')}(\mathbf p_{F,0}^\omega)=0$ and $|\tilde{\mathbf p}_{F,1}^{(\omega,h)}-\mathbf p_{F,0}^\omega|\leqslant(\mathrm{const}.)\ 2^{2h_\epsilon}$, by expanding $\hat W_2^{(h')}$ to first order around $\mathbf p_{F,0}^\omega$, we find that it follows from~(\ref{bilayerpct}) that
\subsubpoint{\bf Dominant part of $\hat{\mathcal L}_h\hat{\bar A}_{h,\omega,1}$} Finally, we notice that the terms in~(\ref{bilayerLAthj}) that are proportional to $z_{h,1}^{\xi\xi}$, $z_{h,1}^{\xi\phi}$ or $K_{h,1}^{\xi\xi}$ are subdominant:
Before bounding $\sigma_{4,1}$, we compute the inverse of~(\ref{bilayerLAthjdom}) by block-diagonalizing it using proposition~\ref{bilayerblockdiagprop} (see appendix~\ref{bilayerdiagapp}): if we define
in which the bound should be understood as follows: the upper-left element in~(\ref{bilayerboundinvLAthjdom}) is the bound on the upper-left $2\times2$ block of $\hat{\mathfrak L}_h\hat{\bar A}_{h,\omega,1}^{-1}$, and similarly for the upper-right, lower-left and lower-right. In turn, using (\ref{bilayerboundinvLAthjdom}) we obtain
\subpoint{\bf Irrelevant part.} Finally, we are left with bounding $\mathcal R\bar A_{h,\omega,1}\mathbb L\bar{\mathfrak g}_{[h],\omega,1}$, which we show is small. The bound
is identical to~(\ref{bilayerineqRAth}): indeed, it follows from~(\ref{bilayerinvLAthjdom}) and (\ref{bilayerboundinvLAthjdom}) that for all $m\leqslant3$,
where $T$ and $\mathcal T_{\mathbf k}$ were defined above~(\ref{bilayerarotation}), and $\tilde{\mathbf p}_{F,4}^{(-,h)}\equiv\tilde{\mathbf p}_{F,1}^{(-,h)}$.
\subsection{Two-point Schwinger function}
\label{bilayerschwinthsec}
\indent We now compute the dominant part of the two-point Schwinger function for $\mathbf k${\it well inside} the third regime, i.e.
so that if $h\not\in\{h_{\bf k},h_{{\bf k}}-1\}$, then $f_{h,\omega,j}({\bf k})=0$.\par
\bigskip
\point{\bf Schwinger function in terms of dressed propagators.} Recall that the two-point Schwinger function can be computed in terms of the effective source term $\mathcal X^{(h)}$, see (\ref{bilayerQdef}) and comment after Lemma~\ref{bilayerschwinlemma}.
Since $h_{\mathbf k}\leqslant\bar{\mathfrak h}_2$, $\mathcal X^{(h)}$ is left invariant by the integration over the ultraviolet, the first and the second regimes, in the sense that $\mathcal X^{(\bar{\mathfrak h}_2)}=\mathcal X^{(\bar{\mathfrak h}_0)}$, with $\mathcal X^{(\bar{\mathfrak h}_0)}$ given by (\ref{bilayerinitXo}). Therefore, we can compute $\mathcal X^{(h)}$ for $h\in\{\mathfrak h_\beta,\cdots,\bar{\mathfrak h}_2-1\}$ inductively using lemma~\ref{bilayerschwinlemma}, and find, similarly to~(\ref{bilayerschwinexpro}) and~(\ref{bilayerschwinexprt}), that
\point{\bf Dominant part of the dressed propagators.} We now compute $\hat{\bar g}_{h_{\mathbf k},\omega,j}+\hat{\bar g}_{h_{\mathbf k}-1,\omega,j}$.\par
\bigskip
\subpoint{\bf$j=0$ case.} We first treat the case $j=0$. It follows from (the analogue of) (\ref{bilayergseplint}), (\ref{bilayersepLAthdom}) and (\ref{bilayerinvLAthdom}), that
where $\bar M_{{h_{\mathbf k}},0}$, $\bar a_{{h_{\mathbf k}},0}^{(M)}$ and $\bar a_{{h_{\mathbf k}},0}^{(m)}$ were defined in~(\ref{bilayerMth}) and~(\ref{bilayerbath}), and the error term $\sigma'_0$ can be bounded using~(\ref{bilayerineqRAth}) and~(\ref{bilayerboundsigmafth}):
where $\bar M_{{h_{\mathbf k}},1}$, $\bar a_{{h_{\mathbf k}},1}^{(M)}$ and $\bar a_{{h_{\mathbf k}},1}^{(m)}$ were defined in~(\ref{bilayerMthj}) and~(\ref{bilayerbathj}), and the error term $\sigma'_1$ can be bounded using~(\ref{bilayerineqRAthj}) and~(\ref{bilayerboundsigmaftho}):
\subpoint{\bf$j=2,3$ cases.} The cases with $j=2,3$ follow from the $2\pi/3$-rotation symmetry~(\ref{bilayerarotation}) (see~(\ref{bilayerWreltht})).\par
\bigskip
\point{\bf Proof of theorem~\ref{bilayertheoth}} We now conclude the proof of theorem~\ref{bilayertheoth}. We focus our attention on $j=0,1$ since the cases with $j=2,3$ follow by symmetry. Similarly to section~\ref{bilayerschwintsec}, we define
(i.e. the inverse of the matrix on the right side of~(\ref{bilayerschwinxprgthp}) for $j=0$, (\ref{bilayerschwinxprgthjp}) for $j=1$, whose explicit expression is similar to the right side of~(\ref{bilayerLAthdom}) and~(\ref{bilayerLAthjdom})), and
We inject~(\ref{bilayerapproxBhth}) into~(\ref{bilayerschwinxprgthp}) and~(\ref{bilayerschwinxprgthjp}), which we then combine with~(\ref{bilayerschwinexprth}), (\ref{bilayerboundsigmalkth}), (\ref{bilayerboundsigmapth}) and~(\ref{bilayerboundsigmapthj}), and find an expression for $s_2$ which is similar to the right side of~(\ref{bilayerschwinxprgthp}) and~(\ref{bilayerschwinxprgthjp}) but with $h_{\mathbf k}$ replaced by $\mathfrak h_2$. This concludes the proof of~(\ref{bilayerschwinth}). Furthermore, the estimate~(\ref{bilayerineqrccthj}) follows from~(\ref{bilayerboundzvtthp}) and~(\ref{bilayersboundmzvwthj}) as well as (\ref{bilayerLfreeAo}) and~(\ref{bilayerineqshiftth}), which concludes the proof of theorem~\ref{bilayertheoth}.\par
\bigskip
\point{\bf Proof of (\ref{bilayersumWo}) and (\ref{bilayersumWt})} In order to conclude the proofs of theorems~\ref{bilayertheoo} and~\ref{bilayertheot} as well as theorem~\ref{bilayertheomain}, we still have to bound the sums on the left side of~(\ref{bilayersumWo}) and of~(\ref{bilayersumWt}), which we recall were assumed to be true to prove~(\ref{bilayerschwino}) and~(\ref{bilayerschwint}) (see sections~\ref{bilayerschwinosec} and~\ref{bilayerschwintsec}). It follows from~(\ref{bilayerpcth}) that
This, along with~(\ref{bilayerboundtsumW}) concludes the proofs of~(\ref{bilayersumWo}) and~(\ref{bilayersumWt}), and thus concludes the proof of theorems~\ref{bilayertheoo}, \ref{bilayertheot} and~\ref{bilayertheoth} as well as theorem~\ref{bilayertheomain}.\par
\section{Conclusion}
\label{bilayerconcsec}
\indent We considered a tight-binding model of bilayer graphene describing spin-less Fermions hopping on two hexagonal layers in Bernal stacking, in the presence of short range interactions. We assumed that only three hopping parameters are different from zero (those usually called $\gamma_0,\gamma_1$ and $\gamma_3$ in the literature), in which case the Fermi surface at half-filling degenerates to a collection of 8 Fermi points. Under a smallness assumption on the interaction strength $U$ and on the transverse hopping $\epsilon$, we proved by rigorous RG methods that the specific ground state energy and correlation functions in the thermodynamic limit are analytic in $U$, uniformly in $\epsilon$. Our proof requires a detailed analysis of the crossover regimes from one in which the two layers are effectively decoupled, to one where the effective dispersion relation is approximately parabolic around the central Fermi points (and the inter-particle interaction is effectively marginal), to the deep infrared one, where the effective dispersion relation is approximately conical around each Fermi points (and the inter-particle interaction is effectively irrelevant). Such an analysis, in which the influence of the flow of the effective constants in one regime has crucial repercussions in lower regimes, is, to our knowledge, novel.\par
\bigskip
\indent We expect our proof to be adaptable without substantial efforts to the case where $\gamma_4$ and $\Delta$ are different from zero, as in (\ref{bilayerrelge}), the intra-layer next-to-nearest neighbor hopping $\gamma_0'$ is $O(\epsilon)$, the chemical potential is $O(\epsilon^3)$, and the temperature is larger than (const.)$\epsilon^4$. At smaller scales, the Fermi set becomes effectively one-dimensional, which thoroughly changes the scaling properties. In particular, the effective inter-particle interaction becomes marginal, again, and its flow tends to grow logarithmically. Perturbative analysis thus breaks down at exponentially small temperatures in $\epsilon$ and in $U$, and it should be possible to rigorously control the system down to such temperatures. Such an analysis could prove difficult, because it requires fine control on the geometry of the Fermi surface, as in [\cite{bilayerBGM06}] and in [\cite{bilayerFKT04}, \cite{bilayerFKT04b}, \cite{bilayerFKT04c}], where the Fermi liquid behavior of a system of interacting electrons was proved, respectively down to exponentially small and zero temperatures, under different physical conditions. We hope to come back to this issue in the future.\par
\bigskip
\indent Another possible extension would be the study of crossover effects on other physical observables, such as the conductivity, in the spirit of [\cite{bilayerMa11}]. In addition, it would be interesting to study the case of three-dimensional Coulomb interactions, which is physically interesting in describing {\it clean} bilayer graphene samples, i.e. where screening effects are supposedly negligible. It may be possible to draw inspiration from the analysis of [\cite{bilayerGMP10}, \cite{bilayerGMP11b}] to construct the ground state, order by order in renormalized perturbation theory. The construction of the theory in the second and third regimes would pave the way to understanding the universality of the conductivity in the deep infrared, beyond the regime studied in [\cite{bilayerMa11}].\par
\bigskip
\vfill\eject
\appendix
\section{Computation of the Fermi points}
\label{bilayerfermiapp}
\indent In this appendix, we prove~(\ref{bilayerfermdef}).\par
the solutions for $k\in\hat\Lambda_\infty$ (see~(\ref{bilayerlae}) and following lines for the definition of $\hat\Lambda$ and $\hat\Lambda_\infty$) of
\point If $S=\sin((3/2)k_x)=0$, then $k_x\in\{0,2\pi/3\}$. Furthermore, since $k\in\hat\Lambda_\infty$, if $k_x=0$ then $k_y=0$, which is not a solution of~(\ref{bilayereqfermpts}) as long as $G<3$. Therefore $k_x=2\pi/3$, so that $C=-1$, and $Y$ solves
\indent In this appendix, we give the explicit expression of the determinant and the inverse of matrices that have the form of the inverse free propagator.
The result is collected in the following proposition and corollary, whose proofs are straightforward, brute force, computations.
In particular, $A$ is invertible if and only if $\mathfrak b^2\neq\mathfrak a\mathfrak c^*$.\par
\endtheo
\section{Block diagonalization}
\label{bilayerdiagapp}
\indent In this appendix, we give the formula for block-diagonalizing $4\times4$ matrices, which is useful to separate the massive block from the massless one.
The result is collected in the following proposition, whose proof is straightforward.
\par
\bigskip
\theo{Proposition}\label{bilayerblockdiagprop}
Given a $4\times4$ complex matrix $B$, which can be written in block-form as
\section{Bound of the propagator in the II-III intermediate regime}
\label{bilayerboundproptthapp}
\indent In this appendix, we prove the assertion between~(\ref{bilayerdetatth}) and~(\ref{bilayerboundfreepropztth}), that is that the determinant of the inverse propagator is bounded below by $(\mathrm{const}.)\ \epsilon^8$ in the intermediate regime between the second and third regimes. Using the symmetry under $k_x\mapsto-k_x$ and under $2\pi/3$ rotations, we restrict our discussion to $\omega=+$ and $k_y-p_{F,0,y}^+>0$. In a coordinate frame centered at $p_{F,0}^+$, we denote with some abuse of notation
${\bf k}'_{+,0}=(k_0,k_x,k_y)$ and $p_{F,1}^+=(0,D\bar\epsilon^2)$, where $\bar\epsilon\frac32\gamma_3$
and $D=\frac{8}{27}\frac{\gamma_1}{\gamma_3}(1+O(\epsilon^2))$ (see (\ref{bilayereq:3.2bis})). Note that
$D>0$ is uniformly bounded away from 0 for $\bar\epsilon$ small (recall that $\gamma_1=\epsilon$ and $\gamma_3=0.33\epsilon$). In these coordinates, we restrict to $k_y>0$, and
the first and third conditions in (\ref{bilayerinttth}) read
where $\bar\kappa\bar\kappa_2(\frac{2\epsilon}{3\gamma_3})^3$. The second condition in (\ref{bilayerinttth}) implies that $(k_x^2+k_y^2)\leqslant(\mathrm{const}.)\ \epsilon^2$, in which case the
desired bound (that is, $|\det\hat A|\geqslant(\mathrm{const}.)\ \epsilon^8$, with $\det\hat A$ as in (\ref{bilayerdetatth})) reads
If $|k_0|>\bar\kappa\bar\epsilon^3/2$, then $l>\bar\kappa^2\bar\epsilon^8/4$ from which~(\ref{bilayerineqdeta}) follows. If $|k_0|\leqslant\bar\kappa\bar\epsilon^3/2$, then
If $|k_x|>(1/4\sqrt3)\bar\kappa\bar\epsilon^2$, then, using the fact that $k_y>0$, $l>\alpha(1/48)D^2\bar\kappa^2\bar\epsilon^8$ from which~(\ref{bilayerineqdeta}) follows. If $|k_x|\leqslant(1/4\sqrt3)\bar\kappa\bar\epsilon^2$, then
and $l>\alpha((3\sqrt{105}-1)^2/2304)\bar\kappa^4\bar\epsilon^8$ from which~(\ref{bilayerineqdeta}) follows.\penalty10000\hfill\penalty10000$\square$\par
\section{Symmetries}
\label{bilayersymapp}
\indent In this appendix, we prove that the symmetries listed in~(\ref{bilayeraglobaluo}) through~(\ref{bilayerainversiont}) leave $h_0$ and $\mathcal V$ invariant. We first recall
from which one easily concludes that $\mathcal V$ is invariant under~(\ref{bilayerarotation}).\penalty10000\hfill\penalty10000$\square$\par
\bigskip
\point{\bf Complex conjugation.} Follows immediately from $\Omega(-k)=\Omega^*(k)$ and $v(-k)=v^*(k)$.\penalty10000\hfill\penalty10000$\square$\par
\bigskip
\point{\bf Vertical reflection.} Follows immediately from $\Omega(R_vk)=\Omega(k)$ and $v(R_vk)=v(k)$ (since the second component of $d_\alpha$ is 0).\penalty10000\hfill\penalty10000$\square$\par
\bigskip
\point{\bf Horizontal reflection.} We have $\Omega(R_hk)=\Omega^*(k)$, $\sigma_1A^{\xi\xi}({\bf k})\sigma_1=A^{\xi\xi}({\bf k})$,
where $\pi_h$ is the permutation that exchanges $a$ with $\tilde b$ and $\tilde a$ with $b$, from which the invariance of $\mathcal V$ follows immediately.\penalty10000\hfill\penalty10000$\square$\par
\bigskip
\point{\bf Parity.} We have $\Omega(Pk)=\Omega^*(k)$ so that $\big[A^{\xi\phi}(P\mathbf k)\big]^T=A^{\xi\phi}(\mathbf k)$, $\big[A^{\phi\phi}(P\mathbf k)\big]^T=A^{\phi\phi}(\mathbf k)$, $\big[A^{\xi\xi}(P\mathbf k)\big]^T=A^{\xi\xi}(\mathbf k)$. Therefore $h_0$ is mapped to
which is equal to $h_0$ since exchanging $\hat\psi^-$ and $\hat\psi^+$ adds a minus sign. The invariance of $\mathcal V$ follows from the remark that under parity
$\hat\psi^+_{{\bf k}_1,\alpha}\hat\psi^-_{{\bf k}_2,\alpha}\mapsto\hat\psi^+_{P{\bf k}_2,\alpha}\hat\psi^-_{P{\bf k}_1,\alpha}$, and $\hat v(k_1-k_2)=\hat v(P(k_2-k_1))$.\penalty10000\hfill\penalty10000$\square$\par
from which the invariance of $h_0$ follows immediately. The invariance of $\mathcal V$ is trivial.\penalty10000\hfill\penalty10000$\square$\par
\section{Constraints due to the symmetries}
\label{bilayerconstWtapp}
\indent In this appendix we discuss some of the consequences of the symmetries listed in section~\ref{bilayersymsec} on $\hat W_2^{(h)}(\mathbf k)$ and its derivatives.\par
\bigskip
\indent We recall the definitions of the symmetry transformations from section~\ref{bilayersymsec}:
Furthermore, given a $4\times4$ matrix $\mathbf M$ whose components are indexed by $\{a,\tilde b,\tilde a,b\}$, we denote the sub-matrix with components in $\{a,\tilde b\}^2$ by $\mathbf M^{\xi\xi}$, that with $\{\tilde a,b\}^2$ by $\mathbf M^{\phi\phi}$, with $\{a,\tilde b\}\times\{\tilde a,b\}$ by $\mathbf M^{\xi\phi}$ and with $\{\tilde a,b\}\times\{a,\tilde b\}$ by $\mathbf M^{\phi\xi}$. In addition, given a complex matrix $M$, we denote its component-wise complex conjugate by $M^*$ (which is not to be confused with its adjoint $M^\dagger$).\par
\bigskip
\theo{Proposition}\label{bilayersymprop}
Given a $2\times2$ complex matrix $M(\mathbf k)$ parametrized by $\mathbf k\in\mathcal B_\infty$ (we recall that $\mathcal B_\infty$ was defined above the statement of theorem~\ref{bilayertheomain} in section~\ref{bilayermaintheosec}) and a pair of points $(\mathbf p_{F}^+,\mathbf p_{F}^-)\in\mathcal B_\infty^2$,
Given a $4\times4$ complex matrix $\mathbf M(\mathbf k)$ parametrized by $\mathbf k\in\mathcal B_\infty$ and two points $(\mathbf p_{F}^+,\mathbf p_{F}^-)\in\mathcal B_\infty^2$, if $\forall(f,f')\in\{\xi,\phi\}^2$ and $\forall\omega\in\{-,+\}$,
with $(\mu^{ff'}, \zeta^{ff'}, \nu^{ff'}, \varpi^{ff'})\in\mathbb R^{4}$ independent of $\omega$,
and $\forall\mathbf k\in\mathcal B_\infty$
\begin{equation}
\mathbf M(\mathbf k)=\mathbf M^T(P\mathbf k)
\label{bilayerinvPM}\end{equation}
and
\begin{equation}
\mathbf p_{F}^\omega=P\mathbf p_{F}^{-\omega}
\label{bilayerinvPpf}\end{equation}
then
\begin{equation}
\mu^{\phi\xi}=\mu^{\xi\phi},\quad
\zeta^{\phi\xi}=\zeta^{\xi\phi},\quad
\nu^{\phi\xi}=\nu^{\xi\phi},\quad
\varpi^{\phi\xi}=\varpi^{\xi\phi}.
\label{bilayereqrotsymo}\end{equation}
Furthermore, if $\mathbf p_{F}^\omega=(0,\frac{2\pi}3,\omega\frac{2\pi}{3\sqrt3})$ and (recalling that $\mathcal T_{\mathbf k}=e^{-i(l_2\cdot k)\sigma_3}$, with $l_2=(3/2,-\sqrt3/2)$)
Furthermore, recalling that $\partial_{k_1}\mathbf M^{\phi\phi}=\nu^{\phi\phi}\sigma_2$ and $\partial_{k_2}\mathbf M^{\phi\phi}=\omega\varpi^{\phi\phi}\sigma_1$,
$\partial_{k_i}\mathbf M^{\xi\xi}(\mathbf p_{F}^\omega)=0$, that is $\nu^{\xi\xi}=\varpi^{\xi\xi}=0$.\penalty10000\hfill\penalty10000$\square$\par
\vfill
\eject
\references
{\small
\BBlography{bibliography-bilayer.BBlog}
}
\vfill
\eject
\part{Kondo effect in a hierarchical $s-d$ model}\label{partkondo}
\indent We will now turn to the second model studied in this thesis, namely a hierarchical version of the $s-d$ (or {\it Kondo}) model. The discussion in this part is a closely linked to that appearing in~[\cite{kondoBGJ15}, \cite{kondoGJ15}], and is the fruit of a collaboration with Giuseppe Benfatto and Giovanni Gallavotti. The model studied in~[\cite{kondoBGJ15}] is a variant of the $s-d$ model, called the {\it Andrei} model. A short discussion of the hierarchical $s-d$ model was later published in~[\cite{kondoGJ15}]. Here, we reproduce the discussion in~[\cite{kondoBGJ15}], but present the result in~[\cite{kondoGJ15}] on the $s-d$ model rather than the Andrei model.
\section{Introduction}
\label{kondosecintro}
\indent Although at high temperature the resistivity of most metals is an increasing function of the temperature, experiments carried out since the early XX\textsuperscript{th} century have shown that in metals containing trace amounts of magnetic impurities (i.e. copper polluted by iron), the resistivity has a minimum at a small but positive temperature, below which the resistivity decreases as the temperature increases. One interesting aspect of such a phenomenon, is that it has been measured in samples of copper with iron impurities at a concentration as small as 0.0005\ [\cite{kondoKo05}], which raises the question of how such a minute perturbation can produce such an effect. Kondo introduced a toy model in 1964, see~(\ref{kondoeqhamkondo}) below, to understand such a phenomenon, and computed electronic scattering amplitudes at third order in the Born approximation scheme [\cite{kondoKo64}], and found that the effect may stem from an antiferromagnetic coupling between the impurities (called ``localized spins'' in [\cite{kondoKo64}]) and the electrons in the metal. The existence of such a coupling had been proposed by Anderson [\cite{kondoAn61}], who had named the model the ``$s-d$ model''.
\indent Kondo's theory attracted great attention and its scaling properties and connection to $1D$ Coulomb gases were understood [\cite{kondoDy69}, \cite{kondoAn70}, \cite{kondoAYH70}] (the obstacle to a complete understanding of the model (with $\lambda_0<0$) being what would later be called the growth of a relevant coupling) when in a seminal paper, published in 1975 [\cite{kondoWi75}], Wilson addressed and solved the problem by constructing a sequence of Hamiltonians that adequately represent the system on ever increasing length scales. Using ideas from his formulation of the renormalization group, Wilson showed, by a combination of numerical and perturbative methods, that only few (three) terms in each Hamiltonian, need to be studied in order to account for the Kondo effect (or rather, a related effect on the magnetic susceptibility of the impurities, see below).
\indent The strong-coupling nature of the effect manifests itself in Wilson's formalism by the presence of a non-trivial fixed point in the renormalization group flow, at which the corresponding effective theory behaves in a way that is qualitatively different from the non-interacting one. Wilson has studied the system around the non-trivial fixed point by perturbative expansions, but the intermediate regime (in which perturbation theory breaks down) was studied by numerical methods. In fact, when using renormalization group techniques to study systems with non-trivial fixed points, oftentimes one cannot treat strong-coupling regimes analytically. The hierarchical $s-d$ model, which will be discussed below, is an exception to this rule: indeed, we will show that the physical properties of the model can be obtained by iterating an {\it explicit} map, computed analytically, and called the {\it beta function}, whereas, in the current state of the art, the beta function for the full (non-hierarchical) $s-d$ model can only be computed numerically.
\indent In this paper, we present a hierarchical version of the $s-d$ model, whose renormalization group flow equations can be written out {\it exactly}, with no need for perturbative methods, and show that the flow admits a non-trivial fixed point. In this model, the transition from the fixed point can be studied by iterating an {\it explicit} map, which allows us to compute reliable numerical values for the {\it Kondo temperature}, that is the temperature at which the Kondo effect emerges, which is related to the number of iterations required to reach the non-trivial fixed point from the trivial one. This temperature has been found to obey the expected scaling relations, as predicted in [\cite{kondoWi75}].
\medskip
\indent It is worth noting that the $s-d$ model (or rather a linearized continuum version of it) was shown to be exactly solvable by Andrei [\cite{kondoAn80}] at $h=0$, as well as at $h\ne0$, [\cite{kondoAFL83}], using Bethe Ansatz, who proved the existence of a Kondo effect in that model. The aim of the present work is to show how the Kondo effect can be understood as coming from a non-trivial fixed point in a renormalization group analysis (in the context of a hierarchical model) rather than a proof of the existence of the Kondo effect, which has already been carried out in [\cite{kondoAn80}, \cite{kondoAFL83}].
\medskip
\section{$s-d$ model and main results}\label{kondosecmodel}
\indent Consider a {\it 1-dimensional} Fermi gas of spin-1/2 ``electrons'', and a spin-1/2 Fermionic ``impurity'', with {\it no} interactions. It is well known that:
\begin{enumerate}
\item the magnetic susceptibility of the impurity diverges as $\beta=\frac1{k_B T}\to\infty$ while
\item the total susceptibility per particle of the electron gas ({\it i.e.}\ the response to a field acting on the whole sample) [\cite{kondoKi76}] is finite at zero temperature.
\end{enumerate}
\medskip
\indent The question that will be addressed in this work is whether a small coupling of the impurity Fermion with the electron gas can change this behavior, that is whether the susceptibility of the impurity interacting with the electrons diverges or not.
\indent To that end we will study a model inspired by the $s-d$ Hamiltonian, which is an operator that acts on $\mathcal F_L\otimes\mathbb C^2$ in which $\mathcal F_L$ is the Fock space of a length-$L$ chain of spin-1/2 Fermions (the electrons) and $\mathbb C^2$ is the state space for the two-level impurity. The expression of the Hamiltonian, in the presence of a magnetic field of amplitude $h$ in the direction $\bm\omega\equiv(\bm\omega_1,\bm\omega_2,\bm\omega_3)$, is
\item$c_\alpha^\pm(x)\,\alpha=\uparrow,\downarrow$ are creation and annihilation operators acting on the electrons,
\item$\sigma^j\equiv\tau^j,\,j=1,2,3$, are the Pauli matrices, ($\tau^j$ acts on the impurity),
\item$x$ is on the unit lattice and $-{L}/2$, ${L}/2$ are identified (periodic boundary),
\item$\Delta f(x)= f(x+1)-2f(x)+f(x-1)$ is the discrete Laplacian,
\item$\bm\omega=(\bm\omega_1,\bm\omega_2,\bm\omega_3)$ is the direction of the field, which is a norm-1 vector.
\item the $-1$ term in $H_0$ is the chemical potential, set to $-1$ (half-filling) for convenience.
\end{enumerate}
\medskip
\indent The model will be said to exhibit a {\it Kondo effect} if, no matter how small the coupling $\lambda_0$ is, as long as it is {\it antiferromagnetic} ({\it i.e.}\ $\lambda_0<0$), the susceptibility {\it remains finite and positive as $\beta\to\infty$ and continuous as $h\to0$}, while it diverges in presence of a ferromagnetic ({\it i.e.}\ $\lambda_0>0$) coupling.
\medskip
{\bf Remark}: In the present work, the {\it Kondo effect} is defined as an effect on the susceptibility of the impurity, and not on the resistivity of the electrons of the chain, which, we recall, was Kondo's original motivation [\cite{kondoKo64}]. The reason for this is that the magnetic susceptibility of the impurity is easier to compute than the resistivity of the chain, but still exhibits a non-trivial effect, as discussed by Wilson~[\cite{kondoWi75}].
\medskip
\indent Here the same questions will be studied in a hierarchical model defined below. The interest of this model is that various observables can be computed by iterating a map, which is explicitly computed and called the ``beta function'', involving few (six) variables, called ``running couplings''. The possibility of computing the beta function exactly for general Fermionic hierarchical models has been noticed and used in [\cite{kondoDo91}].
\medskip
{\bf Remark}: The hierarchical $s-d$ model {\it will not be an approximation} of~(\ref{kondoeqhamkondo}). It is a model that illustrates a simple mechanism for the control of the growth of relevant operators in a theory exhibiting a Kondo effect.
\medskip
\indent The reason why the Kondo effect is not easy to understand is that it occurs in the strong-coupling regime in which the impurity susceptibility in the interacting model is qualitatively different from its non-interacting counterpart. In the sense of the renormalization group it exhibits several ``relevant'', ``marginal'' and ``irrelevant'' running couplings: this makes any naive perturbative approach hopeless because all couplings become large ({\it i.e.}\ at least of O(1)) at large scale, no matter how small the interaction is, as long as $\lambda_0<0$, and thus leave the perturbative regime. It is among the simplest cases in which asymptotic freedom {\it does not occur}. Using the fact that the beta function of the hierarchical model can be computed exactly, its strong-coupling regime can easily be investigated.
\medskip
\indent In the sections below, we will define the hierarchical $s-d$ model and show numerical evidence for the following claims (in principle, such claims could be proved using computer-assisted methods, though, since the numerical results are very clear and stable, it may not be worth the trouble).
\smallskip
{\it If the interactions between the electron spins and the impurity are antiferromagnetic } ({\it i.e.}\ $\lambda_0<0$ in our notations), then
\begin{enumerate}
\item The {\it existence of a Kondo effect} can be proved in spite of the lack of asymptotic freedom and formal growth of the effective Hamiltonian away from the trivial fixed point, {\it because the beta function can be computed exactly} (in particular non-pertubatively).
\item In addition, there exists an inverse temperature $\beta_K=2^{n_K(\lambda_0)}$ called the {\it Kondo} inverse temperature, such that the Kondo effect manifests itself for $\beta>\beta_K$. Asymptotically as $\lambda_0\to0$, $n_K(\lambda_0)=c_1|\lambda_0|^{-1}+O(1)$. The Kondo inverse temperature $\beta_K$ roughly corresponds to the limit at which perturbation theory breaks down.
\item Denoting the magnetic field by $h$, if $h>0$ and $\beta_Kh\ll1$, the flow of the running couplings tends to a trivial fixed point ($h$-independent but different from $0$) which is reached on a scale $r(h)$ which, {asymptotically} as $h\to0$, is $r(h)=c_r \log h^{-1}+O(1)$.
\end{enumerate}
\medskip
{\it The picture is completely different in the ferromagnetic case,} in which the susceptibility diverges at zero temperature and the flow of the running couplings is not controlled by the non trivial fixed point.
\medskip
{\bf Remark}: Unlike in the model studied by Wilson [\cite{kondoWi75}], the $T=0$ nontrivial fixed point is {\it not} infinite in the hierarchical $s-d$ model: this shows that the Kondo effect can, in some models, be somewhat subtler than a rigid locking of the impurity spin with an electron spin [\cite{kondoNo74}].
\medskip
\indent Technically this is one of the few cases in which functional integration for Fermionic fields is controlled by a non-trivial fixed point and can be performed rigorously and applied to a concrete problem.
\medskip
{\bf Remark:}\listparpenalty
\begin{enumerate}
\item It is worth stressing that in a system consisting of two classical spins with coupling $\lambda_0$ the susceptibility at $0$ field is $4\beta(1+e^{-2\beta\lambda_0})^{-1}$, hence it vanishes at $T=0$ in the antiferromagnetic case and diverges in the ferromagnetic and in the free case. Therefore this simple model does {\it not} exhibit a Kondo effect.
\item In the exactly solvable XY model, which can be shown to be equivalent to a spin-less analogue of~(\ref{kondoeqhamkondo}), the susceptibility can be shown to diverge in the $\beta\to\infty$ limit, see~[\cite{kondoBGJ15}, appendix~7] for details. The XY model therefore does {\it not} exhibit a Kondo effect either.
\end{enumerate}\unlistparpenalty
\section{Functional integration in the $s-d$ model}\label{kondosecfunint}
\indent In [\cite{kondoWi75}], Wilson studies the $s-d$ model using renormalization group techniques in a Hamiltonian context. In the present work, our aim is to reproduce, in a simpler model, analogous results using a formalism based on functional integrals.
\indent In this section, we give a rapid review of the functional integral formalism we will use, following~\-[\cite{kondoBG90}, \cite{kondoSh94}]. We will not attempt to reproduce all technical details, since it will merely be used as an inspiration for the definition of the hierarchical model in section~\ref{kondosechierk}.
\indent We introduce an extra dimension, called {\it imaginary time}, and define new creation and annihilation operators:
for $\alpha\in\{\uparrow,\downarrow\}$, to which we associate anti-commuting {\it Grassmann variables}:
\begin{equation}
c_\alpha^\pm(x,t)\longmapsto\psi_\alpha^\pm(x,t).
\label{kondoeqfermgrass}\end{equation}
Functional integrals are expressed as ``Gaussian integrals'' over the Grassmann variables (this means that all integrals will be defined and evaluated via the ``Wick rule''):
\indent By a direct computation [\cite{kondoBG90},~(2.7)], we find that in the limit $L,\beta\to\infty$, if $e(k):=(1-\cos k)-1\equiv-\cos k$ (assuming the Fermi level is set to $1$, {\it i.e.}\ the Fermi momentum to $\pm\frac\pi2$) then
If $\beta,L$ are finite, $\int\,\frac{dk_0 dk}{(2\pi)^2}$ in~(\ref{kondoeqpropk}) has to be understood as $\frac1\beta\sum_{k_0}\frac1L \sum_k$, where $k_0$ is the ``Matsubara momentum'' $k_0=\frac\pi{\beta}+\frac{2\pi}\beta n_0$, $n_0\in\mathbb Z$, $|n_0|\leqslant\frac12\beta$, and $k$ is the linear momentum $k=\frac{2\pi}L n$, $n\in[-L/2,L/2-1]\cap\mathbb Z$.
\medskip
\indent In the functional representation, the operator $V$ of~(\ref{kondoeqhamkondo}) is substituted with the following function of the Grassmann variables~(\ref{kondoeqfermgrass}):
$V(t)$ is thus defined as an operator on the Hilbert space of the impurity, that is, on $\mathbb C^2$, with coefficients that are linear combinations of Grassmann fields. Notice that $V(t)$ only depends on the fields located at the site $x=0$. This is important because it will allow us to reduce the problem to a 1-dimensional one [\cite{kondoAY69}, \cite{kondoAYH70}].
\indent The average of a physical observable $F$ localized at $x=0$, which is a polynomial in the fields $\psi_{\alpha}^\pm(0,t)$, will be denoted by
in which $P_0(d\psi)$ is the Gaussian Grassmannian measure over the fields $\psi_\alpha^\pm(0,t)$ localized at the site $0$ and with propagator $g_{\alpha}(0,\tau)$, the trace is taken over the Hilbert space of the impurity ($\mathbb C^2$), and $Z$ is a normalization factor called the {\it partition function}:
\indent The propagators can be split into scales by introducing a smooth cutoff function $\chi$ which is different from $0$ only on $(\frac14,1)$ and, denoting $N_\beta:=\log_2\beta$, is such that $\sum_{m=-N_\beta}^\infty\chi(2^{-2m}z^2)=1$ for all $|z|\in[\frac{\pi}{\beta},N_\beta]$. Let
where $m_0$ is an integer of order one (see below).
\medskip
{\bf Remark}: The $\omega=\pm$ label refers to the ``quasi particle'' momentum $\omega p_F$, where $p_F$ is the Fermi momentum. The usual approach [\cite{kondoBG90}, \cite{kondoSh94}] is to decompose the field $\psi$ into quasi-particle fields:
indeed, the introduction of quasi particles [\cite{kondoBG90}, \cite{kondoSh94}], is key to separating the oscillations on the Fermi scale $p_F^{-1}$ from the propagators thus allowing a ``naive'' renormalization group analysis of Fermionic models in which multiscale phenomena are important (as in the theory of the ground state of interacting Fermions [\cite{kondoBG90}, \cite{kondoBGe94}], or as in the $s-d$ model). In this case, however, since the fields are evaluated at $x=0$, such oscillations play no role, so we will not decompose the field.
\medskip
\indent We set $m_0$ to be small enough ({\it i.e.}\ negative enough) so that $2^{m_0}p_F\leqslant1$ and introduce a first {\it approximation}: we neglect $g^{[\mathrm{uv}]}$ and replace $e(k)$ in~(\ref{kondoeqpropk}) by its first order Taylor expansion around $\omega p_F$, that is by $\omega k$. As long as $m_0$ is small enough, for all $m\leqslant m_0$ the supports {of the two functions $\chi(2^{-2m}((k-\omega\pi/2)^2+k_0^2))$, $\omega=\pm1$}, which appear in the first of~(\ref{kondoeqpropdcmp}) do not intersect, and approximating $e(k)$ by $\omega k$ is reasonable. We shall hereafter fix $m_0=0$ thus avoiding the introduction of a further length scale and keeping only two scales when no impurity is present.
\indent Since we are interested in the {\it infrared} properties of the system, we consider such approximations as minor and more of a {\it simplification} rather than an approximation, since the ultraviolet regime is expected to be trivial because of the discreteness of the model in the operator representation.
\indent After this approximation, the propagator of the model reduces to
{\bf Remark:} by \label.(\ref{kondoeqpropscale}) this is equivalent to stating that the propagator associated to the $\psi^{[m]}$ field is $2^{-m}g^{[m]}$.
\indent Notice that the function $g^{[0]}(0,t)$ decays faster than any power as $t$ tends to $\infty$ (as a consequence of the smoothness of the cut-off function $\chi$), so that at any fixed scale $m\leqslant0$, fields $\psi^{[m]}$ that are separated in time by more than $2^{-m}$ can be regarded as (almost) independent.
\medskip
\indent The decomposition into scales allows us to express the quantities in~(\ref{kondoeqavggrass}) inductively (see~(\ref{kondoeqeffpotrec})). For instance the partition function $Z$ is given by
\indent In this section, we define a hierarchical $s-d$ model, localized at $x=0$ (the location of the impurity), inspired by the discussion in the previous section and the remark that the problem of the Kondo effect is reduced there to the evaluation of a functional integral over the fields $\psi(x,t)${\it with $x\equiv0$}. The hierarchical model is a model that is represented using a functional integral, that shares a few features, namely the scaling properties, with the functional integral described in section~\ref{kondosecfunint}, which are essential to the Kondo effect. Therein the fields $\psi^{[m]}$ evaluated at $x=0$ are assumed to be constant in $t$ on scale $2^{-m}, m=0,-1,-2\ldots$, and the propagator $g^{[m]}(0,\tau)$ with large Matsubara momentum $k_0$ are neglected ($g^{[uv]}=0$ in~(\ref{kondoeqpropdcmp})).
\indent The hierarchical $s-d$ model is defined by introducing a family of {\it hierarchical fields} and specifying a {\it propagator} for each pair of fields. The average of any monomial of fields is then computed using the Wick rule.
\indent As a preliminary step, we pave the time axis $\mathbb R$ with boxes of size $2^{-m}$ for every $m\in\{0,-1,\ldots,-N_\beta\}$. To that end, we define the set of {\it boxes on scale $m$} as
\indent Given a box $\Delta\in\mathcal Q_m$, we define $t_\Delta$ as the center of $\Delta$; conversely, given a point $t\in\mathbb R$, we define $\Delta^{[m]}(t)$ as the (unique) box on scale $m$ that contains $t$.
\indent A naive approach would then be to define the hierarchical model in terms of the fields $\psi^{[m]}_{t_\Delta}$, and neglect the propagators between fields in different boxes, but, as we will see below, such a model would be trivial (all propagators would vanish because of Fermi statistics).
\indent Instead, we further decompose each box into two {\it half boxes}: given $\Delta\in\mathcal Q_m$ and $\eta\in\{-,+\}$, we define
for $m< 0$ and similarly for $m=0$. Thus $\Delta_{-}$ is the lower half of $\Delta$ and $\Delta_{+}$ the upper half.
\indent The elementary fields used to define the hierarchical $s-d$ model will be {\it constant on each half-box} and will be denoted by $\psi_\alpha^{[m]\pm}(\Delta_{\eta})$ for $m\in\{0,-1,\cdots,$$-N_\beta\}$, $\Delta\in\mathcal Q_m$, $\eta\in\{-,+\}$, $\alpha\in\{\uparrow,\downarrow\}$.
\medskip
\indent We now define the propagators associated to $\psi$. The idea is to define propagators that are {\it similar} [\cite{kondoWi65}, \cite{kondoWi70}, \cite{kondoDy69}], in a sense made more precise below, to the non-hierarchical propagators defined in~(\ref{kondoeqprop}). Bearing that in mind, we compute the value of the non-hierarchical propagators between fields at the centers of two half boxes: given a box $\Delta\in\mathcal Q_{0}$ and $\eta\in\{-,+\}$, let $\delta:=2^{-1}$ denote the distance between the centers of $\Delta_-$ and $\Delta_+$, we get
in which $a$ and $b$ are constants. We define the hierarchical propagators, drawing inspiration from~(\ref{kondoeqpropcoarse}). In an effort to make computations more explicit, we set $a=b\equiv1$ and define
for $m\in\{0,-1,\cdots,-N_\beta\}$, $\eta\in\{-,+\}$, $\Delta\in\mathcal Q_m$, $\alpha\in\{\downarrow,\uparrow\}$. All other propagators are $0$. Note that if we had not defined the model using half boxes, all the propagators in~(\ref{kondoeqpropcoarse}) would vanish, and the model would be trivial.
\medskip
\indent In order to link back to the non-hierarchical model, we define the following quantities: for all $t\in\mathbb R$,
(recall that $m\leqslant0$ and $\Delta^{[m]}(t)\supset\Delta^{[m+1]}(t)$). The hierarchical model for the on-site Kondo effect so defined is such that the propagator on scale $m$ between two fields vanishes unless both fields belong to the same box and, at the same time, to two different halves within that box. In addition, given $t$ and $t'$ that are such that $|t-t'|>2^{-1}$, there exists one and only one scale $m_{(t-t')}$ that is such that $\Delta^{[m_{(t-t')}]}(t)=\Delta^{[m_{(t-t')}]}(t')$ and $\Delta^{[m_{(t-t')}+1]}(t)\neq\Delta^{[m_{(t-t')}+1]}(t')$. Therefore $\forall(t,t')\in\mathbb R^2$, $\forall(\alpha,\alpha')\in\{\uparrow,\downarrow\}^2$,
\indent The non-hierarchical analog of~(\ref{kondoeqprophiercmp}) is (we recall that $\left<\,\cdot\,\right>_K$ was defined in~(\ref{kondoeqavggrass}))
from which we see that the hierarchical model boils down to neglecting the $m'$ that are ``wrong'', that is those that are different from $m_{(t-t')}$, and approximating $g^{[m]}_{\psi}$ by $\mathrm{sign}(t-t')$.
\medskip
\indent The physical observables $F$ considered here will be polynomials in the hierarchical fields; their averages, by analogy with~(\ref{kondoeqavggrass}) and~(\ref{kondoeqZgrass}), will be
in which $\psi^\pm_\alpha(0,t)$ is now defined in~(\ref{kondoeqfieldhier}).
\medskip
\indent Note that since the model defined above only involves fields localized at the impurity site, that is at $x=0$, we only have to deal with $1$-dimensional Fermionic fields. {\it This does not mean} that the lattice supporting the electrons plays no role: on the contrary it will show up, and in an essential way, because the ``dimension'' of the electron field will be different from that of the impurity, as made already manifest by the factor $2^m\displaystyle\mathop{\longrightarrow}_{m\to-\infty}0$ in~(\ref{kondoeqprophiercmp}).
\medskip
\indent Clearly several properties of the non-hierarchical propagators,~(\ref{kondoeqpropapprox}), are not reflected in~(\ref{kondoeqprophiercmp}), though their {\it scaling} was preserved. However it will be seen that even so simplified the model exhibits a ``Kondo effect'' in the sense outlined in section~\ref{kondosecintro}.
\section{Beta function for the partition function.}\label{kondosecbetapart}
\indent In this section, we show how to compute the partition function $Z$ of the hierarchical $s-d$ model (see~(\ref{kondoeqhieravg})), and introduce the concept of a {\it renormalization group flow} in this context. We will first restrict the discussion to the $h=0$ case, in which $V=V_0$; the case $h\ne0$ is discussed in section~\ref{kondosecbetakondo}.
\indent The computation is carried out in an inductive fashion by splitting the averages in~(\ref{kondoeqhieravg}) into partial averages over the fields on scale $m$. Given $m\in\{0,-1,\cdots,-N_\beta\}$, we define $\left<\,\cdot\,\right>_m$ as the partial average over $\psi^{[m]\pm}_{\alpha}(\Delta_\eta)$ for $\alpha\in\{\uparrow,\downarrow\}$, $\Delta\in\mathcal Q_m$ and $\eta\in\{-,+\}$, as well as
\indent Notice that the fields $\psi_{\alpha}^{[\leqslant m-1]\pm}(\Delta)$ play (temporarily) the role of {\it external fields} as they do not depend on the index $\eta$, and are therefore independent of the half box in which the {\it internal fields}$\psi_{\alpha}^{[\leqslant m]\pm}(\Delta_\eta)$ are defined. In addition, by iterating~(\ref{kondoeqhierfieldind}), we can rewrite~(\ref{kondoeqfieldhier}) as
in which $c^{[m-1]}\in\mathbb R$ is a constant and the constant term in $W^{[m-1]}$ is 1. By a straightforward induction, we then find that $Z$ is given again by~(\ref{kondoeqZmultiscale}) with the present definition of $c^{[m]}$ (see~(\ref{kondoeqeffpothier})).
\medskip
\indent We will now prove by induction that the hierarchical $s-d$ model defined above is {\it exactly solvable}, in the sense that~(\ref{kondoeqeffpothier}) can be written out {\it explicitly} as a {\it finite} system of equations. To that end it will be shown that $W^{[m]}$ can be parameterized by only two real numbers, $\bm\ell^{[m]}=(\ell^{[m]}_0,\ell^{[m]}_1)\in\mathbb R^2$ and, in the process, the equation relating $\bm\ell^{[m]}$ and $\bm\ell^{[m-1]}$ (called the {\it beta function}) will be computed:
which, by the anti-commutation of the Grassmann variables $\psi^{[\leqslant0]\pm}_{\alpha}$ implies that a product containing three or more $A_\eta^{[\leqslant0]i}$ vanishes, as well as
Therefore, by injecting~(\ref{kondoeqalgO}) into~(\ref{kondoeqtrotthier}), we find that $W^{[0]}$ can be written as in~(\ref{kondoeqeffpotform}) with $\bm\ell^{[0]}=(\lambda_0,\lambda_0^2/4)$.
\indent We then compute $W^{[m-1]}$ using~(\ref{kondoeqeffpothier}) and show that it can be written as in~(\ref{kondoeqeffpotform}). We first notice that the propagator in~(\ref{kondoeqprophier}) is diagonal in $\Delta$, and does not depend on the value of $\Delta$, therefore, we can split the averaging over $\psi^{[m]}(\Delta_\pm)$ for different $\Delta$. We thereby find that
\indent We then compute the average, which is a somewhat long computation, although finite (see appendix~\ref{kondoappids} for the main shortcuts). We find that
\indent This concludes the proof of~(\ref{kondoeqeffpotform}), and provides an explicit map, defined in~(\ref{kondoeqhierbeta}) and which we denote by $\mathcal R$, that is such that $\bm\ell^{[m]}=\mathcal R^{|m|}\bm\ell^{[0]}$. Finally, the $c^{[m]}$ appearing in~(\ref{kondoeqeffpothier}) is given by
\begin{equation}
c^{[m]}=(C^{[m+1]})^{2^{N_\beta+m}}.
\label{kondoeqhiercst}\end{equation}
\medskip
\indent The dynamical system defined by the map $\mathcal R$ in~(\ref{kondoeqhierbeta}) admits two non trivial fixed points, at $\bm\ell=\mathbf0$ and $\bm\ell=\bm\ell^*$ (see appendix~\ref{kondoappfixed} for a proof) with
where $x_0\approx0.15878626704216...$ is the real root of $4-19x-22x^2-107x^3=0$. A numerical analysis shows that, if the initial data $\lambda_0$ is small and $<0$, then the flow converges to $\bm\ell^*$, whereas it converges to $\mathbf0$ if $\lambda_0\geqslant0$.
\medskip
{\bf Remark}: Proving that the flow converges to $\bm\ell^*$ analytically is complicated by the somewhat contrived expression of $\bm\ell^*$. It is however not difficult to prove that if the flow converges, then it must go to $\bm\ell^*$ (see appendix~\ref{kondoappfixed}). Since the numerical iterations of the flow converge quite clearly, we will not attempt a full proof of the convergence to the fixed point.
\section{Beta function for the Kondo effect}\label{kondosecbetakondo}
\indent In this section, we discuss the Kondo effect in the hierarchical model: {\it i.e.}\ the phenomenon that as soon as the interaction is strictly repulsive ({\it i.e.}\ $\lambda_0<0$) the susceptibility of the impurity at zero temperature remains positive and finite, although it can become very large for small coupling. The problem will be rigorously reduced to the study of a dynamical system, extending the map $\bm\ell\to\mathcal R\bm\ell$ in~(\ref{kondoeqhierbeta}). The value of the susceptibility follows from the iterates of the map, as explained below. The computation will be performed numerically; a rigorous computer assisted analysis of the flow appears possible, but we have not attempted it because the results are very stable and clear.
\medskip
\indent We introduce a magnetic field of amplitude $h\in\mathbb R$ and direction $\bm\omega\in\mathcal S_2$ (in which $\mathcal S_2$ denotes the $2$-sphere) acting on the impurity. As a consequence, the potential $V$ becomes
\begin{equation}
V_h(t)=V_0(t)
-h\sum_{j\in\{1,2,3\}}\bm\omega_j\tau^j
\label{kondoeqpoth}\end{equation}
\indent The corresponding partition function is denoted by $Z_h$ and the free energy of the system by $f_h:=-\beta^{-1}\log Z_h$. The {\it impurity susceptibility} is then defined as
\indent The $h$-dependent potential and the constant term, {\it i.e.}\ $W^{[m]}_h$ and $c^{[m]}_h$, are then defined in the same way as in~(\ref{kondoeqeffpothier}), in terms of which,
\indent We compute $c^{[m]}_h$ in the same way as in section~\ref{kondosecbetapart}. Because of the extra term in the potential in~(\ref{kondoeqpoth}), the number of running coupling constants increases to six: indeed we prove by induction that $W^{[m]}_h$ is parametrized by six real numbers, $\bm\ell^{[m]}_h=(\ell_{0,h}^{[m]},\ell_{1,h}^{[m]},\ell_{4,h}^{[m]},\ell_{5,h}^{[m]},\ell_{6,h}^{[m]},\ell_{7,h}^{[m]})\in\mathbb R^6$ (the numbering is meant to recall that in~[\cite{kondoBGJ15}]):
where $\tilde h:=h/2$ (see appendix~\ref{kondoappinit} for a proof).
\indent For $m<0$, the computation follows the same general lines as in section~\ref{kondosecbetapart}, although the computation is longer, but can be performed easily using a computer ([\cite{Ja15}], see appendix~\ref{kondoappmeankondo}). The result of the computation is a map $\widetilde{\mathcal R}$ which maps $\ell_{n,h}^{[m]}$ to $\ell_{n,h}^{[m-1]}$, as well as the expression for the constant $C_h^{[m]}$. Their explicit expression is somewhat long, and is deferred to~(\ref{kondoeqbetasusc}).
\medskip
\indent By~(\ref{kondoeqhiercst}) and~(\ref{kondoeqfreeench}), we rewrite~(\ref{kondoeqsuscdef}) as
\indent In addition, the derivatives of $C_h^{[m]}$ can be computed exactly using the flow in~(\ref{kondoeqbetasusc}): indeed $\partial_hC_h^{[m]}=\partial_{\bm\ell} C_h^{[m]}\cdot\partial_h\bm\ell_h^{[m]}$ and similarly for $\partial^2_hC_h^{[m]}$, and $\partial_h\bm\ell_h^{[m]}$ can be computed inductively by deriving $\widetilde{\mathcal R}(\bm\ell)$:
and similarly for $\partial_h^2\bm\ell_h^{[m]}$. Therefore, using~(\ref{kondoeqbetasusc}) and its derivatives, we can inductively compute $\chi(\beta,h)$.
\medskip
\indent By a numerical study which produces results that are stable and clear we find the following results.
\medskip
\point If $\lambda_0\equiv\ell_0<0$ and $h=0$, then the flow tends to a nontrivial, $\lambda$-independent, fixed point $\bm\ell^*$ (see figure~\ref{kondofigbetaplot}).
\caption{Plot of $\frac{\bm\ell}{\bm\ell^*}$ as a function of the iteration step $N_\beta$ for $\lambda_0\equiv\ell_0=-0.02$. The {\it marginal} coupling $\ell_0$ (number 0, {\color{iblue}blue}) tends to its fixed value, closely followed by the irrelevant coupling $\ell_1$ (number 1, {\color{ired}red}).}
\label{kondofigbetaplot}
\end{figure}
\indent We define $n_j(\lambda_0)$ for $j=0,1$ as the step of the flow at which the right-discrete derivative of $\ell_j/\ell_j^*$ with respect to the step $N_\beta$ is largest. The reason for this definition is that, as $\lambda_0$ tends to $0$, the flow of $\ell_j$ tends to a step function, so that for each component $j$ the scale $n_j$ is a good measure of the number of iterations needed for that component to reach its fixed value. The {\it Kondo temperature}$\beta_K$ is defined as $2^{n_0(\lambda_0)}$, and is the temperature at which the non-trivial fixed point is reached by all components. For small $\lambda_0$, we find that (see figure~\ref{kondofigasymnj}), for $j=0,1,3$,
\caption{Phase diagram of the flow. There are two fixed points: $\bm\ell^*$ (which is linearly stable and represented by a red star) and $\bm\ell^*_0$ (which has one linearly unstable direction and one quadratically marginal and is represented by a green cross).}
\label{kondofigbetaphase}
\end{figure}
\indent When the running coupling constants are at $\bm\ell^*$, the susceptibility remains finite as $\beta\to\infty$ and positive, whereas when they are at $\bm\ell_0^*$, it grows linearly with $\beta$.
\indent In addition, when $\lambda_0<0$ the flow escapes along the unstable direction towards $\bm\ell^*$ after $n_K(\lambda_0)$ steps. The susceptibility is therefore finite for $\lambda_0<0$ (see figure~\ref{kondofigsuscbeta} (which may be compared to the exact solution [\cite{kondoAFL83}, figure~3])).
\indent If $\lambda_0>0$, then the flow approaches $\bm\ell^*_0$ from the $\lambda_0>0$ side, which is marginally stable, so the flow never leaves the vicinity of $\bm\ell_0^*$ and the susceptibility diverges as $\beta\to\infty$.
\caption{plot of $\chi(\beta,0)$ as a function of $\log_2\beta$ for $\lambda_0=-0.28$.}
\label{kondofigsuscbeta}
\end{figure}
\point We now discuss the flow at $h>0$ and address the question of continuity of the susceptibility in $h$ as $h\to0$. If $\lambda_0 <0$ and $h\ll\beta_K^{-1}=2^{-n_K(\lambda_0)}$, $\ell_0,\ell_1$ first behave similarly to the $h=0$ case and tend to the same fixed point $\bm\ell^*${and stay there until}$\ell_4$ through $\ell_7$ become large enough, after which the flow tends to a fixed point in which, $\ell_5=2$ and $\ell_j=0$ for $j\neq5$ (see figure~\ref{kondofigbetasuscplot}).
\caption{plot of $\frac{\bm\ell}{\bm\ell^*}$ as a function of the iteration step $N_\beta$ for $\lambda_0=-0.25$ and $h=2^{-40}$. Here $\ell_0^*,\ell_1^*$ are the components of the non-trivial fixed point $\bm\ell^*$ and $\ell_4^*$ through $\ell_7^*$ are the values reached by $\ell_4$ through $\ell_7$ of largest absolute value. The flow behaves similarly to that at $h=0$ until $\ell_4$ through $\ell_7$ become large, at which point the couplings decay to 0, except for $\ell_5$.}
\label{kondofigbetasuscplot}
\end{figure}
\indent Setting the initial conditions for the flow as $\ell_j=\ell_j^*$ for $j=0,1$ and $\ell_5=2\tanh(\tilde h)$, we define $ r_j(h)$ for $j=0,1,4,5,6,7$ as the step of the flow at which the discrete derivative of $\ell_j/\ell_j^*$ is respectively smallest (that is most negative) and largest. Thus $ r_j(h)$ measures when the flow leaves $\bm\ell^*$. We find that (see figure~\ref{kondofigasymrj}) for small $h$,
\begin{equation}
r_j(h)=c_r\log_2 h^{-1}+O(1),\quad c_r\approx2.6.
\label{kondoeqscalerj}\end{equation}
\medskip
\indent Note that the previous picture only holds if $r_j(h)\gg\log_2(\beta_K)$, that is $\beta_Kh\ll1$.
\medskip
\indent The susceptibility at $0<h\ll\beta_K^{-1}$ is continuous in $h$ as $h\to0$ (see figure~\ref{kondofigsusch}). This, combined with the discussion in point (2) above, implies that the hierarchical $s-d$ model exhibits a Kondo effect.
\caption{plot of $\chi(\beta,h)$ for $h\leqslant10^{-6}$ at $\lambda_0=-0.28$ and $\beta=2^{20}$ (so that the largest value for $\beta h$ is $\sim1$).}
\label{kondofigsusch}
\end{figure}
\section{Concluding remarks}\label{kondosecconc}
\point The hierarchical $s-d$ model defined in section~\ref{kondosechierk} is a well defined statistical mechanics model, for which the partition function and correlation functions are unambiguously defined and finite as long as $\beta$ is finite. In addition, since the magnetic susceptibility of the impurity can be rewritten as a correlation function:
$\chi(\beta,0)$ is a thermodynamical quantity of the model.
\bigskip
\point The qualitative behavior of the renormalization group flow is unchanged if all but the relevant and marginal running coupling constants ({\it i.e.}\ six constants out of nine) of the beta functions of section~\ref{kondosechierk},\ref{kondosecbetapart} are neglected ({\it i.e.}\ set to $0$ at every step of the iteration). In particular, we still find a Kondo effect.
\bigskip
\point In the hierarchical model defined in section~\ref{kondosechierk}, quantities other than the magnetic susceptibility of the impurity can be computed, although all observables must only involve fields localized at $x=0$. For instance, the response to a magnetic field acting on all sites of the Fermionic chain as well as the impurity cannot be investigated in this model, since the sites of the chain with $x\neq0$ are not accounted for.
\medskip
\subpoint We have attempted to extend the definition of the hierarchical model to allow observables on the sites of the chain at $x\neq0$, by paving the space-time plane with square boxes (instead of paving the time axis with intervals, see section~\ref{kondosechierk}), defining hierarchical fields for each quarter box and postulating a propagator between them by analogy with the non-hierarchical model. The magnetic susceptibility of the impurity is defined as the response to a magnetic field acting on every site of the chain and on the impurity, to which the susceptibility of the non-interacting chain is subtracted. We have found, iterating the flow numerically, that for such a model {\it there is no Kondo effect}, that is the impurity susceptibility diverges as $\beta$ when $\beta\to\infty$.
\medskip
\subpoint A second approach has yielded better results, although it is not completely satisfactory. The idea is to incorporate the effect of the magnetic field $h$ acting on the Fermionic chain into the propagator of the non-hierarchical model, after which the potential $V$ only depends on the site at $x=0$, so that the hierarchical model can be defined in the same way as in section~\ref{kondosechierk} but with {\it an $h$-dependent propagator}. In this model, we have found that {\it there is a Kondo effect}.
\vfill
\eject
\appendix
\section{Some identities.}\label{kondoappids}
\indent In this appendix, we state three relations used to compute the flow equation~(\ref{kondoeqhierbeta}), which follow from a patient algebraic meditation:
where the lower case $\mathbf a$ denotes $\left<\,\mathbf A_1\,\right>\equiv\left<\,\mathbf A_2\,\right>$ and $a^{j_1,j_2}=\left<\,\psi^+_1\sigma^{j_1}\sigma^{j_2}\psi^-_1\,\right> =\left<\,\psi^+_2\sigma^{j_1}\sigma^{j_2}\psi^-_2\,\right>$.
in which we dropped the $^{[m]}$ exponent on the right side. By considering the linearized flow equation (around $\ell_j=0$), we find that $\ell_0,\ell_4,\ell_6$ are {\it marginal}, $\ell_2${\it relevant} and $\ell_1,\ell_7${\it irrelevant}. The consequent linear flow is {\it very different} from the full flow discussed in section~\ref{kondosecbetakondo}.
\section{Fixed points at $h=0$}\label{kondoappfixed}
\indent We first compute the fixed points of~(\ref{kondoeqhierbeta}). If $\ell_0=0$, then $\ell_1=0$. If $\ell_0\not=0$, then the equation for the fixed point of~(\ref{kondoeqhierbeta}) becomes
In addition, it follows from~(\ref{kondoeqhierbeta}) that, if $\lambda_0<0$, then (recall that $\ell_0^{[0]}=\lambda_0$ and $\ell_1^{[0]}=\lambda_0^2/2$)
\begin{equation}
\ell_0^{[m]}<0,\quad
0\leqslant\ell_1^{[m]}<\frac1{12}
\label{kondoeqineqfixedcv}\end{equation}
for all $m\leqslant0$, which implies that the set $\{\bm\ell\ |\ \ell_0<0,\ \ell_1\geqslant0\}$ is stable under the flow. In addition, if $\ell_0^{[m]}>-\frac23$, then $\ell_0^{[m-1]}<\ell_0^{[m]}$, so that the flow cannot converge to $\bm\ell_0^*$. Therefore if the flow converges, then it converges to $\bm\ell^*$.
\section{Initial condition for the flow}\label{kondoappinit}
\indent In this appendix, we prove~(\ref{kondoeqinitcd}), starting from~(\ref{kondoeqtrotthier}):
\section{Asymptotic behavior of $n_j(\lambda_0)$ and $r_j(h)$}\label{kondoappasymn}
\indent In this appendix, we show plots to support the claims on the asymptotic behavior of $n_j(\lambda_0)$ (see~(\ref{kondoeqscalenj}), figure~\ref{kondofigasymnj}) and $r_j(h)$ (see~(\ref{kondoeqscalerj}), figure~\ref{kondofigasymrj}). The plots below have error bars which are due to the fact that $n_j(\lambda_0)$ and $r_j(h)$ are integers, so their value could be off by $\pm1$.
\caption{plot of $n_j(\lambda_0)|\lambda_0|$ for $j=0$ ({\color{iblue}blue}, color online) and $j=1$ ({\color{ired}red}) as a function of $|\log_{10}|\lambda_0||$. This plot confirms~(\ref{kondoeqscalenj}).}
\caption{plot of $r_j(h)|\log_2(h)||$ as a function of $|\log_{2}(h)|$. This plot confirms~(\ref{kondoeqscalerj}).}
\label{kondofigasymrj}
\end{figure}
\section{{\expandonce\tt meankondo}: a computer program to compute flow equations}\label{kondoappmeankondo}
\indent The computation of the flow equation~(\ref{kondoeqbetasusc}) is quite long, but elementary, which makes it ideally suited for a computer. We therefore wrote a program, called {\tt meankondo}~[\cite{Ja15}], and used it to carry out this computation. One interesting feature of {\tt meankondo} is that it has been designed in a {\it model-agnostic} way, that is, unlike its name might indicate, it is not specific to the $s-d$ model and can be used to compute and manipulate flow equations for a wide variety of Fermionic hierarchical models. It may therefore be useful to anyone studying such models, so we have thoroughly documented its features and released the source code under an Apache~2.0 license. See {\color{blue}\url{http://ian.jauslin.org/software/meankondo}} for details.