# UC San Diego UC San Diego Electronic Theses and Dissertations

# Title

Digital Enhancement Techniques for Digital Fractional-N Phase-Locked Loops

Permalink

https://escholarship.org/uc/item/7fg252ss

Author

Alvarez-Fontecilla, Cristian Enrique

Publication Date 2021

Peer reviewed|Thesis/dissertation

### UNIVERSITY OF CALIFORNIA SAN DIEGO

Digital Enhancement Techniques for Digital Fractional-N Phase-Locked Loops

A dissertation submitted in partial satisfaction of the requirements for the degree

Doctor of Philosophy

in

Electrical Engineering (Electronic Circuits and Systems)

by

# Cristián Enrique Álvarez Fontecilla

Committee in charge:

Professor Ian A. Galton, Chair Professor Peter M. Asbeck Professor Gert Cauwenberghs Professor Drew A. Hall Professor Laurence B. Milstein

2021

Copyright

Cristián Enrique Álvarez Fontecilla, 2021

All rights reserved.

The dissertation of Cristián Enrique Álvarez Fontecilla is approved, and it is acceptable in quality and form for publication on microfilm and electronically:

Chair

University of California San Diego

2021

### DEDICATION

To Castor Barracuda.

# TABLE OF CONTENTS

| Signature Pageiii                                                                    |
|--------------------------------------------------------------------------------------|
| Dedicationiv                                                                         |
| Table of Contents                                                                    |
| List of Figuresviii                                                                  |
| List of Tablesxii                                                                    |
| Acknowledgementsxiii                                                                 |
| Vitaxvi                                                                              |
| Abstract of the Dissertationxvii                                                     |
| Chapter 1 Multi-Rate DEM with Mismatch-Noise Cancellation for DCOs in Digital PLLs 1 |
| I. Introduction 1                                                                    |
| II. Background Information                                                           |
| III. Effects of FCE Mismatches                                                       |
| IV. FCE Mismatch Model 10                                                            |
| V. Multi-rate DEM11                                                                  |
| A. Starting Point: Single-Rate Segmented DEM11                                       |
| B. Extension to Multi-Rate Segmented DEM13                                           |
| VI. Adaptive FCE Mismatch Noise Cancellation17                                       |
| A. MNC Sequence Application17                                                        |
| B. FCE Mismatch Error Measurement                                                    |
| C. MNC Coefficients Estimation                                                       |
| VII. Simulation Results                                                              |
| Appendix A                                                                           |
| Appendix B                                                                           |

| Acknowledgements                                                                  |
|-----------------------------------------------------------------------------------|
| Figures                                                                           |
| Chapter 2 Delta-Sigma FDC Enhancements for FDC-Based Digital Fractional-N PLLs 48 |
| I. Introduction                                                                   |
| II. $\Delta\Sigma$ FDC Digital Fractional- <i>N</i> PLL Overview                  |
| A. $\Delta\Sigma$ FDC-Based PLL                                                   |
| B. Original $\Delta\Sigma$ FDC Architecture                                       |
| C. Original $\Delta\Sigma$ FDC Issues                                             |
| III. Improved $\Delta\Sigma$ FDC                                                  |
| A. Proposed $\Delta\Sigma$ FDC Architecture                                       |
| B. Proposed $\Delta\Sigma$ FDC Features                                           |
| IV. Digital Gain Calibration Technique60                                          |
| A. Effects of $\Delta\Sigma$ FDC Forward Path Gain Error                          |
| B. Proposed Digital Gain Calibration Technique62                                  |
| C. Convergence Analysis                                                           |
| D. Gain Calibration Technique for CP-Based $\Delta\Sigma$ FDCs                    |
| V. Simulation Results                                                             |
| VI. Conclusion                                                                    |
| Acknowledgements                                                                  |
| Figures70                                                                         |
| Tables                                                                            |
| References                                                                        |
| Chapter 3 Spectral Breathing and its Mitigation in Digital Fractional-N PLLs      |
| I. Introduction                                                                   |
| II. Background Information                                                        |

| A. Conventional DCO Frequency Control Technique |  |
|-------------------------------------------------|--|
| B. MR-DEM and MNC Techniques                    |  |
| III. Implementation Details                     |  |
| A. MR-DEM Encoder                               |  |
| B. MNC Logic                                    |  |
| C. DCO FCE Banks                                |  |
| IV. Measurement Results                         |  |
| V. Conclusion                                   |  |
| Acknowledgements                                |  |
| Figures                                         |  |
| Tables                                          |  |
| References                                      |  |

# LIST OF FIGURES

| Figure 1: Conventional frequency control technique for an LC-based DCO                                                       |
|------------------------------------------------------------------------------------------------------------------------------|
| Figure 2: Example waveforms related to (15) and (16) for an FCE input bit sequence of 1, 0,                                  |
| 1, 1, 0                                                                                                                      |
| Figure 3: Segmented DEM encoder example                                                                                      |
| Figure 4: (a) Segmenting switching block, (b) non-segmenting switching block, and (c)                                        |
| switching sequence generator                                                                                                 |
| Figure 5: Multi-rate DEM encoder example                                                                                     |
| Figure 6: Slow DEM encoder example                                                                                           |
| Figure 7: Details of the second-order digital $\Delta\Sigma$ modulator                                                       |
| Figure 8: Fractional path of the example multi-rate DEM encoder shown in Fig. 5 modified to                                  |
| accommodate $e_{\text{MNC}}[p_t]$                                                                                            |
| Figure 9: General form of a digital fractional- <i>N</i> PLL                                                                 |
| Figure 10: (a) Synchronization circuit used at DCO input, and (b) illustration of the clock                                  |
| signals within the DCO, $p_t$ , and $n_t = g(p_t)$ for $f_{\text{fast}} = 4.5 f_{\text{ref}}$                                |
| Figure 11: (a) Digital fractional-N PLL with multi-rate DEM and MNC, (b) details of the                                      |
| MNC logic, and (c) details of each switching sequence residue estimator                                                      |
| Figure 12: Example frequency transitions normalized to $\Delta_i/2$ versus time over $T_{\text{fast}} = 8T_{\text{PLL}}$ for |
| six different FCEs                                                                                                           |
| Figure 13: Simulated PLL phase noise PSD versus frequency                                                                    |
| Figure 14: MNC coefficient error evolution over time for $K_a = 2^{-3}$ and $K_b = 2^{-5}$                                   |

| Figure 15: MNC coefficient error evolution over time for MNC gains that change over time.                                                                                                                                                                                                                                                            |
|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Initially $K_a = 2^{-1}$ and $K_b = 2^{-2}$ , and after 3.5.107 reference periods $K_a = 2^{-6}$ and $K_b =$                                                                                                                                                                                                                                         |
| 2 <sup>-6</sup>                                                                                                                                                                                                                                                                                                                                      |
| Figure 16: High-level block diagram of a second-order $\Delta\Sigma$ FDC-based digital fractional-N                                                                                                                                                                                                                                                  |
| PLL with quantization noise cancellation (QNC)70                                                                                                                                                                                                                                                                                                     |
| Figure 17: (a) Simplified block diagram of the DMRO-based $\Delta\Sigma$ FDC described in [11], and                                                                                                                                                                                                                                                  |
| (b) simplified block diagram of the proposed $\Delta\Sigma$ FDC                                                                                                                                                                                                                                                                                      |
| Figure 18: Signal processing equivalents of the $\Delta\Sigma$ FDCs shown in Fig. 17(a) and Fig. 17(b)                                                                                                                                                                                                                                               |
| when they are locked71                                                                                                                                                                                                                                                                                                                               |
| Figure 19: Example timing diagram of (a) the original $\Delta\Sigma$ FDC and (b) the proposed $\Delta\Sigma$ FDC.                                                                                                                                                                                                                                    |
|                                                                                                                                                                                                                                                                                                                                                      |
|                                                                                                                                                                                                                                                                                                                                                      |
| Figure 20: (a) Behavioral model of the proposed $\Delta\Sigma$ FDC with (79), where $Q_r$ and $Q_c$ are                                                                                                                                                                                                                                              |
| Figure 20: (a) Behavioral model of the proposed $\Delta\Sigma$ FDC with ( <b>79</b> ), where $Q_r$ and $Q_c$ are replaced by the additive error sources $e_{qr}[n]$ and $\hat{e}_q[n]$ , respectively, and (b)                                                                                                                                       |
| Figure 20: (a) Behavioral model of the proposed $\Delta\Sigma$ FDC with ( <b>79</b> ), where $Q_r$ and $Q_c$ are<br>replaced by the additive error sources $e_{qr}[n]$ and $\hat{e}_q[n]$ , respectively, and (b)<br>linearized model of the $\Delta\Sigma$ FDC-based PLL shown in Fig. 16 with the proposed $\Delta\Sigma$                          |
| Figure 20: (a) Behavioral model of the proposed $\Delta\Sigma$ FDC with ( <b>79</b> ), where $Q_r$ and $Q_c$ are<br>replaced by the additive error sources $e_{qr}[n]$ and $\hat{e}_q[n]$ , respectively, and (b)<br>linearized model of the $\Delta\Sigma$ FDC-based PLL shown in Fig. 16 with the proposed $\Delta\Sigma$<br>FDC and ( <b>79</b> ) |
| Figure 20: (a) Behavioral model of the proposed $\Delta\Sigma$ FDC with ( <b>79</b> ), where $Q_r$ and $Q_c$ are<br>replaced by the additive error sources $e_{qr}[n]$ and $\hat{e}_q[n]$ , respectively, and (b)<br>linearized model of the $\Delta\Sigma$ FDC-based PLL shown in Fig. 16 with the proposed $\Delta\Sigma$<br>FDC and ( <b>79</b> ) |
| Figure 20: (a) Behavioral model of the proposed $\Delta\Sigma$ FDC with ( <b>79</b> ), where $Q_r$ and $Q_c$ are<br>replaced by the additive error sources $e_{qr}[n]$ and $\hat{e}_q[n]$ , respectively, and (b)<br>linearized model of the $\Delta\Sigma$ FDC-based PLL shown in Fig. 16 with the proposed $\Delta\Sigma$<br>FDC and ( <b>79</b> ) |
| Figure 20: (a) Behavioral model of the proposed $\Delta\Sigma$ FDC with ( <b>79</b> ), where $Q_r$ and $Q_c$ are<br>replaced by the additive error sources $e_{qr}[n]$ and $\hat{e}_q[n]$ , respectively, and (b)<br>linearized model of the $\Delta\Sigma$ FDC-based PLL shown in Fig. 16 with the proposed $\Delta\Sigma$<br>FDC and ( <b>79</b> ) |
| Figure 20: (a) Behavioral model of the proposed $\Delta\Sigma$ FDC with ( <b>79</b> ), where $Q_r$ and $Q_c$ are<br>replaced by the additive error sources $e_{qr}[n]$ and $\hat{e}_q[n]$ , respectively, and (b)<br>linearized model of the $\Delta\Sigma$ FDC-based PLL shown in Fig. 16 with the proposed $\Delta\Sigma$<br>FDC and ( <b>79</b> ) |
| Figure 20: (a) Behavioral model of the proposed $\Delta\Sigma$ FDC with ( <b>79</b> ), where $Q_r$ and $Q_c$ are<br>replaced by the additive error sources $e_{qr}[n]$ and $\hat{e}_q[n]$ , respectively, and (b)<br>linearized model of the $\Delta\Sigma$ FDC-based PLL shown in Fig. 16 with the proposed $\Delta\Sigma$<br>FDC and ( <b>79</b> ) |

ix

| Figure 24 | : Simulated PL | L phase noise | PSD with   | and without       | gain calib | oration ena | abled for | four |
|-----------|----------------|---------------|------------|-------------------|------------|-------------|-----------|------|
|           | combinations   | of PLL bandw  | ridth (BW) | and $\delta^{-1}$ |            |             |           | 76   |

Figure 26: Normalized histogram of u(t) pulse width minus  $T_{\bar{u}}$  calculated via (**73**) for the original FDC [(a) and (c)] and the proposed FDC [(b) and (d)], where  $\alpha = 0.001008987426758$  for (a) and (b) and  $\alpha = 0.401008987426758$  for (c) and (d)...77

| Figure 32: MNC logic bit-level implementation.              |  |
|-------------------------------------------------------------|--|
| Figure 33: Integer and fractional FCE banks layout.         |  |
| Figure 34: Measured PLL phase noise at $f_{PLL} = 6.56$ GHz |  |

| Figure 35: Measured PLL phase noise at $f_{PLL} = 6.56$ GHz for (a) MR-DEM enabled and MNC |
|--------------------------------------------------------------------------------------------|
| disabled, (b) MR-DEM and MNC (stat. branch only) enabled, and (c) MR-DEM                   |
| and MNC (both stat. and ISI branches) enabled 106                                          |
| Figure 36: Measured PLL phase noise at $f_{PLL} = 6.56$ GHz with MR-DEM enabled for (a)    |

MNC disabled, and (b) MNC (both stat. and ISI branches) enabled. ..... 107

### LIST OF TABLES

| Table 1: Parameters used for the simulation.  | .78 |
|-----------------------------------------------|-----|
|                                               |     |
| Table 2: PNR digital block power consumption. | 108 |

#### ACKNOWLEDGEMENTS

I am grateful to my advisor, Professor Ian Galton, for his support, advice, and mentorship. I believe his problem-solving skills and his approach to engineering are remarkable, and his style leaves a mark on everyone who works with him.

I am grateful to my parents for giving me the opportunities they gave me and raising me the way they did. Thanks to them I got the tools necessary to embark on this trip. I am grateful to Pedro Unda, my school teacher, whose passion for teaching and physics lead me to study electrical engineering. I am grateful to Ángel Abusleme, my master's advisor, who played a key role in my graduate career. I am also grateful to all my family for their support. My motherin-law deserves a special mention; among other things, she came to visit us several times and helped with the kids so that I could work on my research during the most critical times.

I am grateful to my lab mates Kevin, Julian, Raghu, Amr, and Eslam for their friendship and constant support. It was nice to share this experience with people who are relentless in their search for truth and the ultimate understanding of things. All my lab mates played an important role during my time here, and they were extremely valuable resources to me in one way or another. Raghu's contribution to my Ph.D. is not only restricted to the numerous technical discussions we had. He also made my passage through the lab a lot smoother by helping me with all computer-related problems. Having technical discussions with Amr, whose thinking process I admire, always helped me with my own thinking process, and working side by side with Eslam taught me a thing or two about taping out chips. I also have to thank Eslam for his *Frozen*-themed debit card - what joy it brought to the lab! I am also grateful to Colin, who helped me multiple times even though he was not a member of the lab anymore. Last but not the least, I want to thank my wife Rosario. Joining me in this project was not easy, and it required a big sacrifice from her. Not only she had to put her professional career on hold, but she undoubtedly suffered the greatest hardship throughout this process, having to raise two small children alone much of the time. My appreciation for her efforts cannot be overstated. Because I know she will read this eventually, I want to play safe and be very explicit about where she is on the list of the people I am grateful to. Sorted from "most grateful to" to "least grateful to", she is hands down the first one on that list.

I believe I am also supposed to be grateful to my two kids, Arturo and Matilde. They brought joy to my house, and they are the most important things in my life, but they also made the Ph.D. considerably harder which is why they get no credit for this.

Chapter 1, in full, has been published in the IEEE Transactions on Circuits and Systems I: Regular Papers, volume 65, number 10, pages 3125-3137, October 2018. E. Alvarez-Fontecilla, C. Venerus, I. Galton, 2018. The dissertation author is the primary investigator and author of this paper. Professor Ian Galton supervised the research which forms the basis for this paper.

Chapter 2, in full, has been published in the IEEE Transactions on Circuits and Systems I: Regular Papers. E. Alvarez-Fontecilla, A. I. Eissa, E. Helal, C. Weltin-Wu, I. Galton, 2021. The dissertation author is the primary investigator and author of this paper. Professor Ian Galton supervised the research which forms the basis for this paper.

Chapter 3, in full, has been submitted for publication to the IEEE Journal of Solid-State Circuits. E. Alvarez-Fontecilla, E. Helal, A. I. Eissa, I. Galton, 2021. The dissertation author is the primary investigator and author of this paper. Professor Ian Galton supervised the research which forms the basis for this paper.

# VITA

| 2021 | Doctor of Philosophy in Electrical Engineering (Electronic Circuits and Systems), University of California San Diego, USA |
|------|---------------------------------------------------------------------------------------------------------------------------|
| 2013 | Master of Science in Electrical Engineering, Pontificia Universidad Católica de Chile, Santiago, Chile                    |
| 2011 | Bachelor of Science in Electrical Engineering, Pontificia Universidad Católica de Chile, Santiago, Chile                  |

### ABSTRACT OF THE DISSERTATION

Digital Enhancement Techniques for Digital Fractional-N Phase-Locked Loops

by

Cristián Enrique Álvarez Fontecilla

Doctor of Philosophy in Electrical Engineering (Electronic Circuits and Systems)

University of California San Diego, 2021

Professor Ian A. Galton, Chair

Phase-locked loops (PLLs) are critical components in modern electronics communication systems, where they are used to synthesize local oscillator signals for modulation and demodulation in wireless transceivers. They are also used to clock digital-to-analog converters (DACs), analog-to-digital converters (ADCs), and digital processors.

Most PLLs incorporate either analog filters and voltage-controlled oscillators (VCOs) or digital filters and digitally-controlled oscillators (DCOs). The former are called analog PLLs

and the latter are called digital PLLs. To date, analog PLLs have the best phase error performance, but digital PLLs have the lowest circuit area and are more compatible with highly-scaled CMOS integrated circuit (IC) technology. Thus, improving the performance of digital PLLs has been the subject of intensive research for many years.

The first chapter of this dissertation presents a multi-rate dynamic element matching (MR-DEM) technique and an adaptive mismatch-noise cancellation (MNC) technique that work together to mitigate spectral breathing in digital PLLs, a problem caused by mismatches among the frequency control elements (FCEs) within the DCO. It presents a theoretical analysis of the techniques, as well as behavioral simulation results that support the analysis.

The second chapter of this dissertation presents delta-sigma ( $\Delta\Sigma$ ) frequency-to-digital converter (FDC) all-digital enhancements for FDC-based digital fractional-*N* PLLs. It describes an enhanced  $\Delta\Sigma$  FDC architecture that has relaxed timing constraints and reduced phase-frequency detector (PFD) output pulse-span compared to prior-art  $\Delta\Sigma$  FDCs. It also describes and analyses a  $\Delta\Sigma$  FDC forward gain calibration technique that reduces the complexity associated with the system's implementation and improves the phase noise performance of PLLs with high loop bandwidths.

The third chapter of this dissertation presents an integrated circuit high-performance PLL which implements the MR-DEM and MNC techniques presented in the first chapter. It demonstrates the detrimental effects of the spectral breathing phenomenon, as well as the effectiveness of the MR-DEM and MNC techniques to mitigate this problem.

#### **CHAPTER 1**

# MULTI-RATE DEM WITH MISMATCH-NOISE CANCELLATION FOR DCOS IN DIGITAL PLLS

**Abstract**—Mismatches among frequency control elements in digitally-controlled oscillators can be a significant source of phase error in digital phase-locked loops (PLLs). This paper presents a multi-rate dynamic element matching technique and an adaptive mismatchnoise cancellation (MNC) technique that work together to address this problem. The two techniques operate in back-ground during normal PLL operation, and the MNC technique has typical cold start convergence times of a few seconds.

### I. INTRODUCTION

High-performance phase-locked loops (PLLs) are critical components in modern electronic communication systems. For example, in wireless transceivers they generate radio

Manuscript received December 6, 2017; revised February 23, 2018; accepted March 20, 2018. Date of publication April 11, 2018; date of current version August 30, 2018. This work was supported by the National Science Foundation under Award 1617545. This paper was recommended by Associate Editor P. Rombouts. (Corresponding author: Enrique Alvarez-Fontecilla.)

E. Alvarez-Fontecilla and I. Galton are with the Electrical and Computer Engineering Department, University of California at San Diego, La Jolla, CA 92092-0407 USA (e-mail: cea007@ucsd.edu; galton@ucsd.edu).

C. Venerus is with Qualcomm Technologies, Inc., San Diego, CA 92121 USA (e-mail: cvenerus@qti.qualcomm.com).

Color versions of one or more of the figures in this paper are available online at http://ieeexplore.ieee.org. Digital Object Identifier 10.1109/TCSI.2018.2819024

frequency local oscillator signals for up-conversion and down-conversion of transmitted and received signals, the phase error of which often limits overall transceiver performance.

Most PLLs incorporate either analog filters and voltage-controlled oscillators (VCOs) or digital filters and digitally-controlled oscillators (DCOs). The former are often called *analog PLLs* and the latter are often called *digital PLLs*. To date, analog PLLs have the best phase error performance, but digital PLLs have the lowest circuit area and are more compatible with highly-scaled CMOS IC technology. Thus, reducing phase error in digital PLLs has been the subject of intensive research and development for over a decade [1-49].

Nevertheless, frequency control element (FCE) mismatches in DCOs remain a significant source of phase error in high-performance digital PLLs [39]. This problem has only been addressed in prior work via an offline calibration technique that requires several minutes to complete [16, 18]. This paper presents a multi-rate dynamic element matching (DEM) technique and an adaptive mismatch-noise cancellation (MNC) technique that work together to address the problem. Both techniques run in background during normal PLL operation, and the MNC technique typically converges in a few seconds from a cold start.

The paper describes the proposed multi-rate DEM and MNC techniques in detail. Section II provides DCO background information. Section III discusses the effects of FCE mismatches on the DCO frequency. Section IV presents an error model for FCE mismatches. Section V presents the multi-rate DEM technique. Section VI presents the MNC technique. Section VII presents behavioral simulation results that support the analysis of the paper. The proposed techniques are described in the context of an example to simplify the explanations.

#### **II. BACKGROUND INFORMATION**

A DCO is an oscillator whose frequency is controlled by one or more FCEs, each of which is controlled by a 1-bit digital sequence. For instance, each FCE in an LC-based DCO contributes to the DCO's tank a capacitance that takes on one of two values depending on the state of the FCE's input bit. Changing the FCE's input bit increases or decreases the DCO frequency by a fixed frequency step.

The instantaneous frequency of a DCO is given by a fixed offset frequency plus  $f_{tune}(t)$ , where

$$f_{\text{tune}}(t) = \sum_{i=1}^{N_{\text{FCE}}} f_i(t), \qquad (1)$$

 $N_{\text{FCE}}$  is the number of FCEs in the DCO, and  $f_i(t)$  is the contribution of the *i*th FCE to the DCO frequency. Ideally,

$$f_i(t) = \left(b_i[m_t] - \frac{1}{2}\right)\Delta_i,\tag{2}$$

where  $b_i[m]$  is the FCE's input bit value (either 0 or 1) over the *m*th clock interval,  $m_t = \lfloor f_{\text{FCE}}t \rfloor$ ,  $f_{\text{FCE}}$  is the clock-rate of the input bit, and  $\Delta_i$  is the FCE's frequency step size.<sup>2</sup>

The DCO's input sequence, d[n], represents the ideal value of  $f_{tune}(t)$  over the *n*th clock interval. For example, suppose d[n] is represented as a 16-bit two's complement code where the least significant bit (LSB) represents a DCO frequency step of  $\Delta$  (e.g.,  $\Delta = 100$  Hz). Then

$$d[n] = \left(-2^{15}d_{15}[n] + \sum_{i=0}^{14} 2^i d_i[n]\right)\Delta,$$
(3)

<sup>&</sup>lt;sup>2</sup> By definition,  $m_t$  is the largest integer less than or equal to  $f_{FCE}t$  at time t, so it is a continuous-time waveform. Hence,  $b_i[m_t]$  is a continuous-time waveform even though  $b_i[m]$  is a discrete-time sequence.

where  $d_i[n]$ , for each i = 0, 1, ..., 15, is the value of the *i*th bit of the code (either 0 or 1) over the *n*th clock interval.

Ideally,  $f_{tune}(t) = d[n_t]$ , where  $n_t = \lfloor f_{in}t \rfloor$  and  $f_{in}$  is the clock-rate of the DCO input. Equations (1)-(3) with  $f_{FCE} = f_{in}$  imply that this can be achieved with a bank of 16 FCEs, where the *i*th FCE's frequency step size is  $\Delta_i = 2^{i-1}\Delta$ ,  $b_i[n] = d_{i-1}[n]$  for i = 1, 2, ..., 15, and  $b_{16}[n] = 1 - d_{15}[n]$ .

Unfortunately, in PLL applications that require low phase noise, such as local oscillator synthesis for cellular telephone transceivers, DCOs with minimum frequency steps of tens of Hz are required, but most existing FCEs have minimum frequency steps of tens of kHz or more [50, 51]. A common solution to this problem is described below for an example case in which  $f_{tune}(t)$  needs to be controlled in steps of  $\Delta$ , yet the smallest realizable FCE frequency step size is  $\Delta_{min} = 2^8 \Delta$ . In this case, the 8 LSBs of d[n] are said to represent the *fractional part* of d[n]because they cause DCO frequency steps that are fractions of  $\Delta_{min}$ , and the 8 most significant bits (MSBs) of d[n] are said to represent the *integer part* of d[n] because they cause DCO frequency steps that are multiples of  $\Delta_{min}$ .

The idea is to have two FCE banks: an *integer FCE bank* controlled by the integer part of d[n], and a *fractional FCE bank* controlled by the output of an oversampling digital  $\Delta\Sigma$ modulator driven by the fractional part of d[n] [2]. The  $\Delta\Sigma$  modulator's highpass-shaped quantization noise is lowpass filtered by the DCO, so provided the oversampling rate is sufficiently high, it negligibly contributes to the DCO's phase error.

Fig. 1 shows a specific example in the context of an LC-based DCO, where  $p_t = \lfloor f_{\text{fast}} t \rfloor$ ,  $f_{\text{fast}} \gg f_{\text{in}}$ , and  $d_I[n_t]$  and  $d_F[n_t]$  are the integer and fractional parts of  $d[n_t]$ , respectively. The  $f_{\text{fast}}$ -

clk signal is such that  $p_t$  changes synchronously with  $n_t$ , so that  $n_t$  can be written as a function of  $p_t$ , i.e.,

$$n_t = g(p_t). \tag{4}$$

In this example  $g(p_t) = \lfloor (f_{in}/f_{fast})p_t \rfloor$ , where  $f_{fast}/f_{in}$  is an integer much greater than 1.

It follows from (3) that  $d[n_t] = d_I[n_t] + d_F[n_t]$ , where

$$d_{I}[n_{t}] = \left(-2^{15}d_{15}[n_{t}] + \sum_{i=8}^{14} 2^{i}d_{i}[n_{t}]\right)\Delta$$
(5)

and

$$d_F[n_t] = \Delta \sum_{i=0}^{7} 2^i d_i[n_t].$$
 (6)

As shown in Fig. 1,  $d_F[n_t]$  is sampled at a rate of  $f_{\text{fast}}$  by a second-order digital  $\Delta\Sigma$  modulator. The  $\Delta\Sigma$  modulator's output is a four-level sequence quantized to multiples of  $\Delta_{\min}$  and can be written as

$$y_{\Delta\Sigma}[p_t] = d_F[n_t] + e_{\Delta\Sigma}[p_t], \tag{7}$$

where  $e_{\Delta\Sigma}[p_t]$  is second-order highpass-shaped quantization noise plus any dither used within the  $\Delta\Sigma$  modulator. A thermometer encoder maps  $y_{\Delta\Sigma}[p_t]$  to a 4-bit thermometer code which drives a bank of four FCEs, each with a frequency step of  $\Delta_{\min}$ . It follows from (1), (2) and (7) that the contribution of the fractional FCE bank to the DCO frequency,  $f_F(t)$ , is

$$f_F(t) = \sum_{i=1}^{4} f_i(t) = d_F[n_t] + e_{\Delta\Sigma}[p_t].$$
(8)

The integer FCE bank is directly driven by  $d_i[n_t]$ . Specifically, the *i*th FCE, for i = 5, 6, ..., 11, has input  $b_i[n_t] = d_{i+3}[n_t]$  and frequency step size  $\Delta_i = 2^{i+3}\Delta$ , and the 12<sup>th</sup> FCE has input

 $b_{12}[n_t] = 1 - d_{15}[n_t]$  and frequency step size  $\Delta_{12} = 2^{15}\Delta$ . It follows from (1), (2) and (5) that the contribution of the integer FCE bank to the DCO frequency,  $f_I(t)$ , is

$$f_I(t) = \sum_{i=5}^{12} f_i(t) = d_I[n_t],$$
(9)

where a constant additive term has been omitted.

The contribution of the two FCE banks to the DCO frequency is  $f_{tune}(t) = f_I(t) + f_F(t)$ , so (8) and (9) imply that

$$f_{\text{tune}}(t) = d[n_t] + e_{\Delta\Sigma}[p_t].$$
(10)

Accordingly,  $e_{\Delta\Sigma}[p_t]$  causes DCO frequency error. The DCO's phase error is the integral of its frequency error, so as mentioned above, a lowpass-filtered version of  $e_{\Delta\Sigma}[p_t]$  appears as a component of the DCO's phase error. Given that  $e_{\Delta\Sigma}[p_t]$  has a highpass-shaped spectrum that peaks at  $f_{\text{fast}}/2$ , its contribution to the DCO's phase error can be made negligible relative to other sources of phase error if  $f_{\text{fast}}$  is large enough [2, 11, 50].

### **III. EFFECTS OF FCE MISMATCHES**

The FCEs in the previous example are ideal. Unfortunately, non-ideal circuit behavior causes  $f_i(t)$  to deviate from (2). For example, suppose for now that  $f_i(t)$  is modeled as ideal except for a static gain error given by  $\alpha_i$ , i.e.,

$$f_i(t) = \left(b_i[m_t] - \frac{1}{2}\right) \alpha_i \Delta_i.$$
(11)

Ideally,  $\alpha_i = 1$  for  $i = 1, 2, ..., N_{FCE}$ , but inevitable component mismatches introduced during fabrication cause  $\alpha_i$  to deviate from 1.

Repeating the analysis for the example in Fig. 1 with (11) in place of (2) gives

$$f_{\text{tune}}(t) = \alpha_F f_{\text{tune-ideal}}(t) + e_F(t) + e_I(t) + (\alpha_I - \alpha_F) d_I[n_t],$$
(12)

where  $f_{\text{tune-ideal}}(t)$  is given by the right side of (10),  $\alpha_F$  and  $\alpha_I$  are the averages of  $\alpha_i$  for i = 1, 2, 3, 4 and i = 5, 6, ..., 12, respectively,

$$e_F(t) = \sum_{i=1}^{4} (\alpha_i - \alpha_F) (b_i[p_t] - \frac{1}{2}) \Delta_{\min}$$
(13)

and

$$e_{I}(t) = \sum_{i=5}^{12} (\alpha_{i} - \alpha_{I}) (b_{i}[n_{t}] - \frac{1}{2}) \Delta_{i}.$$
(14)

Hence, the FCE static gain errors introduce a gain factor,  $\alpha_F$ , and three additive error terms to  $f_{tune}(t)$ . The  $\alpha_F$  gain factor does not significantly degrade performance in typical PLLs. In contrast, as explained next, the three additive error terms in (12) tend to cause spurious tones and increase phase error in PLLs because they are nonlinear functions of  $d[n_t]$ .

The individual bits of d[n], i.e.,  $d_i[n]$ , for each i = 0, 1, ..., 15, each depend on d[n] but are restricted to values of 0 and 1. Hence, each  $d_i[n]$  is a nonlinear function of d[n]. Nevertheless, they can be combined as in (3) to yield d[n], which implies that multiplying  $d_0[n]$ ,  $d_1[n]$ , ...,  $d_{14}[n]$ , and  $d_{15}[n]$  by  $2^0$ ,  $2^1$ , ...,  $2^{14}$ , and  $-2^{15}$ , respectively, and adding the results causes the nonlinear components from the individual bits to cancel each other. Any deviation from a set of scale factors proportional to those mentioned above prevents full cancellation of the nonlinear components. It can be verified from (5), (13) and (14) that  $e_F(t)$ ,  $e_I(t)$ , and  $(\alpha_I - \alpha_F)d_I[n_I]$  are each a function of a subset of the individual bits of  $d[n_I]$ , so they are nonlinear functions of  $d[n_I]$ .

A partial solution to this problem is to replace the thermometer encoder in Fig. 1 with a mismatch-shaping DEM encoder [52]. Doing so would cause  $e_F(t)$  to be replaced by highpass-

shaped noise that is free of nonlinear distortion and is uncorrelated with  $d[n_t]$ , so it would be suppressed by the DCO like the  $\Delta\Sigma$  quantization noise. Similarly, the integer FCE bank could be modified to accommodate a mismatch-shaping DEM encoder clocked at a rate of  $f_{in}$ , which would cause  $e_I(t)$  to be replaced by shaped noise that is free of nonlinear distortion and is uncorrelated with  $d[n_t]$ . However,  $f_{in} \ll f_{fast}$ , so less of the shaped noise would be suppressed by the DCO. Unfortunately, DEM as described above would not help prevent the last term in (12) from introducing nonlinear distortion because  $d_I[n_t]$  is a non-linear function of  $d[n_t]$ .

As demonstrated in [39], the last two terms in (12) increase the phase error in a PLL unless  $d_{I}[n_{t}]$  remains constant once the PLL is locked. In most published digital PLLs d[n]varies by much less than  $\Delta_{\min}$  when the PLL is locked, and measured results are usually presented for PLL frequencies at which  $d_I[n_t]$  does not change during the measurement interval. This renders the last two terms in (12) constant, so they do not contribute phase error. Unfortunately, this is not a viable option in practice because DCO center frequency drift caused by flicker noise, voltage and temperature variations, and pulling from external interference cause  $d[n_t]$  to vary by far more than  $\Delta_{\min}$  over time. For instance, measurement results indicate that the frequency of the DCO presented in [39] varies by about -200 kHz/°C, which corresponds to  $\sim 7\Delta_{min}$  per degree Celsius. In practice, this causes the digital PLL's phase noise to increase drastically from time to time as  $d[n_t]$  slowly drifts past integer multiples of  $\Delta_{\min}$ . This issue is sometimes called "spectral breathing" because the phase noise spectrum, as viewed on laboratory measurement equipment, appears to swell up every now and then as if it is taking deep breaths. During these "breaths" the PLL's performance is extremely degraded. Furthermore, when the PLL is used to generate phase or frequency modulated signals, such as

a GFSK signal for a Bluetooth transmitter,  $d[n_t]$  typically varies by more than  $\Delta_{\min}$ , so there are no periods between "breaths" during which the phase noise performance is good.

To address this problem, a single bank of FCEs driven by a  $\Delta\Sigma$  modulator and a mismatch-shaping DEM encoder could be used, where the  $\Delta\Sigma$  modulator oversamples  $d[n_t]$  instead of just  $d_F[n_t]$ . The DEM encoder would cause any mismatches among the FCEs to contribute shaped noise instead of nonlinear distortion, and the oversampling would ensure that most of the noise is suppressed by the DCO. Unfortunately, high oversampling ratios would be required in practice, which makes this solution impractical because of the associated high power consumption.

In the remainder of the paper, a new multi-rate DEM technique and an MNC technique that work together within a PLL to solve the problems that arise from FCE mismatches are presented. As in Fig. 1, two FCE banks are used. Both FCE banks are driven by a multi-rate DEM encoder, which ensures that the error arising from FCE mismatches is free of nonlinear distortion. In addition, the multi-rate DEM encoder avoids high power consumption because most of its digital logic is clocked at a rate of  $f_{in}$  instead of  $f_{fast}$ .<sup>3</sup> Much of the additive error is not oversampled, so instead of relying on the DCO to suppress it, the MNC technique adaptively measures the error and cancels it in real time.

<sup>&</sup>lt;sup>3</sup> Although the hardware of the proposed techniques is different from that of the solution in which  $d[n_t]$  is oversampled and a DEM encoder clocked at a high rate is used to control the FCEs, a pessimistic power consumption analysis suggests that the proposed techniques are at least five times more power-efficient.

#### IV. FCE MISMATCH MODEL

FCEs with  $\Delta_i > \Delta_{\min}$  are usually built by connecting nominally identical minimumweight FCEs in parallel. Static mismatches among these FCEs are sources of error, but other non-idealities such as the non-instantaneous frequency transitions of realizable FCEs are also sources of error. Hence, a more comprehensive model than (11) for  $f_i(t)$  is

$$f_i(t) = (b_i[m_t] - \frac{1}{2})\Delta_i + e_i(t),$$
(15)

where  $e_i(t)$  is error that models both the static mismatch and the non-ideal frequency transitions of the *i*th FCE. FCEs are designed such that frequency transitions caused by input bit changes settle within a clock period, so  $e_i(t)$  only depends on  $b_i[m_t-1]$  and  $b_i[m_t]$ . This can be modeled as

$$e_{i}(t) = \begin{cases} e_{11i}, & \text{if } b_{i}[m_{t}-1] = 1, \ b_{i}[m_{t}] = 1, \\ e_{01i}(t), & \text{if } b_{i}[m_{t}-1] = 0, \ b_{i}[m_{t}] = 1, \\ e_{00i}, & \text{if } b_{i}[m_{t}-1] = 0, \ b_{i}[m_{t}] = 0, \\ e_{10i}(t), & \text{if } b_{i}[m_{t}-1] = 1, \ b_{i}[m_{t}] = 0, \end{cases}$$
(16)

where  $e_{11i}$ ,  $e_{01i}(t)$ ,  $e_{00i}$ , and  $e_{10i}(t)$  represent the error over each clock interval corresponding to the four different possibilities of the FCE's current and prior input bit values [53].<sup>4</sup>

Fig. 2 shows example waveforms associated with (15) and (16). A consequence of the frequency transitions settling within a clock period is that when an FCE's input bit does not change between clock periods, neither does its contribution to the DCO frequency, so  $e_{00i}$  and  $e_{11i}$  are constant. In contrast,  $e_{01i}(t)$  and  $e_{10i}(t)$  are not constant because they represent deviations

<sup>&</sup>lt;sup>4</sup> The FCE model given by (15) and (16) is analogous to that of a non-return-to-zero (NRZ) 1-bit DAC. To prevent  $e_i(t)$  from depending on  $b_i[m_t-1]$ , return-to-zero (RZ) FCEs could be implemented by setting the FCEs to a signal-independent state for a fraction of each clock period, but this is not practical for PLLs because it would periodically slew the DCO frequency and thereby introduce excessive phase noise.

from the FCE's ideal instantaneous frequency transitions when its input bit changes. As shown in Fig. 2, the shape of each of these frequency transitions depends only on whether the corresponding FCE input changed from 0 to 1 or 1 to 0, and both  $e_{01i}(t)$  and  $e_{10i}(t)$  are  $1/f_{FCE}$ periodic.

Experimental results suggest, at least for the LC-based DCOs presented in [36] and [39], that the frequency transition introduced by each FCE when its input bit changes from 0 to 1 and that when the input bit changes from 1 to 0 are antisymmetric to a high degree of accuracy, i.e.,  $e_{11i} - e_{01i}(t) = -[e_{00i} - e_{10i}(t)]$ . Therefore, substituting (16) into (15), applying this observation, collecting terms and omitting constant additive terms yields

$$f_i(t) = \left(b_i[m_t] - \frac{1}{2}\right) \alpha_i(t) \Delta_i + \left(b_i[m_t - 1] - \frac{1}{2}\right) \gamma_i(t),$$
(17)

where

$$\alpha_i(t) = 1 + (e_{01i}(t) - e_{00i}) / \Delta_i \quad \text{and} \quad \gamma_i(t) = e_{11i} - e_{01i}(t).$$
(18)

Given that  $\alpha_i(t)$  and  $\gamma_i(t)$  are functions of  $e_{01i}(t)$  and  $e_{10i}(t)$ , which are  $1/f_{FCE}$ -periodic, they are also  $1/f_{FCE}$ -periodic.

#### V. MULTI-RATE DEM

A. Starting Point: Single-Rate Segmented DEM

Suppose the DCO's input sequence is given by (3), and for now suppose that  $\Delta\Sigma$  quantization is not necessary because FCEs with small-enough step sizes are available, i.e.,  $\Delta_{min} = \Delta$ . Even in this case, FCE mismatches are a problem because they cause nonlinear distortion. A conventional single-rate segmented DEM encoder can be used to prevent this problem. For example, the mismatch-shaping segmented DEM encoder shown in Fig. 3 can be used with 34 FCEs [54]. The *i*th FCE has input  $b_i[n_t] = c_i[n_t]$  and frequency step size  $\Delta_i = K_i \Delta$ , where

$$K_{2i-1} = K_{2i} = 2^{i-1} \text{ for } i = 1, 2, ..., 13, \text{ and}$$
  

$$K_i = 2^{13} \text{ for } i = 27, 28, ..., 34.$$
(19)

The DEM encoder's input sequence,  $c[n_t]$ , is obtained from the DCO input sequence as

$$c[n_t] = d[n_t] / \Delta + 2^{15} + 2^{13} - 1$$
(20)

for reasons explained in [54].

As shown in Fig. 3, the DEM encoder consists of 33 digital switching blocks (SBs), labeled  $S_{k,r}$  for k = 1, 2, ..., 16, and r = 1, 2, ..., 17, configured in a tree structure. The 13 shaded SBs are called segmenting SBs, whereas the other 20 SBs are called non-segmenting SBs. The functional details of the SBs are shown in Fig. 4. The top and bottom outputs of each segmenting SB are  $\frac{1}{2}(c_{k,1}[n_t] - 1 - s_{k,1}[n_t])$  and  $1 + s_{k,1}[n_t]$ , respectively, where  $c_{k,1}[n_t]$  is the SB input sequence, and  $s_{k,1}[n_t]$ , called a switching sequence, is 0 when  $c_{k,1}[n_t]$  is odd and  $\pm 1$  otherwise. Similarly, the top and bottom outputs of each non-segmenting SB are  $\frac{1}{2}(c_{k,r}[n_t] - s_{k,r}[n_t])$  and  $\frac{1}{2}(c_{k,r}[n_t] + s_{k,r}[n_t])$ , respectively, where  $c_{k,r}[n_t]$  is the SB input sequence and  $s_{k,r}[n_t]$  is 0 when  $c_{k,r}[n_t]$  is even and  $\pm 1$  otherwise.

Regardless of the SB type, each switching sequence is zero-mean and has a first-order highpass-shaped power spectral density (PSD) that peaks at  $f_{in}/2$ . It is generated in two's complement format by the logic shown in Fig. 4(c), wherein  $d_{k,r}[n_l]$  is generated within each SB and is well-modeled as a two-level white random sequence that takes on values of 0 and 1 with equal probability and is independent of the  $d_{k,r}[n_l]$  sequences in the other SBs.

#### B. Extension to Multi-Rate Segmented DEM

Now suppose that the smallest practical FCE frequency step size is  $\Delta_{\min} = 2^8 \Delta$ . As the lower 16 FCEs in the example above all have frequency step sizes smaller than  $\Delta_{\min}$ , the bottom 16 outputs of the DEM encoder can no longer drive FCEs directly. The multi-rate DEM architecture shown in Fig. 5 addresses this situation, where the bottom 4 FCEs make up the fractional FCE bank, the top 18 FCEs make up the integer FCE bank, and  $w_t = p_t - 1$  is a  $T_{\text{fast}}$ -delayed version of  $p_t$ , where  $T_{\text{fast}} = 1/f_{\text{fast}}$ . As in Fig. 1,  $n_t = g(p_t)$  changes synchronously with  $p_t$ .

The block labeled slow DEM encoder in Fig. 5 is a modified version of the DEM encoder in Fig. 3. Its outputs  $c_{17}[n_t]$ ,  $c_{18}[n_t]$ , ...,  $c_{34}[n_t]$  are identical to those in Fig. 3, and instead of outputs  $c_1[n_t]$ ,  $c_2[n_t]$ , ...,  $c_{16}[n_t]$  it has an output,  $x_f[n_t]$ , given by

$$x_{f}[n_{t}] = \Delta \sum_{i=1}^{16} K_{i}(c_{i}[n_{t}] - \frac{1}{2}).$$
(21)

Each  $c_i[n_t]$  takes on values of 0 and 1, so (19) and (21) imply that  $|x_f[n_t]| \le 255\Delta$  and  $x_f[n_t]$  is restricted to multiples of  $\Delta$ .

The slow DEM encoder could be implemented from the DEM encoder of Fig. 3 directly by combining  $c_1[n_t]$ ,  $c_2[n_t]$ , ...,  $c_{16}[n_t]$  as in (21), but the structure of Fig. 6 is used instead because is simpler. As implied by Fig. 4(b), the sum of the outputs of each non-segmenting SB is equal to the SB's input, so it follows from (21), Fig. 3 and Fig. 4(a) that  $x_f[n_t]$  can be computed directly from the bottom outputs of S<sub>16,1</sub>, S<sub>15,1</sub>, ..., S<sub>9,1</sub> as

$$x_{f}[n_{t}] = \Delta \sum_{k=9}^{16} 2^{16-k} s_{k,1}[n_{t}].$$
(22)

Hence, as shown in Fig. 6, S<sub>1,1</sub>, S<sub>1,2</sub>, ..., S<sub>1,8</sub> are not necessary in the slow DEM encoder.

The  $\Delta$  scale factor shown in Fig. 6 is not an actual multiplier; it just denotes that the subsequent digital logic should interpret the LSB of  $x_f[n_t]$  to represent a DCO frequency step size of  $\Delta$ .

As shown in Fig. 5,  $x_f[n_t]$  is sampled at a rate of  $f_{\text{fast}}$  by a second-order digital  $\Delta\Sigma$ modulator whose functional diagram is shown in Fig. 7. The dither sequence,  $d_{\Delta\Sigma}[p_t]$ , is generated such that it can be well-modeled as a two-level white random sequence that is independent of  $d[n_t]$  and  $x_f[n_t]$  and takes on values of 0 and  $\Delta$  with equal probability. It ensures that the  $\Delta\Sigma$  modulator's quantization noise is asymptotically independent of  $x_f[n_t]$  and  $d_{\Delta\Sigma}[p_t]$ , and has a PSD equal to that of the output of a filter with transfer function  $(1 - z^{-1})^2$  driven by white noise with a variance of  $\Delta_{\min}^2/12$  [55]. The  $\Delta\Sigma$  modulator output is quantized to values in the set  $\{-2\Delta_{\min}, -\Delta_{\min}, 0, \Delta_{\min}, 2\Delta_{\min}\}$  and is given by

$$y_{\Delta\Sigma}[p_t] = x_f[n_t] + e_{\Delta\Sigma}[p_t], \qquad (23)$$

where  $e_{\Delta\Sigma}[p_t]$  is second-order highpass-shaped quantization noise plus  $d_{\Delta\Sigma}[p_t]$ .

The block in Fig. 5 labeled fast DEM encoder is a conventional mismatch-shaping nonsegmented DEM encoder with a clock rate of  $f_{\text{fast}}$ . It is implemented as a tree of non-segmenting SBs, and it maps  $y_{\Delta\Sigma}[p_t]$  to four 1-bit sequences, each of which drives an FCE with a frequency step size of  $\Delta_{\min}$  [56, 57].

Each  $b_i[w_i]$  in Fig. 5, for i = 1, 2, 3, 4, is clocked at a rate of  $f_{\text{fast}}$  and toggles rapidly enough such that the FCE frequency transitions from the fractional FCE bank introduce highfrequency error components to the DCO's phase error. Such components are lowpass filtered by the DCO, so they are not a problem in practice provided  $f_{\text{fast}}$  is large enough. Consequently, the frequency transitions of the FCEs from the fractional FCE bank are modeled as ideal, so that  $f_i(t)$  is given by (11) for i = 1, 2, 3, 4.

It follows from the results presented in [53] and (11) that

$$f_F(t) = \alpha_F y_{\Delta\Sigma}[w_t] + e_F(t), \qquad (24)$$

where  $\alpha_F$  is the average of  $\alpha_i$  for i = 1, 2, 3, 4 and  $e_F(t)$  is a function of the errors introduced by the fractional FCE bank and the switching sequences from the fast DEM encoder. The fast DEM encoder ensures that  $e_F(t)$  is free of nonlinear distortion, uncorrelated with  $y_{\Delta\Sigma}[w_t]$ , and has a first-order highpass-shaped PSD that peaks at  $f_{\text{fast}}/2$ , so this term is not a problem in practice provided  $f_{\text{fast}}$  is large enough. Thus, substituting (23) into (24) and neglecting  $e_F(t)$  gives

$$f_F(t) = \alpha_F x_f[g(w_t)] + \alpha_F e_{\Delta\Sigma}[w_t].$$
(25)

As shown in Fig. 5, the  $c_{17}[n_t]$ ,  $c_{18}[n_t]$ , ...,  $c_{34}[n_t]$  outputs of the slow DEM encoder drive the same FCEs as those of the DEM encoder of Fig. 3. As shown in Appendix A, this implies that  $f_I(t)$  is given by

$$f_{I}(t) = \alpha_{I}(t)d[g(w_{t})] + \gamma_{I}(t)d[g(w_{t}-1)] + e_{I}(t),$$
(26)

where

$$e_{I}(t) = \Delta \sum_{k,r} \left\{ \alpha_{k,r}(t) s_{k,r}[g(w_{t})] + \gamma_{k,r}(t) s_{k,r}[g(w_{t}-1)] \right\},$$
(27)

 $\alpha_I(t)$ ,  $\gamma_I(t)$ ,  $\alpha_{k,r}(t)$  and  $\gamma_{k,r}(t)$  are  $T_{\text{fast}}$ -periodic waveforms that depend on the errors introduced by the integer FCE bank, and the summation indices indicate the summation over all *k* and *r* values corresponding to the SBs within the slow DEM encoder.

The contribution to the DCO frequency from both FCE banks is  $f_{tune}(t) = f_I(t) + f_F(t)$ , so (25) and (26) imply that

$$f_{\text{tune}}(t) = \alpha_I(t) d[g(w_t)] + \gamma_I(t) d[g(w_t - 1)] + \alpha_F e_{\Delta\Sigma}[w_t] + e_M(t),$$
(28)

where

$$e_M(t) = e_I(t) + \alpha_F x_f[g(w_t)]$$
<sup>(29)</sup>

is called *FCE mismatch error*. As shown below,  $e_M(t)$  is a linear combination of the switching sequences from the slow DEM encoder whose coefficients depend on the errors introduced by both FCE banks.

The  $\gamma_l(t)d[g(w_l - 1)]$  term in (28) is proportional to a  $T_{\text{fast}}$ -delayed version of  $d[g(w_l)]$ , so it represents a linear filtering operation. It follows from the expressions for  $\alpha_l(t)$  and  $\gamma_l(t)$  in Appendix A that this term tends to be much smaller than the desired signal component,  $\alpha_l(t)d[g(w_l)]$ , so it is not a problem in practice. The  $\alpha_F e_{\Delta\Sigma}[w_l]$  term is proportional to  $\Delta\Sigma$ quantization noise plus dither so it is free of nonlinear distortion, is uncorrelated with the other terms in (28), and has a highpass-shaped PSD. The  $e_M(t)$  term also has these properties because it is a linear combination of the switching sequences from the slow DEM encoder. The PSD of  $\alpha_F e_{\Delta\Sigma}[w_l]$  peaks at  $f_{\text{fast}}/2$ , whereas the PSD of  $e_M(t)$  peaks at  $f_{\text{in}}/2$ . Hence,  $f_{\text{fast}}$  can be increased to make the DCO phase error introduced by  $\alpha_F e_{\Delta\Sigma}[w_l]$  negligible, but this would not reduce the DCO phase error contribution from  $e_M(t)$ . Therefore,  $e_M(t)$  is the only problematic term in (28).

Substituting (22) and (27) into (29) yields

$$e_{M}(t) = \Delta \sum_{k,r} \left\{ \delta_{k,r} s_{k,r} [g(w_{t})] + \gamma_{k,r}(t) \left( s_{k,r} [g(w_{t}-1)] - s_{k,r} [g(w_{t})] \right) \right\},$$
(30)

where

$$\delta_{k,r} = \begin{cases} \alpha_{k,r}(t) + \gamma_{k,r}(t) + \alpha_F 2^{16-k}, & \text{if } k \ge 9, r = 1, \\ \alpha_{k,r}(t) + \gamma_{k,r}(t), & \text{otherwise,} \end{cases}$$
(31)

is constant for each *k* and *r*, even though neither  $\alpha_{k,r}(t)$  nor  $\gamma_{k,r}(t)$  are constant. As can be verified by substituting (18) into the expressions for  $\alpha_{k,r}(t)$  and  $\gamma_{k,r}(t)$  in Appendix A, the non-constant terms in each  $\alpha_{k,r}(t)$  are equal in magnitude but opposite in sign to the corresponding terms in  $\gamma_{k,r}(t)$ , so  $\alpha_{k,r}(t) + \gamma_{k,r}(t)$ , and hence  $\delta_{k,r}$ , are constant. Therefore, the terms proportional to  $\delta_{k,r}$  in (30) represent the DCO frequency error contribution from FCE static gain errors, whereas the terms proportional to  $\gamma_{k,r}(t)$  in (30) represent the DCO frequency error contribution from nonideal FCE frequency transitions.

### VI. ADAPTIVE FCE MISMATCH NOISE CANCELLATION

The purpose of the MNC technique is to cancel most of the DCO phase error that would otherwise be caused by  $e_M(t)$ . To do this, the sequence

$$e_{\text{MNC}}[p_t] = \Delta \sum_{k,r} \left\{ a_{k,r} s_{k,r}[n_t] + b_{k,r} \left( s_{k,r}[g(w_t)] - s_{k,r}[n_t] \right) \right\},$$
(32)

where  $a_{k,r}$  and  $b_{k,r}$  are called the MNC coefficients, is injected into the fractional path of the multirate DEM encoder. The ideal MNC coefficient values, i.e., the values of  $a_{k,r}$  and  $b_{k,r}$  for which the DCO phase error contribution of  $e_M(t)$  is minimized, are estimated with a least-mean-square (LMS)-like algorithm.

In the following, it is explained how  $e_{MNC}[p_t]$  affects the DCO's phase error, how the FCE mismatch error is measured, and how the MNC coefficients are adaptively computed from the FCE mismatch error measurement.

### A. MNC Sequence Application

Fig. 8 shows the fractional path of the multi-rate DEM encoder shown in Fig. 5 modified
to accommodate MNC. The  $e_{MNC}[p_t]$  sequence is subtracted from  $x_f[n_t]$  prior to the  $\Delta\Sigma$  modulator, and the output range of the  $\Delta\Sigma$  modulator, the range of the fast DEM encoder, and the number of FCEs driven by the fast DEM encoder are all four times those of the original system to accommodate the resulting dynamic range increase. Thus,  $f_F(t)$  is still given by (24), but now  $y_{\Delta\Sigma}[p_t]$  is given by the right side of (23) minus  $e_{MNC}[p_t]$ . Despite having the same qualitative properties as before,  $\alpha_F$  and  $e_F(t)$  in (24) are slightly different in the modified system because of the additional FCEs.

An analysis almost identical to that presented in Section V shows that  $f_{tune}(t)$  is now given by

$$f_{\text{tune}}(t) = \alpha_I(t)d[g(w_t)] + \gamma_I(t)d[g(w_t - 1)] + \alpha_F e_{\Delta\Sigma}[w_t] + e_R(t),$$
(33)

where

$$e_R(t) = e_M(t) - \alpha_F e_{\text{MNC}}[w_t]$$
(34)

is the *residual* FCE mismatch error, i.e., what is left of  $e_M(t)$  when  $e_{MNC}[p_t]$  is applied. It follows from (30), (32) and (34) that

$$e_{R}(t) = \Delta \sum_{k,r} \left\{ \delta_{k,r-\text{res}} s_{k,r}[g(w_{t})] + \gamma_{k,r-\text{res}}(t) \left( s_{k,r}[g(w_{t}-1)] - s_{k,r}[g(w_{t})] \right) \right\},$$
(35)

where  $\delta_{k,r-\text{res}}$  and  $\gamma_{k,r-\text{res}}(t)$  are defined as

$$\delta_{k,r-\text{res}} = \delta_{k,r} - \alpha_F a_{k,r} \text{ and } \gamma_{k,r-\text{res}}(t) = \gamma_{k,r}(t) - \alpha_F b_{k,r}, \tag{36}$$

respectively.

Given that  $\delta_{k,r}$  is constant, there exists an  $a_{k,r}$  that causes  $\delta_{k,r-\text{res}} = 0$ . In contrast, there is no  $b_{k,r}$  that causes  $\gamma_{k,r-\text{res}}(t)$  to vanish completely, because  $\gamma_{k,r}(t)$  is not constant. However,  $\gamma_{k,r}(t)$  is  $T_{\text{fast-}}$ 

periodic so there exists a  $b_{k,r}$  that makes the DC component of  $\gamma_{k,r-\text{res}}(t)$  zero, such that  $\gamma_{k,r-\text{res}}(t)$  is a linear combination of sinusoids with frequencies that are non-zero multiples of  $f_{\text{fast}}$  [58]. Therefore, it follows from (36) that if

$$a_{k,r} = \frac{\delta_{k,r}}{\alpha_F} \text{ and } b_{k,r} = \frac{1}{\alpha_F T_{\text{fast}}} \int_0^{T_{\text{fast}}} \gamma_{k,r}(\tau) d\tau,$$
 (37)

for each k and r, then

$$\delta_{k,r-\text{res}} = 0 \text{ and } \int_0^{T_{\text{fast}}} \gamma_{k,r-\text{res}}(\tau) d\tau = 0.$$
(38)

In the absence of FCE static mismatches,  $a_{k,r} = 0$ , and if the FCE frequency transitions are ideal,  $b_{k,r} = 0$ .

Phase error is the integral of frequency error, so the DCO phase error introduced by  $e_R(t)$  is given by

$$\theta_R(t) = \int_0^t e_R(\tau) d\tau.$$
(39)

If (38) is satisfied, then (35) and (39) imply that

$$\theta_{R}(t) = \Delta \sum_{k,r} \left( s_{k,r} [g(w_{t} - 1)] - s_{k,r} [g(w_{t})] \right) \int_{0}^{t - p_{t} T_{\text{fast}}} \gamma_{k,r-\text{res}}(u) du,$$
(40)

where  $t - p_t T_{\text{fast}} = t - [f_{\text{fast}}t]T_{\text{fast}} < T_{\text{fast}}$ . The term within the parenthesis in (40) equals zero when  $g(w_t) - g(w_t - 1) = 0$  and  $s_{k,r}[g(w_t) - 1] - s_{k,r}[g(w_t)]$  otherwise. Given that  $g(w_t) - g(w_t - 1)$  can only take on values from the set {0, 1}, then

$$s_{k,r}[g(w_t-1)] - s_{k,r}[g(w_t)] = \left(g(w_t) - g(w_t-1)\right) \left(s_{k,r}[g(w_t)-1] - s_{k,r}[g(w_t)]\right).$$
(41)

Furthermore,  $g(w_t)$  is a  $T_{\text{fast}}$ -delayed version of  $n_t$ , which increases by one unit every  $T_{\text{in}} = 1/f_{\text{in}}$ , so

 $g(w_t) - g(w_t - 1)$  is T<sub>in</sub>-periodic and is given by

$$g(w_t) - g(w_t - 1) = \sum_{k = -\infty}^{\infty} r(t - kT_{in}),$$
(42)

where r(t) = 1 for  $t \in [T_{\text{fast}}, 2T_{\text{fast}})$  and 0 otherwise. It follows from (42) that the Fourier expansion of  $g(w_t) - g(w_t - 1)$  is

$$\frac{f_{\rm in}}{f_{\rm fast}} + \sum_{m=1}^{\infty} \frac{2}{m\pi} \sin\left(m\pi \frac{f_{\rm in}}{f_{\rm fast}}\right) \cos\left(2\pi m f_{\rm in} \left[t - \frac{3}{2}T_{\rm fast}\right]\right).$$
(43)

Thus, if the conditions shown in (38) are satisfied, (40), (41) and (43) imply that  $\theta_R(t)$  would be given by second-order shaped noise multiplied by a  $T_{in}$ -periodic waveform and a DC-free  $T_{fast}$ -periodic waveform. Consequently,  $e_R(t)$  would introduce components with frequencies around  $f_{n,m} = nf_{fast} \pm mf_{in}$  to the DCO's phase error, where n = 1, 2, 3, ... and m = 0, 1, 2, .... It follows from (43) that the power of the components around frequencies  $f_{n,m}$  with m near multiples of  $f_{fast}/f_{in}$  is very low. Therefore,  $\theta_R(t)$  would not be a problem if  $f_{fast}$  is large enough because  $e_R(t)$  would only introduce high-frequency components to the DCO's phase error that would be lowpass filtered by the DCO. Simulation results also suggest that  $\theta_R(t)$  is not a problem provided the conditions shown in (38) are satisfied and  $f_{fast}$  is large enough.

#### B. FCE Mismatch Error Measurement

The ideal MNC coefficient values are estimated as part of the feedback loop in a digital fractional-*N* PLL that incorporates the DCO. This is done during the PLL's normal operation by adaptively adjusting  $a_{k,r}$  and  $b_{k,r}$  such that the conditions shown in (38) are satisfied for each *k* and *r*, thereby minimizing  $e_R(t)$ .

The purpose of a fractional-N PLL is to generate a periodic output signal,  $v_{PLL}(t)$ , with

frequency  $f_{PLL} = (N + \alpha)f_{ref}$ , where *N* is a positive integer,  $\alpha$  is a fractional value and  $f_{ref}$  is the frequency of a reference oscillator waveform,  $v_{ref}(t)$ . The general form of a digital fractional-*N* PLL without MNC is shown in Fig. 9. It consists of a phase-error-to-digital converter (PEDC), a lowpass digital loop filter (DLF), and a DCO. The PEDC's output is an  $f_{ref}$ -rate digital sequence of the form

$$p[n] = -\theta_{\text{PLL}}[n] + e_p[n], \tag{44}$$

where  $\theta_{PLL}[n]$  is an estimate of the PLL's phase error and  $e_p[n]$  is additive error that includes quantization error from the PEDC's digitization process as well as error from circuit noise and other non-ideal circuit behavior in both the PEDC and reference oscillator.

Suppose the DCO contains the multi-rate DEM structure shown in Fig. 5 modified as shown in Fig. 8 with  $f_{in} = f_{ref}$ . Typically,  $f_{fast}$ -clk is a divided-down version of  $v_{PLL}(t)$ . Given that  $f_{PLL} = (N + \alpha)f_{ref}$ ,  $f_{ref}$  and  $f_{fast}$  are incommensurate frequencies when  $\alpha \neq 0$ , so it is not possible for  $n_t$  to change synchronously with  $p_t = \lfloor f_{fast}t \rfloor$  if  $n_t = \lfloor f_{ref}t \rfloor$ . Therefore, as shown in Fig. 10, in practice the DCO input is synchronized to  $f_{fast}$ -clk so (4) is satisfied, i.e., so  $n_t$  only changes at times  $\mu_n$ , which are multiples of  $T_{fast}$ , instead of times  $nT_{ref}$ , where  $T_{ref} = 1/f_{ref}$  is the reference period. It is common practice in digital PLLs to synchronize the DLF output to the clock signal of the fractional path, so this is not a special requirement of the proposed system. A circuit to avoid metastability issues is also needed as part of the synchronization circuit shown in Fig. 10(a), but it has been omitted for simplicity [59].

A key requirement of a PLL is to suppress low-frequency DCO error, which is achieved by subjecting additive frequency error introduced by the DCO to a highpass filter that has at least one zero at DC. In the following, the impulse response of this filter is denoted as h[n], and its running sum, i.e.,  $h[0] + h[1] + \ldots + h[n]$ , is denoted as l[n].

As shown in Appendix B, p[n] can be written as

$$p[n] = p_{\text{ideal}}[n] + p_R[n], \qquad (45)$$

where  $p_{ideal}[n]$  represents the contribution to p[n] of all noise sources except FCE mismatches and  $p_R[n]$  is the contribution to p[n] from  $e_R(t)$ . Specifically,  $p_R[n]$  is given by

$$p_{R}[n] = \Delta \alpha_{F} T_{\text{fast}} \sum_{i=0}^{n-1} \sum_{k,r} \left\{ y_{k,r-a}[i] + y_{k,r-b}[i] \right\} l[n-1-i],$$
(46)

where  $y_{k,r-a}[i] + y_{k,r-b}[i]$  is proportional to the PLL's frequency error introduced by the  $s_{k,r}[n]$  sequences. As explained in Appendix B, if  $a_{k,r}$  and  $b_{k,r}$  in (32) are replaced by  $a_{k,r}[n_t]$  and  $b_{k,r}[n_t]$ , respectively, then

$$y_{k,r-a}[i] = (q_{i-1} - 3)s_{k,r}[i - 1]a_{k,r-\text{error}}[i - 1] + 3s_{k,r}[i]a_{k,r-\text{error}}[i]$$
(47)

and

$$y_{k,r-b}[i] = \left(s_{k,r}[i-1] - s_{k,r}[i]\right) b_{k,r-\text{error}}[i],$$
(48)

where  $q_{i-1}$  is the number of  $T_{\text{fast}}$  periods between times  $\mu_{i-1}$  and  $\mu_i$ , and

$$a_{k,r-\text{error}}[n] = a_{k,r}[n] - a_{k,r} \text{ and } b_{k,r-\text{error}}[n] = b_{k,r}[n] - b_{k,r}$$
 (49)

are the MNC coefficient errors at sample time *n*.

The term proportional to  $s_{k,r}[i]$  in (47) arises because the time at which the PEDC samples the PLL's phase error, which is given by  $\mu_n + 4T_{\text{fast}}$  in the design example, is not equal to the time at which the integer FCE bank's inputs are updated, i.e.,  $\mu_n + T_{\text{fast}}$ . Accordingly, the integer FCE bank's inputs are updated three  $T_{\text{fast}}$  before the PLL's phase error is sampled, which causes  $y_{k,r}$ . a[i] to depend on  $s_{k,r}[i-1]$  and also on  $s_{k,r}[i]$ .

As implied by (45)-(48), the PEDC's output has information regarding the MNC coefficient errors. The MNC coefficient estimation process described next is based on this result and on the properties of the switching sequences.

# C. MNC Coefficients Estimation

A digital fractional-*N* PLL with the multi-rate DEM encoder and MNC technique is shown in Fig. 11(a). The details of the MNC logic are shown in Fig. 11(b) and Fig. 11(c), wherein

$$t_{k,r}[n] = \sum_{i=0}^{n} s_{k,r}[i]$$
(50)

is the running sum of  $s_{k,r}[n]$ , and  $K_a$  and  $K_b$  are called the MNC gains. The MNC logic block consists of an adder and 25  $s_{k,r}[n_t]$  residue estimators.

It follows from Fig. 4 that each  $s_{k,r}[n]$  sequence is a concatenation of sequences of the form 1, 0, ..., 0, -1, 0, ..., 0 or -1, 0, ..., 0, 1, 0, ..., 0, where each 0 is present only when the input of the  $s_{k,r}[n]$  generator is zero [52]. Thus,  $|s_{k,r}[n]| \le 1$ ,  $|t_{k,r}[n]| \le 1$  and  $|s_{k,r}[n] - s_{k,r}[n-1]| \le 2$  for all n, so the multipliers in Fig. 11(c) are simple in terms of hardware.

The  $s_{k,r}[n_t]$  residue estimators are responsible for the computation of the MNC coefficients. At each sample time, the MNC coefficient errors are measured and  $a_{k,r}[n_t]$  and  $b_{k,r}[n_t]$  are updated such that they approach the values shown in (37). The measurement of the MNC coefficient errors is based on the statistical properties of the switching sequences [60].

As explained in [57] and can be verified from Fig. 4, although each  $s_{k,r}[n]$  sequence depends on the input of its corresponding SB, when it is non-zero, its sign depends on  $d_{k,r}[n]$ . Given that the  $d_{k,r}[n]$  sequences are independent of the  $d_{k,r}[n]$  sequences in the other SBs, this provides enough randomization for the  $s_{k,r}[n]$  sequences to be uncorrelated with each other. Furthermore, as the  $d_{k,r}[n]$  sequences are also independent of all electronic device noise sources in the PLL, each  $s_{k,r}[n]$  sequence is uncorrelated with all such sources as well, and it is also uncorrelated with the PEDC's quantization noise in PLLs where such noise source is uncorrelated with the PLL's phase error [29, 39]. Hence, in such cases, the  $s_{k,r}[n]$  sequences are uncorrelated with all PLL noise except the terms in p[n] arising from  $e_R(t)$ , i.e.,  $p_R[n]$ .

As explained above, the  $y_{k,r-a}[i]$  and  $y_{k,r-b}[i]$  terms in p[n] depend on the MNC coefficient errors, and such terms are proportional to functions of the  $s_{k,r}[n]$  sequences. Specifically, it can be seen from (45)-(48) that p[n] has information about an accumulated version of

$$(q_{n-2}-3)s_{k,r}[n-2]a_{k,r-\text{error}}[n-2],$$
 (51)

and that p[n] - p[n-1] has information about

$$(s_{k,r}[n-2]-s_{k,r}[n-1])b_{k,r-\text{error}}[n-1].$$
 (52)

Therefore, it follows that the accumulator inputs in Fig. 11(c), i.e.,  $-p[n]t_{k,r}[n-2]$  and  $(p[n-1]-p[n])(s_{k,r}[n-2] - s_{k,r}[n-1])$ , when non-zero, are noisy estimates of  $a_{k,r-\text{error}}[n]$  and  $b_{k,r-\text{error}}[n]$ , respectively, so they can be used to adaptively compute the ideal MNC coefficients. In practice, the top and bottom branches within each  $s_{k,r}[n_t]$  residue estimator interfere with each other in a way that makes the accumulator inputs have information about both MNC coefficient errors. However, extensive simulations run by the authors suggest that the MNC coefficient values converge to their ideal values regardless of such interferences provided the MNC gains are set properly.

It would also be possible to correlate p[n-1]-p[n] by  $s_{k,r}[n-2]$  to get an estimate of  $a_{k,r}$ error[n]. However, as  $a_{k,r}[n]$  is only updated when the accumulator input is non-zero, correlating p[n-1]-p[n] against  $s_{k,r}[n-2]$  instead of -p[n] against  $t_{k,r}[n-2]$  would significantly decrease the convergence speed of  $a_{k,r}[n]$  because normally  $s_{k,r}[n-2]$  is zero more often than  $t_{k,r}[n-2]$ . Although correlating -p[n] against  $t_{k,r}[n-2]$  effectively increases the error variance of  $a_{k,r}[n]$ , as explained next, this problem can be mitigated by reducing  $K_a$ .

As is common in most LMS-like algorithms, the choice of  $K_a$  and  $K_b$  represents a tradeoff. The larger the MNC gains, the faster the convergence, but the larger the error variance of  $a_{k,r}[n]$  and  $b_{k,r}[n]$ . Also, as the  $s_{k,r}[n_i]$  residue estimators comprise two LMS-like loops in parallel that interfere with each other,  $K_a$  and  $K_b$  each affect the convergence time and error variance of both  $a_{k,r}[n]$  and  $b_{k,r}[n]$ . Although it might be possible to develop closed-form expressions that quantify these tradeoffs, the authors currently use simulations to assist the design process and to choose the values of  $K_a$  and  $K_b$ .

#### VII. SIMULATION RESULTS

The multi-rate DEM and the MNC techniques were tested in an event-driven behavioral simulation of a modified version of the  $\Delta\Sigma$  frequency-to-digital converter based fractional-*N* PLL presented in [39, 40]. As explained in [38], *p*[*n*] is given by (44) where  $e_p[n]$  is first-order shaped quantization noise that is uncorrelated with the PLL's phase error plus error from both the PEDC and reference oscillator.

The DLF consists of two single-pole IIR stages and a proportional-integral stage. Its transfer function is

$$L(z) = K_M \left( K_P + \frac{K_I}{1 - z^{-1}} \right) \prod_{i=0}^{1} \frac{\lambda_i}{1 - (1 - \lambda_i) z^{-1}},$$
(53)

where  $K_M$ ,  $K_P$ ,  $K_I$ ,  $\lambda_0$  and  $\lambda_1$  are constant loop filter parameters. The DCO consists of an LC oscillator core with a power-of-two-weighted coarse capacitor bank, an integer FCE bank and a fractional FCE bank. The latter two are driven by the multi-rate DEM encoder shown in Fig. 5 and modified as shown in Fig. 8 with  $f_{\text{fast}} = f_{\text{PLL}}/8$  and  $\Delta_{\min} = 40$  kHz (i.e.,  $\Delta = 156.25$  Hz).

The static gain error of the *i*th FCE was modeled as an additive zero-mean Gaussian random variable with a standard deviation of 5% of  $\Delta_i$  divided by the square root of  $\Delta_i/\Delta_{min}$ , which is consistent with measurement results obtained by the authors from the PLL IC presented in [36]. The FCE frequency transitions were modeled as second-order transients that settle within one  $T_{\text{fast}}$  period. The parameters of these transients, such as the damping factor and the natural frequency, are modelled as random variables with means and standard deviations determined from transistor-level simulation results. Fig. 12 shows example frequency transients used in the simulation.

The simulated noise parameters of the DCO and the reference oscillator, as well as the PEDC internal parameters, are the same as those used in [38]. Specifically,  $f_{ref} = 26$  MHz, N = 134 and  $\alpha = 0.0003846153$ , so that  $f_{PLL} = 3.484$  GHz and  $f_{fast} = 435.5$  MHz. The DLF parameters used were  $K_M = 1.25$ ,  $K_P = 2^4$ ,  $K_I = 2^{-4}$ ,  $\lambda_0 = 2^{-3}$  and  $\lambda_1 = 2^{-2}$ , and the MNC gains were set to  $K_a = 2^{-3}$  and  $K_b = 2^{-5}$ . The simulated PLL has a bandwidth of 206 kHz and a phase margin of 63 degrees.

Fig. 13(a) shows the simulated PLL phase noise PSD with the multi-rate DEM technique disabled, i.e., with the flip-flops in both the slow and fast DEM encoders frozen. The two curves in Fig. 13(a) were obtained from two different simulations: one in which  $d_{I}[n_{t}]$  is constant and another one in which  $d_{I}[n_{t}]$  changes frequently. As mentioned in Section III, although the DCO

input sequence does not vary significantly in the short term once the PLL is locked, its moving average drifts over time such that  $d_I[n_t]$  eventually begins to change frequently, at which point it degrades the PLL's phase noise as shown in Fig. 13(a). Once the multi-rate DEM technique is enabled, whether or not  $d_I[n_t]$  changes has no significant effect on the DCO's frequency, so spectral breathing no longer occurs.

Fig. 13(b) shows the simulated PLL phase noise PSD with the multi-rate DEM technique enabled for two cases: one case with just static gain errors, and the other case with just non-ideal frequency transitions. Fig. 13(c) shows the simulated PLL phase noise PSD considering both sources of error with the multi-rate DEM technique enabled and with the MNC technique disabled and enabled. The theoretical PLL phase noise PSD for ideal FCEs, which was computed using the linearized model presented in [38], is also plotted as the dashed curves in Fig. 13 to provide a comparison baseline.

As shown in Fig. 13(c), when the MNC technique is enabled the resulting phase noise PSD matches the theoretically-predicted phase noise PSD for ideal FCEs after 13·10<sup>7</sup> reference periods (5 seconds) from a cold start. This implies a phase noise improvement of more than 20 dB at an offset frequency around 10 MHz. As the FCE mismatches are mostly determined by circuit component mismatches, they are not expected to change significantly over time. Hence, once obtained, the MNC coefficients can be stored in memory and used subsequently by the PLL, thereby avoiding future convergence time delays.

Fig. 14 shows the evolution of the MNC coefficient errors over time from the simulation used to generate the curves in Fig. 13(c). As shown in Fig. 14, some  $b_{k,r}[n]$  coefficients initially move away from their ideal values. As explained above, this happens because the top and

bottom branches of each  $s_{k,r}[n_t]$  residue estimator interfere with each other so that the error estimate at the input of each accumulator is biased by the MNC coefficient error of the opposite branch. As suggested by Fig. 14, if the MNC gains are set properly, this is not a problem in practice because this effect becomes less significant as either one or both MNC coefficients approach their ideal values.<sup>5</sup>

It follows from (47) and (48) that the terms proportional to  $a_{k,r\text{-error}}[n]$  in p[n] are  $q_n - 3$ times larger than those proportional to  $b_{k,r\text{-error}}[n]$  (e.g.,  $q_n \cong 16$  in the design example), so for  $K_a = K_b$ , the error variance of each  $b_{k,r}[n]$  is expected to be larger than that of  $a_{k,r}[n]$ . Therefore, in order to make the error variance of the  $b_{k,r}[n]$  coefficients comparable to that of the  $a_{k,r}[n]$ coefficients,  $K_b$  has to be smaller than  $K_a$ . As shown in Fig. 14, this causes the  $b_{k,r}[n]$  coefficients to converge to their ideal values at a slower rate than the  $a_{k,r}[n]$  coefficients, so the convergence speed of the MNC technique is limited by  $K_b$ . Nonetheless, it follows from Fig. 14 that the  $a_{k,r}[n]$  coefficients get close to their ideal values in less than  $10^7$  reference periods (~0.4 seconds). Hence, as the most significant sources of phase noise are the FCE static gain errors, the MNC technique allows for a considerable phase noise improvement in less than half a second.

To reduce the cold-start convergence time of the MNC technique, large MNC gains can be used initially and decreased over time [61]. Fig. 15 shows the evolution of the MNC coefficient errors over time for  $7.8 \cdot 10^7$  reference periods (3 seconds) for an example case in which  $K_a$  and  $K_b$  are initially set to  $2^{-1}$  and  $2^{-2}$ , respectively, and then divided by two at the

<sup>&</sup>lt;sup>5</sup> Furthermore, extensive simulations run by the authors in which p[n] was subjected to pessimistic nonlinearities suggest that the convergence of the MNC coefficients is barely affected by nonlinearities in the PEDC.

times indicated by the vertical dashed lines. In this case, the MNC coefficients reach the final values shown in Fig. 14 in roughly 3 seconds, and the  $a_{k,r}[n]$  coefficients get close to their ideal values in less than  $2 \cdot 10^6$  reference periods (~0.08 seconds), which is five times faster than in Fig. 14.

## APPENDIX A

It follows from Fig. 5 and (17) that

$$f_{I}(t) = \sum_{i=5}^{22} \left[ \left( b_{i}[w_{t}] - \frac{1}{2} \right) \alpha_{i}(t) \Delta_{i} + \left( b_{i}[w_{t} - 1] - \frac{1}{2} \right) \gamma_{i}(t) \right].$$
(54)

Expressions for each  $b_i[w_t] = c_{i+12}[g(w_t)]$  in terms of  $d[g(w_t)]$  and the switching sequences can be found by tracing through the tree of Fig. 6 and applying (20) and the expressions shown in Fig. 4(a) and Fig. 4(b). This leads to

$$c_i[g(w_t)] - \frac{1}{2} = m_i d[g(w_t)] / \Delta + \sum_{k,r} \kappa_{k,r,i} s_{k,r}[g(w_t)],$$
(55)

where

$$m_i = 0 \text{ for } 17 \le i \le 26 \text{ and } m_i = 2^{-16} \text{ for } 27 \le i \le 34,$$
 (56)

and each  $\kappa_{k,r,i}$  is one of 0,  $-\frac{1}{2}$ ,  $\frac{1}{2}$ ,  $-2^{-k}$  or  $2^{-k}$ . Combining (4), (19) and (54)-(56) yields (26) and (27), where  $\alpha_I(t)$  and  $\gamma_I(t)$  are the averages of  $\alpha_i(t)$  and  $(2^{-13}/\Delta)\gamma_i(t)$  for i = 15, 16, ..., 22, respectively,

$$\alpha_{k,r}(t) = \sum_{i=5}^{22} \alpha_i(t) K_{i+12} \kappa_{k,r,i+12} \text{ and } \gamma_{k,r}(t) = \sum_{i=5}^{22} \frac{\gamma_i(t)}{\Delta} \kappa_{k,r,i+12}.$$
(57)

Each  $\alpha_I(t)$ ,  $\gamma_I(t)$ ,  $\alpha_{k,r}(t)$  and  $\gamma_{k,r}(t)$  is  $T_{\text{fast}}$ -periodic, because it is a linear combination of  $\alpha_i(t)$  and  $\gamma_i(t)$ , which are  $T_{\text{fast}}$ -periodic.

## APPENDIX B

The phase error of the digital PLL shown in Fig. 9 is given by

$$\theta_{\rm PLL}(t) = \int_0^t \psi_{\rm PLL}(u) du, \tag{58}$$

where  $\psi_{PLL}(t)$  is the PLL's frequency error at time *t*. The  $\theta_{PLL}[n]$  term in (44) is a sampled version of  $\theta_{PLL}(t)$  given by

$$\theta_{\text{PLL}}[n] = \theta_{\text{PLL}}(\tau_n), \tag{59}$$

where  $\tau_n = nT_{ref} + \lambda_n$  and  $\lambda_n$  is a small implementation-dependent deviation of  $\tau_n$  from its ideal value. It follows from (44), (58) and (59) that

$$p[n] = p[0] - T_{\text{ref}} \sum_{i=1}^{n} \psi_{\text{PLL}}[i] + e_p[n],$$
(60)

where

$$\psi_{\text{PLL}}[i] = \frac{1}{T_{\text{ref}}} \int_{\tau_{i-1}}^{\tau_i} \psi_{\text{PLL}}(u) du$$
(61)

is the PLL's average frequency error over the time interval  $[\tau_{i-1}, \tau_i]$  and p[0] is the initial value of p[n]. Fig. 9 and (61) imply that  $e_R(t)$  causes a term in  $\psi_{PLL}[i]$  given by

$$\left\{e_{R}*h\right\}[i] = \sum_{j=0}^{\infty} h[j]e_{R}[i-j],$$
 (62)

where

$$e_{R}[i] = \frac{1}{T_{\text{ref}}} \int_{\tau_{i-1}}^{\tau_{i}} e_{R}(u) du$$
(63)

and h[j] is the impulse response of the highpass filtering operation imposed by the PLL on the DCO's additive frequency error as described in Section VI-B.

In the design example of this paper  $\lambda_n = 4.2T_{\text{fast}} + \frac{1}{8}T_{\text{fast}}v[n]$ , where v[n] is an integervalued sequence restricted to the set {-6, -5, ..., 5, 6}, so  $\tau_n = nT_{\text{ref}} + 4.2T_{\text{fast}} + \frac{1}{8}T_{\text{fast}}v[n]$ . As the magnitude of  $\frac{1}{8}T_{\text{fast}}v[n]$  is at most  $\frac{3}{4}T_{\text{fast}}$ , its effect is negligible. Furthermore, for the sake of simplicity,  $\tau_n$  is assumed to be given by

$$\tau_n = \mu_n + 4T_{\text{fast}},\tag{64}$$

where  $\mu_n$ , as shown in Fig. 10(b), is a multiple of  $T_{\text{fast}}$ . Given that  $0 < \mu_n - nT_{\text{ref}} \le T_{\text{fast}}$  for all nand that  $T_{\text{fast}}$  is a small fraction of  $T_{\text{ref}}$ , this approximation does not significantly affect the following results. Substituting (36) with  $a_{k,r}$  and  $b_{k,r}$  replaced by  $a_{k,r}[g(w_t)]$  and  $b_{k,r}[g(w_t)]$ , respectively, into (35), and the result of this operation and (64) into (63) yields

$$e_{R}[i] = \frac{\Delta}{T_{\text{ref}}} \sum_{k,r} \int_{\mu_{i-1}+4T_{\text{fast}}}^{\mu_{i}+4T_{\text{fast}}} \left\{ \left( \delta_{k,r} - \alpha_{F} a_{k,r}[g(w_{t})] \right) s_{k,r}[g(w_{t})] + \left( \gamma_{k,r}(t) - \alpha_{F} b_{k,r}[g(w_{t})] \right) \left( s_{k,r}[g(w_{t}-1)] - s_{k,r}[g(w_{t})] \right) \right\} dt.$$
(65)

Given that  $t \in [\mu_n, \mu_{n+1})$  implies  $g(p_t) = n - 1$ , it follows that  $g(w_t) = i - 2$  for  $t \in [\mu_{i-1} + 4T_{\text{fast}}, \mu_i + T_{\text{fast}})$  and  $g(w_t) = i - 1$  for  $t \in [\mu_i + T_{\text{fast}}, \mu_i + 4T_{\text{fast}})$ , so (65) can be written as

$$e_{R}[i] = -\Delta \frac{\alpha_{F} T_{\text{fast}}}{T_{\text{ref}}} \sum_{k,r} \left\{ y_{k,r-a}[i-1] + y_{k,r-b}[i-1] \right\},$$
(66)

where  $y_{k,r-a}[i]$  and  $y_{k,r-b}[i]$  are given by (47) and (48), respectively, and it has been assumed that  $q_i = (\mu_{i+1} - \mu_i)/T_{\text{fast}}$  is greater than 3 for all *i* (e.g.,  $q_i \cong 16$  in the design example). Substituting (66) into (62) and the result into (60), rearranging terms and considering that  $s_{k,r}[n] = 0$  for n < 0 gives (45) and (46).

# ACKNOWLEDGEMENTS

The authors are grateful to Amr Eissa for carefully reviewing this manuscript and for helpful advice.

This chapter, in full, has been published in the IEEE Transactions on Circuits and Systems I: Regular Papers, volume 65, number 10, pages 3125-3137, October 2018. E. Alvarez-Fontecilla, C. Venerus, I. Galton, 2018. The dissertation author is the primary investigator and author of this paper. Professor Ian Galton supervised the research which forms the basis for this paper.

# FIGURES



Figure 1: Conventional frequency control technique for an LC-based DCO.



Figure 2: Example waveforms related to (15) and (16) for an FCE input bit sequence of 1, 0, 1, 1, 0.



Figure 3: Segmented DEM encoder example.



Figure 4: (a) Segmenting switching block, (b) non-segmenting switching block, and (c) switching sequence generator.



Figure 5: Multi-rate DEM encoder example.



Figure 6: Slow DEM encoder example.



Figure 7: Details of the second-order digital  $\Delta\Sigma$  modulator.



Figure 8: Fractional path of the example multi-rate DEM encoder shown in Fig. 5 modified to accommodate  $e_{MNC}[p_t]$ .



Figure 9: General form of a digital fractional-N PLL.



Figure 10: (a) Synchronization circuit used at DCO input, and (b) illustration of the clock signals within the DCO,  $p_t$ , and  $n_t = g(p_t)$  for  $f_{\text{fast}} = 4.5 f_{\text{ref}}$ .



Figure 11: (a) Digital fractional-N PLL with multi-rate DEM and MNC, (b) details of the MNC logic, and (c) details of each switching sequence residue estimator.



Figure 12: Example frequency transitions normalized to  $\Delta_i/2$  versus time over  $T_{\text{fast}} = 8T_{\text{PLL}}$  for six different FCEs.



Figure 13: Simulated PLL phase noise PSD versus frequency with (a) static gain errors and non-ideal frequency transitions enabled and the multi-rate DEM technique disabled, (b) static gain errors and non-ideal frequency transitions enabled separately and the multi-rate DEM technique enabled, and (c) both sources of error enabled, the multi-rate DEM technique enabled and the MNC technique disabled and enabled.



Figure 14: MNC coefficient error evolution over time for  $K_a = 2^{-3}$  and  $K_b = 2^{-5}$ .



Figure 15: MNC coefficient error evolution over time for MNC gains that change over time. Initially  $K_a = 2^{-1}$  and  $K_b = 2^{-2}$ , and after 3.5 · 107 reference periods  $K_a = 2^{-6}$  and  $K_b = 2^{-6}$ .

#### REFERENCES

- 1. W. T. Bax and M. A. Copeland, "A GMSK Modulator Using a  $\Delta\Sigma$  Frequency Discriminator-Based Synthesizer," *IEEE J. Solid-State Circuits*, vol. 36, no. 8, pp. 1218-1227, Aug. 2001.
- R. B. Staszewski, J. L. Wallberg, S. Rezeq, C.-M. Hung, O. E. Eliezer, S. K. Vemulapalli, C. Fernando, K. Maggio, R. Staszewski, N. Barton, M.-C. Lee, P. Cruise, M. Entezari, K. Muhammad, and D. Leipold, "All-Digital TX Frequency Synthesizer and Discrete-Time Receiver for Bluetooth Radio in 130-nm CMOS," *IEEE J. Solid-State Circuits*, vol. 39, no. 12, pp. 2278-2291, Dec. 2004.
- R. B. Staszewski, J. L. Wallberg, S. Rezeq, C.-M. Hung, O. E. Eliezer, S. K. Vemulapalli, C. Fernando, K. Maggio, R. Staszewski, N. Barton, M.-C. Lee, P. Cruise, M. Entezari, K. Muhammad, and D. Leipold, "All-Digital PLL and Transmitter for Mobile Phones," *IEEE J. Solid-State Circuits*, vol. 40, no. 12, pp. 2469-2482, Dec. 2005.
- K. Muhammad, H. Yo-Chuol, T. L. Mayhugh, C.-M. Hung, T. Jung, I. Elahi, C. Lin, I. Y. Deng, C. Fernando, J. L. Wallberg, S. K. Vemulapalli, S. Larson, T. Murphy, D. Leipold, P. Cruise, J. Jaehnig, M.-C. Lee, R. B. Staszewski, R. Staszewski, and K. Maggio, "The first fully integrated quad-band GSM/GPRS receiver in a 90-nm digital CMOS process," *IEEE J. Solid-State Circuits*, vol. 41, no. 8, pp. 1772-1783, Aug. 2006
- 5. R. Tonietto, E. Zuffetti, R. Castello, and I. Bietti, "A 3 MHz Bandwidth Low Noise RF All Digital PLL with 12ps Resolution Time-to-Digital Converter," *Eur. Solid-State Circuits Conf.*, pp. 150-153, Sept. 2006.
- 6. C. Weltin-Wu, E. Temporiti, D. Baldi, and F. Svelto, "A 3 GHz Fractional-*N* All-Digital PLL with Precise Time-to-Digital Converter Calibration and Mismatch Correction," *IEEE Int. Solid-State Circuits Conf.*, pp. 344-345, Feb. 2008.
- 7. H. H. Chang, P-Y. Wang, J.-H.C. Zhan, and H. Bing-Yu, "A Fractional Spur-Free ADPLL with Loop-Gain Calibration and Phase-Noise Cancellation for GSM/GPRS/EDGE," *IEEE Int. Solid-State Circuits Conf.*, pp. 200-201, 606, Feb. 2008.
- 8. M. A. Ferriss and M. P. Flynn, "A 14 mW Fractional-*N* PLL Modulator with a Digital Phase Detector and Frequency Switching Scheme," *IEEE J. Solid-State Circuits*, vol. 43, no. 11, pp. 2464-2471, Nov. 2008.
- 9. C. Hsu, M. Z. Straayer, and M. H. Perrott, "A Low-Noise, Wide-BW 3.6 GHz Digital  $\Delta\Sigma$ Fractional-*N* Frequency Synthesizer with a Noise-Shaping Time-to-Digital Converter and Quantization Noise Cancellation," *IEEE J. Solid-State Circuits*, vol. 43, no. 12, pp. 2776-

2786, Dec. 2008.

- 10. V. Kratyuk, P. K. Hanumolu, K. Ok, U.-K. Moon, and K. Mayaram, "A Digital PLL With a Stochastic Time-to-Digital Converter," *IEEE Trans. Circuits Syst. I, Reg. Papers*, vol. 56, no. 8, pp. 1612-1621, Aug. 2009.
- 11. M. Lee, M. E. Heidari, and A. A. Abidi, "A Low-Noise Wideband Digital Phase-Locked Loop Based on a Coarse-Fine Time-to-Digital Converter with Subpicosecond Resolution," *IEEE J. Solid-State Circuits*, vol. 44, no. 10, pp. 2808-2816, Oct. 2009.
- M. S.-W. Chen, D. Su, and S. Mehta, "A Calibration-Free 800 MHz Fractional-N Digital PLL With Embedded TDC," *IEEE J. Solid-State Circuits*, vol. 45, no. 12, pp. 2819-2827, Dec. 2010.
- 13. M. Zanuso, S. Levantino, C. Samori, and A. Lacaita, "A 3MHz-BW 3.6GHz Digital Fractional-*N* PLL with Sub-Gate-Delay TDC, Phase-Interpolation Divider, and Digital Mismatch Cancellation," *IEEE Int. Solid-State Circuits Conf.*, pp. 476-477, Feb. 2010.
- J. Borremans, K. Vengattaramane, V. Giannini, B. Debaillie, W. V. Thillo, and J. Craninckx, "A 86MHz-to-12GHz Digital-Intensive Phase-Modulated PLL for Software-Defined Radios, Using a 6fJ/Step TDC in 40nm digital CMOS," *IEEE J. Solid-State Circuits*, vol. 45, no. 10, pp. 2116-2129, Oct. 2010.
- 15. S. K. Lee, Y. H. Seo, Y. Suh, H. J. Park, and J. Y. Sim, "A 1 GHz ADPLL with a 1.25 ps Minimum-Resolution Sub-Exponent TDC in 0.18m CMOS," *IEEE Int. Solid-State Circuits Conf.*, pp. 482-483, Feb. 2010.
- 16. O. Eliezer, B Staszewski, J Mehta, F Jabbar, and I Bashir, "Accurate Self-Characterization of Mismatches in a Capacitor Array of a Digitally-Controlled Oscillator," *IEEE Dallas Circuits and Syst. Workshop*, pp. 17-18, Oct. 2010.
- T. Tokairin, M. Okada, M. Kitsunezuka, M. Tadashi, and M. Fukaishi, "A 2.1-to-2.8-GHz Low-Phase-Noise All-Digital Frequency Synthesizer with a Time-Windowed Timeto-Digital Converter," *IEEE J. Solid-State Circuits*, vol. 45, no. 12, pp. 2582-2590, Dec. 2010.
- 18. O. Eliezer, B. Staszewski, and S Vemulapalli, "Digitally controlled oscillator in a 65nm GSM/EDGE transceiver with built-in compensation for capacitor mismatches," *IEEE Radio Frequency Integrated Circuits Symp.*, pp. 5-7 June 2011.
- 19. K. Waheed, K, M. Sheba, R.B. Staszewski, F. Dulger, and S.D. Vamvakos, "Spurious free time-to-digital conversion in an ADPLL using short dithering sequences," *IEEE Trans. Circuits Syst. I, Reg. Papers*, vol. 58, no. 9, pp. 2051-2060, Sept. 2011.
- 20. E. Temporiti, C. Weltin-Wu, D. Baldi, M. Cusmai, and F. Svelto, "A 3.5 GHz Wideband ADPLL With Fractional Spur Suppression Through TDC Dithering and Feedforward

Compensation," IEEE J. Solid-State Circuits, vol. 45, no. 12, pp. 2723-2736, Dec. 2010.

- 21. K. Takinami, R. Strandberg, P.C.P. Liang, G.L.G. de Mercey, T. Wong, and M. Hassibi, "A rotary-traveling-wave-oscillator-based all-digital PLL with a 32-phase embedded phase-to-digital converter in 65nm CMOS," *IEEE Int. Solid-State Circuits Conf.*, pp. 100-102, Feb. 2011.
- 22. D. Tasca, M. Zanuso, G. Marzin, S. Levantino, C. Samori, and A. L. Lacaita, "A 2.9-to-4.0 GHz fractional-*N* digital PLL with bang-bang phase detector and 560 fs rms integrated jitter at 4.5 mW power," *IEEE J. Solid-State Circuits*, vol. 46, no. 12, pp. 2745-2758, Dec. 2011.
- 23. B. Staszewski, K. Waheed, F. Dulger, and O. Eliezer, "Spur-Free Multirate All-Digital PLL for Mobile Phones in 65 nm CMOS," *IEEE J. Solid-State Circuits, IEEE*, vol. 46, no. 12, pp. 2904-2919, Dec. 2011.
- 24. G. Marzin, S. Levantino, C. Samori, and A. Lacaita, "A 20Mb/s Phase Modulator Based on a 3.6GHz Digital PLL with -36dB EVM at 5mW Power," *IEEE Int. Solid-State Circuits Conf.*, pp. 342-343, Feb. 2012.
- 25. J. Chen, L. Rong, F. Jonsson, G. Yang, and L.-R. Zheng, "The design of all-digital polar transmitter based on ADPLL and phase synchronized  $\Delta\Sigma$  modulator," *IEEE J. Solid-State Circuits*, vol. 47, no. 5, pp. 1154-1164, May 2012.
- L. Vercesi, L. Fanori, F. De Bernardinis, A. Liscidini, and R. Castello, "A Dither-Less All Digital PLL for Cellular Transmitters," *IEEE J. Solid-State Circuits*, vol. 47, no. 8, pp. 1908-1920, Aug. 2012.
- 27. K. Euda, T. Uozumi, R. Endo, T. Nakamura, T. Heima, and H. Sato, "A Digital PLL with Two-step Closed-locking for Multi-mode/Multi-band SAW-less Transmitter," *IEEE Custom Integrated Circuits Conf.*, pp. 1-4, Sept. 2012.
- L. Li, M. P. Flynn, and M. A. Ferriss, "A 5.8GHz Digital Arbitrary Phase-setting Type II PLL in 65nm CMOS with 2.25° Resolution," *IEEE Asian Solid-State Circuits Conf.*, pp. 317-320, Nov. 2012.
- 29. C. Venerus and I. Galton, "Delta-Sigma FDC Based Fractional-*N* PLLs," *IEEE Trans. Circuits Syst. I, Reg. Papers*, vol. 60, no. 5, pp. 1274-1285, May 2013.
- B. Shen, G. Unruh, M. Lugthart, C.H. Lee, and M. Chambers, "An 8.5 mW, 0.07 mm<sup>2</sup> ADPLL in 28 nm CMOS with Sub-ps Resolution TDC and < 230 fs RMS Jitter," *Symp. VLSI Circuits*, pp. C192-C193, June 2013.
- 31. D.-W. Jee, Y.-H. Seo, H.-J. Park, and J.-Y. Sim, "A 2GHz fractional-*N* digital PLL with 1 b noise shaping TDC," *IEEE J. Solid-State Circuits*, vol. 47, no. 4, pp. 875-883, Apr. 2012.

- 32. C.-W. Yao and A. L.Willson, "A 2.8-3.2-GHz fractional-*N* digital PLL with ADCassisted TDC and inductively coupled fine-tuning DCO," *IEEE J. Solid-State Circuits*, vol. 48, no. 3, pp. 698-710, Mar. 2013.
- F.-W. Kuo, R. Chen, K. Yen, H.-Y. Liao, C.-P. Jou, F.-L. Hsueh, M. Babaie, and R. B. Staszewski, "A 12mW all-digital PLL based on class-F DCO for 4G phones in 28nm CMOS," *Symp. VLSI Circuits,* June 2014.
- 34. M. Talegaonkar, T. Anand, A. Elkholy, A. Elshazly, R. K. Nandwana, S. Saxena, B. Young, W. Choi, and P. K. Hanumolu, "A 4.4-5.4GHz Digital Fractional-*N* PLL Using  $\Delta\Sigma$  Frequency-to-Digital Converter," *Symp. VLSI Circuits*, June 2014.
- 35. S. Levantino, G. Marzin, and C. Samori, "An Adaptive Pre-Distortion Technique to Mitigate the DTC Nonlinearity in Digital PLLs," *IEEE J. Solid-State Circuits*, vol. 49, no. 8, pp. 1762-1772, Aug. 2014.
- 36. C. Venerus and I. Galton, "A TDC-Free Mostly-Digital FDC-PLL Frequency Synthesizer with a 2.8-3.5 GHz DCO," *IEEE J. Solid-State Circuits*, vol. 50, no. 2, pp. 450-463, Feb. 2015.
- 37. A. Elkholy, T. Anand, C. Woo-Seok, A. Elshazly, and P. Hanumolu, "A 3.7 mW Low-Noise Wide-Bandwidth 4.5 GHz Digital Fractional-*N* PLL Using Time Amplifier-Based TDC," *IEEE J. Solid-State Circuits*, vol. 50, no. 4, pp. 867-881, Apr. 2015.
- C. Weltin-Wu, E. Familier, and I. Galton, "A Linearized Model for the Design of Fractional-*N* PLLs based on Dual-Mode Ring Oscillator FDCs," *IEEE Trans. Circuits Syst. I, Reg. Papers*, vol. 62, no. 8, pp. 2013-2023, Aug. 2015.
- 39. C. Weltin-Wu, G. Zhao, and I. Galton, "A Highly-Digital Frequency Synthesizer Using Ring-Oscillator Frequency-to-Digital Conversion and Noise Cancellation," *IEEE International Solid-State Circuits Conf.*, pp. 1-3, Feb. 2015.
- 40. C. Weltin-Wu, G. Zhao, and I. Galton, "A 3.5 GHz Digital Fractional-*N* Frequency Synthesizer Based on Ring Oscillator Frequency-to-Digital Conversion," *IEEE J. Solid-State Circuits*, vol. 50, no. 12, pp. 2988-3002, Dec. 2015.
- 41. A. Elkholy, S. Saxena, R. K. Nandwana, A. Elshazly and P. K. Hanumolu, "A 2.0-5.5 GHz Wide Bandwidth Ring-Based Digital Fractional-*N* PLL with Extended Range Multi-Modulus Divider," *IEEE J. Solid-State Circuits*, vol. 51, no. 8, pp. 1771-1784, Aug. 2016.
- 42. W.-S. Chang and T.-C. Lee, "A 5 GHz Fractional-*N* ADC-Based Digital Phase-Locked Loops with –243.8 dB FOM," *IEEE Trans. Circuits Syst. I, Reg. Papers*, vol. 63, no. 11, pp. 1845-1853, Nov. 2016.

- 43. C.-R. Ho and M. Chen, "A Fractional-*N* DPLL with Calibration-Free Multi-Phase Injection-Locked TDC and Adaptive Single-Tone Spur Cancellation Scheme," *IEEE Trans. Circuits Syst. I, Reg. Papers*, vol. 63, no. 8, pp. 1111-1122, Aug. 2016.
- 44. Z. Xu, M. Miyahara, K. Okada, and A. Matsuzawa, "A 3.6 GHz Low-Noise Fractional-*N* Digital PLL Using SAR-ADC-Based TDC," *IEEE J. Solid-State Circuits*, vol. 51, no. 10, pp. 2345-2356, Oct. 2016.
- 45. Y.-H. Tseng, C.-W. Yeh, and S.-I. Liu, "A 2.25-2.7 GHz Area-Efficient Subharmonically Injection-Locked Fractional-*N* Frequency Synthesizer with a Fast-Converging Correlation Loop," *IEEE Trans. Circuits Syst. I, Reg. Papers*, vol. 64, no. 4, pp. 811-822, Apr. 2017.
- 46. C. Yao, R. Ni, C. Lau, W. Wu, K. Godbole, Y. Zuo, S. Ko, N. Kim, S. Han, I. Jo, J. Lee, J. Han, D. Kwon, C. Kim, S. Kim, S. W. Son, and T. B. Cho, "A 14-nm 0.14ps<sub>rms</sub> Fractional-*N* Digital PLL with a 0.2-ps Resolution ADC-Assisted Coarse/Fine-Conversion Chopping TDC and TDC Nonlinearity Calibration," *IEEE J. Solid-State Circuits*, vol. 52, no. 12, pp. 3446-3357, Dec. 2017.
- 47. Y. Wu, M. Shahmohammadi, Y. Chen, P. Lu and R. B. Staszewski, "A 3.5-6.8-GHz Wide-Bandwidth DTC-Assisted Fractional-*N* All-Digital PLL with a MASH  $\Delta\Sigma$ -TDC for Low In-Band Phase Noise," *IEEE J. Solid-State Circuits*, vol. 52, no. 7, pp. 1885-1903, July 2017.
- 48. M. Talegaonkar, T. Anand, A. Elkholy, A. Elshazly, R. K. Nandwana, S. Saxena, B. Young, W. Choi, and P. K. Hanumolu, "A 5 GHz Digital Fractional-*N* PLL Using a 1-bit Delta-Sigma Frequency-to-Digital Converter in 65 nm CMOS," *IEEE J. Solid-State Circuits*, vol. 52, no. 9, pp. 2306-2320, Sept. 2017.
- 49. D. Liao, H. Wang, F. F. Dai, Y. Xu, R. Berenguer and S. M. Hermoso, "An 802.11a/b/g/n Digital Fractional-N PLL with Automatic TDC Linearity Calibration for Spur Cancellation," *IEEE J. Solid-State Circuits*, vol. 52, no. 5, pp. 1210-1220, May 2017.
- 50. L. Fanori, A. Liscidini, and R. Castello, "3.3GHz DCO with a Frequency Resolution of 150Hz for All-Digital PLL," *IEEE International Solid-State Circuits Conf.*, pp. 48-49, Feb. 2010.
- 51. J. Zhuang, K. Waheed, and R.-B. Staszewski, "Design of Spur-Free ΣΔ Frequency Tuning for Digitally Controlled Oscillators," *IEEE Trans. Circuits Syst. II, Exp. Briefs*, vol. 62, no. 1, pp. 46-50, Jan. 2015.
- 52. I. Galton, "Why Dynamic-Element-Matching DACs Work," *IEEE Trans. Circuits Syst. II, Exp. Briefs*, vol. 57, no. 2, pp. 69-74, Mar. 2010.

- 53. J. Remple and I. Galton, "The Effects of Inter-Symbol Interference in Dynamic Element Matching DACs," *IEEE Trans. Circuits Syst. I, Reg. Papers*, vol. 64, no. 1, pp. 14-23, Jan. 2017.
- 54. K. L. Chan, N. Rakuljic, and I. Galton, "Segmented Dynamic Element Matching for High-Resolution Digital-to-Analog Conversion," *IEEE Trans. Circuits Syst. I, Reg. Papers*, vol. 55, no. 11, pp. 3383-3392, Dec. 2008.
- 55. S. Pamarti, J. Welz, and I. Galton, "Statistics of the Quantization Noise in 1-Bit Dithered Single-Quantizer Digital Delta-Sigma Modulators," *IEEE Trans. Circuits Syst. I, Reg. Papers*, vol. 54, no. 3, pp. 492-503, Mar. 2007.
- 56. I. Galton, "Spectral Shaping of Circuit Errors in Digital-to-Analog Converters," *IEEE Trans. Circuits Syst. II, Analog Digit. Signal Process.*, vol. 44, no. 10, pp. 808-817, Nov. 1997.
- 57. J. Welz, I. Galton, and E. Fogleman, "Simplified Logic for First-Order and Second-Order Mismatch-Shaping Digital-to-Analog Converters," *IEEE Trans. Circuits Syst. II, Analog Digit. Signal Process.*, vol. 48, no. 11, pp. 1014-1028, Nov. 2001.
- 58. A.V. Oppenheim, A. S. Willsky, and S.H. Nawab, "Signals & Systems," 2nd ed. Upper Saddle River, NJ: Prentice Hall, 1997.
- 59. R. Ginosar, "Metastability and Synchronizers: A Tutorial," *IEEE Design & Test of Computers*, vol. 28, no. 5, pp. 23-35, Sept.-Oct. 2011.
- 60. D. Kong and I. Galton, "Adaptive Cancellation of Static and Dynamic Mismatch Error in Continuous-Time DACs," *IEEE Trans. Circuits Syst. I, Reg. Papers*, vol. 65, no. 2, pp. 421-433, Feb. 2018.
- 61. W. Y. Chen and R. A. Haddad, "A Variable Step Size LMS Algorithm," *Proc.* 33<sup>rd</sup> *Midwest Symp. Circuits and Systems*, pp. 423-426, Aug. 1990.

#### **CHAPTER 2**

# DELTA-SIGMA FDC ENHANCEMENTS FOR FDC-BASED DIGITAL FRACTIONAL-N PLLS

**Abstract**— This paper describes all-digital enhancements for digital fractional-*N* phase-locked loops (PLLs) based on delta-sigma ( $\Delta\Sigma$ ) frequency-to-digital converters (FDCs). The enhancements include an improved dual-mode ring oscillator (DMRO)-based  $\Delta\Sigma$  FDC architecture and a digital background calibration technique that compensates for the  $\Delta\Sigma$  FDC's forward path gain error. The improved  $\Delta\Sigma$  FDC has significantly relaxed timing constraints and a 3× smaller phase-frequency detector output pulse-width span relative to the prior art, which make it simpler to implement and amenable to higher-frequency reference signals. The calibration technique compensates for non-ideal DMRO frequencies in the digital domain. It eliminates the need to tune the DMRO instantaneous frequencies as a function of the PLL output frequency, thereby simplifying the DMRO implementation, and it also improves the phase noise performance of PLLs with high loop bandwidths.

Manuscript received September 14, 2020; revised November 5, 2020; accepted November 21, 2020. This work was supported by the National Science Foundation under Award 1617545. This article was recommended by Associate Editor R. Rieger. (*Corresponding author:Enrique Alvarez-Fontecilla.*)

Enrique Alvarez-Fontecilla, Amr I. Eissa, Eslam Helal, and Ian Galton are with the Department of Electrical and Computer Engineering, University of California, San Diego, San Diego, CA 92092 USA (e-mail: cea007@ucsd.edu).

Colin Weltin-Wu is with Aeonsemi, Inc., Santa Clara, CA 95134 USA.

Color versions of one or more figures in this article are available online at

https://doi.org/10.1109/TCSI.2020.3040346.

Digital Object Identifier 10.1109/TCSI.2020.3040346

#### I. INTRODUCTION

Digital fractional-*N* phase-locked loops (PLLs) based on second-order delta-sigma ( $\Delta\Sigma$ ) frequency-to-digital converters (FDCs) offer advantages of both analog and digital PLLs [1]–[12]. They have the same quantization error behavior as analog PLLs based on second-order  $\Delta\Sigma$  modulators, but they do not require large-area analog loop filters.

This paper presents all-digital enhancements for  $\Delta\Sigma$  FDCs that reduce implementation complexity and improve performance. The enhancements include a modified dual-mode ring oscillator (DMRO)-based  $\Delta\Sigma$  FDC architecture and a digital background calibration technique that compensates for  $\Delta\Sigma$  FDC forward path gain error caused by non-ideal DMRO frequencies.

The modified  $\Delta\Sigma$  FDC architecture has relaxed timing constraints and a 3× smaller phase-frequency detector (PFD) output pulse-width span compared to prior-art  $\Delta\Sigma$  FDCs [8]–[ 12]. These benefits make the new  $\Delta\Sigma$  FDC simpler to implement [13]. They also make it amenable to higher-frequency reference signals for any given PLL output frequency, which is useful because increasing the reference frequency reduces the contributions of the reference signal phase noise,  $\Delta\Sigma$  FDC quantization error, and DMRO phase noise to the PLL's output phase noise [11].

The DMRO in a DMRO-based  $\Delta\Sigma$  FDC is designed to oscillate at one of two frequencies at any given time. These frequencies, denoted as  $f_{high}$  and  $f_{low}$  in this paper, ideally have a specific relationship to the PLL output frequency,  $f_{PLL}$ . In prior art DMRO-based  $\Delta\Sigma$ FDCs,  $f_{high}$  and  $f_{low}$  are adjusted each time  $f_{PLL}$  is changed to approximate this ideal relationship, which adds complexity to the DMRO design. Furthermore, while the PLL's performance is relatively insensitive to deviations of  $f_{high}$  and  $f_{low}$  from their ideal values for low-to-moderate PLL bandwidths, this is not the case for high PLL bandwidths.

The proposed digital background calibration technique addresses these issues. Rather than dynamically adjusting  $f_{high}$  and  $f_{low}$  by controlling the DMRO's analog circuitry as a function of  $f_{PLL}$ , it dynamically adjusts digital circuitry to compensate for error that would otherwise be caused by non-ideal values of  $f_{high}$  and  $f_{low}$ . Moreover, it does so with much finer resolution than prior art  $\Delta\Sigma$  FDCs are able to adjust the DMRO to tune  $f_{high}$  and  $f_{low}$ . These benefits greatly simplify the DMRO, which can now be designed to have fixed values of  $f_{high}$ and  $f_{low}$ , and significantly reduce phase noise for high PLL bandwidths.

The remainder of the paper consists of four main sections. Section II provides an overview of prior-art fractional-*N* PLLs that incorporate  $\Delta\Sigma$  FDCs based on DMROs. Sections III and IV present the proposed  $\Delta\Sigma$  FDC enhancements described above, and Section V presents simulation results that demonstrate their performance.

#### II. $\Delta\Sigma$ FDC DIGITAL FRACTIONAL-*N* PLL OVERVIEW

## A. $\Delta\Sigma$ FDC-Based PLL

A high-level block diagram of a second-order  $\Delta\Sigma$  FDC-based fractional-*N* PLL is shown in Fig. 16. It consists of a  $\Delta\Sigma$  FDC, a digital loop controller (DLC) with quantization noise cancellation (QNC), and a digitally-controlled oscillator (DCO) [8]–[12]. The signal  $v_{ref}(t)$  is t he output of a reference oscillator with frequency  $f_{ref}$  and  $v_{PLL}(t)$  is the PLL output waveform. Ideally,  $v_{PLL}(t)$  is periodic with frequency  $f_{PLL} = (N + \alpha)f_{ref}$ , where *N* is a positive integer and  $\alpha$ has a fractional value that can range from  $-\frac{1}{2}$  to  $\frac{1}{2}$ . The  $\Delta\Sigma$  FDC generates two  $f_{ref}$ -rate digital sequences, y[n] and  $-\hat{e}_q[n]$ . Specifically,

$$y[n] = -\alpha - e_{\text{PLL}}[n] + \underbrace{e_q[n] - 2e_q[n-1] + e_q[n-2]}_{2^{\text{nd}} \text{-order shaped version of } e_q[n]},$$
(67)

where  $e_q[n]$  is the quantization error introduced by the  $\Delta\Sigma$  FDC and  $e_{PLL}[n]$  is a measure of the average frequency error of  $v_{PLL}(t)$  over the *n*th reference period. The  $\hat{e}_q[n]$  sequence is an estimate of  $e_q[n]$ . It is used to partially cancel the contribution of  $e_q[n]$  at the input of the digital loop filter (DLF) within the DLC [12]–[15]. By cancelling the quantization error prior to the loop filter, QNC allows the PLL's bandwidth to be increased without significantly degrading the PLL's phase noise.

#### B. Original $\Delta\Sigma$ FDC Architecture

A simplified block diagram of the  $\Delta\Sigma$  FDC presented in [11], hereafter referred to as the original  $\Delta\Sigma$  FDC, is shown in Fig. 17(a). It consists of a PFD with top output u(t), a multimodulus divider with output  $v_{div}(t)$ , a DMRO, a digital ring phase calculator (RPC), and a 2 –  $z^{-1}$  digital feedback block with output v[n] that controls the divider. Although not shown in Fig. 2 for simplicity, the RPC's accumulator clips to keep its output in the range  $-2 \le r[n] < 3$ . As explained in [10], this reduces the PLL's worst-case locking time, but has no effect on the PLL's locked behavior. The PFD and divider are identical to those in analog PLLs.

Each reference period, the signal encoded in the width of the u(t) pulse is accumulated by the DMRO. Then, the outputs of the DMRO, which represent a quantized version of its phase, are sampled and processed by the RPC to generate y[n] and  $-\hat{e}_q[n]$ .

The DMRO is implemented as a ring of  $N_R$  nominally identical delay cells. Ideally, its instantaneous frequency is  $f_{\text{high}}$  when u(t) is high and  $f_{\text{low}}$  when u(t) is low, where

$$A\frac{\left(f_{\text{high}} - f_{\text{low}}\right)}{f_{\text{PLL}}} = 1,$$
(68)

and A is a design parameter [11].<sup>7</sup>

Each reference period, the quantized DMRO phase,  $p_R[n]$ , is computed from the DMRO output lines. As indicated in Fig. 2(a),  $p_R[n]$  is passed through a  $1 - z^{-1}$  block, and a positive constant, M, is subtracted from the result prior to the multiplication by A and accumulation. These operations yield r[n], which is a fixed-point measure of  $-\alpha - e_{PLL}[n]$  in units of cycles per reference period. The three most significant bits (MSBs) of r[n] correspond to the integer part of r[n], whereas the remaining least significant bits (LSBs) correspond to the fractional part of r[n][11].

The operation of the divider is such that adjacent rising edges of  $v_{div}(t)$  are separated by N-v[n] PLL output periods. Ideally, v[n] would be set to 2r[n] - r[n-1], but dividers can only count integer numbers of PLL output periods and r[n] contains both integer and fractional parts. Therefore, it is necessary to instead use just the integer part of r[n], i.e., y[n], so that v[n] = 2y[n] - y[n-1] is integer-valued. Given that y[n] is a quantized version of r[n], the fractional part of r[n], i.e.,  $-\hat{e}_q[n]$ , is the negative of the corresponding quantization error. The DLC uses  $-\hat{e}_q[n]$  to perform QNC.

As proven in [11], the behavior of the system shown in Fig. 2(a) is identical to that of the second-order  $\Delta\Sigma$  modulator shown in Fig. 3(a). The phase quantization operation performed by the DMRO is denoted by  $Q_r$  and modeled as a fine quantizer of step-size  $\Delta_r = (2N_R)^{-1}$ . Its

<sup>&</sup>lt;sup>7</sup> In [11],  $2^{-J}$ , where J is an integer, is used instead of  $A^{-1}$ , but the structures shown in Fig. 2 do not restrict A to be a power of 2.

quantization error,  $e_{qr}[n]$ , corresponds to the *residual* quantization error that is left after QNC. The quantization operation that occurs at the output of the RPC is denoted as  $Q_c$  and modeled as a *coarse* quantizer with step-size  $\Delta_c = 1$ . If  $2N_R/A$  is integer-valued, then the blocks contained in the dashed contour in Fig. 3(a) are equivalent to an accumulator followed by a quantizer, Q, with unity step-size and associated error given by

$$e_q[n] = A e_{qr}[n] + \hat{e}_q[n].$$
 (69)

In this case, y[n] is given by (67) and the system's self-dithering property causes  $e_q[n]$  to have a power spectral density (PSD) equivalent to that of a zero-mean white noise sequence with variance 1/12 [16], [17].

## C. Original $\Delta\Sigma$ FDC Issues

The original  $\Delta\Sigma$  FDC suffers from two issues. One issue is tight timing constraints on both the digital part of the  $\Delta\Sigma$  FDC and the divider. The other issue is high sensitivity to nonideal DMRO frequencies for high PLL bandwidths.

Once the  $\Delta\Sigma$  FDC locks, the rising edges of  $v_{div}(t)$  succeed and precede rising and falling edges of  $v_{ref}(t)$ , respectively [10], [11]. Therefore, as implied by Fig. 17(a), after the *n*th rising edge of  $v_{div}(t)$ , the  $\Delta\Sigma$  FDC must compute y[n] and use it along with y[n-1] to form v[n], which the divider then uses to determine the (n + 1)th rising edge of  $v_{div}(t)$ . This limits the time available for the  $\Delta\Sigma$  FDC to process the u(t) pulse and compute y[n] to approximately one reference period, and requires a divider that is capable of loading the divider modulus in the middle or toward the end of the divider count [10]. These features tend to increase the power consumption, circuit area, and complexity of the divider.
As explained in [11],  $\Delta\Sigma$  FDC-based PLLs are not highly sensitive to non-ideal values of  $f_{high}$  and  $f_{low}$ , i.e., values of  $f_{high}$  and  $f_{low}$  that do not exactly satisfy (68), in much the same way that a second-order  $\Delta\Sigma$  modulator is not sensitive to deviations in the gain of its second accumulator [16]. Nevertheless, the need to adjust the DMRO in the original  $\Delta\Sigma$  FDC each time  $f_{PLL}$  changes so that  $f_{high}$  and  $f_{low}$  at least approximately satisfy (68) complicates the DMRO design. Moreover, as shown in Section IV, the accuracy with which (68) must be satisfied increases significantly with PLL bandwidth to the point that process, voltage, and temperature variations cause  $f_{high}$  and  $f_{low}$  to deviate from their ideal values enough to significantly degrade the PLL's phase noise.

### III. IMPROVED $\Delta\Sigma$ FDC

### A. Proposed $\Delta\Sigma$ FDC Architecture

The proposed  $\Delta\Sigma$  FDC is shown in Fig. 17(b).<sup>8</sup> It is similar to that shown in Fig. 17(a) except for the feedback digital block and the details of the RPC. Instead of feeding back 2y[n] -y[n-1] through the divider, 2y[n-1] is fed back directly to the input of the accumulator within the RPC, and only y[n-1] is fed back through the divider.

An argument similar to that presented in [11] shows that the resulting system's behavior is identical to that of the second-order  $\Delta\Sigma$  modulator shown in Fig. 18(b), whose behavior is identical to that of the system shown in Fig. 18(a) provided  $2N_R/A$  is integer-valued.

<sup>&</sup>lt;sup>8</sup> The sequences v[n],  $p_R[n]$  and  $d_R[n]$  and the signals u(t) and  $v_{div}(t)$  in Fig. 17(b) are not identical to those in Fig. 17(a), but they play the same roles in both  $\Delta\Sigma$  FDCs, which is why they share the same names.

A feature of the original  $\Delta\Sigma$  FDC is that once it locks, the DMRO locks to an average frequency of  $Mf_{ref}$ , which minimizes the potential for fractional spurs if M is integer-valued [10]. Specifically, given that r[n] is bounded when the  $\Delta\Sigma$  FDC is locked, the input to the accumulator within the RPC, and, hence, the M-adder output, must be zero-mean, which can only happen if the DMRO phase advances, on average, M cycles per reference period.

In the proposed  $\Delta\Sigma$  FDC, the average of the *M*-adder output is forced to zero by subtracting  $2\alpha$  from the accumulator's input, so that the average DMRO frequency is given by  $Mf_{\rm ref}$ . Reasoning similar to that presented above and (67) imply that without the  $2\alpha$  subtraction the local feedback around the accumulator would cause the output of the *M*-adder to have an average of  $-2A^{-1}\alpha$ . In this case, the DMRO would lock to  $(M - 2A^{-1}\alpha)f_{\rm ref}$ , which would increase the potential for fractional spurs.

The  $2\alpha$  subtraction slightly increases the PLL's digital complexity relative to a comparable PLL based on the original  $\Delta\Sigma$  FDC. For instance, in the PLL implementation described in Section V, the cycle counter and phase decoder's output,  $p_R[n]$ , has 10 fractional bits,  $\alpha$  has 20 fractional bits, and A = 1, so the  $2\alpha$  subtraction nearly doubles the number of fractional bits required to represent the RPC accumulator's input. Nonetheless, the number of fractional bits in the DLF input is determined by  $\alpha$  regardless of which  $\Delta\Sigma$  FDC is used, so the proposed  $\Delta\Sigma$  FDC's  $2\alpha$  subtraction only affects the RPC's accumulator. Hence, it represents only a minor increase in the PLLs overall digital complexity. Moreover, this increase in complexity is offset by the proposed  $\Delta\Sigma$  FDC's features described below.

It follows from Fig. 18(b) that for the proposed  $\Delta\Sigma$  FDC the discrete-time transfer function from the input to the second accumulator output has a pole at DC, which suggests that

the system is unstable. Although the  $2\alpha$  term injected within the RPC causes the DC component at the output of the second accumulator to be zero, noise present at this node can cause the magnitude of the accumulator output to grow without bound. However, the second accumulation shown in Fig. 18(b) is performed by the DMRO, so this is not an issue in practice because the DMRO behaves as an accumulator with infinite output range [10]. Specifically, provided the cycle counter within the RPC does not roll-over more than once per reference period, which can be ensured by design, then the  $1 - z^{-1}$  block within the RPC can unwrap the sampled DMRO phase and retrieve the information encoded in it, thereby allowing the magnitude of the second accumulator's output in Fig. 18(b) to be arbitrarily large.

While the DC pole issue is not a problem in the modified  $\Delta\Sigma$  FDC as explained above, it would present practical issues if corresponding modifications were applied to the charge pump (CP)-based  $\Delta\Sigma$  FDC described in [8], [9], and [12]. In CP-based  $\Delta\Sigma$  FDCs, the CP performs integration in place of the DMRO, yet charge pumps do not offer the convenient rollover feature inherent to DMROs.

### B. Proposed $\Delta\Sigma$ FDC Features

Relaxed Timing Constraints: Fig. 19 shows example timing diagrams for the original and proposed  $\Delta\Sigma$  FDCs, where the time sequences  $t_n$  and  $\tau_n$ , for n = 0, 1, 2, ..., are the times of the *n*th rising edges of  $v_{ref}(t)$  and  $v_{div}(t)$ , respectively, and the *n*th divider modulus is the number of PLL output periods between  $\tau_{n-1}$  and  $\tau_n$ . In this example, the DMRO phase is sampled at times  $\gamma_n = t_n + T_{ref}/2$ , where  $T_{ref} = 1/f_{ref}$  is the reference period, and the *n*th divider modulus can be loaded at time  $t_n$  at the latest. In the original  $\Delta\Sigma$  FDC, the *n*th divider modulus is given by N - (2y[n-1]-y[n-2]), but as illustrated in Fig. 19(a), y[n-1] cannot be computed before the DMRO phase is sampled at  $\gamma_{n-1} > \tau_{n-1}$ . It follows that the *n*th divider modulus can only be loaded once y[n-1] is ready *around the middle or near the end* of the count, which increases the divider's complexity. For example, the divider in [10] required significant additional logic to meet this requirement compared to the original version of the divider presented in [18]. Furthermore, as the divider modulus must be updated before  $t_n$ , the amount of time available for the  $\Delta\Sigma$  FDC to compute y[n-1] is limited to  $T_{ref}/2$ .

As illustrated in Fig. 19(b), the proposed  $\Delta\Sigma$  FDC has much more relaxed timing constraints. In this case, the *n*th divider modulus is given by N-y[n-2]. By the time of the (*n* – 1)th rising edge of  $v_{div}(t)$ , the  $\Delta\Sigma$  FDC has already had a duration of more than  $T_{ref}/2$  to compute y[n-2], so the next count can start with a known divider modulus. Alternatively, the computation of y[n-2] can take up to  $T_{ref}$ , and the divider modulus can be updated *near the beginning* of the current count. In either case, compared to the original  $\Delta\Sigma$  FDC, the proposed  $\Delta\Sigma$  FDC allows for simpler divider topologies to be used and imposes looser digital timing constraints on the  $\Delta\Sigma$  FDC.

Reduced PFD Output Span: As shown in [11], for the original  $\Delta\Sigma$  FDC,  $e_{PLL}[n]$  in (67) is given by

$$e_{\text{PLL}}[n] = \psi_{\text{PLL}}[n] - (N + \alpha)\psi_{\text{ref}}[n] - A(\psi_{\text{DMRO}}[n] - \psi_{\text{DMRO}}[n-1]),$$
(70)

the  $e_q[n]$  sequence is bounded by

$$-1 < e_q[n] \le 0,\tag{71}$$

and the width of u(t) is given by

$$\tau_n - t_n = T_{\bar{u}} + \left(-y[n-1] - \psi_{\text{PLL}}[n] + \left(N + \alpha\right)\psi_{\text{ref}}[n] - A\psi_{\text{DMRO}}[n-1] - e_q[n-1] + e_q[n-2] - \alpha\right)T_{\text{PLL}},$$
(72)

where  $\psi_{PLL}[n]$ ,  $\psi_{ref}[n]$  and  $\psi_{DMRO}[n]$  are the phase noise changes per reference period of  $v_{PLL}(t)$ ,  $v_{ref}(t)$  and the DMRO, respectively, and

$$T_{\bar{u}} = \frac{M - T_{\text{ref}} f_{\text{low}}}{f_{\text{high}} - f_{\text{low}}}$$
(73)

is the average width of the u(t) pulse.

Suppose  $b_{PLL}$  and  $b_{DMRO}$  are the maximum magnitudes of  $e_{PLL}[n]$  and  $\psi_{DMRO}[n]$ , respectively, so

$$|e_{\text{PLL}}[n]| < b_{\text{PLL}} \text{ and } |\psi_{\text{DMRO}}[n]| < b_{\text{DMRO}}$$
(74)

for all *n*. Then, it follows from (67), (70)-(72) and (74) that the maximum span of u(t),  $\Delta T_u$ , which is defined as

$$\Delta T_u = 2 \max_n \left| \tau_n - t_n - T_{\bar{u}} \right|,\tag{75}$$

satisfies

$$\Delta T_u < 2 \left( 3 + 2b_{\text{PLL}} + Ab_{\text{DMRO}} \right) T_{\text{PLL}}.$$
(76)

An analysis similar to that presented in [11] for the proposed  $\Delta\Sigma$  FDC yields (70), (71), and the following expression for the width of the *u*(*t*) pulse during the *n*th reference period:

$$\tau_{n} - t_{n} = T_{\bar{u}} + \left( y[n-1] - \psi_{\text{PLL}}[n] + (N+\alpha) \psi_{\text{ref}}[n] - A\psi_{\text{DMRO}}[n-1] - e_{q}[n-1] + e_{q}[n-2] + \alpha \right) T_{\text{PLL}},$$
(77)

where  $T_{\bar{u}}$  is also given by (73). Hence, (67), (70), (71), (74), (75) and (77) imply that, for the proposed  $\Delta\Sigma$  FDC,  $\Delta T_u$  satisfies

$$\Delta T_u < 2 \left( 1 + 2b_{\text{PLL}} + Ab_{\text{DMRO}} \right) T_{\text{PLL}}.$$
(78)

In practice,  $b_{PLL}$ ,  $b_{DMRO} \ll 1$ , so (76) and (78) imply that  $\Delta T_u$  for the proposed  $\Delta \Sigma$  FDC is approximately a third of that of the original  $\Delta \Sigma$  FDC.

A smaller  $\Delta T_u$  allows for a larger minimum difference between the phases of  $v_{ref}(t)$  and  $v_{div}(t)$ , so it is beneficial as it mitigates spurs generated as a consequence of variations in the PFD supply voltage when  $v_{ref}(t)$  and  $v_{div}(t)$  are close in phase [19]. Additionally, reducing  $\Delta T_u$  mitigates spurs from non-ideal DMRO behavior by increasing the time available for the DMRO's frequency transients to die out each reference period [10].

*Higher-Frequency Reference Signal:* The relaxed timing constraints and smaller  $\Delta T_u$  of the proposed  $\Delta\Sigma$  FDC allows for the use of higher-frequency reference signals, which lowers the contribution to the PLL's phase noise from all noise sources within the  $\Delta\Sigma$  FDC. As in conventional fractional-*N* PLLs, the contribution of the reference signal to the PLL output phase noise PSD,  $S_{PLL}(f)$ , is proportional to  $(N + \alpha)^2$  [8], [11]. Equations (67), (70) and Fig. 1 imply that the  $\Delta\Sigma$  FDC quantization error and the DMRO phase noise appear first-order shaped at the DLF input, so their contribution to  $S_{PLL}(f)$  is proportional to  $\sin^2(\pi T_{ref}f)$ . Additionally, the PSD of the quantization error is proportional to  $T_{ref}$  [11]. Therefore, increasing  $f_{ref}$  by a factor of xfor a given  $f_{PLL}$  with all other things being the same reduces the contributions to the PLL's phase noise from the reference signal,  $\Delta\Sigma$  FDC quantization error, and DMRO by  $20\log(x)$ ,  $30\log(x)$ and  $20\log(x)$ , respectively.

### IV. DIGITAL GAIN CALIBRATION TECHNIQUE

# A. Effects of $\Delta\Sigma$ FDC Forward Path Gain Error

As explained in Section III, the behavior of the system shown in Fig. 17(b) is identical to that of a second-order  $\Delta\Sigma$  modulator provided (68) holds and  $2N_R/A$  is integer-valued. However, in practice

$$A\frac{f_{\text{high}} - f_{\text{low}}}{f_{\text{PLL}}} = \delta^{-1},\tag{79}$$

where the deviation of the factor  $\delta$  from its ideal value of 1 is the  $\Delta\Sigma$  FDC's forward path gain error. This error degrades the system's self-dithering property [16], [17], and, as shown below, it reduces the extent to which QNC cancels the error introduced by the  $\Delta\Sigma$  FDC's coarse quantization operation.

The analysis presented in [11] can be modified with (79) instead of (68) for the proposed  $\Delta\Sigma$  FDC, which yields the behavioral model of the  $\Delta\Sigma$  FDC shown in Fig. 20(a). The model is similar to that shown in Fig. 18(b), except that  $e_{PLL}[n]$  is given by (70) with  $\delta A$  instead of A, and the gain of the second accumulator is  $(\delta A)^{-1}$  instead of  $A^{-1}$ . An analysis similar to that in [11] can also be performed to obtain a linearized model of the  $\Delta\Sigma$  FDC PLL shown in Fig. 16 with the proposed  $\Delta\Sigma$  FDC and (79) instead of (68). The resulting model is shown in Fig. 20(b), where  $\theta_{ref}(t)$ ,  $\theta_{DMRO}(t)$ ,  $\theta_{DCO}(t)$  and  $\theta_{PLL}(t)$  are the phase error waveforms of the reference signal, DMRO, DCO and PLL output, respectively,  ${}^{9}L(z)$  is the DLF's transfer function,  $K_{DCO}$  is the

<sup>&</sup>lt;sup>9</sup> Reasoning similar to that presented in [8] can be applied to the linearized model shown in Fig. 20(b) to obtain expressions for the PLL output's phase noise components that depend on  $\theta_{ref}(t)$ ,  $\theta_{DMRO}(t)$ ,  $\theta_{DCO}(t)$ ,  $e_{qr}[n]$  and  $\hat{e}_{q}[n]$ .

DCO gain (i.e., the amount in Hz by which the DCO frequency changes when the DCO input changes by unity) and

$$H(z) = 1 - \left(1 - \delta^{-1}\right) z^{-2}.$$
(80)

It follows from Fig. 5(b) that the discrete-time transfer functions from  $e_{qr}[n]$  and  $\hat{e}_q[n]$  to the input of the DLF, p[n], are given by

$$A\frac{\left(1-z^{-1}\right)}{H(z)}\frac{1}{1+T(z)} \text{ and } \left(1-\delta^{-1}\right)z^{-2}\frac{\left(1-z^{-1}\right)}{H(z)}\frac{1}{1+T(z)},$$
(81)

respectively, where

$$T(z) = \delta^{-1} K_{\text{DCO}} T_{\text{ref}} \frac{z^{-2} L(z)}{(1 - z^{-1}) H(z)}$$
(82)

is the discrete-time loop gain of the PLL. The right-most expression in (81) implies that if  $\delta = 1$ , then p[n] does not depend on  $\hat{e}_q[n]$ , but if  $\delta \neq 1$ , then  $\hat{e}_q[n]$  leaks into the DLF input. As the power of  $\hat{e}_q[n]$  is much larger than that of  $e_{qr}[n]$  in practice, this can be problematic, particularly for high PLL bandwidths. For instance, in the DMRO-based PLL presented in [10], A = 1 and  $N_R = 13$ , so  $\Delta_r = 1/26$  and the power of  $\hat{e}_q[n]$  is approximately 28 dB larger than that of  $e_{qr}[n]$  (recall that  $\Delta_c = 1$ ). In this case, (81) with A = 1 implies that a  $\Delta\Sigma$  FDC forward path gain error corresponding to  $\delta^{-1} = 1 \pm 0.08$  would introduce an additional error component that depends on  $\hat{e}_q[n]$  with approximately double the power of the component that depends on  $e_{qr}[n]$ . This would significantly increase the PLL output phase noise PSD at offset frequencies where the  $\Delta\Sigma$  FDC quantization error contribution dominates those of the other noise sources.

### B. Proposed Digital Gain Calibration Technique

The proposed digital gain calibration technique is a modification of the  $\Delta\Sigma$  FDC's RPC, the details of which are shown in Fig. 6, where sgn(*x*) = 1 if  $x \ge 0$  and -1 otherwise. To minimize clutter, Fig. 21 only shows a portion of the RPC. The modifications that implement the gain calibration technique are contained entirely within the dashed contour shown in the figure, and except for these modifications the  $\Delta\Sigma$  FDC is identical to that shown in Fig. 17(b).

The gain calibration technique consists of a signed least-mean square (LMS)-like loop with gain *K* and output  $g_n$ , which digitally compensates for forward path gain error caused by  $\delta \neq 1$ . It is based on the following two results that can be derived from an analysis similar to that presented in [11]. The first result is that  $d_R[n]$  in Fig. 17(b) can be multiplied by a constant factor  $g_n$  to compensate for non-ideal DMRO frequencies. In the presence of this factor, the transfer function from  $\hat{e}_q[n]$  to p[n] is given by

$$\left(1 - g_n \delta^{-1}\right) z^{-2} \frac{\left(1 - z^{-1}\right)}{H_g(z)} \frac{1}{1 + T_g(z)},\tag{83}$$

where  $H_g(z)$  is given by (80) with  $\delta^{-1}$  replaced by  $g_n \delta^{-1}$  and  $T_g(z)$  is given by (82) with  $H_g(z)$ and  $g_n \delta^{-1}$  instead of H(z) and  $\delta^{-1}$ , respectively. It follows from (83) that  $g_n = \delta$  makes the contribution to p[n] from  $\hat{e}_q[n]$  equal to zero. The second result is that  $g_n(d_R[n] - d_R[n-1])$  equals  $-v[n-1] - \alpha$  plus zero-mean error when  $g_n$  is equal to its ideal value of  $\delta$ , i.e.,  $\delta(d_R[n] - d_R[n-1]) = -v[n-1] - \alpha$  plus zero-mean error.

These observations suggest that, provided it is stable, the gain calibration feedback loop ramps  $g_n$  up or down until it reaches the point where the input to the accumulator with gain K is zero-mean noise. Fig. 21 implies that this happens when  $g_n(d_R[n] - d_R[n-1]) + v[n-1] + \alpha$  is

uncorrelated with  $v[n-1] + \alpha$ . Therefore, to the extent that the error component in  $\delta(d_R[n] - d_R[n - 1])$  is uncorrelated with  $v[n-1] + \alpha$ , the system converges to the ideal value of  $g_n = \delta$ .

In addition to preventing  $\hat{e}_q[n]$  from leaking into the PLL loop, the proposed calibration technique also allows for the use of DMRO topologies with coarse frequency tuning or no tuning at all. This not only simplifies the design and implementation of the DMRO, but also simplifies the system as it renders feedback loops that tune  $f_{high}$  and  $f_{low}$  as a function of  $f_{PLL}$ unnecessary.

The proposed calibration technique somewhat increases the digital complexity of the  $\Delta\Sigma$  FDC, but typically does not add significantly to the PLL's overall power or area consumption. For example, in the PLL implementation described in Section V, both  $d_R[n]$  and  $g_n$  have 10 fractional bits, so 20 fractional bits are required to represent  $g_n d_R[n]$ . Given that  $\alpha$  also has 20 fractional bits, the gain calibration technique negligibly increases the number of fractional bits required to represent the RPC accumulator's input. Therefore, as the calibration technique's digital LMS loop is relatively simple, the  $f_{ref}$ -rate digital multiplier prior to the RPC's accumulator represents most of the calibration technique's added complexity.

# C. Convergence Analysis

Fig. 22 shows the block diagram of Fig. 5(a) modified to include the gain calibration technique, where  $\varepsilon_n$  is the error in  $g_n$  at sample time n, which is defined as

$$\varepsilon_n = \delta^{-1} g_n - 1. \tag{84}$$

For any fixed value of  $g_n$  and neglecting  $e_{qr}[n]$ , Fig. 22 implies that a[n] is equal to  $(1 + \varepsilon_n)e[n]$ , because the two  $1 - z^{-1}$  blocks cancel the two accumulators in the path between e[n] and a[n]. The gain calibration loop adds  $v[n-1] + \alpha$ , which is an estimate of -e[n], to a[n], and

multiplies the result by the sign of  $v[n-1] + \alpha$  to obtain a measure of  $\varepsilon_n$ , b[n], which is approximately equal to  $-\varepsilon_n |e[n]|$ .

More precisely, Fig. 22 and (84) imply that e[n] is given by

$$e[n] = -v[n-1] - \alpha - e_{\text{PLL}}[n], \qquad (85)$$

and that a[n] can be written as

$$a[n] = (1 + \varepsilon_n)e[n] + (\varepsilon_n - \varepsilon_{n-1})\sum_{i=0}^{n-1}e[i] + a_e[n],$$
(86)

where  $a_e[n]$  is the contribution of  $e_{qr}[n]$  to a[n]. Substituting (85) into (86), adding  $v[n-1] + \alpha$  to the result, and then multiplying the resulting expression by  $sgn(v[n-1] + \alpha)$  yields

$$b[n] = -\varepsilon_n \left| v[n-1] + \alpha \right| + b_e[n], \tag{87}$$

where  $b_e[n]$  is error that arises from the error in the estimate of e[n], the contribution of  $e_{qr}[n]$ , and  $g_n$  not being constant.

Fig. 22 together with (84) and (87) further imply that

$$\varepsilon_{n+1} = \left(1 - \delta^{-1} K \left| v[n-1] + \alpha \right| \right) \varepsilon_n + \delta^{-1} K b_e[n], \tag{88}$$

from which it follows that

$$\bar{\varepsilon}_{n+1} = \left(1 - \delta^{-1} K \left| v[n-1] + \alpha \right| \right) \bar{\varepsilon}_n + \delta^{-1} K \bar{b}_e[n], \tag{89}$$

where  $\overline{\varepsilon}_n$  and  $\overline{b}_e[n]$  are the expected values of  $\varepsilon_n$  and  $b_e[n]$ , respectively, conditioned to the sequence v[n-1].

When  $\delta \neq 1$ , the self-dithering property of the  $\Delta \Sigma$  FDC is not perfect, so  $e_{qr}[n]$  can be correlated with sgn $(v[n-1] + \alpha)$ . Furthermore, it follows from Fig. 16 that

$$p[n] = -e_q[n] + e_q[n-1] + \sum_{i=0}^n y[i] + \alpha$$
(90)

so Fig. 20(b) and v[n-1] = y[n-2] imply that the term  $\psi_{PLL}[n] = \theta_{PLL}(\tau_n) - \theta_{PLL}(\tau_{n-1})$  in  $e_{PLL}[n]$ , which depends on a low-pass filtered version of p[n], can also be correlated with  $sgn(v[n-1] + \alpha)$ . As  $b_e[n]$  depends on both  $e_{qr}[n]$  and  $e_{PLL}[n]$ , it follows from these observations that  $\overline{D}_e[n]$  in (89) is not zero, so  $b_e[n]$  biases the LMS loop and causes  $g_n$  to converge to a value that is slightly different than  $\delta$ . However, numerous simulations run by the authors suggest that the magnitude of this bias is sufficiently small that  $\overline{D}_e[n]$  can be neglected in the remainder of the analysis. Hence, (89) reduces to

$$\bar{\varepsilon}_{n+1} = \left(1 - \delta^{-1} K \left| v[n-1] + \alpha \right| \right) \bar{\varepsilon}_n.$$
(91)

The recursive application of (91) to itself yields

$$\bar{\varepsilon}_{n+1} = \prod_{i=0}^{n} \left( 1 - \delta^{-1} K \left| \nu[i-1] + \alpha \right| \right) \bar{\varepsilon}_{0}, \tag{92}$$

which implies that, on average,  $\varepsilon_{n+1}$  tends to zero provided K is chosen such that

$$\lim_{n \to \infty} \prod_{i=0}^{n} \left( 1 - \delta^{-1} K \left| v[i-1] + \alpha \right| \right) = 0.$$
(93)

As  $|v[n-1] + \alpha|$  is bounded and is regularly non-zero, (93) is easy to satisfy in practice.

D. Gain Calibration Technique for CP-Based  $\Delta\Sigma$  FDCs

The digital gain calibration technique shown in Fig. 21 can be modified to apply to the CP-based  $\Delta\Sigma$  FDC shown in Fig. 23(a) [8], [9], [12]. The modified version of the digital gain calibration technique is shown in Fig. 23(b). Its implementation details are almost identical to those in Fig. 21, except for an extra  $1 - z^{-1}$  block. In the CP-based  $\Delta\Sigma$  FDC, the CP and

subsequent analog-to-digital converter (ADC) play the same role as the DMRO in the DMRObased  $\Delta\Sigma$  FDC. The DMRO-based  $\Delta\Sigma$  FDC already has a  $1 - z^{-1}$  block following the DMRO, which is needed as part of the circuitry that makes it possible to read out the DMRO's phase error [10], but this block is not necessary in the CP-based  $\Delta\Sigma$  FDC. The additional  $1 - z^{-1}$  block in Fig. 23(b) compensates for the absence of a  $1 - z^{-1}$  block at the output of the ADC in the CPbased  $\Delta\Sigma$  FDC architecture.

### V. SIMULATION RESULTS

This section presents results from bit-exact, event-driven, behavioral C code simulations of the fractional-*N* PLL of Fig. 16 with the proposed  $\Delta\Sigma$  FDC and digital gain calibration technique. All digital operations were simulated with fixed-point arithmetic. The PLL's DLF consists of a loop gain multiplier with gain  $K_M$ , two single-pole IIR stages with poles at  $\lambda_0$  and  $\lambda_1$ , and a proportional-integral stage with proportional path gain  $K_P$  and integral path gain  $K_I$ . Its transfer function is given by

$$L(z) = K_M \left( K_P + \frac{K_I}{1 - z^{-1}} \right) \prod_{i=0}^{1} \frac{\lambda_i}{1 - (1 - \lambda_i) z^{-1}}.$$
(94)

The parameters used for the simulations are listed in Table 1.

Fig. 24 shows the simulated PLL phase noise PSD with and without gain calibration enabled for four different combinations of PLL bandwidth and  $\Delta\Sigma$  FDC forward path gain. For each case it also shows the theoretical PLL phase noise PSD with  $g_n = \delta$  as a black dashed curve, and the theoretical combined contribution to the PLL phase noise from  $e_{qr}[n]$  and  $\hat{e}_q[n]$  as colored dashed curves.<sup>10</sup> As demonstrated in the figure, the higher the PLL bandwidth, the more sensitive the PLL's phase noise is to the  $\Delta\Sigma$  FDC quantization error, which becomes more dominant when less filtering is applied to p[n]. Consequently, deviations of  $\delta^{-1}$  from its ideal value of 1 cause a significant degradation of the PLL phase noise PSD for high PLL bandwidth settings.

Given that  $N_R = 31$ , the power of  $\hat{e}_q[n]$  is approximately 36 dB larger than that of  $e_{qr}[n]$ . Therefore, in the absence of gain correction, it follows from (81) with A = 1 that the power of the additional quantization error component seen by the PLL loop is approximately 22 dB and 16 dB higher than that of the  $e_{qr}[n]$  component for  $\delta^{-1} = 0.8$  and  $\delta^{-1} = 1.1$ , respectively. This is supported by the simulation results shown in Fig. 24(a) and Fig. 24(c), where the spot phase noise degradation at a 3 MHz offset frequency is approximately 20 dB and 15 dB, respectively.

Fig. 25 shows  $\varepsilon_n$  versus time for several values of *K* with a PLL bandwidth of 1 MHz and  $\delta = 0.8$ .<sup>11</sup> Equation (92) with  $\varepsilon_0$  in place of  $\overline{\varepsilon_0}$ , i.e.,

$$\prod_{i=0}^{n} \left( 1 - \delta^{-1} K \left| v[i-1] + \alpha \right| \right) \varepsilon_{0}, \tag{95}$$

is also plotted (as a dashed curve) for each value of *K* to provide a comparison baseline and show that the evolution of  $g_n$  follows the trend predicted by the analysis in Section IV-C. The simulated PLL phase noise PSD after  $g_n$  converged was nearly identical to that shown in Fig. 24(a) with gain calibration enabled. In the  $K = 2^{-2}$  case, however, the total integrated jitter increased slightly to 688 fs<sub>rms</sub> because of the relatively large variance of  $\varepsilon_n$ . As shown in the

<sup>&</sup>lt;sup>10</sup> The curves corresponding  $e_{qr}[n]$  and  $\hat{e}_q[n]$  were generated under the assumption that both sequences are white. However, the sequences are not necessarily white if  $\delta \neq 1$ , which is why the simulated curves in Fig. 24 deviate somewhat from their respective theoretical predictions, particularly in the  $\delta^{-1} = 0.8$  case.

<sup>&</sup>lt;sup>11</sup> The resolution of the gain calibration technique's accumulator was limited to 1 integer bit and 36 fractional bits, but its output,  $g_n$ , was truncated to have only 10 fractional bits to reduce hardware complexity.

figure's inset,  $\varepsilon_n$  crosses the -0.01 mark for the first time in about 50 reference periods (1.9  $\mu$ s) and 400 reference periods (15.4  $\mu$ s) for  $K = 2^{-2}$  and  $K = 2^{-6}$ , respectively, and the absolute value of the mean error after convergence is lower than 0.043% of  $\delta$  in all cases.

As in most LMS-like algorithms, the choice of K represents a tradeoff between convergence speed and the error variance of  $g_n$  [20], [21]. Although it might be possible to derive a closed-form expression that quantifies this tradeoff for the proposed calibration technique, the authors currently choose the value of K based on simulation results.

Fig. 26 shows normalized histograms of  $\tau_n - t_n - T_{\bar{u}}$ , where  $T_{\bar{u}}$  was calculated using (73), for both the original  $\Delta\Sigma$  FDC and the proposed  $\Delta\Sigma$  FDC. The u(t) pulse widths were measured for over one million reference periods after the gain calibration had converged with  $K=2^{-9}$ . The 1 MHz-bandwidth set of parameters with  $\delta = 0.8$  was used for the simulations in Fig. 26(a) and Fig. 26(b), whereas the same parameters except for  $\alpha$ , which was set to 0.401008987426758, were used for the simulations in Fig. 26(c) and Fig. 26(d).

Although (76) and (78) do not show an explicit dependence of the u(t) pulse-width span on  $\alpha$  in either  $\Delta\Sigma$  FDC version, it follows from (67) and Fig. 17 that different  $\Delta\Sigma$  FDC output levels are exercised for different values of  $\alpha$ . Accordingly, different values of  $\alpha$  cause  $e_q[n]$  to take different values with higher probability than others, which affects the histogram shapes shown in Fig. 26 but not the maximum u(t) pulse-width.

As explained in Section III-B, the span of the u(t) pulse-width in the proposed  $\Delta\Sigma$  FDC is approximately three times smaller than that of the u(t) pulse-width in the original  $\Delta\Sigma$  FDC, which is supported by the simulation results shown in Fig. 26. As shown in Fig. 26(a), the span of u(t) in the original  $\Delta\Sigma$  FDC goes from  $-3T_{PLL}$  to  $3T_{PLL}$ , although it reaches the extremes

values only rarely. As demonstrated by the results in Fig. 26(c), the span is about  $5T_{PLL}$  for a larger value of  $\alpha$ , and the extreme values in this case (i.e., around  $-2.4T_{PLL}$  and  $2.6T_{PLL}$ ) are reached with higher probability compared to Fig. 26(a). Conversely, as shown in Fig. 26(b) and Fig. 26(d), the proposed architecture's u(t) pulse-width span does not vary significantly with  $\alpha$ , and as suggested by (78), it is approximately limited to  $2T_{PLL}$ .

### VI. CONCLUSION

This paper presents all-digital enhancements of digital fractional-*N* PLLs based on  $\Delta\Sigma$  FDCs. The enhancements comprise an improved DMRO-based  $\Delta\Sigma$  FDC architecture and a digital background gain calibration technique. The former reduces the span of the PFD output pulse-width and significantly relaxes the timing constraints imposed on the  $\Delta\Sigma$  FDC's digital portion and divider, which makes the system amenable to simpler divider topologies and higher-frequency reference signals. The latter compensates for non-ideal DMRO frequencies in the digital domain, thereby facilitating the use of simple DMRO topologies with fixed values of  $f_{\text{high}}$  and  $f_{\text{low}}$ , and improving the phase noise performance of high-bandwidth PLLs.

### ACKNOWLEDGEMENTS

This chapter, in full, has been published in the IEEE Transactions on Circuits and Systems I: Regular Papers. E. Alvarez-Fontecilla, A. I. Eissa, E. Helal, C. Weltin-Wu, I. Galton, 2021. The dissertation author is the primary investigator and author of this paper. Professor Ian Galton supervised the research which forms the basis for this paper.

# FIGURES



Figure 16: High-level block diagram of a second-order  $\Delta\Sigma$  FDC-based digital fractional-*N* PLL with quantization noise cancellation (QNC).



Figure 17: (a) Simplified block diagram of the DMRO-based  $\Delta\Sigma$  FDC described in [11], and (b) simplified block diagram of the proposed  $\Delta\Sigma$  FDC.



Figure 18: Signal processing equivalents of the  $\Delta\Sigma$  FDCs shown in Fig. 17(a) and Fig. 17(b) when they are locked.

(b)

 $\overline{z}^{-1}$ 



Figure 19: Example timing diagram of (a) the original  $\Delta\Sigma$  FDC and (b) the proposed  $\Delta\Sigma$  FDC.



Figure 20: (a) Behavioral model of the proposed  $\Delta\Sigma$  FDC with (79), where  $Q_r$  and  $Q_c$  are replaced by the additive error sources  $e_{qr}[n]$  and  $\hat{e}_q[n]$ , respectively, and (b) linearized model of the  $\Delta\Sigma$  FDC-based PLL shown in Fig. 16 with the proposed  $\Delta\Sigma$  FDC and (79).



Figure 21: Digital gain calibration technique shown in the context of the proposed DMRObased  $\Delta\Sigma$  FDC architecture.



Figure 22: Rearranged version of the behavioral model shown in Fig. 20(a) modified to accommodate the proposed gain calibration technique.



Figure 23: (a) Simplified block diagram of CP-based  $\Delta\Sigma$  FDC, and (b) digital gain calibration technique shown in the context of the CP-based  $\Delta\Sigma$  FDC architecture.



Figure 24: Simulated PLL phase noise PSD with and without gain calibration enabled for four combinations of PLL bandwidth (BW) and  $\delta^{-1}$ . The black and colored dashed curves correspond to the theoretical phase noise PSD for  $g_n = \delta$  and the combined contribution to the PLL phase noise from  $e_{qr}[n]$  and  $\hat{e}_q[n]$ , respectively.



Figure 25: Sequence  $\varepsilon_n$  as a function of *n* for several values of *K*, where  $\varepsilon_n$  calculated via (95) is plotted as a dashed curve for each case.



Figure 26: Normalized histogram of u(t) pulse width minus  $T_{\bar{u}}$  calculated via (73) for the original FDC [(a) and (c)] and the proposed FDC [(b) and (d)], where  $\alpha = 0.001008987426758$  for (a) and (b) and  $\alpha = 0.401008987426758$  for (c) and (d).

# TABLES

| Design Parameters |                                                   | Value                                    |                  |
|-------------------|---------------------------------------------------|------------------------------------------|------------------|
| Reference         | Frequency, $f_{\rm ref}$                          | 26 MHz                                   |                  |
| Source            | Phase noise (white)                               | -155 dBc/Hz                              |                  |
| DCO               | DCO gain, $K_{\rm DCO}$                           | 40 kHz                                   |                  |
|                   | Phase noise <sup>(1)</sup>                        | -130, -125 and -150 dBc/Hz               |                  |
| DMRO              | Number of stages, $N_R$                           | 31                                       |                  |
|                   |                                                   | 150 MHz, 3.0 GHz (nominal)               |                  |
|                   | Frequencies, $f_{\text{low}}$ , $f_{\text{high}}$ | 225 MHz, 2.5 GHz ( $\delta^{-1} = 0.8$ ) |                  |
|                   |                                                   | 142 MHz, 3.3 GHz ( $\delta^{-1} = 1.1$ ) |                  |
|                   | Phase noise <sup>(1)</sup>                        | -99, -109 and -150 dBc/Hz                |                  |
| FDC               | Fixed count, M                                    | 40                                       |                  |
|                   | Α                                                 | 1                                        |                  |
| DLF               | Loop gain multiplier, $K_M$                       | 1.5                                      |                  |
|                   | Proportional gain, $K_P$                          | $2^{6}$                                  | $2^{5}$          |
|                   | Integral gain, $K_I$                              | $2^{-2}$                                 | $2^{-4}$         |
|                   | IIR poles, $\lambda_0$ , $\lambda_1$              | $2^{-1}, 2^0$                            | $2^{-2}, 2^{-1}$ |
| PLL<br>Settings   | Integer multiplier, N                             | 110                                      |                  |
|                   | Fractional multiplier, $\alpha$                   | 0.001008987426758                        |                  |
|                   | Output frequency, <i>f</i> <sub>PLL</sub>         | 2.86 GHz                                 |                  |
|                   | Loop bandwidth                                    | 1 MHz                                    | 500 kHz          |

Table 1: Parameters used for the simulation.

<sup>1</sup>  $1/f^3$ ,  $1/f^2$  and white phase noise components at 1 MHz offset.

# REFERENCES

- 1. R. D. Beards and M. A. Copeland, "An Oversampling Delta-Sigma Frequency Discriminator," *IEEE Trans. Circuits Syst. II. Analog Digit. Signal Process.*, vol. 41, no. 1, pp. 26–32, Jan. 1994.
- 2. I. Galton and G. Zimmerman, "Combined RF Phase Extraction and Digitization," *IEEE Int. Symp. Circuits Syst. (ISCAS)*, vol. 2, pp. 1104–1107, 1993.
- 3. I. Galton, "Higher-Order Delta-Sigma Frequency-to-Digital Conversion," *Proc. IEEE Int. Symp. Circuits Syst. (ISCAS)*, vol. 5, pp. 441–444, 1994.
- 4. I. Galton, "Analog-Input Digital Phase-Locked Loops for Precise Frequency and Phase Demodulation," *IEEE Trans. Circuits Syst. II. Analog Digit. Signal Process.*, vol. 42, no. 10, pp. 621–630, Oct. 1995.
- 5. M. Hovin, A. Olsen, T. S. Lande and C. Toumazou, "Delta-Sigma Modulators Using Frequency-Modulated Intermediate Values," *IEEE J. Solid-State Circuits*, vol. 32, no. 1, pp. 13–22, Jan. 1997.
- 6. W. T. Bax and M. A. Copeland, "A GMSK Modulator Using a  $\Delta\Sigma$  Frequency Discriminator-Based Synthesizer," *IEEE J. Solid-State Circuits*, vol. 36, no. 8, pp. 1218–1227, Aug. 2001.
- J. Fang, N. Filiol, T. Riley and M. Copeland, "A Second Order ΔΣ Frequency Discriminator with Fractional-N Divider and Multi-Bit Quantizer," *IEEE Int. Symp. Circuits Syst. (ISCAS)*, pp. 1689–1692, 2007.
- 8. C. Venerus and I. Galton, "Delta-Sigma FDC Based Fractional-*N* PLLs," *IEEE Trans. Circuits Syst. I. Reg. Papers*, vol. 60, no. 5, pp. 1274–1285, May 2013.
- 9. C. Venerus and I. Galton, "A TDC-Free Mostly-Digital FDC-PLL Frequency Synthesizer With a 2.8-3.5 GHz DCO," *IEEE J. Solid-State Circuits*, vol. 50, no. 2, pp. 450–463, Feb. 2015.
- C. Weltin-Wu, G. Zhao, and I. Galton, "A 3.5 GHz Digital Fractional-N PLL Frequency Synthesizer Based on Ring Oscillator Frequency-to-Digital Conversion," *IEEE J. Solid-State Circuits*, vol. 50, no. 12, pp. 2988–3002, Dec. 2015.
- 11. C. Weltin-Wu, E. Familier, and I. Galton, "A Linearized Model for the Design of Fractional-*N* Digital PLLs Based on Dual-Mode Ring Oscillator FDCs," *IEEE Trans. Circuits Syst. I. Reg. Papers*, vol. 62, no. 8, pp. 2013–2023, Aug. 2015.
- 12. C. Venerus and I. Galton, "Quantization Noise Cancellation for FDC-Based Fractional-N PLLs," *IEEE Trans. Circuits Syst. II. Exp. Briefs*, vol. 62, no. 12, pp. 1119–1123,

Dec. 2015.

- 13. E. Helal, E. Alvarez-Fontecilla, A. I. Eissa, and I. Galton, "A Time Amplifier Assisted Frequency-to-Digital Converter Based Figital Fractional-*N* PLL," *IEEE J. Solid-State Circuits*.
- S. Pamarti, L. Jansson and I. Galton "A Wideband 2.4-GHz Delta-Sigma Fractional-N PLL with 1-Mb/s In-Loop Modulation," *IEEE J. Solid-State Circuits*, vol. 39, no. 1, pp. 49–62, Jan. 2004.
- 15. E. Temporiti, G. Albasini, I. Bietti, R. Castello, and M. Colombo, "A 700-kHz bandwidth  $\Sigma\Delta$  fractional synthesizer with spurs compensation and linearization techniques for WCDMA applications," *IEEE J. Solid-State Circuits*, vol. 39, no. 9, pp. 1446–1454, Sep. 2004.
- 16. R. Schreier and G. C. Temes, *Understanding Delta-Sigma Data Converters*. Hoboken, NJ, USA: Wiley, 2005.
- 17. I. Galton, "Granular quantization noise in a class of delta-sigma modulators," *IEEE Trans. Inf. Theory*, vol. 40, no. 3, pp. 848–859, May 1994.
- C. S. Vaucher, I. Ferencic, M. Locher, S. Sedvallson, U. Voegeli, and Z. Wang, "A Family of Low-Power Truly Modular Programmable Dividers in Standard 0.35-μm CMOS Technology," *IEEE J. Solid-State Circuits*, vol. 35, no. 7, pp. 1039–1045, Jul. 2000.
- 19. K. J. Wang, A. Swaminathan, and I. Galton, "Spurious Tone Suppression Techniques Applied to a Wide-Bandwidth 2.4 GHz Fractional-*N* PLL," *IEEE J. Solid-State Circuits*, vol. 43, no. 12, pp. 2787–2797, Dec. 2008.
- 20. A. H. Sayed, *Fundamentals of Adaptive Filtering*. Hoboken, NJ, USA: Wiley, 2003.
- 21. A. H. Sayed, Adaptive Filters. Hoboken, NJ, USA: Wiley, 2008.

# **CHAPTER 3**

# SPECTRAL BREATHING AND ITS MITIGATION IN DIGITAL FRACTIONAL-N PLLS

**Abstract**— Although digital phase-locked loops (PLLs) offer several advantages over their analog counterparts, they suffer from a major disadvantage that is rarely mentioned in published papers. The disadvantage, known as spectral breathing, is caused by component mismatches among the frequency control elements within a PLL's digitally-controlled oscillator (DCO). The mismatches introduce DCO frequency modulation nonlinearity which fluctuates and, therefore, causes erratic variations in the PLL's measured phase noise spectrum as the DCO's center frequency drifts. The phenomenon is called spectral breathing because the measured phase noise spectrum tends to slowly swell and contract over time as if taking breaths of air. During these breaths, the PLL's phase noise often becomes severely degraded. This paper presents an experimental demonstration of the spectral breathing phenomenon and its solution in a digital fractional-*N* PLL. The demonstrated solution is a multi-rate dynamic element matching technique and a mismatch-noise cancellation technique that together eliminate spectral breathing.

# I. INTRODUCTION

Digital fractional-*N* phase-locked loops (PLLs) offer several advantages over analog PLLs such as lower loop filter circuit area and compatibility with highly-scaled CMOS IC

technology [1]–[14]. However, unlike their analog counterparts, digital PLLs suffer from a problem called *spectral breathing* which can significantly degrade phase noise performance.

Spectral breathing is the result of nonlinear frequency modulation caused by inevitable mismatches among the frequency control elements (FCEs) in a PLL's digitally-controlled oscillator (DCO) [15]. Flicker noise inevitably causes the DCO's center frequency to drift over time, so different FCEs are exercised as the PLL adjusts the DCO's input sequence to compensate for the drift. This causes the DCO's frequency modulation nonlinearity and, therefore, the PLL's measured phase noise spectrum to swell up from time to time as if taking breaths of air.

The effect is particularly significant in the large percentage of digital PLLs that use a digital delta-sigma ( $\Delta\Sigma$ ) modulator to oversample the fractional part of the DCO input. In such PLLs, the  $\Delta\Sigma$  modulator drives a bank of FCEs called the *fractional FCE bank* and the integer part of the DCO input drives a separate bank of FCEs called the *integer FCE bank*. The fractional FCE bank's FCEs are exercised many times per reference period because they are driven by the oversampling  $\Delta\Sigma$  modulator. In contrast, the integer FCE bank's FCEs are exercised at most once per reference period because they are driven directly by the integer part of the DCO's input sequence. Accordingly, mismatches within the integer FCE bank is spread over a much larger frequency range and, thus, contributes much less to the DCO's phase noise than error from FCE mismatches within the integer FCE bank.

Perhaps because the problem is not visible when measurements are restricted to time periods during which the integer FCE bank input bits do not change, it is seldom mentioned in the published literature. Nevertheless, in practice the integer FCE bank input bits do change, at least over time periods of several seconds because of the DCO's center frequency drift, so the problem is significant in applications.

To the knowledge of the authors, the phenomenon was first reported and explained, although not yet named, in [2]. As the PLL presented in [2] uses the DCO frequency control method described above which was already well-known and widely-used at the time, the authors of [2] were surprised to find that FCE mismatches caused the PLL's measured phase noise spectrum to significantly vary over time.

Apart from applying dynamic element matching (DEM) clocked at the  $\Delta\Sigma$  modulator output rate to all the FCEs, which would be prohibitively power-hungry among other issues, only three solutions have been proposed to date that mitigate spectral breathing [15]. An offline calibration technique is proposed in [16] and [17] to compensate for FCE mismatches, but the technique requires several minutes to complete. A digital integer-boundary avoider circuit prior to the DCO is proposed in [3], yet the technique is only effective in cases where the integer FCE bank input bits change relatively infrequently. To the authors' knowledge, the only published comprehensive solution is the combination of a multi-rate DEM (MR-DEM) technique and a mismatch-noise cancelation (MNC) technique [15].

This paper presents a fractional-*N* digital PLL IC that incorporates the MR-DEM and MNC techniques to avoid spectral breathing. In addition to providing the first experimental demonstration of the techniques proposed in [15], it presents implementation details, refinements, and practical observations that are not presented in [15].

83

#### II. BACKGROUND INFORMATION

# A. Conventional DCO Frequency Control Technique

Most fractional-*N* digital PLLs have the general structure shown in Fig. 27(a), which consists of a phase-error-to-digital converter (PEDC), a digital loop filter (DLF), and a DCO. The PLL's input,  $v_{ref}(t)$ , is generated by a reference oscillator not shown in the figure and the PLL's output,  $v_{PLL}(t)$ , ideally is periodic with frequency  $f_{PLL} = (N + \alpha)f_{ref}$ , where  $f_{ref}$  is the reference oscillator frequency, *N* is a positive integer, and  $\alpha$  is a fractional value. The PEDC generates an  $f_{ref}$ -rate sequence of the form  $-\theta_{PLL}[n] + e_p[n]$ , where  $\theta_{PLL}[n]$  is the PLL's phase error and  $e_p[n]$  represents the combined effect of all other errors. The PEDC output is filtered by the DLF, and the DLF output, d[n], controls the DCO frequency.

The DCO's instantaneous frequency,  $f_{DCO}(t)$ , can be written as a fixed offset frequency plus  $f_{tune}(t)$ , where  $f_{tune}(t)$  depends on the states of the DCO's FCEs. Ideally,  $f_{tune}(t) = d[n_i]$ , where  $n_t = n$  over the *n*th period of the  $f_{ref}$ -rate clock that updates d[n], so  $n_t$  is a continuous function of time that takes on values n = 0, 1, 2, 3, ... as *t* increases. If d[n] is represented as a *b*-bit two's complement code sequence and its least-significant bit (LSB) represents a frequency step of  $\Delta$ , then,  $d[n_t] = (-2^{b-1}d_{b-1}[n_t] + 2^{b-2}d_{b-2}[n_t] + ... + 2^0d_0[n_t])\Delta$ , where  $d_i[n_t]$  is the *i*th bit of the  $d[n_t]$ code for i = 0, 1, ..., b - 1. In principle, the DCO could be implemented with *b* FCEs, where the *i*th FCE increases or decreases  $f_{DCO}(t)$  by  $2^i\Delta$  whenever the FCE's input bit changes from 0 to 1 or 1 to 0, respectively. In this case, the input to the *i*th FCE would be  $d_i[n_t]$  for i = 0, 1, ..., b - 2, and  $1 - d_i[n_t]$  for i = b - 1. Unfortunately, in most PLLs this would require FCEs with impractically small frequency steps. Fig. 27(b) shows an example configuration of a DCO control technique that is widely used to circumvent this problem [1]. In this example, the minimum practical FCE frequency step is  $\Delta_{\min} = 2^8 \Delta$ , and d[n] is decomposed into an integer part,  $d_I[n]$ , that takes on values that are multiples of  $\Delta_{\min}$ , and a fractional part,  $d_F[n]$ , that takes on values in the range {0,  $\Delta$ , 2 $\Delta$ , ...,  $\Delta_{\min} - \Delta$ }. The  $d_I[n]$  sequence drives an *integer FCE bank* directly. The  $d_F[n]$  sequence is oversampled at a rate of  $f_{\text{fast}} > f_{\text{ref}}$  and re-quantized by a digital  $\Delta\Sigma$  modulator. The  $\Delta\Sigma$  modulator output is converted to a thermometer code that drives a *fractional FCE bank* which consists of four FCEs, each with a frequency step of  $\Delta_{\min}$ . Thus,  $f_{\text{tune}}(t)$  is equal to  $d[n_t]$  plus  $f_{\text{fast}}$ -rate highpass-shaped  $\Delta\Sigma$  modulator quantization error that is lowpass filtered by the DCO.

Ideally, the contribution to  $f_{DCO}(t)$  from the *i*th FCE instantaneously increases or decreases by  $\Delta_i$  when the FCE's input bit changes from 0 to 1 or 1 to 0, respectively, where  $\Delta_i$ is the FCE's nominal frequency step. In practice, non-ideal circuit behavior causes the FCE's frequency transitions to be non-instantaneous, and component mismatches cause its frequency step to deviate somewhat from  $\Delta_i$  as illustrated in Fig. 27(c). These nonidealities introduce input-code-dependent DCO frequency modulation nonlinearity. As illustrated in Fig. 27(c), this causes the PLL's phase noise spectrum to vary over time as the DCO's center frequency, and, hence,  $d_i[n]$ , drift [15].

### B. MR-DEM and MNC Techniques

Fig. 28(a) shows a high-level diagram of a digital fractional-*N* PLL that includes the MR-DEM and MNC techniques. The system is similar to that shown in Fig. 27(a), except for the addition of two digital blocks: an MR-DEM encoder that is built into the DCO and an MNC logic block.

The MR-DEM encoder is based on the same principle as mismatch-shaping DEM encoders [18]. The idea is to shuffle the error introduced by FCE mismatches, so that  $f_{tune}(t)$  equals a scaled version of  $d[n_t]$  plus additive highpass-shaped error that depends on pseudorandom digital sequences that are known to the system because they are generated within the MR-DEM encoder. This additive error has both  $f_{ref}$ -rate and  $f_{fast}$ -rate components. Most of the  $f_{ref}$ -rate error component in  $f_{tune}(t)$  is canceled by the MNC technique, which applies a leastmean-square (LMS)-like algorithm to compute digital coefficients with which it forms a correction sequence,  $e_{MNC}[p]$ , that is injected into the MR-DEM encoder. The  $f_{fast}$ -rate error component is not canceled by the MNC technique, but its high sample-rate in conjunction with its highpass spectral shape ensures that most of its contribution to the DCO's phase noise gets suppressed by the DCO's first-order lowpass frequency-modulation-to-phase-noise transfer function.

Fig. 28(b) shows the details of the MR-DEM technique for an example case where d[n] has 16 bits. It consists of an MR-DEM encoder that comprises a slow DEM encoder, a second-order digital  $\Delta\Sigma$  modulator, and a fast DEM encoder. The slow DEM encoder is a modified version of a conventional segmented DEM encoder, the details of which are described shortly, and the fast DEM encoder is a conventional non-segmented DEM encoder. The slow DEM encoder. The slow DEM encoder is clocked by the  $f_{ref}$ -rate clock signal,  $clk_{ref}(t)$ , whereas the MR-DEM encoder's fractional path, which consists of the digital  $\Delta\Sigma$  modulator and the fast DEM encoder, is clocked by the  $f_{fast}$ -rate clock signal,  $clk_{fast}(t)$ .

The signal processing details of the slow DEM encoder are shown in Fig. 29, where the  $\Delta$  multiplication prior to the  $x_f[n]$  output denotes that the LSB of  $x_f[n]$  represents a frequency

step of  $\Delta$ . The slow DEM encoder consists of 25 digital switching blocks (SBs), labeled  $S_{k,r}$  for k = 1, 2, ..., 16 and r = 1, 2, ..., 17. The shaded SBs are called segmenting SBs, whereas the remaining SBs are called non-segmenting SBs. The functional details of each SB type are also shown in Fig. 29, where  $c_{k,r}[n]$  is the input sequence of  $S_{k,r}$ . The outputs of each segmenting SB are  $\frac{1}{2}(c_{k,1}[n] - 1 - s_{k,1}[n])$  and  $1 + s_{k,1}[n]$ , where  $s_{k,1}[n]$ , called a switching sequence, is 0 when  $c_{k,1}[n]$  is odd and  $\pm 1$  otherwise. The outputs of each non-segmenting SB are  $\frac{1}{2}(c_{k,r}[n] - s_{k,r}[n])$ , where  $s_{k,r}[n]$  is 0 when  $c_{k,r}[n]$  is even and  $\pm 1$  otherwise. Each switching sequence is zero-mean and has a highpass-shaped power spectral density that peaks at  $f_{ref}/2$ .

As explained in [15], in the absence of the MNC technique the MR-DEM encoder would cause

$$f_{\text{tune}}(t) \cong d[n_t] + \Delta \sum_{k,r} \delta_{k,r} s_{k,r}[n_t] + p_{\text{fast}}(t) \Delta \sum_{k,r} \gamma_{k,r}(t) \Big( s_{k,r}[n_t-1] - s_{k,r}[n_t] \Big), \tag{96}$$

where  $\delta_{k,r}$  and  $\gamma_{k,r}(t)$  are constant and  $1/f_{\text{fast}}$ -periodic waveforms, respectively, that depend only on FCE errors, and  $p_{\text{fast}}(t)$  is a series of unit-amplitude,  $1/f_{\text{fast}}$ -width pulses that go high whenever  $n_t$  changes.<sup>12</sup> As implied by (96), the MR-DEM technique causes  $f_{\text{tune}}(t)$  to be a linear function of  $d[n_t]$  at the expense of introducing two additive error terms. One of the error terms is caused by FCE static gain errors, i.e., mismatch-induced errors in the FCEs' frequency step-sizes. The other error term is inter-symbol interference (ISI) that results from non-instantaneous rise and fall frequency transients of individual FCEs [15].

The details of the MNC logic are shown in Fig. 30(a). It consists of 25  $s_{k,r}[n]$  residue estimators that each compute a correction sequence corresponding to one of the slow DEM

<sup>&</sup>lt;sup>12</sup> The summation indices in (96) indicate the summation over all k and r values corresponding to the SBs within the slow DEM encoder.

encoder's switching sequences, and an adder that combines these sequences to form  $e_{MNC}[p]$ . As shown in Fig. 30(b), each  $s_{k,r}[n]$  residue estimator comprises two branches, one to compute the correction sequence associated with FCE static gain error, and another to compute the correction sequence associated with ISI error, hereafter referred to as the static-error and ISIerror correction branches, respectively.

As indicated in Fig. 30, the MNC logic block's output is

$$e_{\text{MNC}}[p] = \Delta \sum_{k,r} a_{k,r}[n] s_{k,r}[n] + p_{\text{fast}}(t) \Delta \sum_{k,r} b_{k,r}[n] \Big( s_{k,r}[n-1] - s_{k,r}[n] \Big),$$
(97)

where  $a_{k,r}[n]$  and  $b_{k,r}[n]$  are measures called MNC coefficients that correspond to  $\delta_{k,r}$  and  $\gamma_{k,r}(t)$ , respectively. As explained in [15], the MNC coefficients converge such that  $e_{MNC}[p]$  is a sampled measure of the additive error terms in (96). As illustrated conceptually in Fig. 30(c), when injected into the DCO as shown in Fig. 28(b),  $e_{MNC}[p]$  largely prevents these terms from contributing to the DCO's phase noise.

### III. IMPLEMENTATION DETAILS

The MR-DEM and MNC techniques were implemented as modifications to the 6.5 GHz digital fractional-*N* PLL presented in [4]. As the underlying PLL is explained in detail in [4], only the additional implementation details relevant to the MR-DEM and MNC techniques are presented here.

# A. MR-DEM Encoder

The high-level details of the implemented MR-DEM encoder are identical to those shown in Fig. 28(b). The second-order digital  $\Delta\Sigma$  modulator is implemented as an error

feedback structure to reduce its hardware complexity [19]. The fast DEM encoder is implemented as a 4-layer tree of non-segmenting SBs with first-order highpass-shaped switching sequences [20], [21]. The slow DEM encoder is implemented as shown in Fig. 31. The SBs within the slow and fast DEM encoders are identical to the adder-free SBs described in [22], except for a modification made in the slow DEM encoder's SBs for them to accommodate both white and first-order highpass-shaped switching sequences.

It follows from Fig. 29 that the slow DEM encoder bottom output,  $x_f[n]$ , is given by

$$\Delta \sum_{k=9}^{16} 2^{16-k} s_{k,1}[n], \tag{98}$$

which can be computed by combining the bottom outputs of  $S_{16,1}$ ,  $S_{15,1}$ , ... and  $S_{9,1}$  as indicated in Fig. 29. However, in this work  $x_f[n]$  is instead computed without using adders to reduce the block's hardware complexity. As explained shortly, the LSB,  $c_{8,1-LSB}[n]$ , of the input to  $S_{8,1}$ ,  $c_{8,1}[n]$ , corresponds to a quantized version of  $d_F[n]$ , and its quantization error is proportional to (98). The proposed slow DEM encoder architecture takes advantage of this property to generate  $x_f[n]$  by simply combining the bits of the  $d_F[n]$  and  $c_{8,1-LSB}[n]$  sequences as indicated in Fig. 31.

Fig. 31 implies that

$$x_{f}[n] = -2^{8} \left( 1 - c_{8,1-\text{LSB}}[n] \right) \Delta + d_{F}[n], \tag{99}$$

and the SB signal processing operations shown in Fig. 29 imply that

$$c_{8,1}[n] = \frac{1}{2^8} \left( c[n] + 1 - 2^8 - \sum_{k=9}^{16} 2^{16-k} s_{k,1}[n] \right) \Delta.$$
(100)
As explained in [22], the segmenting SBs use negative-extra-LSB encoding, so  $c_{8,1}$ -LSB[n] has a negative weight. This with the segmenting SB details presented in [22] implies that  $c_{8,1}$ -LSB[n] is given by minus the right side of (100) with c[n] replaced by  $d_F[n]/\Delta$ , i.e.,

$$c_{8,1-\text{LSB}}[n] = -\frac{1}{2^8} \left( \frac{d_F[n]}{\Delta} + 1 - 2^8 - \sum_{k=9}^{16} 2^{16-k} s_{k,1}[n] \right) \Delta.$$
(101)

Substituting (101) into (99) yields (98) plus a -1 offset. This offset is not a problem in practice because it is suppressed by a zero-frequency zero in the PLL's transfer function.

### B. MNC Logic

The MNC logic implementation details are shown in Fig. 32. Each variable is represented in fixed-point, two's complement format, and its number of bits is specified via the notation  $\{b_I, b_F\}$ , where  $b_I$  and  $b_F$  are the numbers of integer and fractional bits, respectively. The braces and  $b_F$  are omitted in cases where  $b_F = 0$ .

Each  $s_{k,r}[n]$  residue estimator computes two sequences,  $e_{k,r-\text{stat}}[n]$  and  $e_{k,r-\text{ISI}}[n]$ . The  $e_{k,r-\text{stat}}[n]$  sequences are combined to form  $e_{\text{MNC}-\text{stat}}[n]$ , which corresponds to the first summation in (97), i.e., the part of  $e_{\text{MNC}}[p]$  associated with FCE static gain errors. The  $e_{k,r-\text{ISI}}[n]$  sequences are combined and then multiplied by the unit-amplitude pulse sequence,  $p_{\text{fast}}(t)$ , to form  $e_{\text{MNC}-1}[p]$ , which corresponds to the second summation in (97), i.e., the part of  $e_{\text{MNC}}[p]$  associated with FCE ISI errors. Although not shown in Fig. 32,  $e_{\text{MNC}-\text{stat}}[n]$  and  $e_{\text{MNC}-1}[p]$  are combined at the  $\Delta\Sigma$  modulator's input in Fig. 2(b) to form  $e_{\text{MNC}}[p]$ .

As explained in [21], each  $s_{k,r}[n]$  is a concatenation of sequences of the form 1, 0, ..., 0, -1, 0, ..., 0 or -1, 0, ..., 0, 1, 0, ..., 0. Thus,  $|s_{k,r}[n]| \le 1$  and  $|s_{k,r}[n] - s_{k,r}[n-1]| \le 2$  for all *n*, and the running sum of  $s_{k,r}[n]$  never exceeds 1 nor -1. However, after startup, the finite state machines that generate the switching sequences do not necessarily start at the beginning of their respective cycles, which could cause the magnitudes of the running sums of some switching sequences to exceed 1. This issue is avoided by the inclusion of a  $\pm 1$  clipper within each  $s_{k,r}[n]$  accumulator as shown in Fig. 32.

Each MNC coefficient within each  $s_{k,r}[n]$  residue estimator is the output of a 29-bit clipping accumulator. The 20 LSBs of the coefficients are dropped prior to their respective multiplication by  $s_{k,r}[n]$  and  $s_{k,r}[n-1] - s_{k,r}[n]$ , which reduces power consumption at the expense of reducing the accuracy with which  $e_{MNC}[p]$  cancels the additive error terms in (96). The number of bits to drop was determined with the aid of simulations performed by the authors using a bit-accurate, event-driven, C-language, custom PLL simulator such that the contribution to the PLL's phase noise from the residual error that is left after MNC is applied is negligible.

The  $K_a$  and  $K_b$  gains are restricted to powers of 2 so the implementation of their respective multipliers only involves bus-shifting. Consequently, as shown in Fig. 32, the MNC logic requires no actual digital multipliers.

## C. DCO FCE Banks

The DCO consists of a single-turn center-tapped inductor, a cross-coupled NMOS pair, a tail resonant tank [23], a triode-MOS tail source, and integer and fractional FCE banks. The implemented FCEs are of the type presented in [2], and the minimum-size FCE has an equivalent frequency step of  $\Delta_{min} = 160$  kHz at 6.5 GHz. The integer FCE bank comprises eight  $32 \times \Delta_{min}$  FCEs and five pairs of  $16 \times$ ,  $8 \times$ ,  $4 \times$ ,  $2 \times$  and  $1 \times \Delta_{min}$  FCEs, whereas the fractional FCE bank comprises sixteen  $\Delta_{min}$  FCEs. All FCEs are implemented by connecting one or more  $\Delta_{min}$ FCEs in parallel. Both FCE banks are laid out as illustrated in Fig. 33, where the FCEs are indexed according to their respective control bits. Each FCE is driven by a flip-flop clocked by  $clk_{fast}(t)$ , followed by a buffer. The size of each buffer is scaled according to the number of parallel  $\Delta_{min}$  FCEs it drives.

As shown in Fig. 33, the ten largest FCEs of the integer FCE bank, FCEs 25 to 34, are split into two halves each, which are laid out in a common-centroid fashion to avoid the FCE mismatches from being exacerbated. As explained in Section II-B, the MNC technique cancels much of the error that arises from FCE mismatches, but the larger the FCE mismatches, the larger the required dynamic range of  $e_{MNC}[p]$  and the larger the resulting output dynamic range of the MR-DEM encoder's fractional path. A larger fractional path output dynamic range is undesirable in practice as it increases the MR-DEM encoder's power consumption, as well as the number of control lines that need to be routed from the DCO control to the FCE banks' drivers. Therefore, care was taken with the layout to minimize FCE mismatches.

### IV. MEASUREMENT RESULTS

The prototype IC consists of the digital fractional-*N* PLL as well as a serial peripheral interface (SPI) and additional circuitry used for testing. The IC was fabricated in the Global-Foundries 22 nm CMOS 22FDX technology.

The IC's place-and-route (PNR) digital block has a 0.8 V power supply and is clocked by an  $f_{\text{fast}}$ -rate clock, where  $f_{\text{fast}} = f_{\text{PLI}}/8$ . However, most of the PNR digital block's sub-blocks are clocked by  $f_{\text{ref}} = 80$  MHz clocks that are derived from the  $f_{\text{fast}}$ -rate clock. The maximum value of  $f_{\text{fast}}$  is 830 MHz, although the PNR digital block was designed to run at a clock-rate as high as 1 GHz to provide design margin. The PNR digital block's area is 0.0482 mm<sup>2</sup>, half of which corresponds to the circuitry associated with the MR-DEM and MNC techniques.

Fig. 34 shows the measured PLL phase noise at  $f_{PLL} = 6.56$  GHz for a PLL bandwidth of 900 kHz under various conditions. The signal source analyzer's averaging option was set to 32 when taking each measurement result shown in the figure. In Fig. 34(a), the MR-DEM technique is disabled, i.e., it is configured to operate as a conventional DCO control technique,<sup>13</sup> the MNC technique is disabled, and the measurement was taken over a time period wherein  $d_I[n]$  did not vary. Thus, the phase noise profile shown in Fig. 34(a) corresponds to a standard phase noise profile, as reported in most published papers that report digital PLL results. Fig. 34(b) shows the measured PLL phase noise for the same conditions of Fig. 34(a), except that the measurement time duration was increased and the instrument's persistence option was enabled. The PLL was left running for 2 hours for the measurement, and d[n] was regularly monitored through the SPI to check for integer-boundary crossings. As shown in Fig. 34(b), the PLL phase noise varies significantly over time, such that the spot phase noises at 20 kHz and 1 MHz offset frequencies vary by 9 dB and 11 dB, respectively. The *d*[*n*] sequence crossed three integer boundaries several times during the measurement. Additional measurements performed by the authors indicate that phase noise profiles almost identical to that shown in Fig. 34(b) can be easily obtained in a few minutes, or even in a few seconds if less averaging is used.

Fig. 34(c) shows the measured PLL phase noise for the same conditions of Fig. 34(b), except with the MR-DEM and MNC techniques both enabled. In this case, d[n] crossed two

<sup>&</sup>lt;sup>13</sup> Specifically, the slow DEM encoder's randomization is disabled, in which case the slow DEM encoder behaves as a conventional encoder. The digital  $\Delta\Sigma$  modulator and fast DEM encoder were enabled for all the measurements reported in this paper.

integer boundaries several times during the measurement. As demonstrated in Fig. 34(c), the PLL phase noise varies much less with the proposed techniques enabled. Even at offset frequencies lower than 100 kHz, where the phase noise profile is expected to vary somewhat because of the algorithm used by the instrument to compute the spectrum, the variations in Fig. 34(c) are considerably less significant than those in Fig. 34(b) (e.g., less than 2 dB versus more than 9 dB at a 20 kHz offset frequency). As explained below, the ISI-error correction branch of the MNC technique does not perfectly cancel the ISI component of the error at the PLL output. This is reflected in Fig. 34(c) as a slight phase noise increase at offset frequencies above 10 MHz.

Fig. 34(d) shows the measured PLL phase noise for the same conditions of Fig. 34(c), except that the MR-DEM technique was disabled and the MNC technique's coefficients were frozen after convergence but prior to the measurement. In this case, d[n] crossed two integer boundaries several times during the measurement. The phase noise profile shown in Fig. 8(d) is similar to that shown in Fig. 34(c), except for slightly larger variations at offset frequencies below 200 kHz and at a 1 MHz offset frequency, and the phase noise at offset frequencies above 10 MHz which does not exhibit the excess noise shown in Fig. 34(c). The increase in phase noise variation at low offset frequencies in Fig. 34(d) compared to Fig. 34(c) happens because MNC coefficient noise gets sampled, and, hence, locked in when the MNC coefficients are frozen. Hence, in this configuration, the PLL's performance is expected to be similar to that of the PLL without the MR-DEM and MNC techniques enabled but with considerably smaller FCE mismatches.

Fig. 35 shows the measured PLL phase noise at  $f_{PLL} = 6.56$  GHz with the signal source analyzer's averaging option set to 32 and different combinations of the proposed techniques enabled and disabled. The PLL bandwidth was set to 900 kHz in each case. Fig. 35(a) shows the effect on the PLL phase noise of the MR-DEM technique in the absence of MNC compared to the baseline case of Fig. 34(a). As shown in the figure, the spot phase noise increases by up to 11.5 dB compared to the case of Fig. 34(a), whereas the total integrated jitter from 10 kHz to 80 MHz,  $\sigma_{TJ}$ , increases from 176 fs<sub>rms</sub> to 190 fs<sub>rms</sub>. Fig. 35(b) shows the measured PLL phase noise with the MR-DEM technique and only the static-error correction branch of the MNC technique enabled. The spot phase noise around a 20 MHz offset frequency decreases by 8 dB, which corresponds to most of the noise introduced by the MR-DEM technique. Fig. 35(c) shows the measured PLL phase noise with the MR-DEM technique and both branches of the MNC technique enabled. As shown in Fig. 35(c), the improvement after applying ISI error correction results in a 1.2 dB spot phase noise reduction around a 20 MHz offset frequency, and the resulting phase noise profile still shows some residual error, although its effect on  $\sigma_{TJ}$  is almost negligible.

As explained in [15], each  $b_{k,r}[n]$  coefficient in (97) converges to a value proportional to the average over  $1/f_{\text{fast}}$  of its respective  $\gamma_{k,r}(t)$  coefficient in (96). Consequently, the ISI-error correction branch of the MNC technique does not perfectly cancel the third term from the right side of (96). Nonetheless, as demonstrated in Fig. 35, this is not a problem in practice because the power of the ISI-error component introduced by the MR-DEM technique is much smaller than that of the static-error component. Additional measurements taken by the authors for the same conditions used to generate Fig. 34(d), except with the MNC technique's ISI-error correction branch disabled, yielded phase noise profiles virtually identical to that shown in Fig. 34(d). This suggests that, at least in the case of the presented prototype IC, the ISI-error correction branch of the MNC technique could be omitted to save both power and area without significantly affecting the effectiveness of the MR-DEM and MNC techniques to mitigate spectral breathing.

Although originally intended to work with highpass-shaped switching sequences, the MNC technique also works with white switching sequences. Fig. 36 shows measured PLL phase noise profiles similar to those shown in Fig. 35(a) and Fig. 35(c) but for white switching sequences within the slow DEM encoder. As shown in Fig. 36(a), the PLL phase noise is severely degraded when enabling MR-DEM in this case. The spot phase noise increases by up to 12.6 dB, and  $\sigma_{TJ}$  increases from 176 to 425 fs<sub>rms</sub>. Nonetheless, as shown in Fig 36(b), the MNC technique cancels most of the noise introduced by the MR-DEM technique, except for a small portion for the reasons explained above. As in the case with highpass-shaped switching sequences, most of the noise in Fig. 36(a) gets cancelled when enabling the MNC technique's static-error correction branch; the ISI-error correction branch accounts for less than a 1 dB reduction.

Table 2 summarizes the increase of the PNR digital block's power consumption for different combinations of the MR-DEM and MNC techniques enabled and disabled. As shown in the table, the proposed techniques increase the power consumption by up to 1.64 mW and 1.4 mW when using highpass-shaped and white switching sequences, respectively. In both cases, the most significant contributor is the ISI-error correction branch of the MNC technique.

The proposed techniques can be used without significantly increasing the PLL's power consumption by freezing the MNC coefficients after convergence and disabling the MR-DEM technique, in which case the PLL's power consumption only increases by 250  $\mu$ W. However, as demonstrated in Fig. 34(c) and Fig. 34(d), the effectiveness of the techniques to mitigate spectral breathing is slightly reduced is in this case.

Additional measurements taken by the authors suggest that the proposed techniques have no effect on the PLL's fractional spur performance. In contrast, the reference spur power increases by 10 dB (from -80 dBc to -70 dBc) as a result of enabling MR-DEM, but it does not increase when MNC is enabled with its coefficients frozen and MR-DEM is disabled.

Although the reference spur is expected to increase when enabling MR-DEM due to coupling from the DCO control lines to the DCO outputs, the authors believe that the reported increase of 10 dB is exacerbated by a layout issue in the DCO, which was not caught prior to fabrication because of a post-layout extraction tool flaw. As explained in [4], this issue caused the DCO's quality factor to be significantly lower than expected even after FIB surgery. Consequently, the DCO outputs swing is extremely low, even when raising the DCO tank's supply voltage to 0.9 V, and the DCO is highly sensitive to interference from other circuit blocks.

#### V. CONCLUSION

This paper presents the first experimental demonstration of the MR-DEM and MNC techniques described in [15], which mitigate the spectral breathing problem in digital PLLs that

results from non-ideal FCE behavior. Additionally, implementation details as well as practical observations that complement the techniques' descriptions in [15] are presented.

The MR-DEM technique linearizes the DCO input-output characteristics at the expense of additive highpass-shaped error which depends on known digital sequences. This error has a component that arises from FCE static gain errors, and another component that arises from ISI, both of which are cancelled by the MNC technique. By freezing the MNC coefficients after convergence and disabling MR-DEM, the presented techniques significantly mitigate the spectral breathing problem while only negligibly increasing the PLL's total power consumption.

Measurement results suggest that for this particular DCO design, the power of the ISI component of the DCO error is much less significant than that of the component that arises from FCE static gain errors. Furthermore, the results suggest that white switching sequences can be used in the MR-DEM technique's slow DEM encoder to reduce digital logic complexity, thereby reducing both power and area consumption.

#### ACKNOWLEDGEMENTS

This chapter, in full, has been submitted for publication to the IEEE Journal of Solid-State Circuits. E. Alvarez-Fontecilla, E. Helal, A. I. Eissa, I. Galton, 2021. The dissertation author is the primary investigator and author of this paper. Professor Ian Galton supervised the research which forms the basis for this paper.

## FIGURES



Figure 27: (a) General form of a digital fractional-*N* PLL (b) conventional DCO control technique, and (c) illustration of the effects of non-ideal FCE behavior on the PLL's phase noise spectrum.



Figure 28: (a) Block diagram of a digital fractional-*N* PLL with the MR-DEM and MNC techniques, and (b) MR-DEM technique details.



Figure 29: Slow DEM encoder signal processing.



Figure 30: (a) MNC logic signal processing, (b)  $s_{k,r}[n]$  residue estimator signal processing, and (c) illustration of the MNC technique operation where  $p_t = p$  over the *p*th period of clk<sub>fast</sub>(*t*).



Figure 31: Adder-free slow DEM encoder implementation.



Figure 32: MNC logic bit-level implementation.

|                         |                       | [              |     |              |              |                                          |
|-------------------------|-----------------------|----------------|-----|--------------|--------------|------------------------------------------|
| FCE 27 (1/2)            |                       | ▶              |     | FCE 34 (1/2) |              |                                          |
| FCE 28 (1/2)            |                       | <b>⊢</b> ►     |     | FCE 33 (1/2) |              |                                          |
| FCE 29 (1/2)            |                       | <b>→</b>       |     | FCE 32 (1/2) |              |                                          |
| FCE 30 (1/2)            |                       | Integer        |     | FCE 31 (1/2) |              |                                          |
| FCE 8 FCE 25 (1/2)      | <del>`</del> _`       | Fractional 2   |     | FCE 26 (1/2) | FCE 7        |                                          |
| 18 20 FCE 22 FCE 26 (½) | <b>▲</b>              | FCE Bank       |     | FCE 25 (1/2) | FCE 21 19 17 |                                          |
| FCE 31 (1/2)            | 15 16                 | Drivers 2      | 1 2 | FCE 30 (1/2) |              |                                          |
| FCE 32 (1/2)            | 14 13                 | <del>2</del> ► | 4 3 | FCE 29 (1/2) |              |                                          |
| FCE 33 (1/2)            | 11 12                 | <del>3</del> ► | 5 6 | FCE 28 (1/2) |              |                                          |
| FCE 34 (1/2)            | 10 9                  | <del>2</del> ► | 8 7 | FCE 27 (1/2) |              |                                          |
|                         | <sup>3</sup>          | ^ ľ            |     |              |              |                                          |
|                         | Minimum-<br>sized FCE | <b></b>        |     |              | 34           | $b_{1-34}[p]$<br>clk <sub>fast</sub> (t) |

Figure 33: Integer and fractional FCE banks layout.



Figure 34: Measured PLL phase noise at  $f_{PLL} = 6.56$  GHz for (a) conventional DCO control technique with constant  $d_I[n]$ , (b) conventional DCO control technique and persistence enabled for 2 hrs. (c) MR-DEM and MNC enabled and persistence enabled for 2 hrs., and (d) MR-DEM disabled, MNC enabled with coefficients frozen after convergence, and persistence enabled for 2 hrs.



Figure 35: Measured PLL phase noise at  $f_{PLL} = 6.56$  GHz for (a) MR-DEM enabled and MNC disabled, (b) MR-DEM and MNC (stat. branch only) enabled, and (c) MR-DEM and MNC (both stat. and ISI branches) enabled



Figure 36: Measured PLL phase noise at  $f_{PLL} = 6.56$  GHz with MR-DEM enabled for (a) MNC disabled, and (b) MNC (both stat. and ISI branches) enabled.

## TABLES

|                                                        | MNC                |                     | Power increase (mW) <sup>(1)</sup> |             |  |  |  |  |  |
|--------------------------------------------------------|--------------------|---------------------|------------------------------------|-------------|--|--|--|--|--|
| MR-DEM                                                 | Staterror          | ISI-error           | MNC coeffs.                        | MNC coeffs. |  |  |  |  |  |
|                                                        | branch             | branch              | update active                      | frozen      |  |  |  |  |  |
| Disabled                                               | Enabled            | Enabled             | -                                  | 0.25        |  |  |  |  |  |
| First-Order Highpass-Shaped <i>s<sub>k,r</sub>[n</i> ] |                    |                     |                                    |             |  |  |  |  |  |
|                                                        | Disabled           | Disabled            | 0.52                               | 0.52        |  |  |  |  |  |
| Enabled                                                | Enabled            | Disabled            | 0.92                               | 0.57        |  |  |  |  |  |
|                                                        | Enabled            | Enabled             | 1.64                               | 0.86        |  |  |  |  |  |
| White $s_{k,r}[n]$                                     |                    |                     |                                    |             |  |  |  |  |  |
|                                                        | Disabled           | Disabled            | 0.43                               | 0.43        |  |  |  |  |  |
| Enabled                                                | Enabled            | Disabled            | 0.83                               | 0.47        |  |  |  |  |  |
|                                                        | Enabled            | Enabled             | 1.40                               | 0.70        |  |  |  |  |  |
| Enabled                                                | Enabled<br>Enabled | Disabled<br>Enabled | 0.83                               | 0.47        |  |  |  |  |  |

# Table 2: PNR digital block power consumption.

<sup>1</sup> With respect to the case where MR-DEM and MNC are disabled.

## REFERENCES

- R. B. Staszewski, K. Muhammad, D. Leipold, C.-M. Hung, Y.-C. Ho, J. L. Wallberg, C. Fernando, K. Maggio, R. Staszewski, T. Jung, J. Koh, S. John, I. Y. Deng, V. Sarda, O. Moreira-Tamayo, V. Mayega, R. Katz, O. Friedman, O. E. Eliezer, E. de-Obaldia, and P. T. Balsara, "All-Digital TX Frequency Synthesizer and Discrete-Time Receiver for Bluetooth Radio in 130-nm CMOS," *IEEE J. Solid-State Circuits*, vol. 39, no. 12, pp. 2278-2291, Dec. 2004.
- 2. C. Venerus and I. Galton, "A TDC-Free Mostly-Digital FDC-PLL Frequency Synthesizer With a 2.8-3.5 GHz DCO," *IEEE J. Solid-State Circuits*, vol. 50, no. 2, pp. 450–463, Feb. 2015.
- 3. C. Weltin-Wu, G. Zhao, I. Galton, "A 3.5 GHz Digital Fractional-*N* Frequency Synthesizer Based on Ring Oscillator Frequency-to-Digital Conversion," *IEEE J. Solid-State Circuits*, vol. 50, no. 12, pp. 2988–3002, Dec. 2015.
- 4. E. Helal, E. Alvarez-Fontecilla, A. I. Eissa, and I. Galton, "A Time Amplifier Assisted Frequency-to-Digital Converter Based Digital Fractional-*N* PLL," *IEEE J. Solid-State Circuits*, early access, Feb. 2, 2021, doi: <u>10.1109/JSSC.2020.3048650</u>.
- 5. H. Liu, D. Tang, Z. Sun, W. Deng, H. C. Ngo and K. Okada, "A Sub-mW Fractional-N ADPLL With FOM of -246 dB for IoT Applications," *IEEE J. Solid-State Circuits*, vol. 53, no. 12, pp. 3540–3552, Dec. 2018.
- 6. C. Li, M. Yuan, C. Liao, Y. Lin, C. Chang and R. B. Staszewski, "All-Digital PLL for Bluetooth Low Energy Using 32.768-kHz Reference Clock and ≤0.45-V Supply," *IEEE J. Solid-State Circuits*, vol. 53, no. 12, pp. 3660–3671, Dec. 2018.
- 7. A. Elkholy, S. Saxena, G. Shu, A. Elshazly and P. K. Hanumolu, "Low-Jitter Multi-Output All-Digital Clock Generator Using DTC-Based Open Loop Fractional Dividers," *IEEE J. Solid-State Circuits*, vol. 53, no. 6, pp. 1806–1817, June 2018.
- C. Yao, R. Ni, C. Lau, W. Wu, K. Godbole, Y. Zuo, S. Ko, N. Kim, S. Han, I. Jo, J. Lee, J. Han, D. Kwon, C. Kim, S. Kim, S. W. Son, and T. B. Cho, "A 14-nm 0.14-psrms Fractional-N Digital PLL With a 0.2-ps Resolution ADC-Assisted Coarse/Fine-Conversion Chopping TDC and TDC Nonlinearity Calibration," *IEEE J. Solid-State Circuits*, vol. 52, no. 12, pp. 3446–3457, Dec. 2017.
- 9. M. Talegaonkar, T. Anand, A. Elkholy, A. Elshazly, R. K. Nandwana, S. Saxena, B. Young, W. Choi, and P. K. Hanumolu, "A 5GHz Digital Fractional-*N* PLL Using a 1-bit Delta–Sigma Frequency-to-Digital Converter in 65 nm CMOS," *IEEE J. Solid-State Circuits*, vol. 52, no. 9, pp. 2306–2320, Sept. 2017.

- Y. Wu, M. Shahmohammadi, Y. Chen, P. Lu and R. B. Staszewski, "A 3.5–6.8-GHz Wide-Bandwidth DTC-Assisted Fractional-*N* All-Digital PLL With a MASH ΔΣ-TDC for Low In-Band Phase Noise," *IEEE J. Solid-State Circuits*, vol. 52, no. 7, pp. 1885– 1903, July 2017.
- D. Liao, H. Wang, F. F. Dai, Y. Xu, R. Berenguer and S. M. Hermoso, "An 802.11a/b/g/n Digital Fractional-N PLL With Automatic TDC Linearity Calibration for Spur Cancellation," *IEEE J. Solid-State Circuits*, vol. 52, no. 5, pp. 1210–1220, May 2017.
- D. Pfaff, R. Abbott, X. Wang, S. Moazzeni, R. Mason and R. R. smith, "A 14-GHz Bang-Bang Digital PLL With Sub-150-fs Integrated Jitter for Wireline Applications in 7-nm FinFET CMOS," *IEEE J. Solid-State Circuits*, vol. 55, no. 3, pp. 580–591, March 2020.
- A. Santiccioli, M. Mercandelli, A. L. Lacaita, C. Samori and S. Levantino, "A 1.6-to-3.0-GHz Fractional-N MDLL With a Digital-to-Time Converter Range-Reduction Technique Achieving 397-fs Jitter at 2.5-mW Power," *IEEE J. Solid-State Circuits*, vol. 54, no. 11, pp. 3149–3160, Nov. 2019.
- H. Liu, Z. Sun, H. Huang, W. Deng, T. Siriburanon, J. Pang, Y. Wang, R. Wu, T. Someya, A. Shirane, and K. Okada, "A 265-μW Fractional-N Digital PLL With Seamless Automatic Switching Sub-Sampling/Sampling Feedback Path and Duty-Cycled Frequency-Locked Loop in 65-nm CMOS," *IEEE J. Solid-State Circuits*, vol. 54, no. 12, pp. 3478–3492, Dec. 2019.
- 15. E. Alvarez-Fontecilla, C. Venerus and I. Galton, "Multi-Rate DEM With Mismatch-Noise Cancellation for DCOs in Digital PLLs," *IEEE Trans. Circuits and Syst. I. Reg. Papers*, vol. 65, no. 10, pp. 3125–3137, Oct. 2018.
- 16. O. Eliezer, B. Staszewski, J Mehta, F. Jabbar, I. Bashir, "Accurate Self-Characterization of Mismatches in a Capacitor Array of a Digitally-Controlled Oscillator," *IEEE Dallas Circuits and Syst. Workshop*, pp. 17-18, Oct. 2010.
- 17. O. Eliezer, B. Staszewski, S Vemulapalli, "Digitally controlled oscillator in a 65nm GSM/EDGE transceiver with built-in compensation for capacitor mismatches," *IEEE Radio Frequency Integrated Circuits Symp.*, pp. 5-7 June 2011.
- 18. I. Galton, "Why Dynamic-Element-Matching DACs Work," *IEEE Trans. Circuits Syst. II, Exp. Briefs*, vol. 57, no. 2, pp. 69-74, Mar. 2010.
- P. Kiss, J. Arias, D. Li and V. Boccuzzi, "Stable High-Order Delta-Sigma Digital-to-Analog Converters," *IEEE Trans. Circuits and Syst. I. Reg. Papers*, vol. 51, no. 1, pp. 200–205, Jan. 2004

- 20. I. Galton, "Spectral shaping of circuit errors in digital-to-analog converters," *IEEE Trans. Circuits Syst. II, Analog Digit. Signal Process.*, vol. 44, no. 10, pp. 808–817, Oct. 1997.
- 21. J. Welz, I. Galton, and E. Fogleman, "Simplified logic for first-order and second-order mismatch-shaping digital-to-analog converters," *IEEE Trans. Circuits Syst. II, Analog Digit. Signal Process.*, vol. 48, no. 11, pp. 1014–1028, Nov. 2001.
- 22. C. Venerus, J. Remple, and I. Galton, "Simplified Logic for Tree-Structure Segmented DEM Encoders," *IEEE Trans. Circuits and Syst. II. Exp. Briefs*, vol. 63, no. 11, pp. 1029–1033, Nov. 2016.
- 23. E. Hegazi, H. Sjoland and A. A. Abidi, "A Filtering Technique to Lower LC Oscillator Phase Noise," *IEEE J. Solid-State Circuits*, vol. 36, no. 12, pp. 1921-1930, Dec. 2001.