# **A ROM-less DDFS Based on a Parabolic Polynomial Interpolation Method with an Offset**

Chua-Chin Wang · Chia-Hao Hsu · Chia-Chuan Lee · Jian-Ming Huang

Received: 6 November 2009 / Revised: 6 May 2010 / Accepted: 6 May 2010 © Springer Science+Business Media, LLC 2010

**Abstract** A novel direct digital frequency synthesizer (DDFS) based on a parabolic polynomial with an offset is proposed in this paper. A 16-segment parabolic polynomial interpolation is adopted to replace the traditional ROM-based phase-to-amplitude conversion methods. Besides, the proposed parabolic polynomial interpolation is realized in a multiplier-less structure such that the speed can be significantly improved. This work is manufactured by a standard 0.13  $\mu$ m CMOS cell-based technology. The maximum clock rate is 161 MHz, the core area is 0.33 mm<sup>2</sup>, and the spurious free dynamic range (SDRF) is 117 dBc by physical measurements on silicon.

**Keywords** Direct digital frequency synthesizer (DDFS) • Interpolation • Spurious free dynamic range (SFDR)

# **1** Introduction

The frequency synthesizer is an important component of communication systems. Conventionally, phase-lock loops (PLLs) [1, 2] are usually adopted to synthesize sinusoid waves. However, the PLL-based frequency synthesizers can not provide fast frequency switching and high spectral purity sine outputs at the same time [3]. This deficiency makes PLLs inadequate for modern wireless communication systems demanding fast

Published online: 27 May 2010

frequency switching. The direct digital frequency synthesizer (DDFS) has been considered as a better alternative rather than PLL-based frequency synthesizers. The reason is that DDFS can provide fast frequency switching and excellent spectral purity.

A conventional DDFS architecture is shown in Fig. 1. The digital phase is converted into samples of sine amplitude by a ROM look-up table. Then, the sample values are converted into analog signals by a Digital-to-Analog Converter (DAC). However, this architecture has an intrinsic difficulty: it demands a very large ROM as the storage of the sinusoid amplitudes. Thus, it suffers from the inherent drawbacks of large power dissipation, large chip area, and slow speed. Although the ROM size can be significantly reduced by truncating the output of the phase accumulator, the added spurious noise will degrade the spectral purity.

On the other hand, many researches have been reported on the designs of ROM-less DDFS, e.g., [4-8]. The ROM-less DDFSs utilized different algorithms instead of ROM tables to realize the phase-to-sine mapper. Many prior ROM-less DDFS works demanded complicated polynomials to carry out the phase-tosine mapper. However, any method based on highorder polynomials will be hard to meet the high speed requirement of modern wireless communication applications due to the intrinsic massive computation complexity. Considering the feasibility of hardware realization, any polynomial whose order is larger than three may be inefficient to be implemented. By contrast, lately reported DDFSs based on 2nd-order polynomials can not achieve high performance without sacrificing the speed. This paper proposes a DDFS based on a 2nd-order parabolic polynomial with an offset to resolve all of the mentioned difficulties. The

C.-C. Wang (⊠) · C.-H. Hsu · C.-C. Lee · J.-M. Huang Department of Electrical Engineering, National Sun Yat-Sen University, 70 Lian-Hai Rd., Kaohsiung, Taiwan e-mail: ccwang@ee.nsysu.edu.tw



Figure 1 The conventional DDFS structure.

proposed DDFS can simultaneously achieve an exceptional spectral purity and a satisfactory speed.

#### **2** The Proposed DDFS Architecture

### 2.1 Prior Parabolic Polynomial Interpolation Methods

To implement the phase-to-sine mapping function by a 2nd-order polynomial, the parabolic polynomial could be the most convenient choice. The first quadrant of the cosine signal ( $\cos \theta$ ,  $0 \le \theta < \pi/2$ ) can be divided into  $M = 2^m$  segments, and each segment can be approximated by the parabolic polynomial, which is expressed as follows:

$$y(x) = a_i x^2 + c_i, \quad i = 1 \sim M.$$
 (1)

where the parameter  $a_i$  and  $c_i$  can be derived by the least square method. Although the SFDR of this DDFS



Figure 2 The error between the ideal cosine function and the QLIP method (M = 4).



Figure 3 The error between the ideal cosine function and the proposed method (M = 4).

will be increased as the increase of M, large M will lead to divergence when performing the least square method.

Ashrafi and Adhami [4] proposed a quasi-linear interpolation method (QLIP) to improve the performance of the DDFS based on the parabolic polynomial. It was based on the observation that a cosine signal is close to a parabola at the vicinity of  $\theta = 0$ . Meanwhile, it is more like a straight line around  $\theta = \pi/2$ . The QLIP method can be expressed as follows:

$$y(x) = \begin{cases} a_i x^2 + c_i, & 1 \le i \le 3M/4 \\ a_i x + c_i, & 3M/4 + 1 < i \le M. \end{cases}$$
(2)

According to [4], the QLIP method attained a 6 dBc improvement over the parabolic polynomial method.

Both the QLIP and the parabolic polynomial method have the same difficulty in fitting the curvature of the sinusoid. The curvature of a given curve y = f(x) can be expressed as follows:

$$k(x) = \frac{y''}{(1+y'^2)^{3/2}},$$
(3)

**Table 1** The SFDR of the QLIP method and the proposed method under different *M*.

|        | QLIP method [4] | Proposed method |  |  |  |
|--------|-----------------|-----------------|--|--|--|
| M = 4  | 65 dBc          | 81.3 dBc        |  |  |  |
| M = 8  | 78 dBc          | 99 dBc          |  |  |  |
| M = 16 | 90 dBc          | 117.3 dBc       |  |  |  |



Figure 4 The block diagram of the proposed DDFS.

where k(x) is the curvature at x. If the parabolic polynomial is used, its 1st-order derivative  $(y' = 2a_ix)$  is proportional to the 2nd-derivative  $(y'' = 2a_i)$ . Thus, if

only one parameter  $a_i$  is used to determine both the slope and the curvature of the curve, the fitting will be hard.



Figure 5 Post-layout simulation result without coupled to a DAC.



Figure 6 The die photo of the proposed DDFS.

# 2.2 Proposed Parabolic Polynomial Interpolation

Therefore, to enhance the adjustability of the parabolic polynomial, we introduce a 1st-order term, which is called the "offset", into Eq. 1,

$$y(x) = a_i(x + x_i)^2 + c_i, \quad i = 1 \sim M$$
 (4)

where  $x_i$  is the displacement factor, i.e., "offset", for segment *i*. The coefficient " $x_i$ " is derived by the variation between a ideal sine wave and the generated sine wave. When Eq. 3 is used to carry out the phase-to-sine mapping function, there are two parameters that can be used to adjust the characteristics of the curve in each segment.

#### 2.3 Iterative Derivation of Offsets

We start with an initial value 0 for factor  $x_i$ , and then use the least square method to derive  $a_i$  and  $c_i$ , respectively. Equation 4 can be expanded to M segments and re-formulated to be Eq. 5. After multiplying the transpose of the first matrix at the righthand side of Eq. 5, we attain Eq. 6. Thus, the coefficients  $a_i$  and  $c_i$  can be derived, which is expressed as Eq. 7. Finally, the approximated sine wave amplitude in Eq. 5 can be rewritten as Eq. 8.

According to Eq. 8, we can derive the coefficient  $x_i$  to find the least square error. Then, back-substitute  $x_i$  into Eq. 7 such that  $a_i$  and  $c_i$  are derived, respectively.

# 2.4 Performance Evaluation

To evaluate the performance of the proposed parabolic polynomial, we carry out different DDFSs based on these three methods, Eqs. 1, 2, ours in Eq. 4, and compare their performance. The coefficients of these DDFSs are derived by the least square method of MATLAB, where the output of these DDFSs are not quantized. Figures 2 and 3 show the computed error of the QLIP method and the proposed method, where the maximum error is  $3.77 \times 10^{-3}$  and  $4.8 \times 10^{-4}$ , respectively. It is obvious that the proposed method is superior in terms of accuracy.

Then, we try to distinguish our method from QLIP in terms of SFDR. Table 1 shows the SFDR comparison between the QLIP method and the proposed method with different numbers of segments. The SFDR of the proposed method for M = 16 has almost 27 dBc improvement over that reported in [4]. Regarding the computation complexity, Eq. 4 needs only one more addition/subtraction than Eq. 2. However, this overhead is acceptable compared to the significant improvement of the spectral purity. Since 100 dBc SFDR meets the requirement of many modern wireless communication applications, we adopt the 16-segment parabolic polynomial with an offset method to realize the proposed DDFS.

$$\begin{bmatrix} y_1 \\ y_2 \\ y_2 \\ \vdots \\ y_M \end{bmatrix} = \begin{bmatrix} (x+x_1)^2 & 1 \\ (x+x_2)^2 & 1 \\ (x+x_3)^2 & 1 \\ \vdots & \vdots \\ (x+x_M)^2 & 1 \end{bmatrix} \begin{bmatrix} a_i \end{bmatrix}$$
(5)









#### Table 2 Specifications of the proposed DDFS.

| Technology                   | 0.13 µm CMOS process               |  |  |  |  |  |
|------------------------------|------------------------------------|--|--|--|--|--|
| Power supply                 | 1.2 V                              |  |  |  |  |  |
| Frequency control word (FCW) | 32 bits                            |  |  |  |  |  |
| Phase word                   | 20 bits                            |  |  |  |  |  |
| Frequency tuning range       | 0.053 Hz                           |  |  |  |  |  |
| SFDR                         | 117.3 dBc                          |  |  |  |  |  |
| Output resolution            | 24 bits                            |  |  |  |  |  |
| Max. clock rate              | 161 MHz                            |  |  |  |  |  |
| Gate count                   | 17,767                             |  |  |  |  |  |
| Power dissipation            | 3.56 mW @ 161 MHz (core)           |  |  |  |  |  |
|                              | 69.83 mW @ 161 MHz (PAD)           |  |  |  |  |  |
| Area                         | 2.015 mm <sup>2</sup> (whole chip) |  |  |  |  |  |
|                              | $0.33 \text{ mm}^2$ (core area)    |  |  |  |  |  |

#### 2.5 Hardware Implementation

Considering the hardware implementation feasibility of the DDFS based by the proposed parabolic polynomial, we must choose a suitable length of the phase word. The reason is that the phase word-length determines the performance and the complexity of the phase-tosine mapper. According to [9], the worst case SFDR for a DDFS with a perfect phase-to-sine mapper (unquantized output) can be expressed as Eq. 9:

$$SFDR < 20 \cdot \log 2^W \tag{9}$$

where W is the phase word-length. Since the simulation result shows the SFDR of the proposed methods for M = 16 is 117.3 dBc, the phase word-length is chosen to be 20 bits.

Given the phase word-length of 20 bits, the hardware complexity is certainly increased to slow down the operation speed. To enhance the speed, the multiplication in Eq. 4 should be removed to reduce the complexity. In our implementation, coefficient  $a_i$  is manipulated as a 15-bit binary sequence, and therefore the multiplication can be replaced by the shift-and-add operation (Ex:  $0.75 \cdot x = 2^{-1} \cdot x + 2^{-2} \cdot x$ ). Figure 4 shows the entire hardware block diagram of the proposed DDFS. The frequency control word (FCW) is 32 bits to obtain a fine frequency tuning range. The output of the squarer is shifted and selected by multiplexers. The symbol  $a_{i-j}$  in Fig. 4 represents the *j*-th bit of the sequence for the *i*-th segment. The summation of the total 16 multiplexer's outputs is realized by a 4-level adder tree. To reduce the latency caused by the adder tree, the summation is implemented by a 4-stage pipeline structure.

## **3 Implementation and Measurement**

The proposed DDFS prototype is carried out by a standard 0.13  $\mu$ m CMOS technology to verify the performance. All of the PVT (pressure, voltage, and temperature): [0°C, +100°C], VDD ± 10%, and (SS, SF, TT, FS, FF) models, are simulated. The frequency transition from 62.5 MHz to 3.91 MHz sine output given the worst condition (SS Model, 0°C,  $f_{clock} = 227$  MHz) is shown in Fig. 5, where  $f_{clock}$  means the operation frequency. The die photo of the proposed DDFS is shown in Fig. 6.

The proposed prototype is verified by the Agilent 93000 SOC test system, which shows that the measured data exactly match the expected results. Figure 7 shows the measurement result. The output values (ALLOUT-PUT) match the simulation results (DDFSOUT11). Figure 8 exhibits the SHMOO diagram, which shows that the maximum clock range at 1.2 V power supply is 161 MHz.

Figure 9 shows the spurious performance of the proposed method is as high as 117.2 dBc. The specifications of the proposed prototype is summarized in Table 2. Table 3 presents the comparison between the proposed DDFS and several recent ROM-less DDFS designs. The comparison shows that the proposed DDFS has the largest SFDR while maintaining the satisfactory speed and energy efficiency. The waveform of the synthesized sine wave derived from the measured samples by the Agilent 16702B logic analyzer is shown in Fig. 10.

| <b>Table 3</b> Comparison of DDFS designs.         *The DDFS design was implemented by FPGA (Field Programmable Gate Array). |                                 | [6]   | [7]   | [10]  | [11] | [12]  | [13]     | [14] | Ours  |
|------------------------------------------------------------------------------------------------------------------------------|---------------------------------|-------|-------|-------|------|-------|----------|------|-------|
|                                                                                                                              | Process (µm)                    | 0.5   | 0.25  | 0.35  | 0.35 | *     | *        | *    | 0.13  |
|                                                                                                                              | Туре                            | sine  | quad  | sine  | quad | quad  | quad     | quad | sine  |
|                                                                                                                              | SFDR (dBc)                      | 91.51 | 80    | 35    | 80   | 95.11 | 50.89    | 69.3 | 117.3 |
|                                                                                                                              | Phase accumulator bits          | 16    | 24    | 8     | 18   | 12    | 15       | 7    | 20    |
|                                                                                                                              | Output resolution (bits)        | 14    | 12    | 8     | 16   | 10    | 14       | n/a  | 24    |
|                                                                                                                              | Power (mW/MHz)                  | 0.56  | 0.127 | 410   | 0.81 | 2.18  | 1,044.35 | 2.4  | 0.35  |
|                                                                                                                              | Max. clock (MHz)                | 106   | 600   | 2,000 | 100  | 221   | 11.25    | 201  | 161   |
|                                                                                                                              | Area (core) mm <sup>2</sup>     | 2.489 | n/a   | n/a   | n/a  | n/a   | n/a      | n/a  | 0.33  |
|                                                                                                                              | Area (with pad) mm <sup>2</sup> | n/a   | 0.09  | 3.99  | n/a  | n/a   | n/a      | n/a  | 2.015 |
|                                                                                                                              | Year                            | 2005  | 2005  | 2005  | 2006 | 2007  | 2008     | 2008 | 2009  |

**Figure 10** Waveform samples of the synthesized sine wave.



# **4** Conclusion

A parabolic polynomial with an offset to realize the phase-to-sine mapping function of a DDFS is presented in this paper. The proposed method can achieve a significant improvement in SFDR compared to the QLIP method [4]. A 16-segment parabolic polynomial is employed to implement a ROM-less DDFS. The logic operation of the proposed DDFS is manipulated to achieve a multiplier-less design. The pipeline design is adopted to further reduce the latency in signal processing. The proposed DDFS achieves a SFDR of 117 dBc, and the maximum frequency of the synthesis sine wave is 161 MHz.

Acknowledgements This investigation is partially supported by National Science Council under grant NSC 96-2628-E-110-019-MY3, and National Health Research Institutes under grant NHRI-EX99-9732EI. It is also partially supported by Ministry of Economic Affairs, Taiwan, under grant 98-EC-17-A-01-S1-104, 98-EC-17-A-02-S2-0017, 98-EC-17-A-19-S1-133 and 98-EC-17-A-07-S2-0010. The authors would like to express their deepest gratefulness to Chip Implementation Center of National Applied Research Laboratories, Taiwan, for their thoughtful chip fabrication service.

# References

- 1. Dosho, S., Yanagisawa, N., & Toyama, M. (2004). A design of compact PLL with adaptive active loop filter circuit. *IEICE Transactions on Electronics*, *E87-C*(6), 949–955.
- Dosho, S., Yanagisawa, N., Sogawa, K., Yamada, Y., & Morie, T. (2007). An ultra-wide range digitally adaptive con-

trol phase locked loop with new 3-phase switched capacitor loop filter. *IEICE Transactions on Electronics*, 106(206), 93–98.

- Kesoulis, M., Soudris, D., Koukourlis, C., & Thanailakis, A. (2007). Systematic methodology for designing low power direct digital frequency synthesizers. *IET Circuits, Devices & Systems*, 1(4), 293–304.
- Ashrafi, A., & Adhami, R. (2005). A direct digital frequency synthesizer utilizing quasi-linear interpolation method. In *Proc. of 37th IEEE southeastern symposium on system theory* (pp. 144–148).
- Ashrafi, A., & Adhami, R. (2006). An optimized direct digital frequency synthesizer based on even fourth order polynomial interpolation. In *Proc. of 38th IEEE southeastern symposium* on system theory (pp. 109–113).
- Jafari, H., Ayatollahi, A., & Mirzakuchaki, S. (2005). A low power, high SFDR, ROM-less direct digital frequency synthesize. In 2005 IEEE conf. on electron devices and solid-state circuits (pp. 50–54).
- De Caro, D., & Strollo, A. G. M. (2005). High-performance direct digital frequency synthesizers in 0.25 μm CMOS using dual-slope approximation. *IEEE Journal of Solid-State Circuits*, 40(11), 2220–2227.
- Song, Y., & Kim, B. (2003). A 250 MHz direct digital frequency synthesizer with delta-sigma noise shaping. In 2003 *IEEE int. solid-state circuits conf. (ISSCC 2003)* (Vol. 1, pp. 472–509).
- Nicholas, H. T., & Samueli, H. (1987). An analysis of the output spectrum of direct digital frequency synthesizers in the presence of phase-accumulator truncation. In *41st annual frequency control symposium* (pp. 495–502).
- Yu, X., Dai, F. F., Shi, Y., & Zhu, R. (2005). 2 GHz 8bit CMOS ROM-less direct digital frequency synthesizer. In *IEEE inter. symp. on circuits and systems* (Vol. 5, pp. 4397– 4400).
- 11. Yi, S. C., Lee, K. T., Chen, J. J., & Lin, C. H. (2006). A lowpower efficient direct digital frequency synthesizer based on new two-level lookup table. In 2006 Canadian conference on electrical and computer engineering (pp. 963–966).

- Kesoulis, M., Soudris, D., Koukourlis, C., & Thanailakis, A. (2007). Systematic methodology for designing low power direct digital frequency synthesizers. *IET Circuits, Devices & Systems*, 1(4), 293–304.
- Jeng, S. S., Lin, H. C., & Wu, C. Y. (2008). DDFS design using the equi-section division method for SDR transceiver. In *IEEE 19th international symposium on personal, indoor* and mobile radio communications (pp. 1–5).
- Li, X., Lai, L., Lei, A., & Lai, Z. (2008). A memory-reduced direct digital frequency synthesizer for OFDM receiver systems. *IEEE Transactions on Consumer Electronics*, 54(4), 1564–1568.

Prof. Wang has served as the program committee member in many international conferences. He was Chair of IEEE Circuits and Systems Society (CASS) for 2007-2008, Tainan Chapter. He was also the founding Chair of IEEE Solid-State Circuits Society (SSCS), Tainan Chapter for 2007-2008, and the founding Consultant of IEEE NSYSU Student Branch. He is also a member of the IEEE CASS Multimedia Systems & Applications (MSA), VLSI Systems and Applications (VSA), Nanoelectronics and Giga-scale Systems (NG), and Biomedical Circuits and Systems (BioCAS) Technical Committees. He is a senior member of IEEE since 2004. In 2007, he was elected to be Chair-elect of IEEE CASS Nanoelectronics and Giga-scale Systems (NG) Technical Committee to serve a 2-year term of 2008–2009. Since 2010, he has been invited to be Associate Editors of IEEE Trans. on TCAS-I and TCAS-II. Currently, he is also serving as Associate Editor of IEICE Transactions on Electronics, and Journal of Signal Processing.

Dr. Wang is the General Chair of 2007 VLSI/CAD Symposium. He is General Co-Chair of 2010 IEEE Inter. Symp. On Next-generation Electronics (2010 ISNE). He is General Chair of 2011 IEEE Inter. Conf. On IC Design and Technology (2011 ICICDT).



Chua-Chin Wang received the Ph.D. degree in electrical engineering from SUNY (State University of New York) at Stony Brook, USA, in 1992. He then joined the Department of Electrical Engineering, National Sun Yat-Sen University, Taiwan, and became a full professor since 1998. He founded SOC group in Department of Electrical Engineering, National Sun Yat-Sen University in 2005. He is currently Chairman of this department. Dr. Wang's research interests include memory and logic circuit design, communication circuit design, neural networks, and interfacing I/O circuits. Particularly, he applies most of his research results on biomedical, memories, consumer electronics, and wireless communication applications, such as implantable ASIC/SOC, DVB-T/H and NTSC TV circuits, low power memory, high speed digital logic, etc. He has won the Outstanding Youth Engineer Award of Chinese Engineer Association in 1999, and NSC Research Award from 1994 to 1999. In 2000, he co-funded Asuka Semiconductor Inc., which is an IC design house located in renowned Hsinchu Scientific Park, Taiwan, and became Executive Secretary in 2005. In 2005, he was awarded with Best Inventor Award in National Sun Yat-Sen University, Taiwan. In 2006, he won "Distinguished Engineering Professor" Award of Chinese Institute of Engineers and "Distinguished Engineer" Award of Chinese Institute of Electrical Engineering in the same year. He also won Distinguished Electrical Engineering Professor Award of Chinese Institute of Electrical Engineers in 2007. In 2008, he won Outstanding Paper Award of 2008 IEEE Inter. Conf. of Consumer Electronics. In 2009, he again won Best Inventor Award.



**Chia-Hao Hsu** was born in Taiwan in 1981. He received the B.S. and M.S. degree in electronic engineering from Southern Taiwan University in 2005 and 2007, respectively. He is currently working toward the Ph.D. in the Department of Electrical Engineering National Sun Yat-Sen University, Kaohsiung, Taiwan, R.O.C. His recent research interests include VLSI design and Mixed Signal integrated Circuit Design.

# J Sign Process Syst



**Chia-Chuan Lee** was born in Taiwan in 1983. He received the B.S. degree in electrical engineering from National University of Kaohsiung, Taipei, Taiwan, R.O.C., in 2005, and the M.S. degree in institute of communications engineering from National Sun Yat-Sen University, Kaohsiung, Taiwan, R.O.C., His research interests include VLSI design.



**Jian-Ming Huang** was born in Taiwan in 1980. He received the B.S. degree, the M.S. and Ph.D. degree in Department of electrical engineering in National Sun Yat-Sen University, Taipei, Taiwan, R.O.C., in 2002, 2004 and 2007, respectively. His recent research interests include VLSI design and interfacing I/O circuits.