# 2.5 GHz Data Rate 2×VDD Digital Output Buffer Design Realized by 16-nm FinFET CMOS

Chua-Chin Wang<sup>†</sup>, *Senior Member, IEEE/SPIE*, and Shao-Wei Lu Department of Electrical Engineering National Sun Yat-Sen University Kaohsiung, Taiwan 80424 Email: ccwang@ee.nsysu.edu.tw

Abstract—A 2×VDD output buffer equipped with SR (slew rate) self-adjustment mechanism driven by a PVT (process, voltage, temperature) detector is proposed in this investigation. Notably, the proposed buffer design is realized by 16-nm FinFET CMOS technology, where specical design constraints required by FinFET must be taken into consideration. In other words, design trade-off will be discussed and highlight. To enhance the output SR, awlays-on driving transistors in Output Stage must be realized with low Vth devices to boost the output current. For FinFET devices, The gate drives of these driving transistors must be stablized to prevent any possible noise interference. Nonoverlapping signaling control is directly realized in transistor level instead of conventional gate level designs such that the the speed is fastened. According to the all-PVT-corner simulations, the worst data rate is 2.5/2.5 GHz with 20 pF loading when the supply voltage is 0.8/1.6 V, respectively. The  $\Delta$ SR improvement is at least 10%, when the proposed SR self-adjustment mechanism is activated.

**Keywords**— FinFET, output buffer, PVT detection, mixed-voltage tolerant, slew rate self-adjustment

## I. INTRODUCTION

Ever since the first report for FinFET in 1999, this novel technology has been considered a major challenger of planar or bulk CMOS when the technology evolved into < 20 nm. The main principle behind FinFET is a thin body, around 10 nm or less, such that the gate capacitance is closer to the channel. Thus, there is no leakage path close the gate. FinFET devices attains numerous advantages over planar (bulk) CMOS, including higher drive current for a given transistor aspect, higher speed, lower leakage, lower power consumption, no random dopant fluctuation, hence better mobility and scaling feature for technology node better than 20 nm. However, many end products or systems still need chips fabricated using legacy processes using higher voltage supplies, e.g., 1.8 V or 3.3 V. Therefore, mixed-voltage I/O buffer for FinFET devices is considered as a solution to carry out data exchange with chips fabricated by different processes [1], [2], [3]. Otherwise, area-consuming and power-hungry voltage level translaters are needed.

Besides harsh design constraints of FinFETs, the slew rate is another major issue for digital transmission between FinFET and legacy technologies. The SR variation to violate interfacing standars is mainly caused by various PVT senarios aside from the significant leakage issue in advanced technologies. Prior researchers reported many

<sup>†</sup>C.-C. Wang, and S.-W. Lu are with Department of Electrical Engineering, National Sun Yat-Sen University, Kaohsiung, Taiwan 80424.



Fig. 1. Proposed 2×VDD digital output buffer using FinFET.

different PVT detection methods for mixed-voltage I/O buffers to adjust the SR correspondingly, e.g., [4], [5]. The major problem is that most of these reports detected only 3 process corners, namely TT, FF, and SS. Besides, Many of these reported methods may cause long settling time and missing code resulting poor SR since these methods need multiple clock cycles to determine what the current scenario is [5]. As for digital-based process detectors, they were reported to provide a robust solution for the detection and fasten data rate with the penalty of power and area [6]. On top of these problems, none of the prior solutions were focused on the mixed-voltage digital buffers for FinFET CMOS.

# II. 2×VDD OUTPUT BUFFER USING FINFET Devices

The proposed digital buffer mainly comprises Floating N-well circuit, Level Shifter (LS), UP and DOWN Gate Drivers, PVT Detector, External VDD Detector, Timing Shifters, Digital Coder, and Output Stage comprising driving transistors, as shown in Fig. 1. CLK is the system clock, while  $V_{PAD}$  denotes output port loaded with off-chip discretes. Dout is the digital signal given by the internal logic circuit or core to be transmitted outward. The operation of the proposed design is briefed as follows.

- PVT detector is coupled with CLK to generate 4 digital signals, namely PS, PF, NS, and NF. Notably, 4 temperature-insensitive reference biases, i.e., V<sub>ref1</sub>, V<sub>ref2</sub>, V<sub>ref3</sub>, V<sub>ref4</sub>, are provided by a conventioal bandgap bias geneartor (not shown).
- 2). External VDD Detector decides what voltage level of the external device is so as to generate the VD to following Level Shifters (LS1, and LS2).
- 3). LS1 generates 2 signals, DH and DL, to trigger Timing Shifter 1 and 2, respectively. Two corre-

<sup>978-1-7281-2062-1/19/\$31.00 © 2019</sup> IEEE



Fig. 2. Gate drive of  $MP_2$  (a) without; (b) with  $C_{VD}$ 

sponding timing signals, D1 and D2, are then delivered to Digital Coder to carry out the generationg of timing control signals for Output Stage.

 Digital Coder is the brain of the entire design, which is digital encoder to realize a function table between inputs, including PS, PF, NS, NF, OEH, OE, D1, D2, and outputs, including V<sub>gp1</sub>, V<sub>gp2</sub>, V<sub>gp3</sub>, V<sub>gn1</sub>, V<sub>gn2</sub>, V<sub>gn3</sub>.

Apparently, many blocks in Fig. 1 can be realized by existing circuits, e.g., UP and DOWN Gate Drivers, Digital Coder, and Floating N-well circuit, such that they will not be addressed due to the page limit required by the conference. By contrast, the details of driving transtors, Timing Shifter, and PVT detectors, will be disclosed.

## A. Selection of driving transistors

Referring to Fig. 1 again, a stacked MOS string is composed of  $MP_{1a}$ ,  $MP_2$ ,  $MN_2$ , and  $MN_{1a}$  to spread the voltage overstress over the transistors on the same path when the external VDD is over the internal VDD, namely  $2 \times VDD$ .  $MP_{1b}$ ,  $MP_{1c}$  are driving transistors to be turned on if necessary. That is, certain PVT corners are detected such that we need to increase the driving current as well as the rising edge's SR. By contrast,  $MN_{1b}$ ,  $MN_{1c}$  are those corresponding driving transistors for sinking more current to increase the falling edge's SR.

As mentioned earlier, FinFET device is physically different from planar devices. Particularly, the current is proportional to the number of FINs instead of arbitray channel width. Thus, it is found out that those devices in the stacked structure are better always on to prevent from severe fluctuation of output current magnitude. We propose to use FinFET with low Vth for those devices in this always-on path. In contrast, those auxiliary transistors are realized with high Vth devices to reduce the possible leakage when they are not activated.

Another long ignored issue is the stability of gate drives of those awlays-on devices.  $MN_2$  is kept on by Internal VDD, which is assumed to be highly stable.  $MP_{1a}$  and  $MN_{1a}$  are driven by UP Gate Drivers, respectively, which depends on the outcome of PVT detections. The only one left is  $MP_2$ . The gate drive of this transistor depends on VD, which is the output of External VDD Detector. It highly depends on the stability of this sub-circuit. Thus, we tend to add a capacitor,  $C_{VD}$  to reject the noise coupled from External VDD Detector. With reference to Fig. 2 (a) and (b), the stability of  $MP_2$  gate drive is significantly enhanced.

# B. Timing Shifter

The reliability and robustness of the proposed design not only reply on the timing generation of those gate drives for PMOS and NMOS driving transistors, the delay on the cirtical path is also severely affected by the timing control. Since the external VDD could be either the same of internal VDD or 2 times higher, two Timing Shiferts are needed to generate nonoverlapping signals. One is for the voltage range from GND to internal VDD, while the other one is for VD to External VDD, where VD is the reference voltage genetared by External VDD Detector. Referring to Fig. 3, it is the mentioned Timing Shifters. The feature of this circuit is that it is fully designed in transistor level, not traditional gate level design. Fig. 4 demonstrates the delay reduction contributed by the proposed design. 44 ps delay reduction between the top stripe (transistor level) and the bottom strip (gate level) is found, which is almost 25% faster.



Fig. 3. Schematic of Timing Shifter



Fig. 4. Delay reduction by noneoverlapping signals

#### **III. SIMULATION AND VERIFICATION**

The proposed digital buffer is realized by TSMC 16 nm CMOS LOGIC Fin FET Compact (Shrink) LL ELK Cu 1P13M process. Fig. 5 shows the layout of an NMOS with multiple FINs. Referring to Fig. 6 (a) and (b), where the former is all-PVT-corner simulation outcome without PVT detection at 0.8 V power supply, where the SR is not adjusted at all. By contrast, the latter is the SR selfadjusted given that PVT detection is activated. As for the scenarios when external VDD = 1.6 V, SRs without and with PVT detection are demonstrated in Fig. 7 (a) and (b), respectively. The input load = 1 pF and the output load = 20 pF are used in all simulations. The all-PVT-corner simulations cover all the scenarios at the 75 corners composed of 5 process corners, 5 temperatures in the range of [0, 100]°C, and 3 supply voltages from VDD $\times$ 0.9 to VDD $\times$ 1.1. The increase of the slew rate for External VDD = 0.8/1.6V is 18.0%(rising)/23.3\%(falling)



Fig. 5. Layout view of NMOS with multiple FINs



Fig. 6. Simulations given  $VDD_{ext}$ =0.8V (a) without ; (b) with PVT detection

and 10.0%(rising)/15.8%(falling), respectively, with and without the proposed PVT detection and self-adjustment are estimated. Regarding the maximum data rate, it is found to be 4.5/5.8 GHz given VDD=0.8/1.6V, respectively, with the activated proposed PVT detection and SR auto-adjustment. The slew rate is 18.0/19.0 V/ns given VDD<sub>ext</sub>=0.8/1.6 V (worst case) and data rate = 2.5 GHz. Both the data rate and the slew rate are highly above the requirements of DDR4 specifications.

Table I summarizes the performance comparison with several existing works. Apparently, the proposed design provides an over 2.5 GHz all-corner-detected solution for  $2 \times VDD$  data transmission with the best SR.

TABLE I PERFORMANCE COMPARISON OF OUTPUT BUFFERS

|                        | [7]<br>ESSCIRC | [8]<br>ICICDT | [9]<br>TCAS-2 | [10]<br>APCCAS | This      |
|------------------------|----------------|---------------|---------------|----------------|-----------|
|                        | 2013           | 2016          | 2017          | 2018           | work      |
| CMOS (nm)              | 28             | 28            | 40            | 40             | 16        |
| Verification           | meas.          | simu.         | meas.         | simu.          | simu.     |
| VDD <sub>int</sub> (V) | 1.8            | 1.05          | 0.9           | 0.9            | 0.8       |
| $VDD_{ext}(V)$         | 3.3-1.8        | 1.8/1.05      | 1.8/0.9       | 1.8/0.9        | 1.6/0.8   |
| Data (GHz)             | 0.2            | 0.8           | 0.5           | 1.0            | 2.5       |
| SR (V/ns)              | N/A            | 3.9-4.9       | 1.54          | 6.0-6.5        | 18.0-19.1 |
| Loading (pF)           | N/A            | 20            | 20            | 20             | 20        |
| Power (mW)             | 0.09           | N/A           | 27            | 34.8           | 28        |
| @GHz                   | @ static       |               | 0.5           | 0.5            | 0.5       |

## **IV. CONCLUSION**

Aside from PVT detection, the proposed  $2 \times VDD$ buffer featured with stabilizing gate drives of the driving transistors and transistor-level nonoverlapping signaling control is demonstrated. The SR improvement is estimated to be at least over 10% regardless in either voltage mode of data transmission mode. The most important of all is that this investigation demonstrates the wolrd first FinFET



Fig. 7. Simulations given  $\mbox{VDD}_{ext}\mbox{=}1.6\mbox{V}$  (a) without ; (b) with PVT detection

mixed-voltage output buffer design to achieve 2.5 GHz data rate.

### ACKNOWLEDGMENT

This proposed design was partially supported by Ministry of Science and Technology (MOST), Taiwan, under grant MOST 108-2218-E-110-002- and 107-2218-E-110-016-. The authors would like to express the appreciation to TSRI (Taiwan Semicoductor Research Intitute) in NARL (Nation Applied Research Laboratories), Taiwan, for the assistance of EDA tool support.

#### References

- S. Jain, T. Kukal, and J. Ahuja, Using DSP techniques to model analog IO buffers for serial links, in *International Conference on Signal Processing and Communication (ICSC)*, pp. 291-295, March 2015.
- [2] H. A. Ravi, M. Goel, and P. Bhilawadi. Circuit to reduce gate induced drain leakage in CMOS output buffers, in 2014 22nd International Conference on Very Large Scale Integration (VLSI-SoC), pp. 1-5, Oct. 2014.
- [3] H. Zhu, and V. Kursun, Novel low-leakage and high-speed triplethreshold-voltage buffers with skewed inputs and outputs, *IEEE Transactions on Circuits and Systems I: Regular Papers (TCAS-I)*, pp. 2013-2021, Mar. 2014.
- [4] Y. Lin, X. Zou, Z. Zheng, W. Huo, X. Chen, and W. Kang, "High-speed, low switching noise and load adaptive output buffer," in *International Symposium on Integrated Circuits 2009, (ISCI 2009)*, pp. 280-282, Dec. 2009.
- [5] T. Qian, L. Chen, X. Li, H. Sun, and J. Ni, 1.25Gbps programmable FPGA I/O Buffer with multi-standard support, in *Proc. 2018 IEEE 3rd International Conference on Integrated Circuits and Microsystems (ICICM)*, pp.362-365, Nov. 2018.
- [6] C.-C. Wang, C.-L. Chen, H.-Y. Tseng, H.-H. Hou, and C.-Y. Juan, "A 800 Mbps and 12.37 ps jitter bidirectional mixed-voltage I/O buffer with dual-path gate-tracking circuit," *IEEE Trans. Circuits Syst. I, Reg. Papers*, vol. 60, no. 1, pp. 116-124, Jan. 2013.
- [7] V. Kumar, and M. Rizvi, Power sequence free 400Mbps 90μW 6000μm<sup>2</sup> 1.8V3.3V stress tolerant I/O buffer in 28nm CMOS, in *Proc. European Solid-State Circuits Conference (ESSCIRC)*, pp. 37-40, Oct 2013.
- [8] T.-Y. Tsai, Y.-Y. Chou, and C.-C. Wang, A method of leakage reduction and slew-rate adjustment in 2ŒVDD output buffer for 28 nm CMOS technology and above, in *Proc. IEEE International Conference on IC Design and Technology (ICICDT)*, pp. 1-4, Aug. 2016.
- [9] C.-C. Wang, Z.-Y. Hou and K.-W. Ruan, 2 Œ VDD 40-nm CMOS output buffer with slew rate self-adjustment using leakage compensation, *IEEE Transactions on Circuits and Systems II: Express Briefs*, vol. 64, no. 7, pp. 812-816, July 2017.
- [10] C.-C. Wang, Z.-Y. Hou, S.-W. Huang, 40-nm 2ŒVDD digital output buffer design with DDR4-compliant slew rate, in *Proc. 2018 IEEE Asia Pacific Conference on Circuits and Systems (APCCAS)*, pp. 279-282, Oct. 2018.