

Contents lists available at [ScienceDirect](https://www.elsevier.com/locate/vlsi)

Integration, the VLSI Journal



journal homepage: [www.elsevier.com/locate/vlsi](https://www.elsevier.com/locate/vlsi)

# A 15.13 mW 3.2 GHz 8-bit carry look-ahead adder using single-phase all-N-transistor logic

Chua-Chin Wang [∗](#page-0-0) , L S S Pavan Kumar Chodisetti, Durga Srikanth Kamarajugadda, Oliver Lexter July Alvarez Jose, Pradyumna Vellanki

*Department of Electrical Engineering, National Sun Yat-sen University, No. 70 Lien-hai Road, Gushan District, Kaohsiung City, 80424, Taiwan*



## **1. Introduction**

The increasing demand for battery-powered portable electronic systems, such as mobile phones, tablets, laptops, etc., raises concerns about speed and power usage [[1](#page-7-0)]. Adders, as one of mentioned application essential building components, play a critical role in arithmetic circuit efficiency. Since addition is the most used operation in arithmetic circuits, research for effective adder architectures has always been a hot topic [\[2\]](#page-7-1). Static and dynamic logics are the two main techniques in adder circuit design. The integrated circuit architecture of a static adder is more complex and requires a significant amount of power and area. Dynamic CMOS gates are considered to be more effective due to the smaller parasitic capacitance and absence of output glitching [[3](#page-7-2)]. The increase in power dissipation during higher switching activity is the primary issue of dynamic adders.

The overall power consumption of digital circuits is mainly divided into two types: dynamic and static. Dynamic power dissipation occurs when the device is in active mode, whereas leakage dominates static power consumption. All-N Transistors (ANT) logic is a dynamic circuit that consists of stacked series of NMOS (N-block) for logic opera-tions [[4](#page-7-3)]. The NMOS in the N-block requires lower  $V_{th}$  compared to PMOS devices, resulting in reduced power consumption and a smaller area. However, as the number of stacked NMOS devices in the N-block increases, prior ANT circuit's operation speed decrease with higher delay [[5](#page-7-4)]. The charge and discharge in another ANL circuit are easily affected by the large value of gate capacitance, which leads to glitch problems [\[6\]](#page-7-5).

This research presents a single-phase ANT logic architecture that obtains a low PDP (power delay product) solution. The topology does not have an internal loop that primarily affects prior ANT power consumption and delay. Its functionality is justified through a prototype fabricated using 40-nm CMOS technology.

The prior ANT logic design is briefly described in Section [2.1](#page-0-1). The proposed Single-Phase ANT logic is addressed in Section [2.2.](#page-1-0) Singlephase ANT logic-based 8-bit CLA is thoroughly described in Section [2.3](#page-1-1). The measurement results are demonstrated in Section [3](#page-3-0) and conclusion is presented in Section [4](#page-4-0).

## **2. Single-phase ANT logic**

### *2.1. Prior ANT logic design*

<span id="page-0-1"></span>The prior ANT topology has an internal feedback loop, as illustrated in [Fig.](#page-1-2) [1.](#page-1-2) Prior ANT logic enters the evaluation state when the clock input is "1", producing a logical output based on the N-block  $[4,7]$  $[4,7]$  $[4,7]$  $[4,7]$  $[4,7]$ . When the input clk = "0", ANT logic enters the precharge state, generating the same output as the previous state. The following issues that may affect the prior ANT logic functionality are listed.

- 1. During the logic evaluation, the loop formed by  $MN_{103}$  and  $MP<sub>103</sub>$  may cause hysteresis and additional delay.
- 2. The big loading of the clk signal that drives three transistors in a single ANT may cause hold and setup time hazards.
- 3. Lastly, a total of 7 transistors in addition to the N-block consumes a large area.

<span id="page-0-0"></span>Corresponding author. *E-mail address:* [ccwang@ee.nsysu.edu.tw](mailto:ccwang@ee.nsysu.edu.tw) (C.-C. Wang).

<https://doi.org/10.1016/j.vlsi.2024.102234>

Available online 3 July 2024 Received 2 February 2024; Received in revised form 14 May 2024; Accepted 17 June 2024

0167-9260/© 2024 Elsevier B.V. All rights are reserved, including those for text and data mining, AI training, and similar technologies.



**Fig. 1.** Prior ANT Logic.

#### <span id="page-1-2"></span>**Table 1**

<span id="page-1-3"></span>

To minimize hysteresis and improve the efficiency of this previous ANT design, the internal loop shall be removed. This will also improve circuit evaluation speed and reduce the total number of transistors in the auxiliary circuit. In addition, decreasing the voltage across  $V_A$ and switching off the output block of the ANT will reduce its power consumption even more.

#### *2.2. Single-phase ANT logic*

<span id="page-1-0"></span>[Fig.](#page-2-0) [2](#page-2-0)(a) shows the topology of the proposed single-phase ANT. Unlike the prior ANT, the absence of the internal loop will fasten the discharge and charge activity of the inverter formed by  $MP<sub>202</sub>$  and  $MN_{202}$ . Furthermore, an RC delay produced by  $MN_{201}$  will reduce potential errors at the gate drives of  $MP<sub>202</sub>$  and  $M<sub>202</sub>$ . The voltage across node  $V_A$  achieves  $V_{DD}-V_{th}$ , which provides a small voltage swing to the inverter's input. It will lessen the power dissipation of the proposed ANT.

The output waveform for the proposed ANT's operation is shown in [Fig.](#page-2-0) [2\(](#page-2-0)b). The operation of the ANT is divided into two stages; precharge and evaluation.

- 1. **Precharge:** When  $clk = 0$ , the ANT goes to the precharge. In this stage,  $MP<sub>201</sub>$  is on,  $MN<sub>203</sub>$  is off, while  $MN<sub>201</sub>$  will always be on due to its gate drive equal to  $V_{DD}$ . MN<sub>202</sub> is on and MP<sub>202</sub> is off, since the voltage across node  $V_A$  is  $V_{DD}-V_{th}$ .  $V_Y$  produced an output based on the previous state.
- 2. **Evaluation:** The ANT logic enters the evaluation stage when clk  $= 1$ . The evaluation stage has four conditions depending on the output  $V_y$ 's previous state and the "on" and "off" state of the N-block. All the conditions are explained as follows.
	- **Condition 1:** Node  $V_A$  is discharged via N-block to gnd, when N-block is on. In this condition,  $MN_{202}$  is off, and  $MP<sub>202</sub>$  is on.  $V<sub>Y</sub>$  is then charged to  $V<sub>DD</sub>$ .
	- **Condition 2:** When N-block is off, and the voltage across node  $V_A$  is  $V_{DD}-V_{th}$ , MP<sub>202</sub>, and MN<sub>202</sub> are off and on, respectively, to pull  $V_Y$  to gnd. The voltage at node  $V_A$  is discharged to gnd through  $MN<sub>203</sub>$  and N-block, once if the N-block is turned on. At this point,  $MN<sub>202</sub>$  will be switched off, and MP<sub>202</sub> will be on, pulling  $V_Y$  up to  $V_{DD}$ .
	- **Condition 3:** This condition occurs when the voltage at node  $V_A$  is  $V_{DD} - V_{th}$  and N-block is off. Due to the state of  $MP<sub>202</sub>$  and  $MN<sub>202</sub>$ , which are off and on, respectively, output  $V_Y$  will be discharged from  $V_{DD}$  to gnd.
	- **Condition 4:** Node  $V_A$  turns on  $MN_{202}$  and turns off  $MP_{202}$ when N-block is off, pulling output  $V_Y$  to the gnd.

[Table](#page-1-3) [1](#page-1-3) summarizes the four conditions for single-phase ANT logic in the evaluation phase.

### *2.3. Single-phase ANT logic-based 8-bit CLA*

<span id="page-1-1"></span>[Fig.](#page-2-1) [3](#page-2-1) illustrates the 8-bit CLA block design based on the proposed single-phase ANT logic. Inputs A[0-7] and B[0-7] are coupled to 8-bit  $G/P$  generator. A 1-bit generation  $(G_i)$  and propagation  $(P_i)$  circuit are shown in [Figs.](#page-2-2) [4](#page-2-2) and [5](#page-2-3), respectively. Eq. ([1](#page-1-4)) defines the equation for  $G_i$ and  $P_i$  [\[8,](#page-7-7)[9\]](#page-7-8).

<span id="page-1-4"></span>
$$
P_i = A_i \oplus B_i, \qquad G_i = A_i B_i \tag{1}
$$

[Fig.](#page-3-1) [6](#page-3-1) shows the carry generation circuit using single-phase ANT logic in which input is coupled to the output of the generation circuit. Its output signal  $(C_i)$  is characterized by Eq. [\(2\)](#page-1-5) [[8](#page-7-7),[9](#page-7-8)].

<span id="page-1-5"></span>
$$
C_i = G_i + P_i G_{i-1} + \dots + P_i P_{i-1} \dots P_0 C_{in}
$$
 (2)

[Fig.](#page-3-2) [7](#page-3-2) illustrates the sum generation circuit, whose output signal  $(S_i)$  is represented by Eq. [\(3\)](#page-1-6) [[8](#page-7-7)[,9\]](#page-7-8). The generation and propagation circuits' output is connected to the sum generation's input. The output of the sum generation circuit is connected to 3 stages of tapered buffers, as shown in [Fig.](#page-2-1) [3.](#page-2-1)

<span id="page-1-6"></span>
$$
S_i = P_i \oplus C_{i-1} \tag{3}
$$

The demand for an increasing number of devices in an IC for higher operating speeds and functionality leads to circuit optimization issue, such as propagation delay, power consumption, and area [[10\]](#page-7-9). These factors are mainly affected by the transistor size. Increasing the size of the transistor improves the propagation delay but also results in additional power consumption due to the increase in the gate capacitance. The transistor sizes in the proposed single-phase ANT logic are properly



**Fig. 2.** Single-phase ANT logic (a) proposed schematic and; (b) output waveforms.

<span id="page-2-0"></span>



<span id="page-2-1"></span>

<span id="page-2-2"></span>**Fig. 4.** 1-bit generation circuit.



<span id="page-2-3"></span>**Fig. 5.** 1-bit propagation circuit.



**Fig. 6.** 8-bit carry generation circuit.

<span id="page-3-1"></span>

**Fig. 7.** 1-bit sum generation circuit.

<span id="page-3-2"></span>tuned by equalizing the rise time and the fall time of the output signals. Referring to [Fig.](#page-2-0) [2,](#page-2-0)  $MP<sub>201</sub>$  and  $MP<sub>202</sub>$  must have a higher value of resistance to match with  $MN_{201}$  and  $MN_{202}$ , with a 2:1 aspect ratio to achieve an equal magnitude of currents [[11\]](#page-7-10).

Referring to [Fig.](#page-2-2) [4,](#page-2-2) the transistors in the N-block are connected in series. The width of  $MN_{305}$  must be twice as large as  $MN_{304}$ . If the transistor in the N-block is connected in parallel, as shown in [Fig.](#page-2-3) [5](#page-2-3), the width of the transistors must be equal (namely,  $MN_{404} = MN_{406}$ ,  $MN_{405}$  =  $MN_{407}$ ). [Table](#page-3-3) [2](#page-3-3) summarizes the overall transistor sizes of the proposed ANT.

#### **3. Realization and measurement**

<span id="page-3-0"></span>The 8-bit CLA based on the single-phase ANT logic is implemented using TSMC 40-nm CMOS process. Shown in [Fig.](#page-4-1) [8](#page-4-1) is the die photo of

<span id="page-3-3"></span>**Table 2** AND/XOR ANT Logic block sizing.

| Fig. 4, Fig. 5 transistors | W/L ratio  |
|----------------------------|------------|
| $MP_{301}$ , $MP_{401}$    | 75/2, 75/2 |
| $MP_{302}$ , $MP_{402}$    | 75/2, 75/2 |
| $MN_{301}$ , $MN_{401}$    | 25/1, 25/1 |
| $MN_{302}$ , $MN_{402}$    | 75/4, 75/4 |
| $MN_{303}$ , $MN_{403}$    | 75/2, 75/2 |
| $MN_{304}$ , $MN_{305}$    | 5/1, 10/1  |
| $MN_{404}$ , $MN_{405}$    | 5/1, 10/1  |
| $MN_{406}$ , $MN_{407}$    | 5/1, 10/1  |



<span id="page-3-4"></span>**Table 3**





the 8-bit CLA with 797.565 × 804.395 μm<sup>2</sup> and 154.776 × 179.165 μm<sup>2</sup> chip area and core area, respectively.

[Fig.](#page-4-2) [9](#page-4-2) shows the measurement set-up for this research. The clk, A0 – A7, and B0 – B7 inputs of the CLA were generated by the Agilent 82 150 pattern generator. The 0.9 V supply is provided by Agilent N6761A power supply. Lastly, the output waveform of our chip is observed using high-frequency Keysight DSAV134 and MXR254A oscilloscopes.

[Tables](#page-3-4) [3](#page-3-4) (Test 1) and [4](#page-4-3) (Test 2) tabulates the input pattern and equivalent outputs of the 8-bit CLA at  $clk = 3.2$  GHz. [Fig.](#page-5-0) [10](#page-5-0)(a) and (b) show the corresponding measurement waveforms for Test I, and [Fig.](#page-5-1) [11](#page-5-1)(a) and (b) for Test II. Both measurement results produced a logic level the same as the equivalent output from [Tables](#page-3-4) [3](#page-3-4) and [4](#page-4-3), which demonstrate the performance and functionality of the proposed single-phase ANT-based CLA.



**Fig. 8.** Die photo and layout of the proposed single-phase ANT-based CLA.

<span id="page-4-1"></span>

**Fig. 9.** Measurement set-up (TSRI Tainan Lab.).

<span id="page-4-3"></span><span id="page-4-2"></span>



[Fig.](#page-6-0) [12](#page-6-0) shows the histogram and eye diagram measurement of the proposed single-phase ANT logic-based CLA. The eye diagram and histogram have a height and width of 213 mV and 8.76 ns, respectively.

[Table](#page-5-2) [5](#page-5-2) summarizes recent studies on CLA adders. Our 8- bit CLA operates at a clock frequency of 3.2 GHz (maximum) at the highest value of  $C_{load} = 20$  pF. Notably, this study provides the lowest measured normalized PDP, indicating less energy lost per switching events than all the prior works as shown in [Fig.](#page-6-1) [13.](#page-6-1)

### **4. Conclusion**

<span id="page-4-0"></span>This study is focused on single-phase ANT logic-based 8-bit CLA implemented in TSMC 40-nm CMOS technology. The single-phase ANT design does not have an internal loop reducing its power consumption



**Fig. 10.** Test 1 (a) S0 ∼ S3 and (b) S4 ∼ S7 at 3.2 GHz operating frequency.

<span id="page-5-0"></span>

**Fig. 11.** Test 2 (a) S0 ∼ S3 and (b) S4 ∼ S7 at 3.2 GHz operating frequency.

<span id="page-5-2"></span><span id="page-5-1"></span>**Table 5**





Note:

<span id="page-5-3"></span><sup>a</sup> Normalized area =  $\frac{\text{Core Area (mm}^2)}{\text{Technology (mm)} \times \text{Technology (nm)} \times 10^6}$ Core Area (mm<sup>2</sup>)<br>
(nm) × Technology (nm) × 10<sup>6</sup>

<span id="page-5-4"></span>b Nor. power =  $\frac{P (mW)}{\text{Freq } (MHz) \times C_{load}(pF) \times V_{dd}^2(V)}$ 



Fig. 12. 8-bit CLA jitter and histogram measurement.

<span id="page-6-0"></span>

**Fig. 13.** CLA designs technology roadmap.

<span id="page-6-1"></span>and improving the performance of the prior ANT. It is one of the two CLA designs that is physically fabricated at the highest  $C_{load} = 20$  pF. It has a maximum operating frequency of 3.2 GHz. Notably, this work has the lowest PDP, which makes it very energy efficient.

#### **Data availability**

Data will be made available on request.

## **Acknowledgements**

**Declaration of competing interest**

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

This work has been partially funded by the National Science and Technology Council, Taiwan, subject to grants NSTC 112-2221-E-110- 063-MY3 and NSTC 111-2623-E-110-002-. The authors would like to articulate their sincere admiration to Taiwan Semiconductor Research Institute (TSRI) for the support they provided in chip fabrication.

#### *C.-C. Wang et al.*

#### **References**

- <span id="page-7-0"></span>[1] [A.K. Yadav, B.P. Shrivatava, A.K. Dadoriya, Low power high speed 1-bit full](http://refhub.elsevier.com/S0167-9260(24)00098-1/sb1) [adder circuit design at 45 nm CMOS technology, in: Proc. 2017 Int. Conf. Recent](http://refhub.elsevier.com/S0167-9260(24)00098-1/sb1) [Innovations Signal Process. Embedded Syst., RISE, 2017, pp. 427–432.](http://refhub.elsevier.com/S0167-9260(24)00098-1/sb1)
- <span id="page-7-1"></span>[2] [A. Raghunandan, D.R. Shilpa, Design of high-speed hybrid full adders using](http://refhub.elsevier.com/S0167-9260(24)00098-1/sb2) [FinFET 18 nm technology, in: Proc. 2019 4th Int. Conf. Recent Trends Electron.](http://refhub.elsevier.com/S0167-9260(24)00098-1/sb2) [Inf. Commun. Technol., RTEICT, 2019, pp. 410–415.](http://refhub.elsevier.com/S0167-9260(24)00098-1/sb2)
- <span id="page-7-2"></span>[3] [E. Macii, M. Poncino, Power consumption of static and dynamic CMOS circuits:](http://refhub.elsevier.com/S0167-9260(24)00098-1/sb3) [A comparative study, in: Proc. 2nd Int. Conf. ASIC, 1996, pp. 425–427.](http://refhub.elsevier.com/S0167-9260(24)00098-1/sb3)
- <span id="page-7-3"></span>[4] [T.-J. Lee, W.-S. Yang, C.-C. Wang, A 20 GHz 8-bit all-n-transistor logic cla using](http://refhub.elsevier.com/S0167-9260(24)00098-1/sb4) [16-nm FinFET technology, in: Proc. 2021 IEEE Asia Pacific Conf. Circuit Syst.,](http://refhub.elsevier.com/S0167-9260(24)00098-1/sb4) [APCCAS, 2021, pp. 33–36.](http://refhub.elsevier.com/S0167-9260(24)00098-1/sb4)
- <span id="page-7-4"></span>[5] [M. Afghahi, A robust single phase clocking for low power, high-speed VLSI](http://refhub.elsevier.com/S0167-9260(24)00098-1/sb5) [applications, IEEE J. Solid-State Circuits 31 \(2\) \(1996\) 247–254.](http://refhub.elsevier.com/S0167-9260(24)00098-1/sb5)
- <span id="page-7-5"></span>[6] [M. Kargar, M.B. Ghaznavi-Ghoushchi, A high performance, race eliminated,](http://refhub.elsevier.com/S0167-9260(24)00098-1/sb6) [two phase nonoverlapping clocked all-n-logic for both strong and subthreshold](http://refhub.elsevier.com/S0167-9260(24)00098-1/sb6) [designs, in: Proc. the 16th CSI Int. Symp. Comput. Architecture Digital Syst.,](http://refhub.elsevier.com/S0167-9260(24)00098-1/sb6) [CADS 2012, 2012, pp. 87–92.](http://refhub.elsevier.com/S0167-9260(24)00098-1/sb6)
- <span id="page-7-6"></span>[7] C.-C. Wang, O.L.J.A. Jose, W.-S. Yang, R.G.B. Sangalang, L.K.S. Tolentino, T.-J. Lee, A 16-nm FinFET 28.8-mw 800-MHz 8-bit all-n-transistor logic carry lookahead adder, Circuits Syst. Signal Process. (2022) [http://dx.doi.org/10.1007/](http://dx.doi.org/10.1007/s00034-022-02212-2) [s00034-022-02212-2.](http://dx.doi.org/10.1007/s00034-022-02212-2)
- <span id="page-7-7"></span>[8] [C.-C. Wang, K.-C. Tsai, VLSI design of a 1.0 GHz 0.6-](http://refhub.elsevier.com/S0167-9260(24)00098-1/sb8)μm 8-bit CLA using PLA[styled all-N-transistor logic, in: Proc. 1998 IEEE Int. Symp. Circuits Syst., ISCAS,](http://refhub.elsevier.com/S0167-9260(24)00098-1/sb8) [Vol. 2, 1998, pp. 236–239.](http://refhub.elsevier.com/S0167-9260(24)00098-1/sb8)
- <span id="page-7-8"></span>[9] [D.S. Kamarajugadda, O.L.J.A. Jose, L.-J. Yang, B. Esakki, S. Sampath, C.-C.](http://refhub.elsevier.com/S0167-9260(24)00098-1/sb9) [Wang, A low-energy 8-bit CLA realized by single-phase ANT logic, in: Proc.](http://refhub.elsevier.com/S0167-9260(24)00098-1/sb9) [2022 International Conference on IC Design and Technology, ICICDT, Vietnam,](http://refhub.elsevier.com/S0167-9260(24)00098-1/sb9) [2022, pp. 28–31.](http://refhub.elsevier.com/S0167-9260(24)00098-1/sb9)
- <span id="page-7-9"></span>[10] [P. Gupta, S. Gourishetty, H. Mandadapu, Z. Abbas, Pvt variations aware robust](http://refhub.elsevier.com/S0167-9260(24)00098-1/sb10) [transistor sizing for power-delay optimal CMOS digital circuit design, in: Proc.](http://refhub.elsevier.com/S0167-9260(24)00098-1/sb10) [2019 IEEE International Symposium on Circuits and Systems, ISCAS, Japan,](http://refhub.elsevier.com/S0167-9260(24)00098-1/sb10) [2019, pp. 1–5.](http://refhub.elsevier.com/S0167-9260(24)00098-1/sb10)
- <span id="page-7-10"></span>[11] [R.J. Baker, CMOS Circuit Design, Layout, and Simulation, Wiley- IEEE Press,](http://refhub.elsevier.com/S0167-9260(24)00098-1/sb11) [2010.](http://refhub.elsevier.com/S0167-9260(24)00098-1/sb11)
- <span id="page-7-11"></span>[12] [H. Naseri, S. Timarchi, Low-power and fast full adder by exploring new xor](http://refhub.elsevier.com/S0167-9260(24)00098-1/sb12) [and xnor gates, IEEE Trans. Very Large Scale Integr. \(VLSI\) Syst. 26 \(8\) \(2018\)](http://refhub.elsevier.com/S0167-9260(24)00098-1/sb12) [1481–1493.](http://refhub.elsevier.com/S0167-9260(24)00098-1/sb12)
- <span id="page-7-12"></span>[13] [W. Al-Akel, K. Abugharbieh, A. Hasan, H.W. Marar, A power efficient 500 MHz](http://refhub.elsevier.com/S0167-9260(24)00098-1/sb13) [adder, in: Proc. 2019 SoutheastCon, 2019, pp. 1–6.](http://refhub.elsevier.com/S0167-9260(24)00098-1/sb13)
- <span id="page-7-13"></span>[14] [M. Hasan, M.J. Hossein, M. Hossain, H.U. Zaman, S. Islam, Design of a scalable](http://refhub.elsevier.com/S0167-9260(24)00098-1/sb14) [low-power 1-bit hybrid full adder for fast computation, IEEE Trans. Circuits Syst.](http://refhub.elsevier.com/S0167-9260(24)00098-1/sb14) [II: Express Briefs 67 \(8\) \(2020\) 1464–1468.](http://refhub.elsevier.com/S0167-9260(24)00098-1/sb14)
- <span id="page-7-14"></span>[15] [S. Vidhyadharan, S.S. Dan, An efficient ultra-low-power and superior perfor](http://refhub.elsevier.com/S0167-9260(24)00098-1/sb15)[mance design of ternary half adder using CNFET and gate-overlap TFET devices,](http://refhub.elsevier.com/S0167-9260(24)00098-1/sb15) [IEEE Trans. Nanotechnol. 20 \(2021\) 365–376.](http://refhub.elsevier.com/S0167-9260(24)00098-1/sb15)