# An SRAM Design Using Dual Threshold Voltage Transistors and Low-Power Quenchers

Chua-Chin Wang, Po-Ming Lee, and Kuo-Long Chen

Abstract-Static random access memories (SRAM) are widely used in computer systems and many portable devices. In this paper, we propose an SRAM cell with dual threshold voltage transistors. Low threshold voltage transistors are mainly used in driving bitlines while high threshold voltage transistors are used in latching data voltages. The advantages of dual threshold voltage transistors can be used to reduce the access time and maintain data retention at the same time. Also, the unwanted oscillation of the output bitlines of memories caused by large currents in bitlines is reduced by adding two back-to-back quenchers. The proposed quenchers not only prevent oscillation, but also reduce the idle power consumption when the memory cells are not activated by wordline signals. Meanwhile, a large noise margin is provided such that the gain of the sense amplifier will not be reduced to avoid the oscillation. Hence, high-speed and low-power readout operations of the SRAMs are feasible.

Index Terms—CMOS, dual threshold voltage, quenchers, SRAM.

#### I. INTRODUCTION

S EMICONDUCTOR memories, particularly SRAMs, are widely used in electronic systems [1]–[3]. Many efforts have been made to improve the efficiency of the SRAM, e.g., Itoh et al. [4] have proposed an SRAM architecture using multi- $V_{\rm T}$  transistors. However, Itoh's results were obtained mainly from simulations rather than real chip measurement. Ohhata et al. [5] and Horiuchi et al. [6] have proposed various schemes to solve the bitline oscillation problem. Their works demand either special bipolar/SOI processes or capacitors within the circuit which will consume large area. Thanks to the advance of semiconductor process, e.g., Taiwan Semiconductor Manufacturing Company (TSMC) 0.25-µm one-poly five-metal (1P5M) CMOS process, dual threshold voltage transistors are available now. In this paper, a novel SRAM architecture using the dual threshold voltage  $(V_{\rm th})$  transistor is proposed. The low threshold voltage is called native  $V_{\rm th}$  $(V_{\rm th} = 0.21 \text{ V})$  and the high threshold voltage is called nominal  $V_{\rm th}$  ( $V_{\rm th}$  = 0.53 V) in this process. Low threshold voltage transistors are capable of supplying large current while high threshold voltage transistors are good in reducing leakage current. Hence, the former is a good bitline driver while the latter is an excellent data latch candidate. If low- $V_{\rm th}$  transistors

Manuscript received July 23, 2002; revised June 17, 2003. This work was supported in part by the National Science Council under Grants NSC 89-2218-E-110-014 and 89-2218-E-110-015, and in part by the Academic Foundation of Taiwan, and Ericsson Company, Ltd.

The authors are with the Department of Electrical Engineering, National Sun Yat-Sen University, Kaohsiung, Taiwan 80424, R.O.C. (e-mail: ccwang@ee. nsysu.edu.tw).

Digital Object Identifier 10.1109/JSSC.2003.817254

TABLE I Threshold Voltages of Nominal nMOS/pMOS and Native nMOS Transistors (Native pMOS is not Available in the 0.25- $\mu$ m CMOS Process)

|      | Nominal V <sub>th</sub>       | Native V <sub>th</sub>                    |
|------|-------------------------------|-------------------------------------------|
| PMOS | $V_{thNoP} = -0.53 \text{ V}$ | $V_{thNaP} = N/A$                         |
| NMOS | $V_{thNoN} = 0.53 \text{ V}$  | $\mathbf{V}_{thNaN} = 0.21 \; \mathbf{V}$ |

are used as bitline drivers and high- $V_{\rm th}$  transistors are the data latch components, not only can the access time be shortened, the data retention is also enhanced. Also, since the oscillation of the bitline (BL) and a complementary bitline (BL) might introduce unwanted power dissipation due to the large current supplied by low- $V_{\rm th}$  transistors, a possible wrong reading will be produced [9]. In this paper, we also introduce *quenchers* to subside the oscillation to keep the speed of readout operations. On top of quenching the oscillation, the power saving is also verified by HSPICE simulations regardless of MOS models, temperature variations, and input signal frequencies.

# II. DUAL- $V_{\rm th}$ SRAM

Conventional CMOS processes only provide transistors with single threshold voltage. However, the evolution of CMOS technology makes dual threshold voltage transistors currently available. In this paper, dual threshold voltage transistors provided by the 0.25- $\mu$ m 1P5M CMOS process are used to recreate the six-transistor (6-T) SRAM cell. According to our simulation results, the refined 6-T SRAM cell processes the advantages of speed and power efficiency. In the following, the basics characteristics of dual threshold transistors will be introduced as well as the refined SRAM cell.

# A. Current Analysis of Dual-V<sub>th</sub> Transistors

The drain current in the saturation region of a MOSFET transistor is

$$I_D = \frac{k_p}{2} \frac{W_{\text{eff}}}{L_{\text{eff}}} (V_{\text{GS}} - V_{\text{th}})^2 \tag{1}$$

where  $k_p$  is the process parameter and  $W_{\text{eff}}$  and  $L_{\text{eff}}$  are the effective width and length of the transistor, respectively. According to (1), a lower threshold voltage can produce a larger drain current. If we take  $(W_{\text{eff}})/(L_{\text{eff}})$  as a constant, then (1) can be derived as

$$I_D \propto (V_{\rm GS} - V_{\rm th})^2. \tag{2}$$

In this paper, the 0.25- $\mu$ m 1P5M CMOS process is adopted to realize dual threshold voltage transistors. The threshold voltages



Fig. 1. Schematic view of SRAM cell.

TABLE II COMPARISON BETWEEN HIGH AND LOW THRESHOLD VOLTAGE TRANSISTORS

| V <sub>th</sub>  | Characteristic                              | Advantage             |  |
|------------------|---------------------------------------------|-----------------------|--|
| Nominal (0.53 V) | low leakage current                         | data retention        |  |
| Native (0.21 V)  | high output current,<br>fast switching time | driving<br>capability |  |

of nominal (high) nMOS/pMOS and native (low) nMOS are tabulated in Table I.  $V_{\rm GS} = V_{\rm DD} = 2.5$  V, high threshold voltage  $V_{\rm th\,NoN} = 0.53$  V, and low threshold voltage  $V_{\rm th\,NaN} = 0.21$  V. High threshold voltage for pMOS is  $V_{\rm th\,NoP} = -0.53$  V. Thus, we can compute a ratio of  $I_{\rm DH}/I_{\rm DL}$  as

$$\frac{I_{\rm DH}}{I_{\rm DL}} = \frac{\frac{k_p}{2} \frac{W_{\rm eff}}{L_{\rm eff}} (V_{\rm GS} - V_{\rm th\,NoN})^2}{\frac{k_p}{2} \frac{W_{\rm eff}}{L_{\rm eff}} (V_{\rm GS} - V_{\rm th\,NaN})^2} \\
= \frac{(V_{\rm GS} - V_{\rm th\,NoN})^2}{(V_{\rm GS} - V_{\rm th\,NaN})^2} = \frac{(2.5 - 0.53)^2}{(2.5 - 0.21)^2} = \frac{3.8809}{5.2441} \quad (3)$$

where  $I_{\rm DH}$  and  $I_{\rm DL}$  are the drain currents of the high threshold voltage transistor and low threshold voltage transistor, respectively.

Hence, the current increasing rate is about 35.12%, which can be calculated as

Current Increase = 
$$\frac{I_{\rm DL} - I_{\rm DH}}{I_{\rm DH}}$$
  
=  $\frac{5.2441 - 3.8809}{3.8809}$   
 $\approx 35.12\%$ . (4)

With the decreasing of the transistor operating voltage, the threshold voltage is decreasing as well. The subthreshold current is computed as

$$I_{\rm DSUB} = \frac{W_{\rm eff}}{W_o} \cdot I_o \cdot 10^{(V_{\rm GS} - V_{\rm th})/S}$$
(5)

where  $W_o$  and  $I_o$  are the gate width and drain current, respectively. S is the subthreshold swing parameter, which can be calculated as

$$S \approx 2.3 V_T \left[ 1 + \frac{C_d}{C_{\text{ox}}} \right] \tag{6}$$

where  $V_T$  is thermal voltage and  $C_d$  is the junction capactance between source and drain. The leakage current can be obtained by replacing  $V_{\text{GS}}$  with 0, which is

$$I_{\text{leak}} = \frac{W_{\text{eff}}}{W_o} I_o 10^{-V_{\text{th}}/S}.$$
 (7)



Fig. 2. Simulation with different models, temperatures, and threshold voltages. (a) Nominal  $V_{\rm th}$ . (b) Native  $V_{\rm th}$ .

TABLE III CURRENT INCREASE FOR NATIVE  $V_{\rm th}$  VS. Nominal  $V_{\rm th}$ 

| Model         | Current Increase (%) |                    |                    |  |
|---------------|----------------------|--------------------|--------------------|--|
|               | $0^{o}C$             | $25^{o}\mathrm{C}$ | $75^{o}\mathrm{C}$ |  |
| TT            | 39.70                | 36.13              | 33.33              |  |
| $\mathbf{SS}$ | 41.10                | 37.82              | 33.33              |  |
| $\mathbf{SF}$ | 33.51                | 30.60              | 26.19              |  |
| $\mathbf{FS}$ | 45.41                | 42.93              | 38.46              |  |
| $\mathbf{FF}$ | 39.42                | 35.34              | 31.31              |  |

If  $W_o$  remains unchanged,  $I_{\rm DSUB}$  as well as  $I_{\rm leak}$  will be increased when  $I_o$  increases. Thus, the subthreshold current becomes a positive factor of driving wires [7]. In short, a transistor with low  $V_{\rm th}$  is more appropriate to drive wire rather than to store data.

According to the above discussion, we conclude the following.

- High threshold voltage (nominal  $V_{\rm th}$ ) transistors possess the advantage of low leakage current. Hence, they are more appropriate to **store data** in memory designs.
- Low threshold voltage (native  $V_{\rm th}$ ) transistors possess larger drain current. Therefore, it is more suitable to **drive** the bitlines.



Fig. 3. Conventional memory configuration.

By taking advantage of these two different threshold voltage transistors, a refined design of SRAM memory cell is proposed.

# B. Dual-V<sub>th</sub> SRAM Cell

A typical 6-T SRAM cell is shown in Fig. 1. N1 and N2 are, respectively, the bitlines (BL,  $\overline{BL}$ ) drivers which are controlled by the wordline (WL). If the threshold voltage of N1 and N2 is low, the switching time of N1 and N2 will be reduced, which will in turn shorten the access time of the SRAM cell. Hence, we use the native  $V_{\rm th}$  transistors to implement the driving transistors. It will produce a larger driving current than normal or high- $V_{\rm th}$ transistors. By contrast, transistors with high  $V_{\rm th}$  possess low leakage current and subthreshold current. Thus, they are very good to be cross coupled as a data latch as shown in Fig. 1. We, then, use nominal  $V_{\rm th}$  transistors such as P1, P2, N3, and N4 to keep valid data. The difference between high- $V_{\rm th}$  transistors and low- $V_{\rm th}$  transistors is summarized in Table II.

# C. Simulation

To verify the proposed cell, we perform a series of simulations given the temperature of 0 °C, 25 °C, and 75 °C. Different transistor models, such as TT, SS, SF, FS, and FF, are all simulated. The complete simulation results are shown in Fig. 2. As we expected, the native  $V_{\rm th}$  in the simulations provides more driving capability, i.e., current, than nominal  $V_{\rm th}$ . Besides, a current comparison of nominal  $V_{\rm th}$  with native  $V_{\rm th}$  is tabulated in Table III.

According to the simulation results in Fig. 2 and Table III, there is no doubt that the native  $V_{\rm th}$  transistors provide better driving current. They will provide up to 45.41% current increase in the best case, 26.19% in the worst case. Hence, using low threshold voltage driving transistors is proven to be feasible.

## **III.** QUENCHERS

In this section, we point out the reason causing the oscillation of the SRAM bitlines [8], as well as the resolution to squelch the oscillation.

## A. Oscillations on the Bit Lines

Referring to Fig. 3, a conventional current sense amplifier (SA) and the SRAM memory cells are shown. Basically, the



Fig. 4. Oscillation scenario.

datapath from a memory cell to the outputs consists of a current source enabled by the complement of a sense amplifier enable signal,  $\overline{\text{SAEN}}$ , a differential amplifier, an equalizer which is used to pre-equalize the bitlines, and a current sink which is also enabled by  $\overline{\text{SAEN}}$ . The oscillation of the readout operation is illustrated in Fig. 4. The oscillation will be significantly enlarged when the low- $V_{\text{th}}$  nMOSs are used as bitline drivers since they supply large currents. The scenario is summarized as follows.

- Stage1 Wordline (WL) is enabled to activate the memory cell. SAEN is also enabled as soon as WL is enabled. In the meantime, PCH/EQ is disabled. Hence, the voltages on the respective outputs of the bitlines are clearly either pulled up or pulled down.
- Stage2 WL is disabled such that the memory cell is deactivated. Owing to the high gain of the differential amplifier, the difference of the voltages of the bitlines will be enlarged. In the meantime, the SAEN is still kept enabled while PCH/EQ is disabled, which in turn causes the oscillation.



Fig. 5. Memory cells with quenchers.



Fig. 6. Diodes as quenchers.

Stage3  $\overline{\text{SAEN}}$  is switched to 0 after Stage 1 and 2. The entire datapath waits for the next valid WL = 1 and  $\overline{\text{SAEN}} = 1$ .

In the above simulation, Column Selector Y (as shown in Fig. 3) is always enabled, which implies that DL = BL,  $\overline{DL} = \overline{BL}$ . In short, the scenario of the oscillation of the voltages on the bitlines occurs when WL does not enable the memory cell and the SAEN is activated. Particularly, the oscillation becomes very serious if the gain of the sense amplifier is very large, which is originally intended to accelerate the readout. Not only might an error be produced by the unwanted oscillations, but unwanted power consumption also occurs.

# B. Quenchers

By a simple observation, the voltage phases of the signals on the respective bitlines are complementary when the bitlines are activated. We can simply create a unidirectional closed loop which shortcircuits the bitlines at this moment in order to cancel out the out-of-phase ripples of the voltages of the fed signals. Referring to Fig. 5, two back-to-back diodes are used to form such a unidirectional loop between the bitlines. The loop formed by the diode pair is capable of reducing the swing on the bitlines by shortcircuiting the two complementary signals as shown in



Fig. 7. NMOSs as quenchers.

Fig. 10. Thus, power consumption is reduced as well when the quenchers are used in the memory design.

Using the identical simulation conditions as those given in Figs. 4 and 6 shows a significant improvement on the squelch of the oscillation.

#### C. Noise Margin Improvement

Another advantage of the quenchers is the improvement of the noise margin, particularly if the  $V_{OH}$  is critical [9]. Referring to Fig. 3, the values of the bitlines may oscillate when the power supply  $V_{DD}$  is high and the gain of the amplifier is very large. Note that the gain is determined by the size of the transistors in the differential amplifier and those in the current sink. The sensing speed of the current SA increases as the gain grows. However, the output could be incorrectly sensed if the oscillation occurs and the gain is high. This possibility leads to a



Fig. 8. Quenchers = nMOS pass transistors.

X



Fig. 9. Quenchers = pMOS pass transistors.

TABLE IV Comparison of Power Consumption (Unit = mW)

| Sin           | nulati | on  | No Quenchers |       | + Quenchers |       |       | Reduction |       |
|---------------|--------|-----|--------------|-------|-------------|-------|-------|-----------|-------|
| Model         | °C     | VDD | avg          | min   | max         | avg   | min   | $\max$    | %     |
| TT            | 25     | 2.5 | 73.76        | 1.710 | 1149        | 61.06 | 1.710 | 304       | 17.22 |
| $\mathbf{SS}$ | 75     | 2.2 | 35.84        | 0.803 | 209         | 35.33 | 0.803 | 214       | 1.42  |

small noise margin,  $V_{\rm DD} - V_{\rm OH}$ . By contrast, the insertion of a quencher pair suppresses the oscillation such that the noise margin is increased without the hazard of incorrect sensing. Meanwhile, the gain of the current SA is preserved so as not to slow down the readout operation in any case.

#### D. Alternatives to Quenchers

Besides the diode, which is deemed a nonlinear element in a standard CMOS process, other alternatives can be used as the quenchers. The performance of these alternatives turns out to be not worse than that of the diode.

1) NMOS Pass Transistor: NMOSs with gate drive at full  $V_{\rm DD}$  are considered as another alternative. They are easily designed and integrated. Fig. 7 is an example of the quenchers

made by nMOSs. Fig. 8 is the simulation waveform given the same condition.

2) *PMOS Pass Transistor:* In dual respect, pMOSs with gate drive at GND are considered as the last alternative. They are also easily designed and integrated. Fig. 9 is the simulation waveform given the same condition.

#### E. Simulations and Analysis

By employing the same 0.25- $\mu$ m 1P5M CMOS process, we have simulated several corner conditions to attain the power performance. Note that the operating frequency of the WL is 200 MHz. Table IV shows the comparison of average, maximum, and minimum power dissipations given different simulation conditions.



Fig. 10. Current comparison.



Fig. 11. Die photo of the 4-kb SRAM Chip.

It is noted that the proposed quenchers indeed reduce power no matter what the condition is. On top of these simulation results, Fig. 10 also shows the current variations in the conventional design and the proposed quencher design.

#### **IV. IMPLEMENTATION & MEASUREMENT**

## A. Simulation

In order to verify the correctness of the refined SRAM architecture as well as the advantages of the proposed quencher design, we design and implement a 4-kb SRAM by using the 0.25- $\mu$ m process. The die photo of the SRAM chip is shown in Fig. 11. Complete post-layout simulations have been performed to ensure the performance of our design. Table V shows the post-layout simulation results of the chip. The maximum operating clock frequency is 200 MHz, while the minimum operating clock frequency is 166 MHz, according to the simulation results.

 TABLE
 V

 Performance Summary of Post-Layout Simulations.

| Model         | $^{o}C$ | VDD | access time | clock rate           |
|---------------|---------|-----|-------------|----------------------|
| TT            | 25      | 2.5 | 3.49 ns     | 200  MHz             |
|               |         |     | 4.83 ns     | 166  MHz             |
| $\mathbf{FF}$ | 0       | 2.8 | 2.51  ns    | $200 \ \mathrm{MHz}$ |

 TABLE
 VI

 COMPARISON BETWEEN THE PROPOSED ARCHITECTURE AND PRIOR WORKS

| SRAM     | VDD                              | access time | clock rate             | Process           |
|----------|----------------------------------|-------------|------------------------|-------------------|
| Proposed | $2.5 \mathrm{V}$                 | 5  ns       | 100  MHz               | $0.25~\mu{ m m}$  |
| [4]      | $0.5 \mathrm{V}$                 | N/A         | $100 \text{ MHz} \ ^1$ | $0.25~\mu{ m m}$  |
| [10]     | 2.7-3.6 V                        | 70-85 ns    | $1-10 \mathrm{~MHz}$   | $0.18~\mu { m m}$ |
| [11]     | $3.3 \text{ V}{\pm}0.3 \text{V}$ | 10 ns       | $100 \ \mathrm{MHz}$   | $0.18~\mu{\rm m}$ |

<sup>1</sup> Simulation only.

The longest access time is 4.83 ns. Fig. 12 shows the post-layout simulation results given by TimeMill where  $V_{\rm DD}$  is 2.5 V, temperature is 25 °C, and the TT Model is adopted. DL and BL in Fig. 12 are the data line and the bitline, respectively. The data accessing procedures are as follows.

- 1) The address is latched by the address buffer block in Fig. 11 during the rising edge of the CLK signal.
- 2) The address is decoded after the precharge stage in Fig. 12.
- 3) The read data is sensed by the sense amplifier in Fig. 5.
- 4) The data are then magnified by the second-stage sense amplifier (SA/write block in Fig. 11).
- 5) Finally, the data are read out by the I/O buffer in Fig. 11.

#### B. Implementation & Measurement

The chip implemented by the 0.25- $\mu$ m process is shown in Fig. 11. To test and verify its physical performance, we used the HP 1660CP logic analyzer and the IMS 200 test platform [12] to perform chip test and verification. The maximum operating clock frequency supported by the mentioned instruments is 100 MHz, as shown in Fig. 13. The worst case accessing time



Fig. 12. Post-layout simulation waveforms.



Fig. 13. IMS 200 measurement result (100 MHz).

is measured to be 5 ns, which indicates that our chip can operate given a clock as high as 200 MHz.

## C. Comparison

A performance comparison with prior works and currently commercial SRAM products is shown in Table VI. It should be noted that although Itoh's work was claimed to be able to operate at 100-MHz frequency with 0.5-V  $V_{\rm CC}$  supply, the result was obtained merely from simulations. A PL (power) node was used to provide a high threshold voltage  $V_{\rm T}$  in Itoh's work. At the PL node, the bulk is connected to the source of the pMOS which is driven by  $V_{\rm CH}$ ,  $V_{\rm CH} > V_{\rm CC}$ . Such a design demands

|               | Expected                                    | Measured                                    |
|---------------|---------------------------------------------|---------------------------------------------|
| Area          | $1.28 \times 1.25 \text{ mm}^2$ (with pads) | $1.28 \times 1.25 \text{ mm}^2$ (with pads) |
| Maximum Freq. | 200 MHz (TT)                                | 100 MHz (*)                                 |
| Avg. Power    | 61.06 mW (200 MHz, TT, 2.5V)                | 59.5  mW (100  MHz, 2.5 V)                  |
| Access Time   | 3.49 ns (TT)                                | 5 ns                                        |
| Standby Power | 10.44  mW (200  MHz, TT, 2.5V)              | 26.5 mW (100 MHz, 2.5V)                     |

TABLE VIII COMPARISON OF POWER-DELAY PRODUCT

|          | CMOS process     | Power   | Delay   | Power-Delay Product |
|----------|------------------|---------|---------|---------------------|
| Proposed | $0.25~\mu{ m m}$ | 26.5 mW | 5  ns   | 132.5               |
| [4]      | $0.25~\mu{ m m}$ | N/A     | N/A     | N/A                 |
| [13]     | $0.25~\mu{ m m}$ | 425  mW | 1.24 ns | 527                 |
| [14]     | $0.25~\mu{ m m}$ | 50  mW  | 10 ns   | 500                 |

that a special CMOS process, e.g., multiple n-well layers, is required to implement the pMOS used at the PL node as well as the normal pMOSs to implement the cross-coupled FETs. Otherwise, Itoh's multi- $V_{\rm T}$  scheme cannot be feasible. However, the CMOS process was not mentioned in detail, nor was physical implementation provided. Thus, the simulation results in Itoh's design did not ensure good performance if a chip were really implemented.

On the contrary, the proposed dual- $V_{\rm th}$  SRAM architecture takes advantage of the special CMOS process. This CMOS process provides different MOSs for different threshold voltages. In addition, we performed a series of complete simulations regarding all corner conditions including 0 °C, 25 °C, and 75 °C as well as TT, SS, SF, FS, and FF models. A real chip was then fabricated using the mentioned CMOS process. The measured results of the SRAM chip not only verify the correctness of the proposed architecture, but also provide a better performance than the prior works.

The characteristics of the proposed SRAM chip are summarized in Table VII, while a comparison of the power-delay product of our design and the prior works is given in Table VIII. It is obvious that the proposed design possesses the smallest power-delay product.

#### V. CONCLUSION

In this paper, an SRAM using dual threshold voltage transistors is proposed. The low- $V_{\rm th}$  transistors are used to increase driving capability and speed. The high- $V_{\rm th}$  transistors, by contrast, are used to construct data storage latches. Meanwhile, a novel quencher design is proposed to be added at the output bitlines of memories, which will reduce unwanted oscillation and will also suppress unwanted power dissipation. According to the simulation results, nMOS pass transistors seem to be a better choice for the quenchers. A 4-kb SRAM is implemented by using the dual threshold transistors and the quenchers. The simulation result demonstrates that the proposed architecture is better than the commercial products using the same or better technology.

### ACKNOWLEDGMENT

The authors would like to express their highest appreciation to the Academic Foundation of Taiwan.

#### REFERENCES

- [1] B. Prince, Semiconductor Memories. New York: Wiley, 1991.
- [2] K. Takeda, Y. Aimoto, N. Nakamura, H. Toyoshima, T. Iwasaki, K. Noda, K. Matsui, S. Itoh, S. Masuoka, T. Horiushi, A. Nakagawa, K. Shimogawa, and H. Takahashi, "A 16-Mb 400-MHz loadless CMOS four-transistor SRAM macro," *IEEE J. Solid-State Circuits*, vol. 35, pp. 1631–1640, Nov. 2000.
- [3] S.-M. Yoo, J. M. Han, E. Hag, S. S. Yoon, S.-J. Jeong, B. C. Kim, J.-H. Lee, T.-S. Jang, H.-D. Kim, C. J. Park, D. H. Seo, C. S. Choi, S.-I. Cho, and C. G. Hwang, "A 256 M DRAM with simplified register control for low power self refresh and rapid burn-in," in *Symp. VLSI Circuits Dig. Tech. Papers*, 1994, pp. 85–86.
- [4] K. Itoh, A. R. Fridi, A. Bellaouar, and M. I. Elmasry, "A deep sub-V, single power-supply SRAM cell with multi-V<sub>T</sub>, boosted storage node and dynamic load," in *Symp. VLSI Circuits Dig. Tech. Papers*, June 1996, pp. 132–133.
- [5] K. Ohhata, Y. Sakurai, H. Nambu, K. Kanetani, Y. Idei, T. Hiramoto, N. Tamba, K. Yamaguchi, M. Odaka, K. Watanabe, T. Ikeda, and N. Homma, "Noise reduction techniques for an ECL-CMOS RAM with a 2 ns write cycle time," in *Proc. IEEE Bipolar/BiCMOS Circuits and Technology Meeting*, Oct. 1992, pp. 174–177.
- [6] M. Horiuchi, T. Sakata, and S. Kimura, "Suppression of bitline-induced disturbance in SOI DRAM/SRAM cells by bipolar embedded source structure (BESS)," in *Symp. VLSI Technology Dig. Tech. Papers*, June 1997, pp. 157–158.
- [7] A. Bellaouar and M. I. Elmasry, *Low-Power Digital VLSI Design Circuits and Systems*. Reading, MA: Kluwer, 1995, pp. 63–90.
- [8] T. Ooishi, M. Asakura, S. Tomishima, H. Hidaka, K. Arimoto, and K. Fujishima, "A well-synchronized sening/equalizing method for sub-1.0-V operating advanced DRAMs," *IEEE J. Solid-State Circuits*, vol. 29, pp. 432–440, Apr. 1994.
- [9] R. J. Baker, H. W. Li, and D. E. Boyce, CMOS—Circuit Design, Layout, and Simulation. Piscataway, NJ: IEEE Press, 1998.
- [10] M5M5W816TP-70HI data sheet, Mitsubishi Electric and Electronics USA, Inc., Cypress, CA, 1996.
- [11] (2001) High-Speed SRAM. Hitachi Ltd., Japan. [Online]. Available: http://www.hitachisemiconductor.com/sic/jsp/japan/eng/products/ memory/sram\_high\_speed.html
- [12] (2001) IMS 200. Integrated Measurement Systems, Inc, Beaverton, OR. [Online]. Available: http://www.ims.com/html/engineering\_test\_stations.html
- [13] K. Ando, K. Higeta, Y. Fujimura, K. Mori, M. Nakayama, H. Nambu, K. Miyamoto, and K. Yamaguchi, "A 0.9-ns-access, 700-MHz SRAM macro using a configurable organization technique with an automatic timing adjuster," in *Symp. VLSI Circuits Dig. Tech. Papers*, 1998, pp. 182–183.
- [14] T. Miwa, J. Yamada, H. Koike, T. Nakura, T. Kobayashi, N. Kasai, and H. Toyoshima, "A 512 kbit low-voltage NV-SRAM with the size of a conventional SRAM," in *Symp. VLSI Circuits Dig. Tech. Papers*, 2001, pp. 129–132.



**Chua-Chin Wang** was born in Taiwan in 1962. He received the B.S. degree in electrical engineering from National Taiwan University, Taipei, Taiwan, R.O.C., in 1984 and the M.S. and Ph.D. degrees in electrical engineering from the State University of New York at Stony Brook in 1988 and 1992, respectively.

He is currently a Professor in the Department of Electrical Engineering, National Sun Yat-Sen University, Kaohsiung, Taiwan. His recent research interests include low-power and high-speed logic

circuit design, VLSI design, neural networks, and interfacing I/O circuits.



**Po-Ming Lee** was born in Taiwan in 1973. He received the B.S. degree in computer science and engineering from Yuan-Ze University, Taiwan, R.O.C., in 1995 and the M.S. degree in electrical engineering from National Sun Yat-Sen University, Taiwan, in 1999. He is currently working toward the Ph.D. degree in electrical engineering at National Sun Yat-Sen University.

His research interests include VLSI design, computer graphics, and consumer electronics.



**Kuo-Long Chen** was born in Taiwan in 1977. He received the B.S. and M.S. degrees in electrical engineering from National Sun Yat-Sen University, Taiwan, R.O.C., in 2000 and 2002, respectively. He is currently a Hardware Design Engineer with the ASUSTEK Computer Inc., Taipei, Taiwan.