

# Enhanced Gain, Low Voltage, Rail-To-Rail Buffer Amplifier Suitable For Energy Harvesting

Ali Far

ali.t.far@ieee.org

**Abstract** — A CMOS subthreshold rail-to-rail input-output buffer amplifier suitable for energy harvesting applications is presented, having high gain ( $A_v$ ) of  $\sim 130$ dB, consuming ultra low currents ( $I_{DD}$ ) of  $\sim 150$ nA, and operating with low power supply voltage ( $V_{DD}$ )  $> \sim 0.8$ v. Contributions of this work are the synthesis of the following attributes: First, using a single transistor, the amplifier input stage's tail current is steered between the two PMOSFET input pairs, while one of the PMOSFET pairs is level shifted by a pair of NMOSFET source followers, which keeps the amplifier's input stage transconductance ( $gm$ ) roughly constant while the inputs span rail-to-rail. Second, to boost folded cascode transconductance amplifier's (FCTA)  $A_v$ , the proposed plurality of regulated cascode (RGC) current mirrors (CM) utilize a small size auxiliary amplifier, containing the same type and un-scaled FETs as that of the cascaded CMs employed within the FCTA. As such, the boosting of  $A_v$  is less impeded by the otherwise higher impedance and high capacitance associated with scaled FETs, utilized in most prior art, in the RGC's auxiliary amplifier's signal path. Moreover, the RGC-CM utilizing the same FET, as that of the FCTA's CM, provides more consistency in FCTA's DC, AC, and dynamic response over process and operating condition variations. Third, a buffer driver containing a Minimum Current Selector (MCS) and an inverting current mirror amplifier (ICMA) controls the quiescent current of the inactive output transistors (FETs), while a complementary non-inverting current mirror (CNICM) curbs the current waste attributed to monitoring the FET's (external load) currents. The output buffer driver is inherently fast and can work at low  $V_{DD}$ , since it operates mainly in current mode. Montecarlo (MC) and worst case (WC) simulations indicate the following specifications are achievable: input voltage range rail to rail; output voltage range  $\sim 10$ mV from the rails; resistive load ( $R_L$ ) 5K ohms capability; unity gain frequency ( $f_u$ )  $\sim 200$ KHz and phase margin (PM)  $\sim 40$  degrees; power supply rejection ratio (PSRR)  $\sim -90$ dB; common mode rejection ratio (CMRR)  $\sim -110$ dB; slew rate (SR)  $\sim 3$ V/ 10uS; settling time (ts)  $\sim 15$ uS; size  $\sim 130$ um/side.

**Keywords** — ultra low power, energy harvesting, self powered IC, transconductance, bioelectronics, amplifier, class AB, buffer, wireless sensor network, rail-to-rail, sink-source, enhanced gain, regulated cascode, auxiliary amplifier, high speed, fast, low power, low current, low voltage, subthreshold, RFID, IoT, wearable electronics, biometrics, sensor array, smart sensor array, smart dust, SSA, batteryless, resistorless, analog.

## I. INTRODUCTION

Class AB buffers utilizing folded cascode transconductance amplifiers (FCTA) are suitable for low voltage applications in lieu of their wide rail-to-rail input-output range and being structurally self-compensating, which is generously covered in the analog literature [1]-[27][29]. In ultra low power wireless batteryless applications the extra current, that can lift the gain

TABLE I: AMPLIFIER TYPICAL SPECIFICATION &amp; COMPARISONS

| Typical Specs                                   | [1]<br>Sim'     | [4]<br>SiO2      | [6]<br>SiO2      | This<br>Work<br>Sim' | See Fig     |
|-------------------------------------------------|-----------------|------------------|------------------|----------------------|-------------|
| FOM ( $f_u \times L^2 / I_{DD}$ )               | 11.1            | 2.8              | 2.3              | 43.2                 |             |
| $V_{DD}$ min (V)                                | 2.5             | 1                | 1                | 0.8                  | 2           |
| $I_{DD}$ (nA)                                   | 50E3            | 35E4             | 15E4             | 150                  | 3,4         |
| $L_{CMOS}$ (μm)                                 | 0.18            | 0.35             | 0.35             | 0.18                 |             |
| $A = \text{Area} (\mu\text{m})^2$               | 94 <sup>2</sup> | 230 <sup>2</sup> | 315 <sup>2</sup> | 130 <sup>2</sup>     | 22          |
| Rails - $\Delta V_{IN}$                         | 0               | 0                | 0                | 0                    | 7,8,16      |
| Rails - $\Delta V_{OUT}$                        | np              | ?0               | np               | 10mv                 | 7,8,16      |
| SR $^{+/-}$ (V/μs)                              | 11              | 2.7/5            | 0.17             | 0.3                  | 18,19,20,21 |
| $R_L (\Omega)$                                  | np              | np               | 1K               | 5K                   | 9,10,19,21  |
| $t_S$ (μs) to 1%                                | 0.1             | np               | np               | 20                   | 18,19,20,21 |
| $f_u$ (MHz)                                     | 17.3            | 8                | 0.28             | 0.2                  | 10,11,12    |
| PM = φ (°)                                      | 67.7            | 60               | np               | 40                   | 10,11,12    |
| PSRR $^{+/-}$ (dB)                              | 93              | 45/40            | np               | 90                   | 13,14       |
| CMRR <sub>1Hz</sub> (db)                        | 86.5            | 75               | np               | 110                  | 15          |
| $A_v$ (db)                                      | 99.6            | 86               | 90               | 130                  | 10,11,12    |
| Noise <sub>1KHz</sub> (nV/ $\sqrt{\text{Hz}}$ ) | np              | 903              | np               | 1400                 | 16          |

of an amplifier, is a scarcity. As such, keeping the currents ultra low, while adopting a gain enhancement strategy is useful, where utilizing RGC-CM is an effective mean to boost a FCTA's gain [11-27]. The following additional capabilities of RGC-CM would be beneficial for energy harvesting applications: First, operating with low  $V_{DD}$ ; second, having wide input-output voltage span; third, providing high PSRR and fast power up and down transient responses, keeping in mind potentially less tidy and jittery  $V_{DD}$  patterns in some power harvesting applications; and fourth, exhibiting consistent DC, AC, and transient performance over process and operating variations.

Considering the need, in for example wireless batteryless sensor networks, for an amplifier to drive resistive sensors, the following capabilities are also useful for an output buffer driver: First, operate with low  $V_{DD}$  at ultra low  $I_{DD}$ ; second (to allow FCTA gain stage to dominate the frequency response) the buffer should have a much faster dynamic response than the FCTA; third, having wide input-output voltage span; and fourth, controlling the quiescent current in the inactive output FET driver while having high currents drive capability for the resistive sensor. This work aims to optimize the cost-performance trade-offs in providing a buffer amplifier (patent pending) with the attributes noted above.

## II. INPUT AND BIAS STAGE

Note that the entire buffer amplifier operates in the subthreshold region. The input stage of the proposed buffer amplifier depicted in Fig. 1 contains two pairs of PMOSFETs ( $P_{i1} - P_{i2,3}$  and  $P_{i4} - P_{i6,7}$ ), where one input pair ( $P_{i4} - P_{i6,7}$ ) is DC level shifted by NMOSFET source followers ( $N_{i1} - N_{i2}$ ) to accommodates rail-to-rail common mode input range [3][10-11][29]. Before the input voltage nears  $V_{DD}$ , and at a voltage pre-determined by  $P_{i9}$  and  $N_{i5}$ , a single PMOSFET ( $P_{i10}$ ) is activated to steer the same tail current ( $P_{i8}$ ) between the source summing junction of the two PMOSFET input pairs,  $P_{i1} - P_{i2,3}$  and  $P_{i4} - P_{i6,7}$  which keeps the input stage's  $g_m$  roughly constant throughout the rail-to-rail input span.

The bias stage is amply covered in prior art [28-29], and here is a brief description: It operates with a minimum  $V_{DD} \geq V_{GS} + 2V_{DS}$ . The  $\Delta VGS_{Pb3,4} \approx \eta V_T \ln(8)$  is applied across the drain-source terminals of  $P_{b1}$  which is in the triode region. Hence, a bias current is generated:  $I_B \propto f(\eta_{PMOS} \times \beta_{PMOS} \times V_T^2 \times K_b)$ , where  $V_T$  is thermal voltage,  $\beta_{PMOS}$  is the FET's gain, and  $K_b$  is set by FET aspect ratio. Note that the amplifier's  $I_B$  is mostly a function of  $V_T$  and  $\mu_{PMOS}$  which are more tightly controlled in fabrication compared to  $V_{TH}$ .

## III. GAIN STAGE

Please see Fig. 1. As a general description, the common source amplifier (CSA) section, contained in the enhanced gain FCTA, feeds the  $ID_{Pi1} + ID_{Pi4}$  onto the common gate amplifier's (CGA) section of the FCTA's gain stage, containing  $N_{g2} - N_{g1}$  [12-27]. Similarly, FCTA's CSA section feeds  $ID_{Pi7,3} + ID_{Pi6,2}$  onto FCTA's CGAs containing  $N_{g3,5} - N_{g4,6}$ . The outputs of CGAs (i.e.,  $I_{DS}$  of  $N_{g1} - N_{g2}$  and  $N_{g3,5} - N_{g4,6}$ ) are fed onto the current mirror containing  $P_{g1} - P_{g2}$  and  $P_{g3,5} - N_{g4,6}$ , and summed at node(s)  $Z_{N,P}$  which is the high-gain high-impedance output node of the FCTA.

Plurality of RGC-CM are utilized in the CGA section of FCTA to enhance its gain via the auxiliary amplifiers centered

around NMOSFETs  $N_{z4} - N_{g2}$  and  $N_{z15,16} - N_{g4,6}$  and their complementary PMOSFET counterparts  $P_{z4} - P_{g2}$ , and  $P_{z15,16} - P_{g4,6}$ . A brief description of the role of auxiliary amplifier, centered around  $N_{z15,16} - N_{g4,6}$ , in RGC-CM is provided next, which is also applicable to the other RGC-CM's contained in the buffer amplifier. The basic idea of RGC current mirror (CM) is to raise the effective output impedance of a CM ( $N_{g4,6}$ ) by using negative feedback of an auxiliary amplifier (formed by  $N_{z15,16} - P_{z17}$ ) with a gain ( $G \propto gm_{Nz15,16} \times R_{Op_{z17}} \propto V_A/V_T$  in subthreshold) to keep the  $VDS_{Ng3,5}$  nearly constant, irrespective of variations at the CM's output voltage ( $V_{DNg4,6}$ ). As such, the output impedance at the drain terminal of the cascaded current source ( $N_{g4,6}$ ) within the FCTA current mirror network is increased, which enhances the FCTA's gain.

To meet the DC span (rail-to-rail operations at low  $V_{DD}$ ) objectives, the auxiliary amplifier's (centered around  $N_{z15,16} - N_{g4,6}$ ) input signal is shifted by  $VGS_{Nz12,18}$  before it is applied to  $N_{z15,16}$  whose drain terminal(s) regulate the  $V_{DNg4,6}$ . Note that  $VDS_{Ng3,5} = VGS_{Nz8} - VGS_{Nz10} + VGS_{Nz15,16} - VGS_{Nz12,18} \approx \eta_{NMOS} \times V_T \ln(32)$ .

Excluding the contribution of output buffer driver to the overall buffer amplifier's gain and frequency response, note that the subthreshold FCTA's bandwidth is roughly  $f_u \propto (I_B/V_T)/C_o$ , and approximately  $A_V \propto (V_A/V_T)^3$ , where  $C_o$  is the effective capacitance at node(s)  $Z_{n,p}$  that is dominated by the aggregate of the sink-source FET drivers  $C_{gate}$ [29].

For brevity, the same description as above applies to the other plurality of RGC-CMs (within the CGA section). Please note that single FET current sources are shown in the simplified Fig. 1, where the actual circuit contains cascaded FETs for CM's higher output impedance.

In summary, the benefits of the proposed enhanced gain FCTA stage: First, in setting  $VDS_{Ng3,5}$  to widen the RGC-CM input-output voltage span, no FETs with long length (L) or wide width (W) are used in the auxiliary amplifications signal paths, except for scaling  $N_{z8}$  to establish a bias voltage. Being



Figure 1. Simplified Amplifier Circuit (output buffer driver stage and RGC-CM patents pending)

mindful of offset and noise considerations, the auxiliary amplifier FETs (e.g.,  $N_{z12,18}$  and  $N_{z15,16}$ ) can be sized closer to a minimum geometry for RGC-CM's faster speed, by avoiding low admittance or high capacitance FETs in the auxiliary amplifier's signal path. Second, the same type FETs are used in each of the auxiliary amplifiers and their associated cascoded FETs (in the FCTA's CGA current mirror network) whose device parameters match and track each other. Hence, the amplifier's DC, AC, and transient specifications exhibit more consistency over fabrication process, temperature, and power supply variations. Third, low power supply can be accommodated with the minimum  $V_{DD} \geq V_{GS} + 2V_{DS}$ . Fourth, as stated earlier, energy harvesting applications may require ICs with more rugged and stable transient responses. Availability of harvestable energy may be unpredictable. The  $V_{DD}$  patterns may be less orderly for reasons such as power off, power save, or for frequent switching from a magnetic to solar to kinetic power harvesting sources. Hence, the RGC-CM utilizing few FETs with no complicated AC loops, as is the case with the proposed design, could provide more rugged, predictable, and a more stable AC and transient responses.

#### IV. Output Buffer Driver

Please see Fig. 1. As an introductory description, the CNICM monitors and feeds the MCS, which provides the CMA with the smallest of the output driver FET (sink or source) currents. As such, the loop containing the output buffer driver's elements, including CNICM, MCS, and CMA, in concert with the FCTA controls the minimum operating current in the inactive sink or source FETs [1-10].

The CNICM's inputs ( $P_{m1}, N_{m1}$ ) mirror and scales down the output FET's ( $P_{o1}, N_{o1}$ ) sink-source currents, and curb them ( $P_{m5}, N_{m5}, P_{m6}$ ) at  $2I_B$  maximum before heading onto the inputs of MCS ( $P_{s1}, P_{s2}$ ). During steady state,  $I_{Pm5} \approx I_{Nm5} \approx I_B \rightarrow$  MCS's inputs are equalized  $\rightarrow$  MCS's output select min ( $I_{Pm5}, I_{Pm6} \approx I_B \rightarrow I_B$  fed onto the input of CMA via  $N_{a1}$ ). The current equilibrium is established at nodes  $Z_n$  and  $Z_p$  when in the FCTA:  $(I_{Pg6} \approx I_{Ng6}) \approx (I_{Pg4} \approx I_{Ng4})$ , and when in the CMA:  $I_{Na4} \approx I_{Pa2} \approx I_B \rightarrow I_{Po1} \approx s \times I_B \approx I_{No1} \approx k \times I_B$ .

When output FET is serving excess sink-source currents to an external load ( $R_L$ ), but before steady-state, when either ( $I_{Po1} \gg I_{No1}$ ) or ( $I_{No1} \gg I_{Po1}$ )  $\rightarrow$  ( $I_{Pm5} \gg I_{Pm6}$ ) or ( $I_{Pm6} \gg I_{Pm5}$ )  $\rightarrow$  min ( $I_{Pm5}, I_{Pm6} \approx 0$  or min ( $I_{Pm5}, I_{Pm6} \ll I_B$   $\rightarrow$  before steady-state, there is unsustainable current imbalance at nodes  $Z_n$  and  $Z_p$ ). To neutralize such current imbalance, the loop containing FCTA, MCS, and CMA, would regulate  $V_{GNo1}$ ,  $V_{GNa4}$ , and  $V_{GPo1}$  so that  $I_{Pa2} \approx I_{Na4} \approx I_B$  and  $(I_{Pg6} \approx I_{Ng6}) \approx (I_{Pg4} \approx I_{Ng4})$ .

In the current sink mode,  $I_{No1} \gg I_{Po1}$  and before steady-state  $\rightarrow$  MCS selects the min ( $I_{Pm5}, I_{Pm6} \approx 0 \approx I_{Na1} \approx I_{Na2} \rightarrow I_{Na3} \approx I_{Na4} \approx 2I_B \rightarrow I_{Na4} > I_B$  and  $I_{Na4} > I_{Pa2}$  that is unsustainable, which the loop regulates until steady-state is arrived by equalizing  $I_{Pa2} \approx I_{Na4} \approx I_B$ . Therefore, the loop regulates the inactive source FET current when  $I_{Na1} \approx I_B \approx$



Fig. 2. Preliminary layout and rough area estimate  $\sim 130\mu\text{m}^2$

$I_{Pm5} \approx I_{Pm1} \approx I_{Po1} \times 1/s$  while the active sink FET,  $N_{o1}$ , sinks the excess current needed for  $R_L$ .

Similarly, in the current source mode,  $I_{Po1} \gg I_{No1}$  and before steady-state  $\rightarrow$  MCS selects the min ( $I_{Pm5}, I_{Pm6} \approx 0$  and the loop regulate node  $Z_n$  and  $Z_p$  by an infinitesimal lift in ( $I_{Ng4} \approx I_{Ng6}$ ) above ( $I_{Pg4} \approx I_{Pg6}$ ) which increases the  $Z_n$  and  $Z_p$  voltages tiny enough in order for  $N_{o1}$  (and its mirror  $N_{m1}$ ,  $N_{m5}$ , and  $P_{m6}$ ) turn back on enough so that steady-state current equilibrium is reached and  $I_{Pa2} \approx I_{Na4} \approx I_B$ . As such, the loop regulates the inactive sink FET current where  $I_{Na1} \approx I_B \approx I_{Pm6} \approx I_{Nm6} \approx I_{Nm5} \approx I_{Nm1} \approx I_{No1} \times 1/k$  while the active source FET,  $P_{o1}$ , sources the excess current needed for  $R_L$ .

Here is a summary of the benefits of the proposed output buffer driver: First, it can work with low power supply minimum  $V_{DD} \geq 2V_{DS} + V_{GS}$ . Second, all elements of the output buffer driver namely CNICM, MCS, and CMA mostly operate in the current mode, which inherently makes the buffer's dynamic response fast. Third, it is low cost, given the low number of FET. Fourth, it can accommodate rail-to-rail input-output voltage span. Fifth, it can control the output FET's steady state quiescent current, and regulate the current in the inactive sink or source FET. Sixth, CNICM curbs the current that is otherwise wasted, while monitoring the FET's sink-source associated with variety of  $R_L$ s.

#### V. MONTE CARLO & WORST CASE SIMULATIONS:

A set of performance specification comparisons, with selected prior arts, are provided in Tables I. The WC and MC simulations are performed based on BSIM3v3.1 MOSIS/TSMC equivalent  $0.18\mu\text{m}$  CMOS device models. Fig. 2 is a rough layout area ( $\sim 130\mu\text{m}^2$ ). The amplifier simulations, which are illustrated in Fig. 3 through Fig. 26, are configured in unity gain,  $V_{DD} = 2\text{v}$ , Temperature ( $T$ ) =  $27^\circ\text{C}$ , unless otherwise noted. Figure 3 is WC simulations showing minimum  $V_{DD} \sim 0.8\text{v}$  measured by increasing  $V_{DD}$  until the steady state for  $V_{in,OFF}$  and  $I_{DD}$  is reached. Figure 4 is MC with average  $I_{DD} \approx 149\text{nA}$ . Figure 5 shows WC  $I_{DD} \leq 195\text{nA}$  as a function of temperature  $-35^\circ\text{C} \leq T \leq +65^\circ\text{C}$ . Figure 6 is MC for  $-4.6\text{mv} \leq V_{in,OFF} \leq +4.5\text{mv}$ . Figure 7 shows WC  $V_{in,OFF} \leq 300\mu\text{V}$  as a function of temperature  $-35^\circ\text{C} \leq T \leq +65^\circ\text{C}$ .



Fig. 3. WC Min'  $V_{DD}$  vs.  $I_{DD}$  &  $V_{OFS}$



Fig. 4. MC  $I_{DD}$



Fig. 5. WC  $I_{DD}$  vs.  $T$



Fig. 6. MC  $V_{OFS}$



Fig. 7. WC  $V_{OFS}$  vs.  $T$



Fig. 8. WC  $V_{OFS}$  vs.  $V_{in}$ ,  $V_{DD} = 2V$



Fig. 9. MC  $V_{OFS}$  vs.  $V_{in}$ ,  $V_{DD} = 1V$



Fig. 10. WC  $V_{OFS}$  vs.  $V_{in}$ ,  $V_{DD} = 2V$



Fig. 11. WC  $gm$  vs  $V_{in}$



Fig. 12.  $V_{in}$  vs.  $R_L$  ( $I_{SINK-SOURCE}$ )



Fig. 13. WC AC reponse



Fig. 14. MC AC reponse



Fig. 15. AC response vs.  $R_L$ ,  $V_{DD} = 2V$



Fig. 16. AC response vs.  $R_L$ ,  $V_{DD} = 1V$



Fig. 17. WC PSRR  $V_{DD}$  &  $V_{SS}$



Fig. 18. MC PSRR  $V_{DD}$  &  $V_{SS}$



Fig. 19. WC CMRR



Fig. 20. MC CMRR



Fig. 21. Noise input & output



Fig. 22. WC start-up  $V_{DD} = 0.12V$

## AMPLIFIER'S SIMULATION SUMMARY CONTINUED

Fig. 23. WC  $\tau_s$  & SR,  $V_{DD}=2V$ Fig. 24.  $\tau_s$  & SR vs.  $R_L, V_{DD}=2V$ Fig. 25.  $\tau_s$  & SR vs.  $C_L, V_{DD}=2V$ Fig. 26.  $\tau_s$  & SR vs.  $R_L, V_{DD}=1V$ 

Figure 8 is MC for  $V_{IN,OFFS}$  at  $V_{DD} = 2V$  while spanning  $5mv \leq V_{IN} = V_{OUT} \leq 1.995v$ , results in  $-6.5mv \leq V_{IN,OFFS} \leq +5mv$  due mostly to mismatch between the first PMOS versus the second PMOS+NMOS input pairs resulted when  $V_{IN}$  nears the rails. Figure 9, similar to Fig. 8, is the MC for  $V_{IN,OFFS}$  at  $V_{DD} = 1V$  while spanning  $10mv \leq V_{IN} = V_{OUT} \leq 0.99v$ , which results in  $-8mv \leq V_{IN,OFFS} \leq +8mv$ . Figure 10 is WC for  $V_{IN,OFFS}$  with  $V_{DD} = 2V$  where spanning  $50mv \leq V_{IN} = V_{OUT} \leq 1.95mv$ , results in  $-160\mu v \leq V_{IN,OFFS} \leq -80\mu v$ . Figure 11 is WC  $\Delta g_{mT}$  vs.  $V_{IN}$  indicating a  $\sim 2\%$  variation in amplifier's input stage  $g_m$  as a function of the change in PMOSFET input pair's current versus  $V_{IN} = V_{OUT}$  spanning rail-to-rail. Figure 12 is sink-source output FET current,  $I_{SINK}$  and  $I_{SOURCE}$ , as a function of external load  $R_L = 5K\Omega$ ,  $50K\Omega$ , and  $500K\Omega$  versus  $50mv \leq V_{IN} = V_{OUT} \leq 1.95mv$ . Figure 13 is WC AC response which indicates  $115dB < A_V < 135dB$  with unity gain frequency  $f_u \approx 200kHz$ , and phase margin  $\phi \approx 40^\circ$ . Figure 16 is AC response as a function of  $R_L$  with  $V_{DD} = 1V$ . Figure 17 is WC PSRR indicating at DC:  $-90dB < PSRR_{VDD} < -135db$  and  $-92dB < PSRR_{VSS} < -130db$ . Figure 18 is MC PSRR indicating at DC:  $-80dB < PSRR_{VDD} < -110db$  and  $-83dB < PSRR_{VSS} < -120db$ . Figure 19 is WC CMRR indicating at DC:  $-114dB < CMRR < -125db$ . Figure 20 is MC CMRR indicating at DC:  $-84dB < CMRR < -108db$ . Figure 21 is typical  $V_{O,Noise} = 1.4\mu V/\sqrt{Hz}$  at  $1kHz$ ,  $V_{I,Noise} = 475\mu V/\sqrt{Hz}$  at  $1kHz$  which is based on very preliminary noise models. Figure 22 is WC  $t_{start-up} \leq 350\mu s$  with  $V_{DD}$  pulsed to  $2V$  from  $0$  (and  $V_{in} = 1/2V_{DD}$  and amplifier is in unity gain). Figure 23 is WC transient response (TR) for  $V_{in} = V_o$  that is stepped to  $1.9v$  from  $0.1v$  which shows  $SR > \sim 1v / 7\mu S$  and  $\tau_s < \sim 20 \mu S$ , with  $V_{DD} = 2v$ . Figure 24 is TR vs.  $R_L$  for  $V_{in} = V_o$  stepped to  $1.9v$  from  $0.1v$ , for  $R_L = 5K\Omega$ ,  $50K\Omega$ , and  $500K\Omega$ , with  $V_{DD} = 2v$ . Figure 25 is TR vs.  $C_L$  for  $V_{in} = V_o$  stepped to  $1.5v$  from  $0.5v$ , for  $C_L = 0pF$ ,  $5pF$ , and  $10pF$ . Figure 26 is TR vs.  $R_L$  for  $V_{DD} = 1v$   $V_{in} = V_o$  stepped to  $950mv$  from  $50mv$ , for  $R_L = 5K\Omega$ ,  $50K\Omega$ ,  $500K\Omega$ , and  $5M\Omega$ .

## VI. SPECIFICATIONS RISK FROM SIMULATION TO FABRICATION &amp; DESIGN GAURD-BANDS

Given the ultra low currents, the transient injection could cause prolonged settling time within the amplifier's (high impedance) bias network. To guard band against such jitters, each of the biasing circuits for the input, gain, and output buffer driver stages are segregated. For conservatism, a power-on-reset (POR) function is added to the bias stage, although simulations indicate self start-up due to source-drain junction leakage currents. The PSRR and CMRR results in silicon are subject to substrate currents, FET parasitic leakages, and FET mismatches, which are not completely modeled in SPICE. The amplifier is optimized to drive resistive loads (e.g., use of resistors sensors in sensor array networks utilizing a power harvesting source) and it is not suited to drive capacitive loads. Also, cross-over distortion will be high at the output stage operating at ultra low currents. The input offset voltage mismatch between the PMOS and PMOS+NMOS paths will appear as distortion. Operating at ultra low currents elevates noise, and noise models are approximate. Amplifier's performance at high temperature will be degraded due to leakage, exacerbated by operating the amplifier at ultra low currents. FETs with small W and L mismatch more and are more noisy, while smaller FETs with smaller source-drain junction areas leak less. Smallest W/L used in this design is 3 times larger than minimum. Offset mismatch over temperatures is not modeled and not captured in the simulations. Metal mask options with compensation capacitors near  $Z_{n,p}$  and  $V_{out}$ , and extra dummy FETs (e.g., larger  $N_{01}$  and  $P_{01}$ ) will be provided in layout as a back-up plan for stability and extra output current drive capability.

## VII. CONCLUSION

A rail-to-rail input-output buffer amplifier is presented that consumes  $150nA$  of current, operates with low  $V_{DD} \sim 0.8v$ , has gain of  $130dB$ , and can drive a  $5K\Omega$  load while accurately controlling the inactive sink-source FET currents. The input stage uses two pairs of PMOSFETs, with one pair that is DC level shifted using a NMOSFET source follower to facilitate input rail-to-rail span, while a single FET steers the tail current between the two PMOSFET to keep the amplifier's input transconductance stable during rail-to-rail input span. The gain

stage utilizes a simple regulated cascode current mirror topology, that uses the same type FETs as the folded cascode amplifier's current mirrors, to enhance its gain, accommodate rail-to-rail performance at low  $V_{DD}$ , and to provide consistent AC and DC performance over the span of process and operating variations. The output stage runs mostly in current mode that help the speed at low  $V_{DD}$ , with input-output rail-to-rail span, and capable of curbing current waste associated with monitoring the output stage FET driver currents.

## REFERENCES

- [1] Y. Zhang, Q. Meng, Z. Wang and S. Chen, "Constant-gm low-power rail-to-rail operational amplifier," *2009 International Conference on Wireless Communications & Signal Processing*, Nanjing, 2009, pp. 1-4.
- [2] A. Far, "Amplifier for energy harvesting: Low voltage, ultra low current, rail-to-rail input-output, high speed," *2016 IEEE International Autumn Meeting on Power, Electronics and Computing (ROPEC)*, Ixtapa, 2016, pp. 1-6
- [3] A. Far, "Small size class AB amplifier for energy harvesting with ultra low power, high gain, and high CMRR," *2016 IEEE International Autumn Meeting on Power, Electronics and Computing (ROPEC)*, Ixtapa, 2016, pp. 1-5
- [4] C. W. Lu and C. M. Hsiao, "1 V rail-to-rail constant-gm CMOS op amp," in *Electronics Letters*, vol. 45, no. 11, pp. 529-530, May 21 2009.
- [5] J. M. Carrillo, G. Torelli, R. Perez-Aloe Valverde and J. F. Duque-Carrillo, "1-V Rail-to-Rail CMOS OpAmp With Improved Bulk-Driven Input Stage," in *IEEE JSSC*, vol. 42, no. 3, pp. 508-517, March 2007.
- [6] R. Hogervorst, J. P. Tero, R. G. H. Eschauzier and J. H. Huijsing, "A compact power-efficient 3 V CMOS rail-to-rail input/output operational amplifier for VLSI cell libraries," in *IEEE Journal of Solid-State Circuits*, vol. 29, no. 12, pp. 1505-1513, Dec 1994.
- [7] K. J. De Langen and J. H. Huijsing, "Compact low-voltage power-efficient operational amplifier cells for VLSI," in *IEEE Journal of Solid-State Circuits*, vol. 33, no. 10, pp. 1482-1496, Oct 1998.
- [8] J. Huijsing, R. Hogervorst and K. J. De Langen, "Low-power low-voltage VLSI operational amplifier cells," in *IEEE Transactions on Circuits and Systems I: Fundamental Theory and Applications*, 1995.
- [9] Minsheng Wang, T. L. Mayhugh, S. H. K. Embabi and E. Sanchez-Sinencio, "Constant-gm rail-to-rail CMOS op-amp input stage with overlapped transition regions," in *IEEE Journal of Solid-State Circuits*, vol. 34, no. 2, pp. 148-156, Feb 1999.
- [10] A. Far, "Low Noise Rail-To-Rail Amplifier Runs Fast at Ultra Low Currents And Targets Energy Harvesting", submitted to ROPEC 2017 for considerations
- [11] A. Far, "Compact Ultra Low Power Class AB Buffer Amplifier", submitted to ROPEC 2017 for considerations
- [12] S. Yan and E. Sanchez-Sinencio, "Low voltage analog circuit design: A tutorial", *IEICE Trans. Analog Integrated Circuits and Systems*, vol. E83A, no. 2, pp. 179-196, 2000.
- [13] E. Sackinger and W. Guggenbuhl, "A versatile building block: the CMOS differential difference amplifier," in *IEEE Journal of Solid-State Circuits*, vol. 22, no. 2, pp. 287-294, Apr 1987.
- [14] A. L. Coban and P. E. Allen, "A low-voltage CMOS op amp with rail-to-rail constant-gm input stage and high-gain output stage," *Circuits and Systems, 1995. ISCAS '95, 1995 IEEE International Symposium on, Seattle, WA, 1995*, pp. 1548-1551 vol.2.
- [15] E. Sackinger and W. Guggenbuhl, "A high-swing, high-impedance MOS cascode circuit," in *IEEE Journal of Solid-State Circuits*, vol. 25, no. 1, pp. 289-298, Feb 1990.
- [16] K. Gulati and Hae-Seung Lee, "A high-swing CMOS telescopic operational amplifier," in *IEEE Journal of Solid-State Circuits*, vol. 33, no. 12, pp. 2010-2019, Dec 1998.
- [17] T. Serrano and B. Linares-Barranco, "The active-input regulated-cascode current mirror," in *IEEE Transactions on Circuits and Systems I: Fundamental Theory and Applications*, vol. 41, no. 6, Jun 1994.
- [18] F. You, S. H. K. Embabi, E. Sanchez-Sinencio and A. Ganesan, "A design scheme to stabilize the active gain enhancement amplifier," *Circuits and Systems, 1995. ISCAS '95, 1995 IEEE International Symposium on, Seattle, WA, 1995*, pp. 1976-1979 vol.3.
- [19] K. Nakamura and L. R. Carley, "An enhanced fully differential folded-cascode op amp," in *IEEE Journal of Solid-State Circuits*, vol. 27, no. 4, pp. 563-568, Apr 1992.
- [20] M. Akbari and O. Hashemipour, "Enhancing transconductance of ultra-low-power two-stage folded cascode OTA," in *Electronics Letters*, vol. 50, no. 21, pp. 1514-1516, October 9 2014.
- [21] D. S. Lee, A. Dadashi and K. Y. Lee, "A high DC-gain modified CMOS OTA," *2013 International SoC Design Conference (ISOCC)*, Busan, 2013, pp. 329-332.
- [22] T. Loeliger and W. Guggenbuhl, "Cascode circuits for low-voltage and low-current applications," *Proceedings of Third International Conference on Electronics, Circuits, and Systems, Rodos, 1996*, pp. 1029-1032 vol.2.
- [23] Lei Ge and E. El-Masry, "A novel technique to enhance the open loop gain of CMOS folded cascode OTA," *The 2nd Annual IEEE Northeast Workshop on Circuits and Systems, 2004. NEWCAS 2004*, 2004.
- [24] G. N. Lu and G. Sou, "A CMOS op amp using a regulated-cascode transimpedance building block for high-gain, low-voltage achievement," *Circuits and Systems, 1997. ISCAS '97, Proceedings of 1997 IEEE International Symposium on, 1997*, pp. 165-168 vol.1.
- [25] M. Akbari, O. Hashemipour and A. Javid, "An ultra-low voltage, ultra-low power fully recycling folded cascode amplifier," *2014 22nd Iranian Conference on Electrical Engineering (ICEE), Tehran, 2014*.
- [26] T. Burger and Qiuting Huang, "On the optimum design of regulated cascode operational transconductance amplifiers," *Proceedings. 1998 International Symposium on Low Power Electronics and Design (IEEE Cat. No.98TH8379)*, Monterey, CA, USA, 1998, pp. 203-208.
- [27] D. B. Ribner and M. A. Copeland, "Design techniques for cascaded CMOS op amps with improved PSRR and common-mode input range," in *IEEE JSSC*, vol. 19, no. 6, pp. 919-925, Dec 1984.
- [28] A. T. Far, "ultra-low power bias current generation and utilization in current and voltage source and regulator devices," U.S. Patent 9 519 304, December 13, 2016.
- [29] Allen P.E. , "Analog integrated circuit design", 1st. Ed., John Wiley & Sons, New York, 1997



Ali Far received the B.S. in EECS from UC Berkeley in 1983. He got his MSEE in 1986, MBA. in 1989, Juris Doctor in Law in 1997, and M.A. in Psychology in 2010, all from Santa Clara University. He is working on his M.A. in Philosophy at SFSU. He joined Plantronics in 1982 as an Audio Engineer. Between 1983 to 1985 he was a data converter Design Engineer with PMI (now part of ADI). From 1985 to 1992, he worked at MPS (now part of MaxLinear) as Design and Marketing Manager working on converters, S/Hs, high-speed amplifiers, and regulators. Then, Ali joined MVT (now part of Creative Labs) as Director of Design, making multi-media chips. Between 1994 to 1997, he worked at TelCom (now part of Microchip) as VP of Design, where he developed converters, high-power regulators, and power management chips. In 1998, Ali joined Prudential Securities, as a Semiconductor Analyst and Vice President covering investments in the analog industry. Between 2000 to 2006, he worked at Galleon, where he was an Analyst and Managing Director covering technology equities broadly, including semiconductors, storage, wireless, and networking sectors. In 2008, he founded Spherix where he worked as an Analyst and Portfolio Manager focused on technology investments. Ali has published several papers in IEEE, has 13 patents and some pending in the area of analog ICs, and is currently researching analog chips for energy harvesting.