## nature electronics

**Article** 

https://doi.org/10.1038/s41928-024-01251-8

# Monolithic and heterogeneous three-dimensional integration of two-dimensional materials with high-density vias

Received: 29 January 2024

Accepted: 22 August 2024

Published online: 9 October 2024



Check for updates

Subir Ghosh 1, Yikai Zheng 1, Zhiyu Zhang 1, Yongwen Sun 1, Thomas F. Schranghamer 1, Najam U Sakib, Aaryan Oberoi 1, Chen Chen, Joan M. Redwing  $oldsymbol{\mathbb{D}}^{2,3,4,5}$ , Yang Yang  $oldsymbol{\mathbb{D}}^{1,5,6}$  & Saptarshi Das  $oldsymbol{\mathbb{D}}^{1,2,4,5}$ 

Monolithic three-dimensional (M3D) integration is being increasingly adopted by the semiconductor industry as an alternative to traditional through-silicon via technology as a way to increase the density of stacked, heterogenous electronic components. M3D integration can also provide transistor-level partitioning and material heterogeneity. However, there are few large-area demonstrations of M3D integration using non-silicon materials. Here, we report heterogeneous M3D integration of two-dimensional materials using a dense inter-via structure with an interconnect (I/O) density of 62,500 I/O per mm<sup>2</sup>. Our M3D stack consists of graphene-based chemisensors in tier 2 and molybdenum disulfide (MoS<sub>2</sub>) memtransistor-based programmable circuits in tier 1, with more than 500 devices in each tier. Our process allows the physical proximity between sensors and computing elements to be reduced to 50 nm, providing reduced latency in near-sensor computing applications. Our manufacturing process also stays below 200 °C and is thus compatible with back-end-of-line integration.

Three-dimensional (3D) integration<sup>1,2</sup> can be used to increase transistor count per unit area and create processors with increased computational power<sup>3</sup>. It can also be used to enhance chip functionalities beyond the traditional approach of increasing transistor density. Various technologies-such as analogue devices, radiofrequency devices, sensors, memories and microelectromechanical systems-can be integrated alongside digital components in this manner<sup>4</sup>.

There are various approaches to 3D integration. Through-silicon via (TSV) stacking offers benefits such as enhanced bandwidth and reduced interconnect lengths. TSV-based 3D integrated circuits (ICs) were originally pioneered by companies such as IBM, Samsung and Micron, and primarily concentrated on flash memories and dynamic random-access memory stacks<sup>5-8</sup>, while other commercial providers used TSV electrodes in 3D-stacked complementary metal-oxide-semiconductor (CMOS) image sensors<sup>9,10</sup>. Recently, technologies such as Intel's Foveros have made it possible to create dense TSVs with a standard pitch of 50 μm, resulting in an interconnect (I/O) density of up to 400 I/O per mm<sup>2</sup> (ref. 11). The I/O metric is a critical performance benchmark for 3D ICs. To further increase I/O density, a transition from macro-to micro-3D heterogeneous integration via hybrid bonding is needed. This technology facilitates direct copper-to-copper pad connections with a TSV pitch of less than 10  $\mu$ m, achieving 10,000 I/O per mm<sup>2</sup> (refs. 12,13).

Monolithic 3D (M3D) integration can achieve vias with pitches of less than 1 μm for even higher I/O density<sup>14,15</sup>. In M3D integration,

<sup>1</sup>Engineering Science and Mechanics, The Pennsylvania State University, University Park, PA, USA. <sup>2</sup>2D Crystal Consortium Materials Innovation Platform, The Pennsylvania State University, University Park, PA, USA. 3 Materials Science and Engineering, The Pennsylvania State University, University Park, PA, USA. <sup>4</sup>Electrical Engineering, The Pennsylvania State University, University Park, PA, USA. <sup>5</sup>Materials Research Institute, The Pennsylvania State University, University Park, PA, USA. 6Nuclear Engineering, The Pennsylvania State University, University, Park, PA, USA. 🖂 e-mail: sud70@psu.edu

functionally diverse layers of devices are sequentially stacked on the same wafer to enhance routability and design flexibility while reducing inter-tier signal delay. Thus, the approach enables transistor-level heterogeneity; for example, stacking silicon p-type field-effect transistors (p-FETs) on gallium nitride (GaN) n-type FETs (n-FETs) allows for efficient power delivery and radiofrequency solutions <sup>16</sup>. Similarly, integration of high-performance germanium p-FETs with Sin-FETs can advance CMOS logic applications <sup>17</sup>. Two-dimensional (2D) materials have been integrated at the CMOS back-end for memristive applications <sup>18,19</sup>. Silicon-free M3D integration was initially achieved by stacking carbon nanotube transistors and resistive memory devices <sup>20</sup>. However, more recent efforts have involved M3D integration of transistors made of 2D transition metal dichalcogenides such as molybdenum disulfide (MoS<sub>2</sub>) and tungsten diselenide (WSe<sub>2</sub>)<sup>21-23</sup>.

In this Article, we report M3D integration of graphene-based chemitransistors with monolayer  $MoS_2$ -based memtransistors for near-sensor computing. The M3D stack includes more than  $500\,MoS_2$  memtransistors and  $500\,$  graphene chemitransistors on each tier, which are used for data processing and acquisition, respectively, with a vertical separation between processors and sensors of less than  $50\,$  nm. By exclusively using 2D materials, we demonstrate inter-tier vias measuring  $3\times3\,\mu\text{m}^2$  with a pitch of  $4\,\mu\text{m}$ , allowing us to achieve an interconnect density of  $62,500\,$  I/O per mm². The entire stack is fabricated at temperatures below 200 °C, making it compatible with standard back-end-of-line (BEOL) integration processes. Table 1 highlights the advances achieved in via pitch, I/O density and BEOL compatibility compared to previous technologies.

We selected monolayer  $MoS_2$  and graphene for our demonstration because both are among the most mature 2D materials and can be grown at the wafer-scale<sup>24</sup>. Furthermore,  $MoS_2$  transistors have exhibited excellent device performance and can meet the standards for advanced technology nodes<sup>25–29</sup>, as well as enable various neuromorphic and bio-inspired applications<sup>30–36</sup>. Graphene-based sensors offer versatility in detecting gases, biomolecules and various chemical species due to their electrochemically inert basal plane<sup>37–39</sup>. Similarly, the high carrier mobility of graphene and emerging properties in stacked graphene layers open new possibilities for expanding the functionalities of  $3D ICs^{40,41}$ .

## Heterogeneous M3D chip using 2D materials

M3D integration of monolayer  $MoS_2$  memtransistors and graphene chemitransistors necessitates large-area synthesis of these materials. A metal-organic chemical vapour deposition (MOCVD) technique was used to grow monolayer  $MoS_2$  on a 5 cm (2 inch) sapphire substrate in a cold-wall horizontal reactor (see the Methods section for details), whereas graphene was commercially procured on copper (Cu) foil. Figure 1a,b show an optical image of the as-grown  $MoS_2$  film and its Raman spectra obtained using a 532 nm laser, respectively. The in-plane  $E_{2g}^1$  (387 cm<sup>-1</sup>) and out-of-plane  $A_{1g}$  (404 cm<sup>-1</sup>) Raman active vibrational modes have a separation of 17 cm<sup>-1</sup>, confirming the monolayer nature of the  $MoS_2$  film. Similarly, Fig. 1c,d show an optical image of the graphene on Cu foil and its Raman spectra, respectively. The distinct Raman peaks at around 1,583 and 2,674 cm<sup>-1</sup>, corresponding to the G band and 2D band, respectively, support the presence of monolayer graphene.

To construct the heterogeneous M3D stack, we selected a commercially available substrate composed of 285 nm SiO $_2$  on p\*\*-Si. However, it should be noted that any other substrate compatible with our fabrication process flow could also be used. The monolayer-Mo S $_2$ -memtransistor-based computational circuits were allocated to tier 1 and the graphene-chemitransistor-based sensing circuits were positioned in tier 2. This arrangement was achieved using a sequential fabrication method, as depicted schematically in Fig. 1e. To start, local back-gate electrodes (2 nm Ti/18 nm Pt) were patterned using e-beam lithography and deposited using e-beam evaporation. This was followed by atomic layer deposition (ALD) of 15 nm Al $_2$ O $_3$ /7 nm

HfO<sub>2</sub>/3 nm Al<sub>2</sub>O<sub>3</sub> to serve as the back-gate dielectric stack for the tier 1 MoS<sub>2</sub> devices. In this structure, the HfO<sub>2</sub> layer with a smaller bandgap functions as a charge trapping layer positioned between two layers of Al<sub>2</sub>O<sub>3</sub>, which have larger bandgaps. This stack resembles the floating-gate stack found in traditional flash memory devices, enabling non-volatile programming of the channel conductance via charge trapping and detrapping phenomena. Next, access to the back-gate metal electrodes was achieved by etching the floating-gate stack with a boron trichloride (BCl<sub>3</sub>) plasma etch. The MOCVD-grown monolayer MoS<sub>2</sub> film was then transferred from the growth substrate to the prefabricated local back-gate islands using a polymethyl methacrylate (PMMA) assisted transfer process. The MoS<sub>2</sub> film was then patterned through e-beam lithography and etched with sulfur hexafluoride (SF<sub>6</sub>) to define the channel areas. Subsequently, the source-drain contacts and connections for the comparator circuits were delineated using e-beam lithography, followed by the deposition of 20 nm Au/20 nm Ni/10 nm Au and a subsequent lift-off process.

After finishing the fabrication of the tier 1 devices, a 50 nm layer of Al<sub>2</sub>O<sub>3</sub> was deposited to act as the interlayer dielectric (ILD) separating tier 1 and tier 2. A 1 nm-thick seed layer of evaporated aluminium was applied to improve the nucleation of ALD-deposited Al<sub>2</sub>O<sub>3</sub> to overcome the inert basal plane of 2D materials. Vias with lengths of 3 µm and widths of 3 µm were patterned into the ILD using e-beam lithography and then opened with a BCl<sub>3</sub> plasma etch. A subsequent lithography step allowed for the deposition of 2 nm Ti/28 nm Ni/30 nm Au to fill these vias. Note that, while we used a via pitch of 4 µm for the work discussed here, our technique permits even smaller via sizes and pitches, highlighting the advantages of M3D integration. Following via formation, graphene was transferred onto the chip from the Cu foil using a PMMA-assisted wet etching transfer method. The graphene devices in tier 2 do not require dedicated back-gate electrodes because they are designed for chemisensing applications in liquid environments, with the liquid effectively functioning as a top-gate. Subsequently, the graphene film was patterned using e-beam lithography and etched with oxygen plasma to define the channel regions. As with the tier 1, this step was followed by the patterning of source-drain contacts and gate electrodes, again using e-beam lithography. A stack of 2 nm Ti/28 nm Ni/30 nm Au was deposited to form these electrodes, with the excess metal being removed in a lift-off process. The final step in the fabrication of the tier 2 graphene chemitransistors involved the deposition of a 70 nm Al<sub>2</sub>O<sub>3</sub> capping layer that covered the source-drain contacts to prevent leakage while leaving the gate area exposed to ensure direct contact with any liquid placed onto the chip.

The entire fabrication process occurs within a thermal budget of 180 °C. This temperature limit ensures compatibility with BEOL requirements, allowing for the possible addition of further tiers without compromising the integrity of lower ones. It also facilitates future integration with silicon front-end devices. While the fabrication process may seem straightforward, creating an M3D stack with diverse materials and dense vias requires overcoming substantial lithographic challenges. As more layers are added and structures are miniaturized, precise layer alignment is critical. It is also vital to ensure each layer is electrically isolated by the ILD to avoid leakage. This requires optimization of the fabrication process, including choosing suitable resists, etching recipes and deposition conditions for dielectrics and metals to maintain the structural and electrical integrity of the 3D structure. (Further details on the fabrication process flow are available in the Methods section.)

Figure 2a shows an optical image of a densely packed array of M3D-integrated two-tier cells based on monolayer  $MoS_2$  memtransistors and graphene chemitransistors. The enlarged scanning electron microscopy (SEM) images shown in Fig. 2b,c reveal that each cell in the array contains four devices, namely two graphene chemitransistors located above two  $MoS_2$  memtransistors. These two graphene chemitransistors form a chemisensor, while the two  $MoS_2$  memtransistors

Table 1 | A comparison of different technologies based on inventor, materials used, via pitch ( $\mu$ m), I/O per mm<sup>2</sup>, application and BEOL compatibility

| Inventor                                                                         | Material                                        | Via pitch (µm) | I/O per mm² | Application                       | Integration type         | Notable<br>feature                                                                | Integration technique                      | BEOL compatibility | Reference |
|----------------------------------------------------------------------------------|-------------------------------------------------|----------------|-------------|-----------------------------------|--------------------------|-----------------------------------------------------------------------------------|--------------------------------------------|--------------------|-----------|
| Samsung                                                                          | Si                                              | 4              | 62,500      | No                                | Die-to-wafer             | Reliable,<br>void-free vias<br>High<br>alignment<br>accuracy                      | Hybrid copper<br>bonding                   | Compatible         | 59        |
| IBM                                                                              | Si                                              | 40             | 625         | Yes                               | Chip-to-interposer       | Optimization on fine-pitch micro bumps                                            | Cu-pillar<br>micro-bump<br>solder joint    | Compatible         | 60        |
| Intel                                                                            | Si                                              | 10             | 10,000      | No                                | Die-to-die               | Reduced<br>parasitic<br>capacitance<br>and higher<br>interconnect<br>density      | Hybrid<br>bonding                          | Not mentioned      | 61        |
| IMEC                                                                             | Si                                              | 1              | 1,000,000   | No                                | Wafer-to-wafer           | Low resistance<br>Void-free<br>bonding                                            | Hybrid<br>bonding<br>(SiCN and Cu)         | Compatible         | 62        |
| Silicon Austria<br>Laboratories                                                  | Si                                              | 10             | 10,000      | No                                | Die-to-die               | Shorter<br>processing<br>times<br>Lower<br>bonding<br>temperatures                | Bump-less Cu<br>bonding                    | Compatible         | 63        |
| Institute of<br>Microelectronics<br>(IME)                                        | Si                                              | 20             | 2,500       | No                                | Chip-to-wafer            | High<br>throughput<br>and accuracy                                                | Cu micro-pillar<br>array                   | Compatible         | 64        |
| UCLA                                                                             | Si                                              | 7              | 20,400      | No                                | Die-to-wafer             | High overlay<br>accuracy<br>Low resistance                                        | Cu-Cu<br>thermal<br>compression<br>bonding | Compatible         | 65        |
| TSMC                                                                             | Si                                              | 24             | 1,740       | High-<br>performance<br>computing | Chip-to-interposer       | Low resistance<br>High-density<br>IPD                                             | CoWoS-R <sup>+</sup>                       | Compatible         | 66        |
| Hitachi                                                                          | Si                                              | 20             | 2,500       | No                                | Chip-to-wafer            | High thermal<br>stability<br>chemical<br>mechanical<br>polishing<br>compatibility | Hybrid<br>bonding<br>(polyimide<br>and Cu) | Compatible         | 67        |
| Intel                                                                            | Ge/Si                                           | -              | -           | No                                | Bottom-up<br>fabrication | Monolithic integration                                                            | Layer transfer                             | Not mentioned      | 17        |
| Intel                                                                            | GaN/Si                                          | -              | -           | Radiofrequency                    | Bottom-up<br>fabrication | Monolithic integration                                                            | Layer transfer                             | Compatible         | 16        |
| MIT and Stanford                                                                 | CNT and Si                                      | 2              | 250,000     | Gas sensing and classification    | Bottom-up<br>fabrication | Monolithic integration                                                            | Layer transfer                             | Compatible         | 20        |
| KAUST, Tsinghua<br>University<br>and Institute<br>of Micro and<br>Nanotechnology | h-BN and Si                                     | 7              | 20,400      | Memristive<br>application         | Bottom-up<br>fabrication | Monolithic<br>integration                                                         | Layer transfer                             | Compatible         | 18        |
| MIT, Washington<br>University in St.<br>Louis and Yonsei<br>University           | MoS <sub>2</sub> , WSe <sub>2</sub><br>and h-BN | -              | -           | Al processing                     | Bottom-up<br>fabrication | Monolithic integration                                                            | Layer transfer                             | Compatible         | 22        |
| Penn State                                                                       | MoS <sub>2</sub> /WSe <sub>2</sub>              | 250            | 16          | No                                | Bottom-up<br>fabrication | Monolithic integration                                                            | Layer transfer                             | Compatible         | 21        |
| Penn State                                                                       | MoS <sub>2</sub> /<br>graphene                  | 4              | 62,500      | Near sensing computing            | Bottom-up fabrication    | Monolithic integration                                                            | Layer transfer                             | Compatible         | This Work |

The bold font highlights features of our work on M3D integration.

constitute a comparator circuit. The chemisensors in tier 2 are connected by  $3 \times 3 \, \mu m^2$  vias to the comparators in tier 1. The SEM images also show a via separation of 1  $\mu$ m and thus a pitch of 4  $\mu$ m between the cells. Extended Data Fig. 1 shows a schematic of the M3D IC for near-sensor compute applications. Figure 2d,e show cross-sectional images at different magnifications, obtained using scanning transmission electron microscopy (STEM) in high-angle annular dark-field

(HAADF) mode, taken at the location marked with the white dashed line in Fig. 2b. The presence of the Ti/Pt gate electrode,  $Al_2O_3/HfO_2/Al_2O_3$  floating-gate stack,  $MoS_2$  channel, tier 1 source–drain contacts,  $Al_2O_3$  ILD, graphene channel and  $Al_2O_3$  capping layer are indicated. Figure 2f displays the energy dispersive X-ray spectroscopy (EDS) elemental mapping of the area highlighted by the light orange dashed line in the magnified view shown in the rightmost HAADF image of Fig. 2e. The



**Fig. 1**| Characterization of 2D materials and fabrication process flow for M3D integration. a, Optical image of a 2 inch sapphire wafer with MOCVD-grown  $MoS_2$ . Scale bar, 2.5 cm (1 inch) **b**, The corresponding Raman spectrum with the characteristic  $E_{3p}^1$  peak at 387 cm<sup>-1</sup> and  $A_{1g}$  peak at 404 cm<sup>-1</sup>. **c**, Optical image of

commercially purchased monolayer graphene film on a copper substrate. Scale bar, 40 mm.  $\bf d$ , The corresponding Raman spectra obtained using a 532 nm laser.  $\bf e$ , Fabrication process flow of the 3D monolithic and heterogeneous integration of monolayer-MoS $_{2}$ - and graphene-based devices.

most notable achievement showcased by these TEM images show the precise placement and dense integration of the scaled vias. Extended Data Fig. 2 shows atomic force microscopy (AFM) images after the fabrication of tier 1 MoS $_2$  devices, post-ILD deposition and after via formation and fabrication of tier 2 graphene devices. Note that the topographic features are preserved after each fabrication step. Furthermore, the surface roughness remained less than 1 nm throughout the fabrication process. This clearly shows that via fabrication does not introduce variations in topography despite the etching and deposition processes involved. Also note that we have not implemented any surface planarization in this work. Although chemical mechanical polishing, a standard step in semiconductor fabrication, will probably be adopted for M3D integration when stacking many tiers, it was found to be unnecessary for the two-tier integration demonstrated here.

## MoS<sub>2</sub> memtransistors and comparator circuit

In our M3D IC, tier 1 is composed of monolayer  $MoS_2$  memtransistors that serve as the primary computing elements. These devices are used for the construction of comparator circuits responsible for processing chemical signals detected by the graphene-based chemisensors in tier 2. Figure 3a shows the transfer characteristics, that is, drain current  $(I_{DS})$  plotted against the back-gate voltage  $(V_{BG})$ , at a constant drain voltage,  $V_{DS}$ , of  $1\,V\,for\,50\,MoS_2$  memtransistors. Here, all  $MoS_2$  memtransistors

have a channel length ( $L_{CH}$ ) and width ( $W_{CH}$ ) of 500 nm and 1  $\mu$ m, respectively. Figure 3b-d, respectively, show the distribution of field-effect mobility ( $\mu_{FF}$ ) extracted from peak transconductance, subthreshold slope (SS) extracted for three orders of magnitude change in  $I_{DS}$  and threshold voltage  $(V_{TH})$  extracted using the iso-current method at 100 nA  $\mu$ m<sup>-1</sup> for the 50 devices. The median values for  $\mu_{FF}$ , SS and  $V_{TH}$ were found to be  $\sim 3.3 \, \text{cm}^2 \, \text{V}^{-1} \, \text{s}^{-1}$ , 150 mV dec<sup>-1</sup> and 0.2 V, respectively. The median  $\mu_{FE}$  value here is lower than that of exfoliated flakes<sup>42,43</sup>, which is expected since MOCVD-grown MoS<sub>2</sub> generally shows smaller grain sizes with more impurities and defects<sup>44</sup>. Although we have previously achieved higher  $\mu_{FF}$  values through thorough optimization of growth and device design<sup>24,45,46</sup>, the focus of our current study was not solely on enhancing the individual device performance. The SS was also higher than the ideal value of ~60 mV dec<sup>-1</sup> owing to the use of a thicker oxide stack and the presence of non-idealities such as interface traps. By using high-k dielectric materials and a thinner stack, it is possible to improve SS. Similarly,  $V_{\rm TH}$  can be engineered through careful selection of the dielectric interface and metal gate work function. Extended Data Fig. 3 shows the output characteristics of a representative memtransistor with the ON-current reaching as high as 70  $\mu$ A  $\mu$ m<sup>-1</sup> for a  $V_{DS}$  of 5 V. The lower ON-current values can be attributed to the higher contact resistance,  $R_C$ , of 7 k $\Omega$  µm, associated with Ni contacts to MoS<sub>2</sub>. Extended Data Fig. 4 shows the full  $R_c$  extraction using a transmission



**Fig. 2** | **Monolithic and heterogeneous 3D ICs. a**, Optical image of a densely packed array of M3D-integrated two-tier cells based on monolayer  $MoS_2$  and graphene. Scale bar, 25  $\mu$ m. **b**, Enlarged SEM image showing that each cell in the array contains four devices, including two graphene chemitransistors above two  $MoS_2$  memtransistors. Scale bar, 3  $\mu$ m. **c**, An angled SEM image. Notably, the via width and pitch achieved in this work are 3 and 4  $\mu$ m, respectively. Scale bar,

 $3~\mu m.~d$ , Cross-sectional STEM–HAADF image. Scale bar,  $1~\mu m.~e$ , Enlarged version of the HAADF images taken at the location marked with light orange dashed line, showing the entire M3D stack involving both MoS $_2$  and graphene channels with a separation of 50 nm. f, Enlarged version of EDS mass percentage elemental mapping showing the MoS $_2$  memtransistor and its floating-gate stack. Scale bars are 20 nm.



**Fig. 3** | **Characterization of MoS<sub>2</sub> memtransistors in tier 1. a**, Transfer characteristics, that is, source-to-drain current,  $I_{\rm DS}$  versus back-gate voltage,  $V_{\rm BG}$ , taken at a constant drain voltage,  $V_{\rm DS}$ , of 1 V, for 50 MoS<sub>2</sub> memtransistors with a  $L_{\rm CH}$  and  $W_{\rm CH}$  of 1 μm. **b**–**d**, Histograms showing the  $\mu_{\rm FE}$  (**b**), SS (**c**) and  $V_{\rm TH}$  (**d**) for these 50 devices. **e**, **f**, Transfer characteristics of a representative memtransistor after programming or erasing with positive (**e**) and negative (**f**) voltage pulses of varying magnitudes for 100 μs each. **g**, Non-volatile retention for high and low conductance states measured using a  $V_{\rm BG}$  of 0 V and a  $V_{\rm DS}$  of 1 V for 6,000 s.

**h**, Analogue programming and retention for four distinct conductance states. **i**, Circuit diagrams for a comparator consisting of two MoS<sub>2</sub> memtransistors (MT1 and MT2). **j**, Transfer curve for the comparator, that is, the output voltage,  $V_{\rm OUT}$ , as a function of the input voltage,  $V_{\rm IN}$ . Switching occurs at a  $V_{\rm IN}$  of 200 mV, which is denoted as the reference voltage,  $V_{\rm REF}$ , for the comparator. **k**, Different  $V_{\rm REF}$  obtained by programming MT2. **l**,  $V_{\rm OUT}$  achieved in response to an arbitrary input waveform for a  $V_{\rm REF}$  of 0 V and a  $V_{\rm DD}$  of 2 V.

line model geometry. This  $R_{\rm C}$  value is consistent with previous literature on Ni–MoS $_2$  contacts  $^{46,47}$ . However, it is much higher than recent reports using bismuth (Bi) and antimony (Sb) as contacts to MoS $_2$  (refs. 26,48). Nevertheless, integrating these exotic materials into standard fabrication processes poses compatibility challenges. Although enhancing the performance of MoS $_2$  memtransistors in future M3D ICs is a desirable goal, the current performance level is sufficient for the applications intended in this work.

A key feature of our MoS<sub>2</sub> memtransistors is their programmability, which allows reconfiguration of circuits on the basis of application need. Figure 3e,f shows the transfer characteristics of a representative MoS<sub>2</sub> memtransistor after programming with positive and negative voltage pulses applied to the local back gate, with varying magnitudes ranging from 4 to 12 V and -4 to -12 V, respectively, with the same pulse time for 100  $\mu$ s each. The resulting shifts in  $V_{TH}$  can be attributed to the trapping and detrapping of carriers in the floating-gate stack. Figure 3g illustrates the non-volatile retention for the high and low conductance states measured using a  $V_{BG}$  of 0 V and a  $V_{DS}$  of 1 V for 6,000 s. The memory ratio (MR) between the two states show minimal degradation from  $\sim 4 \times 10^3$  to  $\sim 10^3$ . The projected retention before MR decays to 1 was found to be around 1 day on the basis of the experimental fit. Similarly, Fig. 3h shows non-volatile retention characteristics for four distinct analogue conductance states, each for 1,000 s. Extended Data Fig. 5 presents the programming endurance over 1,000 cycles. Although

demonstrating longer-term retention and additional endurance cycles would be ideal, the current performance is adequate for numerous edge applications.

Figure 3i shows the circuit diagrams for a comparator consisting of two MoS<sub>2</sub> memtransistors (MT1 and MT2), connected in series. Note that MT1 is made to serve as a depletion load by shorting its gate terminal to its source terminal. Figure 3j shows the transfer curve for the comparator, that is, the output voltage,  $V_{OUT}$ , measured at node, N2, as a function of the input voltage,  $V_{IN}$ , applied to node N3 (the gate terminal of MT2). For  $V_{IN} = -1$  V, MT2 is in the OFF-state (open circuit), pulling up  $V_{OUT}$  to the  $V_{DD}$  of 5 V, which is applied to the source terminal of MT1 (node N1). Similarly, for  $V_{IN} = 1 \text{ V}$ , MT2 is in the ON-state (short circuit), pulling down  $V_{\text{OUT}}$  to the  $V_{\text{GND}}$  of 0 V, which is applied to the drain terminal of MT2 (node N4). This explains why  $V_{\text{OUT}}$  switches from 5 to 0 V as  $V_{\rm in}$  is swept from –1 to 1 V. This switching occurs at  $V_{\rm IN}$  = 200 mV, which is denoted as the reference voltage,  $V_{\rm REF}$ , of the comparator. Interestingly,  $V_{REF}$  can be adjusted by programming MT2 as shown in Fig. 3k. Extended Data Fig. 6 shows the comparator output for different  $V_{\rm DD}$ . Finally, Fig. 31 shows the output of the comparator in response to an arbitrary input waveform for a  $V_{REF}$  of 0 V and a  $V_{DD}$  of 2 V. It is important to note here that, while the ILD affects the performance of the MoS<sub>2</sub> memtransistors due to n-type surface charge transfer doping from ALD-grown Al<sub>2</sub>O<sub>3</sub> (ref. 49), as shown in Extended Data Fig. 7, the functionalities of MoS<sub>2</sub> memtransistor-based circuits are not adversely



**Fig. 4** | **Electrical characterization of graphene chemitransistors. a**, Transfer characteristics of 130 graphene chemitransistors taken at a  $V_{\rm DS}$  of 500 mV. **b**-**d**, Distribution of  $V_{\rm Dirac}(\mathbf{b})$ ,  $\mu_{\rm FE,N}(\mathbf{c})$  and  $\mu_{\rm FE,P}(\mathbf{d})$  for all 130 graphene chemitransistors. **e**, Schematic diagram of a graphene-based chemisensor

consisting of two graphene chemitransistors, GC1 and GC2, connected in series.  $\mathbf{f}$ ,  $V_{\text{Gr}}$  as a function of  $V_{\text{LTG}}$  with a  $V_{\text{DD,Gr}}$  of 500 mV under different concentration of NaCl solution (20, 40, 60 and 100 mM).  $\mathbf{g}$ , Monotonic  $V_{\text{Gr}}$  evolution with increasing NaCl concentration obtained at two different  $V_{\text{LTG}}$  values of 0.45 and 0.55 V.

affected. For example, the observed shift in  $V_{\rm TH}$  is compensated using the programming capability of the floating-gate stack to ensure proper logic levels for the intended applications.

## Graphene chemitransistor-based sensing module

Tier 2 of our M3D IC incorporates graphene chemitransistors that are specifically engineered to allow direct application of chemical solutions onto the chip for chemisensing. These solutions play the role as a liquid top-gate for the graphene chemitransistors, due to the formation of an electric double layer at the graphene channel and chemical solution interface. This electric double layer functions as an ultra-thin dielectric layer and is therefore crucial for controlling the conductance of the channel when an electrical bias is applied to the solution<sup>50</sup>. Figure 4a shows the transfer characteristics, that is,  $I_{\rm DS}$  plotted as a function of the liquid top-gate voltage ( $V_{\rm LTG}$ ), for 130 graphene chemitransistors in aqueous solution at a constant  $V_{DS}$  of 500 mV. The observed transfer curves are quintessential to graphene with the global minima referred to as the Dirac points. Additionally, the graphene chemitransistors exhibit ambipolar transport, that is, they demonstrate both electron and hole conduction, which is a direct consequence of graphene being a zero-bandgap semiconductor. Figure 4b-d, respectively, depict the distribution of the Dirac voltage  $(V_{\rm Dirac})$ , that is, the applied  $V_{\rm LTG}$  that results in a minimum  $I_{\rm DS}$ , and the electron and hole mobility values ( $\mu_{\text{FE},N}$  and  $\mu_{\text{FE},P}$ ) extracted from their respective peak transconductances for these 130 graphene chemitransistors. The median values for  $V_{\rm Dirac}$ ,  $\mu_{\rm FE,N}$  and  $\mu_{\rm FE,P}$  were found to be 1.1 V,  $126 \text{ cm}^2 \text{ V}^{-1} \text{ s}^{-1}$  and  $219 \text{ cm}^2 \text{ V}^{-1} \text{ s}^{-1}$ , respectively. The device-to-device variation can be ascribed to imperfections introduced in the graphene during growth, transfer and/or chemitransistor fabrication processes<sup>51-53</sup>. Although such variation is generally undesirable for most applications, here it presents an opportunity for designing a chemisensor, as discussed in the following section.

Each chemisensor in our M3D IC comprises two graphene chemitransistors (GC1 and GC2) connected in series as shown using the circuit schematic in Fig. 4e. The output from the chemisensor is obtained at node N2. Figure 4f shows the output voltage ( $V_{\rm Gr}$ ) as a function of

 $V_{\rm LTG}$  with a supply voltage ( $V_{\rm DD,Gr}$ ) of 1 V when deionized (DI) water with different concentrations of NaCl is used as the liquid solution. The distinct shapes of these response curves directly arise from the device-to-device variation. Without this variation, the output of the chemisensor would be a horizontal straight line with a  $V_{\rm Gr}$  of 500 mV irrespective of the chemical solution. Nevertheless, the  $V_{\rm Gr}$  obtained at  $V_{\rm LTG}$  values of 0.45 and 0.55 V show monotonic changes with the increasing concentration of NaCl in DI water, as illustrated using the bar plots in Fig. 4g. This confirms the functioning of the graphene-based circuit as a chemisensor, offering the ability to adjust the  $V_{\rm LTG}$  for optimal sensitivity.

Additionally, it should be noted that the characteristics of the voltage transfer curve can vary depending on the position of the Dirac points in individual graphene chemitransistors, as illustrated in Extended Data Fig. 8. While the exact shape of these curves is not critical for the application demonstrated here, consistent performance is crucial for future practical use. This will require further development of graphene devices. Initially, minimizing device-to-device variation through optimized CVD growth conditions and improved transfer processes is key. Subsequently, we can reintroduce variation between two graphene chemitransistors by utilising the memristive properties of graphene, as detailed in our previous work Although our current study intentionally uses device-to-device variation to achieve the desired curve shape, our aim is to control this variation to guarantee reproducibility.

## M3D chip for near-sensor computing applications

Near-sensor computing is a transformative approach that allows processing of data close to where it is generated, as opposed to transmitting it over long distances to central processing units: that is, cloud-based computing. Such a shift enables real-time analysis of data, leading to lower latency and faster decision making, and contributes to better bandwidth, accuracy and energy efficiency. As a result, this proximity-driven approach has far-reaching implications in many applications, with extra consideration to the field of chemical sensing. For example, in environmental monitoring and



 $\label{eq:fig.5} \textbf{Fig. 5} \ | \ \textbf{Near-sensor compute using M3D ICs. a}, \ \textbf{Optical image of the M3D chip} \\ \textbf{with a chemical solution on top. b}, \ \textbf{3D circuit layout illustrating the connection} \\ \textbf{between a graphene-chemitransistor-based chemical sensor in tier 2 and an } \\ \textbf{MoS}_2\text{-memtransistor-based comparator in tier 1, enabled by an inter-tier via.} \\ \textbf{c}, \ \textbf{Transfer curves for the chemisensor in response to two different sugar} \\ \textbf{solutions. d}, \ \textbf{Temporal evolution of the transfer curves for the dilute sugar} \\ \end{aligned}$ 

solution when left to evaporate for 15 min. **e**,  $V_{\rm Gr}$  measured at a  $V_{\rm LTG}$  of 0.75 V (**e**) and corresponding output from the comparator,  $V_{\rm OUT}$  (**f**), with different programmed  $V_{\rm REF}$ , as a function of time. **g**, **h**, Transfer curves for 16 chemisensors in response to four different chemicals, C1 to C4 (**g**), and corresponding  $V_{\rm Gr}$  extracted at a  $V_{\rm LTG}$  of 0.6 V (**h**). **i.j**, One-dimensional (**i**) and 2D (**j**) digital code for each chemical obtained using the same circuit architecture shown in **b**.

industrial process control, the ability to detect chemical events or anomalies in real-time enables early warning systems and proactive measures to mitigate potential risks and safety issues. We illustrate this concept by using our M3D IC to detect instances where the concentration of a certain chemical in a solution rises above a set limit, as shown in Fig. 5a.

The near-sensor computing architecture used for the task is illustrated in Fig. 5b. Here, the output node of a graphene-chemitransistorbased chemisensor in tier 2 is connected to the input node of an MoS<sub>2</sub>-memtransistor-based comparator circuit in tier 1, as facilitated by an inter-tier via. Figure 5c displays the transfer curves for the chemisensor when exposed to two different sugar solutions created by mixing a sugary beverage with water in ratios of 1:1 (dilute, red curve) and 2:1 (concentrated, blue curve). These curves were recorded immediately after the solutions were applied to the chip. In contrast, Fig. 5d illustrates how these response curves evolve over time as a dilute sugar solution is left to evaporate for 15 min, gradually increasing its sugar concentration. Figure 5e shows  $V_{Gr}$  measured at  $V_{LTG} = 0.75$  V as a function of time as the solution evaporates. As can be seen,  $V_{Gr}$  starts at ~0.7 V and eventually decreases to 0.5 V after 10 min; this can be read as if the device is shifting from the dilute transfer characteristics shown in Fig. 5c to the more concentrated transfer characteristics due to an increase in sugar concentration. This establishes that  $V_{Gr}$  serves as an indication of sugar level in the solution. Note that while we have not functionalized individual graphene chemisensors, functionalized multiplexed arrays<sup>55</sup> can enhance the selectivity and sensitivity of our proposed architecture when incorporating more complex and diverse chemicals.

Next, to develop an alert system, the analogue output voltage  $(V_{Gr})$  needs to be transformed into a digital signal to trigger subsequent modules. This is accomplished using the programmable comparator based on MoS<sub>2</sub> memtransistors depicted in Fig. 5f, for several distinct reference voltages,  $V_{\rm REF}$ . Notably, the output logic shifts when the sugar concentration surpasses a specific allowable limit predefined by  $V_{REF}$ . As anticipated, the time required to activate the alert system is shorter when the objective is to signal at lower concentrations and longer at higher concentrations. The non-idealities observed in the digitization process can be ascribed to the lower gain of the comparator circuit reducing the abruptness of state transition from 0 V to  $V_{\rm DD}$ , that is, digital 0 state to the digital 1 state. The gain can be improved either by using a CMOS inverter, which will require integration of both n- and p-type 2D memtransistors<sup>56</sup>, or by cascading multiple depletion-mode inverters. Extended Data Fig. 9 shows the results of digitization using a three-stage-cascaded-inverter-based comparator circuit.

We also show how the 3D IC array shown in Fig. 2a can be exploited for chemical codification by harnessing the response variation among the graphene-based chemisensors. Figure 5g shows the transfer curves for 16 chemisensors in response to four different chemicals, C1 to C4. Figure 5h shows the corresponding  $V_{\rm Gr}$  values extracted at  $V_{\rm LTG}$  of 0.6 V, which form an analogue code for each chemical. Using the same circuit architecture shown in Fig. 5b, this analogue code can be converted to a one-dimensional or 2D digital code as shown Fig. 5i, j, respectively. The comparator  $V_{RFF}$  was set to 250 mV. Note that by tuning the read voltage,  $V_{\rm LTG}$ , or by using the programmability of the MoS<sub>2</sub>-memtransistor-based comparator to adjust  $V_{REF}$ , it is possible to generate different codes for the same chemicals, as shown in Extended Data Fig. 10. Nevertheless, while these demonstrations are straightforward, they have broad implications for near-sensor computing, offering potential application in more complex scenarios through the integration of intricate circuits and additional sensors. Also, the physical proximity of less than 50 nm between sensing and computing modules achieved in our heterogeneous M3D chip based on 2D materials is better than state-of-the-art packaging solutions that use distinct technologies in these components.

## **Conclusions**

We have reported a monolithic heterogeneous 3D integration of graphene-based chemitransistors and monolayer-MoS $_2$ -based memtransistors across two tiers using a dense inter-tier via structure with an interconnect density of 62,500 I/O per mm $^2$ . The M3D stack can be fabricated at temperatures below 200 °C, making it compatible with BEOL integration. Notably, the vertical proximity between sensors and computing elements in our stack is 50 nm, surpassing current 3D packaging solutions and potentially reducing computational latency and improving bandwidth.

## **Methods**

#### Large-area monolayer MoS<sub>2</sub> film growth

Monolayer  $MoS_2$  was deposited on epi-ready 2-inch c-sapphire substrate by MOCVD. An inductively heated graphite susceptor equipped with wafer rotation in a cold-wall horizontal reactor (https://doi.org/10.60551/znh3-mj13) was used to achieve uniform monolayer deposition. Molybdenum hexacarbonyl ( $Mo(CO)_6$ ) and hydrogen sulfide ( $H_2S$ ) were used as precursors.  $Mo(CO)_6$  maintained at 25 °C and 625 torr in a stainless-steel bubbler was used to deliver  $4.7 \times 10^{-3}$  sccm of the metal precursor for the growth, while 400 sccm of  $H_2S$  was used for the process.  $MoS_2$  deposition was carried out at 1,000 °C and 50 torr in  $H_2$  ambient, with monolayer growth being achieved in 11 min. Before growth, the substrate was baked at 1,000 °C in  $H_2$  for 10 min. Following growth, the substrate was cooled in  $H_2S$  to 300 °C to inhibit the decomposition of the  $MoS_2$  film. More details on the growth process can be found in an earlier study <sup>57</sup>.

## MoS, film transfer to local back-gate island

Film transfer from the growth substrate to the application substrate was performed using a PMMA-assisted wet transfer process<sup>58</sup>. First, the as-grown MoS<sub>2</sub> on the sapphire substrate was spin-coated with PMMA and left to sit for 24 h to ensure good PMMA-MoS<sub>2</sub> adhesion. The corners of the spin-coated film were then scratched using a razor blade and immersed in DI water kept at 50 °C for 2 h. Capillary action caused the DI water to be preferentially drawn into the substrate-MoS<sub>2</sub> interface, owing to the hydrophilic nature of sapphire and hydrophobic nature of MoS<sub>2</sub> and PMMA, separating the PMMA–MoS<sub>2</sub> stack from the sapphire substrate. The separated film was then fished from the DI water using a clean glass slide and rinsed in three separate water baths for 15 min each before finally being transferred onto the application substrate. Subsequently, the substrate was baked at 50 and 70 °C for 15 min each to remove moisture and promote film adhesion, thus ensuring a pristine interface, before the PMMA was removed by immersing the sample in acetone for 1 h and the substrate was cleaned with a subsequent 30 min isopropyl alcohol (IPA) bath.

#### SEM

SEM of the 2D MoS $_2$  transistors used in this study was conducted using a Zeiss Gemini 500 field emission SEM system at an accelerating voltage of 5 kV.

## **TEM sample preparation**

The TEM sample depicted in Fig. 2d was prepared using a Thermo Fisher Scientific (TFS) Scios 2 DualBeam focused ion beam SEM instrument. The sample was coated with two carbon layers initially: the first layer,  $\sim\!0.5\,\mu m$  thick, was deposited by a 1.6 nA electron beam to protect the surface of  $MoS_2$  and  $WSe_2$  layer from the following Ga ion beam damage during the second layer of carbon deposition. The second layer, approximately 3  $\mu m$  thick, was deposited using a 0.3 nA Ga ion beam to provide surface protection for later ion beam milling and sample thinning. After this coating process, the sample with a 2- $\mu m$ -thick cross-section from the region of interest was extracted and transferred in situ to a copper half-grid. Then the lamella was thinned using a Ga ion beam at progressively lower voltages (30,16,8,5 and 2 kV) to minimize ion beam damage as the sample became thinner.

#### STEM characterization of the cross-section

In this study, the TFS Titan3 G2 60-300 S/TEM was used to perform STEM and EDS analyses, as shown in Fig. 2d–f. To mitigate carbon deposition, a 15 min beam shower was applied in the STEM setting, with a dwell time of 0.05  $\mu$ s,  $\times$ 5,000 magnification and 150  $\mu$ m C2 aperture. The STEM and EDS analyses were operated with an acceleration voltage of 300 kV, featuring a spot size of 6, a C2 aperture of 70  $\mu$ m and a convergent angle of 25.2 mrad. Elemental mapping was performed using the Super-X EDS system in STEM mode. A series of HAADF-STEM images of the 3D IC were captured at a beam current of 0.07 nA, followed by EDS mappings at a beam current of 0.30 nA. EDS analysis was conducted using Esprit software, incorporating 1/8 Q-Map preprocessing and the series fit deconvolution method. Peak-to-background (P/B) ZAF quantification was used to generate mass percentage (norm.) elemental maps, with a postfilter averaging 9 pixels.

#### **AFM**

AFM was used to study the surface morphology, coverage and thickness of the deposited layers. Scanasyst air probe AFM tips with a nominal tip radius of about 2 nm and spring constant of 0.4 N m $^{-1}$  were used for the measurements, and the images were collected using peak-force tapping mode with a peak force of 14 nN and a scan speed of 2 Hz.

## Raman spectroscopy

Raman characterization on  $MoS_2$  and graphene was taken using a Horiba LabRAM HR Evolution confocal Raman microscope with a 532 nm laser. The power was 34 mW filtered at 1%. The objective magnification was ×100 with a numerical aperture of 0.9, and the grating had a spacing of 1,800 gr mm<sup>-1</sup> for Raman.

#### **Electrical characterization**

Electrical characterization of the fabricated devices was performed using a semi-automated Formfactor 12000 probe station under atmospheric conditions with a Keysight B1500A parameter analyser. A continuous wave white light source was used for all experiments involving light illumination unless otherwise stated.

## Data availability

Data on samples produced in the 2DCC-MIP facility, including growth recipes and characterization data, are available at https://doi.org/10.26207/f095-ha45. Other data that support the finding of this study are available from the corresponding author on reasonable request.

## **Code availability**

The codes used for plotting the data are available from the corresponding authors on reasonable request.

## References

- Choudhury, D. 3D integration technologies for emerging microsystems. In 2010 IEEE MTT-S International Microwave Symposium 1–4 (IEEE, 2010).
- Koester, S. J. et al. Wafer-level 3D integration technology. IBM J. Res. Dev. 52, 583–597 (2008).
- Clavelier, L. et al. Engineered substrates for future More Moore and More than Moore integrated devices. In 2010 International Electron Devices Meeting 2–6 (IEEE, 2010).
- Radojcic, R. More-than-Moore 2.5 D and 3D SiP Integration (Springer, 2017).
- Sun, H. et al. Design of 3D DRAM and its application in 3D integrated multi-core computing systems. *IEEE Des. Test Comput.* 26, 36–47 (2009).
- Pawlowski, J. T. Hybrid memory cube (HMC). In 2011 IEEE Hot Chips 23 Symposium (HCS) 1–24 (IEEE, 2011).

- Lee, J. C. et al. High bandwidth memory (HBM) with TSV technique. In 2016 International SoC Design Conference (ISOCC) 181–182 (IEEE. 2016).
- Woo, D. H., Seong, N. H., Lewis, D. L. & Lee, H. H. An optimized 3D-stacked memory architecture by exploiting excessive, high-density TSV bandwidth. In HPCA-16 2010 The Sixteenth International Symposium on High-Performance Computer Architecture 1–12 (IEEE, 2010).
- 9. Kurino, H. et al. Intelligent image sensor chip with three dimensional structure. In *International Electron Devices Meeting* 1999. Technical Digest (Cat. No. 99CH36318) 879–882 (IEEE, 1999).
- Henry, D. et al. Through silicon vias technology for CMOS image sensors packaging. In 2008 58th Electronic Components and Technology Conference 556–562 (IEEE, 2008).
- Lee, H. J., Mahajan, R., Sheikh, F., Nagisetty, R. & Deo, M. Multi-die integration using advanced packaging technologies.
   In 2020 IEEE Custom Integrated Circuits Conference (CICC) 1–7 (IEEE, 2020).
- Gao, G. et al. Scaling package interconnects below 20µm pitch with hybrid bonding. In 2018 IEEE 68th Electronic Components and Technology Conference (ECTC) 314–322 (IEEE, 2018).
- Sheikh, F., Nagisetty, R., Karnik, T. & Kehlet, D. 2.5 D and 3D heterogeneous integration: emerging applications. *IEEE Solid-State Circuits Mag.* 13, 77–87 (2021).
- Vinet, M. et al. Monolithic 3D integration: A powerful alternative to classical 2D scaling. In 2014 SOI-3D-Subthreshold Microelectronics Technology Unified Conference (S3S) 1–3 (IEEE, 2014).
- Dhananjay, K., Shukla, P., Pavlidis, V. F., Coskun, A. & Salman, E. Monolithic 3D Integrated circuits: recent trends and future prospects. *IEEE Trans. Circuits Syst. Express Briefs* 68, 837–843 (2021).
- Then, H. W. et al. 3D heterogeneous integration of high performance high-K metal gate GaN NMOS and Si PMOS transistors on 300mm high-resistivity Si substrate for energy-efficient and compact power delivery, RF (5G and beyond) and SoC applications. In 2019 IEEE International Electron Devices Meeting (IEDM) 17.3.1–17.3.4 (IEEE, 2019).
- Rachmady, W. et al. 300mm heterogeneous 3D integration of record performance layer transfer germanium PMOS with silicon NMOS for low power high performance logic applications. In 2019 IEEE International Electron Devices Meeting (IEDM) 29.7.1–29.7.4 (IEEE, 2019).
- Zhu, K. et al. Hybrid 2D–CMOS microchips for memristive applications. *Nature* 618, 57–62 (2023).
- Kim, K.-H. et al. Scalable CMOS back-end-of-line-compatible AlScN/two-dimensional channel ferroelectric field-effect transistors. Nat. Nanotechnol. 18, 1044-1050 (2023).
- 20. Shulaker, M. M. et al. Three-dimensional integration of nanotechnologies for computing and data storage on a single chip. *Nature* **547**, 74–78 (2017).
- Jayachandran, D. et al. Three-dimensional integration of two-dimensional field-effect transistors. *Nature* 625, 276–281 (2024).
- 22. Kang, J.-H. et al. Monolithic 3D integration of 2D materials-based electronics towards ultimate edge computing solutions. *Nat. Mater.* **22**, 1470–1477 (2023).
- 23. Pendurthi, R., Sakib, N.U. & Sadaf, M.U.K. et al. Monolithic three-dimensional integration of complementary two-dimensional field-effect transistors. *Nat. Nanotechnol.* **19**, 970–977 (2024).
- Sebastian, A., Pendurthi, R., Choudhury, T. H., Redwing, J. M. & Das, S. Benchmarking monolayer MoS<sub>2</sub> and WS2 field-effect transistors. *Nat. Commun.* 12, 693 (2021).

- English, C. D., Smithe, K. K., Xu, R. L. & Pop, E. Approaching ballistic transport in monolayer MoS₂ transistors with self-aligned 10 nm top gates. In 2016 IEEE International Electron Devices Meeting (IEDM) 5–6 (IEEE, 2016).
- Shen, P.-C. et al. Ultralow contact resistance between semimetal and monolayer semiconductors. *Nature* 593, 211–217 (2021).
- Nikonov, D. E. & Young, I. A. Benchmarking of beyond-CMOS exploratory devices for logic integrated circuits. *IEEE J. Explor.* Solid-State Comput. Devices Circuits 1, 3–11 (2015).
- Sylvia, S. S., Alam, K. & Lake, R. K. Uniform benchmarking of low-voltage van der Waals FETs. *IEEE J. Explor. Solid-State* Comput. Devices Circuits 2, 28–35 (2016).
- Lee, C.S., Cline, B., Sinha, S., Yeric, G. & Wong, H. S. 32-bit processor core at 5-nm technology: Analysis of transistor and interconnect impact on VLSI system performance. In 2016 IEEE international electron devices meeting (IEDM) 28.3.1–28.3.4 (IEEE, 2016).
- Sebastian, A., Pannone, A., Radhakrishnan, S. S. & Das, S. Gaussian synapses for probabilistic neural networks. Nat. Commun. 10, 4199 (2019).
- Jayachandran, D. et al. A low-power biomimetic collision detector based on an in-memory molybdenum disulfide photodetector. Nat. Electron. https://doi.org/10.1038/s41928-020-00466-9 (2020).
- 32. Dodda, A. et al. Stochastic resonance in MoS2 photodetector. *Nat. Commun.* **11**, 4406 (2020).
- Arnold, A. J. et al. Mimicking neurotransmitter release in chemical synapses via hysteresis engineering in MoS<sub>2</sub> transistors. ACS Nano 11, 3110–3118 (2017).
- 34. Nasr, J. R. et al. Low-power and ultra-thin MoS<sub>2</sub> photodetectors on glass. *ACS Nano* **14**, 15440–15449 (2020).
- Das, S., Dodda, A. & Das, S. A biomimetic 2D transistor for audiomorphic computing. Nat. Commun. 10, 3450 (2019).
- Das, S. Two dimensional electrostrictive field effect transistor (2D-EFET): a sub-60mV/decade steep slope device with high ON current. Sci. Rep. 6, 34811 (2016).
- Xue, M. et al. Integrated biosensor platform based on graphene transistor arrays for real-time high-accuracy ion sensing. Nat. Commun. 13, 5064 (2022).
- Hayasaka, T. et al. An electronic nose using a single graphene FET and machine learning for water, methanol, and ethanol. *Microsyst. Nanoeng.* 6, 50 (2020).
- Ghosh, S. et al. An all 2D bio-inspired gustatory circuit for mimicking physiology and psychology of feeding behavior. Nat. Commun. 14, 6021 (2023).
- Akinwande, D. et al. Graphene and two-dimensional materials for silicon technology. *Nature* 573, 507–518 (2019).
- 41. Lin, Y.-M. et al. Wafer-scale graphene integrated circuit. *Science* **332**, 1294–1297 (2011).
- Yu, Z. et al. Realization of room-temperature phonon-limited carrier transport in monolayer MoS<sub>2</sub> by dielectric and carrier screening. Adv. Mater. 28, 547–552 (2016).
- Radisavljevic, B., Radenovic, A., Brivio, J., Giacometti, V. & Kis, A. Single-layer MoS<sub>2</sub> transistors. Nat. Nanotechnol. 6, 147–150 (2011).
- Kwon, J. et al. 200-mm-wafer-scale integration of polycrystalline molybdenum disulfide transistors. *Nat. Electron.* https://doi. org/10.1038/s41928-024-01158-4 (2024).
- Dodda, A. et al. Active pixel sensor matrix based on monolayer MoS<sub>2</sub> phototransistor array. Nat. Mater. 21, 1379–1387 (2022).
- Schranghamer, T. F. et al. Ultrascaled contacts to monolayer MoS<sub>2</sub> field effect transistors. Nano Lett. 23, 3426–3434 (2023).
- Schulman, D. S., Arnold, A. J. & Das, S. Contact engineering for 2D materials and devices. *Chem. Soc. Rev.* https://doi.org/10.1039/ c7cs00828g (2018).

- Chou, A. S. et al. Antimony semimetal contact with enhanced thermal stability for high performance 2D electronics. In 2021 IEEE International Electron Devices Meeting (IEDM) 7.2.1–7.2.4 (IEEE. 2021).
- 49. McClellan, C. J., Yalon, E., Smithe, K. K., Suryavanshi, S. V. & Pop, E. High current density in monolayer  $MoS_2$  doped by AlO x. ACS Nano **15**, 1587–1596 (2021).
- 50. Chen, F., Qing, Q., Xia, J., Li, J. & Tao, N. Electrochemical gate-controlled charge transport in graphene in ionic liquid and aqueous solution. *J. Am. Chem. Soc.* **131**, 9908–9909 (2009).
- 51. Yazyev, O. V. & Louie, S. G. Electronic transport in polycrystalline graphene. *Nat. Mater.* **9**, 806–809 (2010).
- Chen, H. J. et al. Defect scattering in graphene. *Phys. Rev. Lett.* 102, 236805 (2009).
- 53. Dodda, A. et al. Graphene-based physically unclonable functions that are reconfigurable and resilient to machine learning attacks. *Nat. Electron.* **4**, 364–374 (2021).
- 54. Schranghamer, T. F., Oberoi, A. & Das, S. J. N. C. Graphene memristive synapses for high precision neuromorphic computing. *Nat. Commun.* **11**, 5474 (2020).
- 55. Hwang, Y. J. et al. Multiplexed DNA-functionalized graphene sensor with artificial intelligence-based discrimination performance for analyzing chemical vapor compositions. *Microsyst. Nanoeng.* **9**, 28 (2023).
- Pendurthi, R. et al. Heterogeneous integration of atomically thin semiconductors for non-von Neumann CMOS. Small 18, 2202590 (2022).
- 57. Chen, C. et al. Effect of growth temperature on the microstructure and properties of epitaxial MoS<sub>2</sub> monolayers grown by metalorganic chemical vapor deposition. *J. Vac. Sci. Technol. A* **42**, 022201 (2024).
- 58. Sebastian, A. et al. Electrochemical polishing of two-dimensional materials. *ACS Nano* **13**, 78–86 (2018).
- Kim Y, Kim J, Kim H, Lee H, Kim D, Seo SK, Jo C, Kim DW. Die to wafer hybrid Cu bonding for fine pitch 3D-IC applications. In 2023 IEEE 73rd Electronic Components and Technology Conference (ECTC) 1043–1047 (IEEE, 2023).
- Sakuma, K. et al. Heterogeneous integration on organic interposer substrate with fine-pitch RDL and 40 micron pitch micro-bumps. In Proc. 2023 IEEE 73rd Electronic Components and Technology Conference (ECTC) 872–877 (IEEE, 2023).
- 61. Elsherbini, A. et al. Hybrid bonding interconnect for advanced heterogeneously integrated processors. In *Proc. 2021 IEEE 71st Electronic Components and Technology Conference (ECTC)* 1014–1019 (IEEE, 2021).
- 62. Kim, S. W. et al. Novel Cu/SiCN surface topography control for 1 µm pitch hybrid wafer-to-wafer bonding. In 2020 IEEE 70th Electronic Components and Technology Conference (ECTC) 216–222 (IEEE, 2020).
- 63. Roshanghias, A. et al. 3D lintegration via D2D bump-less Cu bonding with protruded and recessed topographies. *ECS J. Solid State Sci. Technol.* **12**, 084001 (2023).
- 64. Xie, L., Wickramanayaka, S., Chong, S. C., Sekhar, V. N. & Cereno, D. I. High-throughput Thermal Compression Bonding of 20 um Pitch Cu Pillar with Gas Pressure Bonder for 3D IC Stacking. In 2016 IEEE 66th Electronic Components and Technology Conference (ECTC) 108–114 (IEEE, 2016).
- Sahoo, K., Ren, H. & Iyer, S. S. A high throughput two-stage die-to-wafer thermal compression bonding scheme for heterogeneous integration. In Proc. 2023 IEEE 73rd Electronic Components and Technology Conference (ECTC) 362–366 (IEEE, 2023).
- Lin, M. L. et al. Organic interposer CoWoS-R+ (plus) technology.
  In Proc. 2022 IEEE 72nd Electronic Components and Technology Conference (ECTC) 1–6 (IEEE, 2022).

67. Yoneda, S. et al. A novel photosensitive polyimide adhesive material for hybrid bonding processing. In 2021 IEEE 71st Electronic Components and Technology Conference (ECTC) 680–686 (IEEE, 2021).

## **Acknowledgements**

The MOCVD-grown  ${\rm MoS_2}$  monolayer samples were provided by the 2D Crystal Consortium Materials Innovation Platform (2DCC-MIP) facility at the Pennsylvania State University, which is funded by the National Science Foundation (NSF) under cooperative agreement no. DMR-2039351. The authors also acknowledge funding support from the NSF under Award EECS-2042154.

#### **Author contributions**

S.D. conceived the idea and designed the experiments. S.G. and Y.Z. fabricated all the 3D chips. S.D., S.G. and Y.Z. performed the experiments, analysed the data, discussed the results and agreed on their implications. C.C. grew the 2D materials under the supervision of J.M.R. Z.Z. and Y.S. performed the focused ion beam and TEM for the 3D chip under the supervision of Y.Y. T.F.S. performed the SEM of the 3D chip. N.U.S. performed the Raman experiment. A.O. performed the AFM measurements. S.G., Y.Z. and S.D. contributed to the preparation of the paper.

## **Competing interests**

The authors declare no competing interests.

## **Additional information**

**Extended data** is available for this paper at https://doi.org/10.1038/s41928-024-01251-8.

**Correspondence and requests for materials** should be addressed to Saptarshi Das.

**Peer review information** *Nature Electronics* thanks Sang-Hoon Bae, Xurui Mao and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.

**Reprints and permissions information** is available at www.nature.com/reprints.

**Publisher's note** Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

© The Author(s), under exclusive licence to Springer Nature Limited 2024



 $\textbf{Extended Data Fig. 1} | \textbf{Monolithic and heterogeneous 3D integration of 2D materials.} \\ \textbf{Schematic showing the M3D stack comprising graphene chemitransistor-based chemisensors in tier 2 connected to $MoS_2$-memtransistor-based comparator in tier 1 for near sensor computing application.} \\$ 



**Extended Data Fig. 2** | **Surface topography and roughness of M3D IC.** Atomic force microscope (AFM) images and height profiles after (a) fabrication of tier  $1\,\text{MoS}_2$  devices, (b) post-ILD deposition, and (c) after the via formation and fabrication of tier  $2\,\text{graphene}$  devices. (d) Surface roughness on top of the ILD over  $5\,\mu\text{m} \times 5\,\mu\text{m}$  area showing a mean roughness of 270 pm.



 $\textbf{Extended Data Fig. 3} \ | \ \textbf{Output characteristics for an MoS}_2 \ \textbf{memtransistor.} \ \textbf{O} \ \textbf{utput characteristics, that is, source-to-drain current, } \ l_{DS}, \ \textbf{versus drain voltage, V}_{DS}, \ \textbf{at different back-gate voltage, V}_{BG}, \ \textbf{ranging from 0 to 7 V in steps of 1 V, for a representative MoS}_2 \ \textbf{memtransistor.}$ 



**Extended Data Fig. 4** | **Extraction of contact resistance for MoS<sub>2</sub> memtransistors.** (a) Via resistance as a function of via-area. (b) SEM image of the traditional transmission line measurement (TLM) design showing channel lengths of 100 nm, 200 nm, 500 nm, and 1000 nm used to extract the contact resistance. (c) Corresponding transfer characteristics for  $MoS_2$  memtransistors

obtained from 25 TLM structures. **(d)** Extracted total resistance ( $R_T$ ) as a function of  $L_{CH}$  for an inversion carrier density,  $n_s = 5 \times 10^{12} \, \mathrm{cm}^{-2}$ . **(e)**  $R_C$  extracted from the y-intercept of the  $R_T$  versus  $L_{CH}$  plots as a function of  $n_s$ .  $R_T = R_{CH} + 2R_C$ .  $R_{CH}$  is proportional to  $L_{CH}$  and inversely proportional to the carrier density ( $n_s$ ),  $R_C$ , however, is independent of  $L_{CH}$ .



 $\label{eq:continuous_problem} \textbf{Extended Data Fig. 5} \ | \ \textbf{Endurance measurements for an MoS}_2 \ \textbf{memtransistor.} \\ The read current measured at a $V_{BG}$ of 0 V using a $V_{DS}$ of 1 V every time after programming a representative $MoS}_2$ memtransistor in its high and low$ 

conductance states for a total of 1000 cycles. No degradation in the memory ratio (MR) highlights the fact that our  $\rm MoS_2$  memtransistors offer high endurance.



 $\textbf{Extended Data Fig. 6} \ | \ \textbf{Supply voltage dependence of an MoS}_2 \ \textbf{memtransistor based comparator.} \ \textbf{Transfer characteristics of a representative MoS}_2 \ \textbf{-memtransistor-based comparator for different V}_{DD}.$ 



**Extended Data Fig. 7** | **Impact of ILD on MoS<sub>2</sub> memtransistors.** Transfer characteristics and extracted distributions for threshold voltage ( $V_{TH}$ ), field-effect mobility ( $\mu_{FE}$ ), and subthreshold swing (SS) for 50 MoS<sub>2</sub> memtransistors with an  $L_{CH}$  of 500 nm (a) before and (b) after the ILD deposition. We observed a negative 4.5 V shift in the median  $V_{TH}$  value, which can be ascribed to n-type surface charge transfer doping (SCTD) from ALD  $Al_2O_3$ . Additionally,

there was an improvement in the median  $\mu_{FE}$  from  $3.31\,cm^2V^{-1}s^{-1}$  to  $7.27\,cm^2V^{-1}s^{-1}$ . However, the median SS experienced a degradation from 150 mV/dec to 375 mV/dec. Despite these changes, it is important to note that the functionalities of MoS $_2$  memtransistors were not adversely affected. For example, the observed shift in  $V_{TH}$  could be compensated using the programming capability of the FG stack to ensure proper logic levels for the intended applications.



 $\textbf{Extended Data Fig. 8} | \textbf{Impact of variability on chemisensor output.} \ Voltage \ transfer \ curves \ appear \ complementary \ when \ GC1 \ and \ GC2 \ are \ swapped.$ 



**Extended Data Fig. 9** | **Digitization using a three-stage cascaded inverter-based comparator.** (a) Schematic of a three-stage-inverter-based comparator circuit. Voltage transfer characteristics measured at the output of **(b)** stage 1, **(c)** stage 2, and **(d)** stage 3. Clearly, the gain improves from 172 in stage-1 to 519 in

stage-2 to 644 in stage-3. **(e)** Analogue output voltage ( $V_{Gr}$ ) from graphene chemisensor. Results of digitization at the output of **(f)** stage 1, **(g)** stage 2, and **(h)** stage 3. Clearly, cascading a higher number of inverters to construct the comparator allows better digitization.



 $\textbf{Extended Data Fig. 10} \ | \ \textbf{2D digital codes using M3D integration platform.} \ \textbf{D} \ if ferent \ 2D \ digital codes generated from the same 4 chemicals (C1, C2, C3, and C4) under different \ \textbf{V}_{LTG}, ranging from 0.2 \ V \ to 1.4 \ V.$