Diffractive optical neural networks (DONNs) are emerging as high‐throughput and energy‐efficient hardware platforms to perform all‐optical machine learning (ML) in machine vision systems. However, the current demonstrated applications of DONNs are largely image classification tasks, which undermine the prospect of developing and utilizing such hardware for other ML applications. Herein, the deployment of an all‐optical reconfigurable DONNs system for scientific computing is demonstrated numerically and experimentally, including guiding two‐dimensional quantum material synthesis, predicting the properties of two‐dimensional quantum materials and small molecular cancer drugs, predicting the device response of nanopatterned integrated photonic power splitters, and the dynamic stabilization of an inverted pendulum with reinforcement learning. Despite a large variety of input data structures, a universal feature engineering approach is developed to convert categorical input features to images that can be processed in the DONNs system. The results open up new opportunities for employing DONNs systems for a broad range of ML applications.
more »
« less
Effects of interlayer reflection and interpixel interaction in diffractive optical neural networks
Multilayer diffractive optical neural networks (DONNs) can perform machine learning (ML) tasks at the speed of light with low energy consumption. Decreasing the number of diffractive layers can reduce inevitable material and diffraction losses to improve system performance, and incorporating compact devices can reduce the system footprint. However, current analytical DONN models cannot accurately describe such physical systems. Here we show the ever-ignored effects of interlayer reflection and interpixel interaction on the deployment performance of DONNs through full-wave electromagnetic simulations and terahertz (THz) experiments. We demonstrate that the drop of handwritten digit classification accuracy due to reflection is negligible with conventional low-index THz polymer materials, while it can be substantial with high-index materials. We further show that one- and few-layer DONN systems can achieve high classification accuracy, but there is a trade-off between accuracy and model-system matching rate because of the fast-varying spatial distribution of optical responses in diffractive masks. Deep DONNs can break down such a trade-off because of reduced mask spatial complexity. Our results suggest that new accurate and trainable DONN models are needed to advance the development and deployment of compact DONN systems for sophisticated ML tasks.
more »
« less
- Award ID(s):
- 1936729
- PAR ID:
- 10474423
- Publisher / Repository:
- OSA
- Date Published:
- Journal Name:
- Optics Letters
- Volume:
- 48
- Issue:
- 2
- ISSN:
- 0146-9592
- Page Range / eLocation ID:
- 219
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
All‐optical and fully reconfigurable transmissive diffractive optical neural network (DONN) architectures emerge as high‐throughput and energy‐efficient machine learning (ML) hardware accelerators in broad applications. However, current device and system implementations have limited performance. In this work, a novel transmissive diffractive device architecture, a digitized phase‐change material (PCM) heterostack, which consists of multiple nonvolatile PCM layers with different thicknesses, is demonstrated. Through this architecture, the advantages of PCM electrical and optical properties can be leveraged and challenges associated with multilevel operations in a single PCM layer can be mitigated. Through proof‐of‐concept experiments, the electrical tuning of one PCM layer is demonstrated in a transmissive spatial light modulation device, and thermal analysis guides the design of multilayer devices and DONN systems to avoid thermal cross talk if individual heterostacks are assembled into an array. Further, a heterostack containing three PCM layers is designed based on experimental results to produce a large‐phase modulation range and uniform coverage, and the ML performance of DONN systems with the designed heterostack is evaluated. The developed device architecture is practically feasible and scalable for future energy‐efficient, fast‐reconfigured, and compact transmissive DONN systems.more » « less
-
This paper presents a design approach for the modeling and simulation of ultra-low power (ULP) analog computing machine learning (ML) circuits for seizure detection using EEG signals in wearable health monitoring applications. In this paper, we describe a new analog system modeling and simulation technique to associate power consumption, noise, linearity, and other critical performance parameters of analog circuits with the classification accuracy of a given ML network, which allows to realize a power and performance optimized analog ML hardware implementation based on diverse application-specific needs. We carried out circuit simulations to obtain non-idealities, which are then mathematically modeled for an accurate mapping. We have modeled noise, non-linearity, resolution, and process variations such that the model can accurately obtain the classification accuracy of the analog computing based seizure detection system. Noise has been modeled as an input-referred white noise that can be directly added at the input. Device process and temperature variations were modeled as random fluctuations in circuit parameters such as gain and cut-off frequency. Nonlinearity was mathematically modeled as a power series. The combined system level model was then simulated for classification accuracy assessments. The design approach helps to optimize power and area during the development of tailored analog circuits for ML networks with the ability to potentially trade power and performance goals while still ensuring the required classification accuracy. The simulation technique also enables to determine target specifications for each circuit block in the analog computing hardware. This is achieved by developing the ML hardware model, and investigating the effect of circuit nonidealities on classification accuracy. Simulation of an analog computing EEG seizure detection block shows a classification accuracy of 91%. The proposed modeling approach will significantly reduce design time and complexity of large analog computing systems. Two feature extraction approaches are also compared for an analog computing architecture.more » « less
-
Abstract Yellow organic crystals, like BNA, MNA, and NMBA, can be used to generate terahertz (THz) pulses of light through optical rectification of infrared ultrafast laser pulses. When producing THz with these organic crystals, one needs to consider that 1) their damage thresholds are low due to having low melting points and 2) Fresnel reflection losses due to multiple interfaces reduce the efficiency of the generated THz output. In this work, new heterogeneous multi‐layer “sandwich” structures are developed with these yellow organic crystals by 1) fusing them to sapphire plates to permit the crystal to withstand higher laser fluences and 2) using an index‐matching fluid (liquid crystal MBBA) to decrease Fresnel reflection losses and improve the THz output. It is shown that the sapphire plates increase the damage threshold of these yellow organic crystals by a factor of two or more, thus allowing the crystals to generate higher THz electric fields. Furthermore, it is shown that the THz light output efficiency increases by assembling the yellow crystals into multi‐layered sandwich structures. For some yellow organic crystals, the sandwich structures increase the THz intensity by more than a factor of two.more » « less
-
Abstract Solving partial differential equations (PDEs) is the cornerstone of scientific research and development. Data-driven machine learning (ML) approaches are emerging to accelerate time-consuming and computation-intensive numerical simulations of PDEs. Although optical systems offer high-throughput and energy-efficient ML hardware, their demonstration for solving PDEs is limited. Here, we present an optical neural engine (ONE) architecture combining diffractive optical neural networks for Fourier space processing and optical crossbar structures for real space processing to solve time-dependent and time-independent PDEs in diverse disciplines, including Darcy flow equation, the magnetostatic Poisson’s equation in demagnetization, the Navier-Stokes equation in incompressible fluid, Maxwell’s equations in nanophotonic metasurfaces, and coupled PDEs in a multiphysics system. We numerically and experimentally demonstrate the capability of the ONE architecture, which not only leverages the advantages of high-performance dual-space processing for outperforming traditional PDE solvers and being comparable with state-of-the-art ML models but also can be implemented using optical computing hardware with unique features of low-energy and highly parallel constant-time processing irrespective of model scales and real-time reconfigurability for tackling multiple tasks with the same architecture. The demonstrated architecture offers a versatile and powerful platform for large-scale scientific and engineering computations.more » « less
An official website of the United States government

