skip to main content


This content will become publicly available on October 31, 2025

Title: Controllable Data Generation by Deep Learning: A Review

Designing and generating new data under targeted properties has been attracting various critical applications such as molecule design, image editing and speech synthesis. Traditional hand-crafted approaches heavily rely on expertise experience and intensive human efforts, yet still suffer from the insufficiency of scientific knowledge and low throughput to support effective and efficient data generation. Recently, the advancement of deep learning has created the opportunity for expressive methods to learn the underlying representation and properties of data. Such capability provides new ways of determining the mutual relationship between the structural patterns and functional properties of the data and leveraging such relationships to generate structural data, given the desired properties. This article is a systematic review that explains this promising research area, commonly known as controllable deep data generation. First, the article raises the potential challenges and provides preliminaries. Then the article formally defines controllable deep data generation, proposes a taxonomy on various techniques and summarizes the evaluation metrics in this specific domain. After that, the article introduces exciting applications of controllable deep data generation, experimentally analyzes and compares existing works. Finally, this article highlights the promising future directions of controllable deep data generation and identifies five potential challenges.

 
more » « less
Award ID(s):
2318831 2113350 2403312 2103592
NSF-PAR ID:
10520955
Author(s) / Creator(s):
; ; ; ; ;
Publisher / Repository:
ACM
Date Published:
Journal Name:
ACM Computing Surveys
Volume:
56
Issue:
9
ISSN:
0360-0300
Page Range / eLocation ID:
1 to 38
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Large-scale and controllable fabrication is an indispensable step for the industrialization and commercialization of halide perovskite nanocrystals, which are new-generation semiconductor materials for optoelectronic applications. Microfluidics, which provides continuous and precise synthesis, has been considered as a promising technique to fulfill this aspect. The research studies over the past decades have witnessed the advancement of microfluidics as a powerful tool in the fabrication of halide perovskite nanocrystals. In this Perspective, the state-of-the-art research based on microfluidics is introduced initially, including the synthesis of functional structures and materials, devices, as well as the interdisciplinary interactions between microfluidics and artificial intelligence and machine learning, etc. We then detail the issues and challenges in hindering progress in the above areas. Finally, we provide future directions and trends for the technology to achieve its full potential. This Perspective is expected to benefit the collective efforts between the field of nanomaterials and microfluidics in advanced manufacturing.

     
    more » « less
  2. Self-powered biosensors are innovative devices that can detect and analyze biological or chemical substances without the need for an external power source. These biosensors can convert energy from the surrounding environment or the analyte itself into electrical signals for sensing and data transmission. The self-powered nature of these biosensors offers several advantages, such as portability, autonomy, and reduced waste generation from disposable batteries. They find applications in various fields, including healthcare, environmental monitoring, food safety, and wearable devices. While self-powered biosensors are a promising technology, there are still challenges to address, such as improving energy efficiency, sensitivity, and stability to make them more practical and widely adopted. This review article focuses on exploring the evolving trends in self-powered biosensor design, outlining potential advantages and limitations. With a focal point on enzymatic biofuel cell power generation, this article describes various sensing mechanisms that employ the analyte as substrate or fuel for the biocatalyst’s ability to generate current. Technical aspects of biofuel cells are also examined. Research and development in the field of self-powered biosensors is ongoing, and this review describes promising areas for further exploration within the field, identifying underexplored areas that could benefit from further investigation. 
    more » « less
  3. Abstract

    The hydrologic community has experienced a surge in interest in machine learning in recent years. This interest is primarily driven by rapidly growing hydrologic data repositories, as well as success of machine learning in various academic and commercial applications, now possible due to increasing accessibility to enabling hardware and software. This overview is intended for readers new to the field of machine learning. It provides a non‐technical introduction, placed within a historical context, to commonly used machine learning algorithms and deep learning architectures. Applications in hydrologic sciences are summarized next, with a focus on recent studies. They include the detection of patterns and events such as land use change, approximation of hydrologic variables and processes such as rainfall‐runoff modeling, and mining relationships among variables for identifying controlling factors. The use of machine learning is also discussed in the context of integrated with process‐based modeling for parameterization, surrogate modeling, and bias correction. Finally, the article highlights challenges of extrapolating robustness, physical interpretability, and small sample size in hydrologic applications.

    This article is categorized under:

    Science of Water

     
    more » « less
  4. Designing molecules with specific structural and functional properties (e.g., drug-likeness and water solubility) is central to advancing drug discovery and material science, but it poses outstanding challenges both in wet and dry laboratories. The search space is vast and rugged. Recent advances in deep generative models are motivating new computational approaches building over deep learning to tackle the molecular space. Despite rapid advancements, state-of-the-art deep generative models for molecule generation have many limitations, including lack of interpretability. In this paper we address this limitation by proposing a generic framework for interpretable molecule generation based on novel disentangled deep graph generative models with property control. Specifically, we propose a disentanglement enhancement strategy for graphs. We also propose new deep neural architecture to achieve the above learning objective for inference and generation for variable-size graphs efficiently. Extensive experimental evaluation demonstrates the superiority of our approach in various critical aspects, such as accuracy, novelty, and disentanglement. 
    more » « less
  5. The ability of phase-change materials to reversibly and rapidly switch between two stable phases has driven their use in a number of applications such as data storage and optical modulators. Incorporating such materials into metasurfaces enables new approaches to the control of optical fields. In this article we present the design of novel switchable metasurfaces that enable the control of the nonclassical two-photon quantum interference. These structures require no static power consumption, operate at room temperature, and have high switching speed. For the first adaptive metasurface presented in this article, tunable nonclassical two-photon interference from −97.7% (anti-coalescence) to 75.48% (coalescence) is predicted. For the second adaptive geometry, the quantum interference switches from −59.42% (anti-coalescence) to 86.09% (coalescence) upon a thermally driven crystallographic phase transition. The development of compact and rapidly controllable quantum devices is opening up promising paths to brand-new quantum applications as well as the possibility of improving free space quantum logic gates, linear-optics bell experiments, and quantum phase estimation systems.

     
    more » « less