skip to main content


Title: Data-driven discovery of Green’s functions with human-understandable deep learning
Abstract

There is an opportunity for deep learning to revolutionize science and technology by revealing its findings in a human interpretable manner. To do this, we develop a novel data-driven approach for creating a human–machine partnership to accelerate scientific discovery. By collecting physical system responses under excitations drawn from a Gaussian process, we train rational neural networks to learn Green’s functions of hidden linear partial differential equations. These functions reveal human-understandable properties and features, such as linear conservation laws and symmetries, along with shock and singularity locations, boundary effects, and dominant modes. We illustrate the technique on several examples and capture a range of physics, including advection–diffusion, viscous shocks, and Stokes flow in a lid-driven cavity.

 
more » « less
Award ID(s):
2045646
NSF-PAR ID:
10364146
Author(s) / Creator(s):
; ;
Publisher / Repository:
Nature Publishing Group
Date Published:
Journal Name:
Scientific Reports
Volume:
12
Issue:
1
ISSN:
2045-2322
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract

    We extend the Adaptive Antoulas-Anderson () algorithm to develop a data-driven modeling framework for linear systems with quadratic output (). Such systems are characterized by two transfer functions: one corresponding to the linear part of the output and another one to the quadratic part. We first establish the joint barycentric representations and the interpolation theory for the two transfer functions of systems. This analysis leads to the proposed algorithm. We show that by interpolating the transfer function values on a subset of samples together with imposing a least-squares minimization on the rest, we construct reliable data-driven models. Two numerical test cases illustrate the efficiency of the proposed method.

     
    more » « less
  2. Abstract

    Machine learning (ML) has become commonplace in educational research and science education research, especially to support assessment efforts. Such applications of machine learning have shown their promise in replicating and scaling human‐driven codes of students' work. Despite this promise, we and other scholars argue that machine learning has not yet achieved its transformational potential. We argue that this is because our field is currently lacking frameworks for supporting creative, principled, and critical endeavors to use machine learning in science education research. To offer considerations for science education researchers' use of ML, we present a framework, Distributing Epistemic Functions and Tasks (DEFT), that highlights the functions and tasks that pertain to generating knowledge that can be carried out by either trained researchers or machine learning algorithms. Such considerations are critical decisions that should occur alongside those about, for instance, the type of data or algorithm used. We apply this framework to two cases, one that exemplifies the cutting‐edge use of machine learning in science education research and another that offers a wholly different means of using machine learning and human‐driven inquiry together. We conclude with strategies for researchers to adopt machine learning and call for the field to rethink how we prepare science education researchers in an era of great advances in computational power and access to machine learning methods.

     
    more » « less
  3. The applicability of computational models to the biological world is an active topic of debate. We argue that a useful path forward results from abandoning hard boundaries between categories and adopting an observer-dependent, pragmatic view. Such a view dissolves the contingent dichotomies driven by human cognitive biases (e.g., a tendency to oversimplify) and prior technological limitations in favor of a more continuous view, necessitated by the study of evolution, developmental biology, and intelligent machines. Form and function are tightly entwined in nature, and in some cases, in robotics as well. Thus, efforts to re-shape living systems for biomedical or bioengineering purposes require prediction and control of their function at multiple scales. This is challenging for many reasons, one of which is that living systems perform multiple functions in the same place at the same time. We refer to this as “polycomputing”—the ability of the same substrate to simultaneously compute different things, and make those computational results available to different observers. This ability is an important way in which living things are a kind of computer, but not the familiar, linear, deterministic kind; rather, living things are computers in the broad sense of their computational materials, as reported in the rapidly growing physical computing literature. We argue that an observer-centered framework for the computations performed by evolved and designed systems will improve the understanding of mesoscale events, as it has already done at quantum and relativistic scales. To develop our understanding of how life performs polycomputing, and how it can be convinced to alter one or more of those functions, we can first create technologies that polycompute and learn how to alter their functions. Here, we review examples of biological and technological polycomputing, and develop the idea that the overloading of different functions on the same hardware is an important design principle that helps to understand and build both evolved and designed systems. Learning to hack existing polycomputing substrates, as well as to evolve and design new ones, will have massive impacts on regenerative medicine, robotics, and computer engineering. 
    more » « less
  4. Societal Impact Statement

    Citrus are intrinsically connected to human health and culture, preventing human diseases like scurvy and inspiring sacred rituals. Citrus fruits come in a stunning number of different sizes and shapes, ranging from small clementines to oversized pummelos, and fruits display a vast diversity of flavors and aromas. These qualities are key in both traditional and modern medicine and in the production of cleaning and perfume products. By quantifying and modeling overall fruit shape and oil gland distribution, we can gain further insight into citrus development and the impacts of domestication and improvement on multiple characteristics of the fruit.

    Summary

    Citrus come in diverse sizes and shapes, and play a key role in world culture and economy. Citrus oil glands in particular contain essential oils which include plant secondary metabolites associated with flavor and aroma. Capturing and analyzing nuanced information behind the citrus fruit shape and its oil gland distribution provide a morphology‐driven path to further our insight into phenotype–genotype interactions.

    We investigated the shape of citrus fruit of 51 accessions based on 3D X‐ray computed tomography (CT) scan reconstructions. Accessions include members of the three ancestral citrus species as well as related genera, and several interspecific hybrids. We digitally separate and compare the size of fruit endocarp, mesocarp, exocarp, and oil gland tissue. Based on the centers of the oil glands, overall fruit shape is approximated with an ellipsoid. Possible oil gland distributions on this ellipsoid surface are explored using directional statistics.

    There is a strong allometry along fruit tissues; that is, we observe a strong linear relationship between the logarithmic volume of any pair of major tissues. This suggests that the relative growth of fruit tissues with respect to each other follows a power law. We also observe that on average, glands distance themselves from their nearest neighbor following a square root relationship, which suggests normal diffusion dynamics at play.

    The observed allometry and square root models point to the existence of biophysical developmental constraints that govern novel relationships between fruit dimensions from both evolutionary and breeding perspectives. Understanding these biophysical interactions prompts an exciting research path on fruit development and breeding.

     
    more » « less
  5. Abstract

    Using a hybrid-kinetic particle-in-cell simulation, we study the evolution of an expanding, collisionless, magnetized plasma in which strong Alfvénic turbulence is persistently driven. Temperature anisotropy generated adiabatically by the plasma expansion (and consequent decrease in the mean magnetic-field strength) gradually reduces the effective elasticity of the field lines, causing reductions in the linear frequency and residual energy of the Alfvénic fluctuations. In response, these fluctuations modify their interactions and spatial anisotropy to maintain a scale-by-scale “critical balance” between their characteristic linear and nonlinear frequencies. Eventually the plasma becomes unstable to kinetic firehose instabilities, which excite rapidly growing magnetic fluctuations at ion-Larmor scales. The consequent pitch-angle scattering of particles maintains the temperature anisotropy near marginal stability, even as the turbulent plasma continues to expand. The resulting evolution of parallel and perpendicular temperatures does not satisfy double-adiabatic conservation laws, but is described accurately by a simple model that includes anomalous scattering. Our results have implications for understanding the complex interplay between macro- and microscale physics in various hot, dilute, astrophysical plasmas, and offer predictions concerning power spectra, residual energy, ion-Larmor-scale spectral breaks, and non-Maxwellian features in ion distribution functions that may be tested by measurements taken in high-beta regions of the solar wind.

     
    more » « less