Search for: All records

Creators/Authors contains: "Liu, Licheng"

« Prev Next »

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Knowledge-guided machine learning can improve carbon cycle quantification in agroecosystems

https://doi.org/10.1038/s41467-023-43860-5

Liu, Licheng; Zhou, Wang; Guan, Kaiyu; Peng, Bin; Xu, Shaoming; Tang, Jinyun; Zhu, Qing; Till, Jessica; Jia, Xiaowei; Jiang, Chongya; et al (December 2024, Nature Communications)

Abstract Accurate and cost-effective quantification of the carbon cycle for agroecosystems at decision-relevant scales is critical to mitigating climate change and ensuring sustainable food production. However, conventional process-based or data-driven modeling approaches alone have large prediction uncertainties due to the complex biogeochemical processes to model and the lack of observations to constrain many key state and flux variables. Here we propose a Knowledge-Guided Machine Learning (KGML) framework that addresses the above challenges by integrating knowledge embedded in a process-based model, high-resolution remote sensing observations, and machine learning (ML) techniques. Using the U.S. Corn Belt as a testbed, we demonstrate that KGML can outperform conventional process-based and black-box ML models in quantifying carbon cycle dynamics. Our high-resolution approach quantitatively reveals 86% more spatial detail of soil organic carbon changes than conventional coarse-resolution approaches. Moreover, we outline a protocol for improving KGML via various paths, which can be generalized to develop hybrid models to better predict complex earth system dynamics.
more » « less
Full Text Available
Knowledge Guided Machine Learning for Extracting, Preserving, and Adapting Physics-aware Features

https://doi.org/10.1137/1.9781611978032.82

He, Erhu; Xie, Yiqun; Liu, Licheng; Jin, Zhenong; Zhang, Dajun; Jia, Xiaowei (April 2024, SIAM International Conference on Data Mining (SDM) 2024)

Training machine learning (ML) models for scientific problems is often challenging due to limited observation data. To overcome this challenge, prior works commonly pre-train ML models using simulated data before having them fine-tuned with small real data. Despite the promise shown in initial research across different domains, these methods cannot ensure improved performance after fine-tuning because (i) they are not designed for extracting generalizable physics-aware features during pre-training, (ii) the features learned from pre-training can be distorted by the fine-tuning process. In this paper, we propose a new learning method for extracting, preserving, and adapting physics-aware features. We build a knowledge-guided neural network (KGNN) model based on known dependencies amongst physical variables, which facilitate extracting physics-aware feature representation from simulated data. Then we fine-tune this model by alternately updating the encoder and decoder of the KGNN model to enhance the prediction while preserving the physics-aware features learned through pre-training. We further propose to adapt the model to new testing scenarios via a teacher-student learning framework based on the model uncertainty. The results demonstrate that the proposed method outperforms many baselines by a good margin, even using sparse training data or under out-of-sample testing scenarios.
more » « less
Knowledge Guided Machine Learning for Extracting, Preserving, and Adapting Physics-aware Features

https://doi.org/10.1137/1.9781611978032.82

He, Erhu; Xie, Yiqun; Liu, Licheng; Jin, Zhenong; Zhang Dajun; Jia, Xiaowei (April 2024, Proceedings of the SIAM International Conference on Data Mining)

Training machine learning (ML) models for scientific problems is often challenging due to limited observation data. To overcome this challenge, prior works commonly pre-train ML models using simulated data before having them fine-tuned with small real data. Despite the promise shown in initial research across different domains, these methods cannot ensure improved performance after fine-tuning because (i) they are not designed for extracting generalizable physics-aware features during pre-training, (ii) the features learned from pre-training can be distorted by the fine-tuning process. In this paper, we propose a new learning method for extracting, preserving, and adapting physics-aware features. We build a knowledge-guided neural network (KGNN) model based on known dependencies amongst physical variables, which facilitate extracting physics-aware feature representation from simulated data. Then we fine-tune this model by alternately updating the encoder and decoder of the KGNN model to enhance the prediction while preserving the physics-aware features learned through pre-training. We further propose to adapt the model to new testing scenarios via a teacher-student learning framework based on the model uncertainty. The results demonstrate that the proposed method outperforms many baselines by a good margin, even using sparse training data or under out-of-sample testing scenarios.
more » « less
Full Text Available
A flexible and efficient knowledge-guided machine learning data assimilation (KGML-DA) framework for agroecosystem prediction in the US Midwest

https://doi.org/10.1016/j.rse.2023.113880

Yang, Qi; Liu, Licheng; Zhou, Junxiong; Ghosh, Rahul; Peng, Bin; Guan, Kaiyu; Tang, Jinyun; Zhou, Wang; Kumar, Vipin; Jin, Zhenong (December 2023, Remote Sensing of Environment)

Full Text Available
Task-Adaptive Meta-Learning Framework for Advancing Spatial Generalizability

https://doi.org/10.1609/aaai.v37i12.26680

Liu, Zhexiong; Liu, Licheng; Xie, Yiqun; Jin, Zhenong; Jia, Xiaowei (June 2023, Proceedings of the AAAI Conference on Artificial Intelligence)

Spatio-temporal machine learning is critically needed for a variety of societal applications, such as agricultural monitoring, hydrological forecast, and traffic management. These applications greatly rely on regional features that characterize spatial and temporal differences. However, spatio-temporal data often exhibit complex patterns and significant data variability across different locations. The labels in many real-world applications can also be limited, which makes it difficult to separately train independent models for different locations. Although meta learning has shown promise in model adaptation with small samples, existing meta learning methods remain limited in handling a large number of heterogeneous tasks, e.g., a large number of locations with varying data patterns. To bridge the gap, we propose task-adaptive formulations and a model-agnostic meta-learning framework that transforms regionally heterogeneous data into location-sensitive meta tasks. We conduct task adaptation following an easy-to-hard task hierarchy in which different meta models are adapted to tasks of different difficulty levels. One major advantage of our proposed method is that it improves the model adaptation to a large number of heterogeneous tasks. It also enhances the model generalization by automatically adapting the meta model of the corresponding difficulty level to any new tasks. We demonstrate the superiority of our proposed framework over a diverse set of baselines and state-of-the-art meta-learning frameworks. Our extensive experiments on real crop yield data show the effectiveness of the proposed method in handling spatial-related heterogeneous tasks in real societal applications.
more » « less
Full Text Available
Physics Guided Neural Networks for Time-Aware Fairness: An Application in Crop Yield Prediction

https://doi.org/10.1609/aaai.v37i12.26664

He, Erhu; Xie, Yiqun; Liu, Licheng; Chen, Weiye; Jin, Zhenong; Jia, Xiaowei (June 2023, Proceedings of the AAAI Conference on Artificial Intelligence)

This paper proposes a physics-guided neural network model to predict crop yield and maintain the fairness over space. Failures to preserve the spatial fairness in predicted maps of crop yields can result in biased policies and intervention strategies in the distribution of assistance or subsidies in supporting individuals at risk. Existing methods for fairness enforcement are not designed for capturing the complex physical processes that underlie the crop growing process, and thus are unable to produce good predictions over large regions under different weather conditions and soil properties. More importantly, the fairness is often degraded when existing methods are applied to different years due to the change of weather conditions and farming practices. To address these issues, we propose a physics-guided neural network model, which leverages the physical knowledge from existing physics-based models to guide the extraction of representative physical information and discover the temporal data shift across years. In particular, we use a reweighting strategy to discover the relationship between training years and testing years using the physics-aware representation. Then the physics-guided neural network will be refined via a bi-level optimization process based on the reweighted fairness objective. The proposed method has been evaluated using real county-level crop yield data and simulated data produced by a physics-based model. The results demonstrate that this method can significantly improve the predictive performance and preserve the spatial fairness when generalized to different years.
more » « less
Full Text Available
Mini-Batch Learning Strategies for modeling long term temporal dependencies: A study in environmental applications

https://doi.org/10.1137/1.9781611977653.ch73

Xu, Shaoming; Khandelwal, Ankush; Li, Xiang; Jia, Xiaowei; Liu, Licheng; Willard, Jared; Ghosh, Rahul; Cutler, Kelly; Steinbach, Michael; Duffy, Christopher; et al (April 2023, Proceedings of the 2023 SIAM International Conference on Data Mining (SDM))
Shekhar, Shashi; Zhou, Zhi-Hua; Chiang, Yao-Yi; Stiglic, Gregor (Ed.)
In many environmental applications, recurrent neural networks (RNNs) are often used to model physical variables with long temporal dependencies. However, due to minibatch training, temporal relationships between training segments within the batch (intra-batch) as well as between batches (inter-batch) are not considered, which can lead to limited performance. Stateful RNNs aim to address this issue by passing hidden states between batches. Since Stateful RNNs ignore intra-batch temporal dependency, there exists a trade-off between training stability and capturing temporal dependency. In this paper, we provide a quantitative comparison of different Stateful RNN modeling strategies, and propose two strategies to enforce both intra- and inter-batch temporal dependency. First, we extend Stateful RNNs by defining a batch as a temporally ordered set of training segments, which enables intra-batch sharing of temporal information. While this approach significantly improves the performance, it leads to much larger training times due to highly sequential training. To address this issue, we further propose a new strategy which augments a training segment with an initial value of the target variable from the timestep right before the starting of the training segment. In other words, we provide an initial value of the target variable as additional input so that the network can focus on learning changes relative to that initial value. By using this strategy, samples can be passed in any order (mini-batch training) which significantly reduces the training time while maintaining the performance. In demonstrating the utility of our approach in hydrological modeling, we observe that the most significant gains in predictive accuracy occur when these methods are applied to state variables whose values change more slowly, such as soil water and snowpack, rather than continuously moving flux variables such as streamflow.
more » « less
Full Text Available
Distinct driving mechanisms of non-growing season N2O emissions call for spatial-specific mitigation strategies in the US Midwest

https://doi.org/10.1016/j.agrformet.2022.109108

Yang, Yufeng; Liu, Licheng; Zhou, Wang; Guan, Kaiyu; Tang, Jinyun; Kim, Taegon; Grant, Robert F.; Peng, Bin; Zhu, Peng; Li, Ziyi; et al (September 2022, Agricultural and Forest Meteorology)

Full Text Available
Improved global wetland carbon isotopic signatures support post-2006 microbial methane emission increase

https://doi.org/10.1038/s43247-022-00488-5

Oh, Youmi; Zhuang, Qianlai; Welp, Lisa R.; Liu, Licheng; Lan, Xin; Basu, Sourish; Dlugokencky, Edward J.; Bruhwiler, Lori; Miller, John B.; Michel, Sylvia E.; et al (December 2022, Communications Earth & Environment)

Abstract Atmospheric concentrations of methane, a powerful greenhouse gas, have strongly increased since 2007. Measurements of stable carbon isotopes of methane can constrain emissions if the isotopic compositions are known; however, isotopic compositions of methane emissions from wetlands are poorly constrained despite their importance. Here, we use a process-based biogeochemistry model to calculate the stable carbon isotopic composition of global wetland methane emissions. We estimate a mean global signature of −61.3 ± 0.7‰ and find that tropical wetland emissions are enriched by ~11‰ relative to boreal wetlands. Our model shows improved resolution of regional, latitudinal and global variations in isotopic composition of wetland emissions. Atmospheric simulation scenarios with the improved wetland isotopic composition suggest that increases in atmospheric methane since 2007 are attributable to rising microbial emissions. Our findings substantially reduce uncertainty in the stable carbon isotopic composition of methane emissions from wetlands and improve understanding of the global methane budget.
more » « less
Full Text Available
Characterizing Performance of Freshwater Wetland Methane Models Across Time Scales at FLUXNET‐CH ₄ Sites Using Wavelet Analyses

https://doi.org/10.1029/2022JG007259

Zhang, Zhen; Bansal, Sheel; Chang, Kuang‐Yu; Fluet‐Chouinard, Etienne; Delwiche, Kyle; Goeckede, Mathias; Gustafson, Adrian; Knox, Sara; Leppänen, Antti; Liu, Licheng; et al (November 2023, Journal of Geophysical Research: Biogeosciences)

Abstract Process‐based land surface models are important tools for estimating global wetland methane (CH₄) emissions and projecting their behavior across space and time. So far there are no performance assessments of model responses to drivers at multiple time scales. In this study, we apply wavelet analysis to identify the dominant time scales contributing to model uncertainty in the frequency domain. We evaluate seven wetland models at 23 eddy covariance tower sites. Our study first characterizes site‐level patterns of freshwater wetland CH₄fluxes (FCH₄) at different time scales. A Monte Carlo approach was developed to incorporate flux observation error to avoid misidentification of the time scales that dominate model error. Our results suggest that (a) significant model‐observation disagreements are mainly at multi‐day time scales (<15 days); (b) most of the models can capture the CH₄variability at monthly and seasonal time scales (>32 days) for the boreal and Arctic tundra wetland sites but have significant bias in variability at seasonal time scales for temperate and tropical/subtropical sites; (c) model errors exhibit increasing power spectrum as time scale increases, indicating that biases at time scales <5 days could contribute to persistent systematic biases on longer time scales; and (d) differences in error pattern are related to model structure (e.g., proxy of CH₄production). Our evaluation suggests the need to accurately replicate FCH₄variability, especially at short time scales, in future wetland CH₄model developments.
more » « less
Full Text Available

« Prev Next »