NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Asymmetrical Reciprocity-based Federated Learning for Resolving Disparities in Medical Diagnosis

https://doi.org/10.1145/3690624.3709235

Wang, Jiaqi; Yin, Ziyi; You, Quanzeng; Lyu, Lingjuan; Ma, Fenglong (July 2025, ACM)

Free, publicly-accessible full text available July 20, 2026
An uncertainty-aware digital shadow for underground multimodal CO2 storage monitoring

https://doi.org/10.1093/gji/ggaf176

Gahlot, Abhinav_Prakash; Orozco, Rafael; Yin, Ziyi; Bruer, Grant; Herrmann, Felix_J (May 2025, Geophysical Journal International)

SUMMARY Geological Carbon Storage (GCS) is one of the most viable climate-change mitigating net-negative CO2-emission technologies for large-scale CO2 sequestration. However, subsurface complexities and reservoir heterogeneity demand a systematic approach to uncertainty quantification to ensure both containment and conformance, as well as to optimize operations. As a step toward a digital twin for monitoring and control of underground storage, we introduce a new machine-learning-based data-assimilation framework validated on realistic numerical simulations. The proposed digital shadow combines simulation-based inference (SBI) with a novel neural adaptation of a recently developed nonlinear ensemble filtering technique. To characterize the posterior distribution of CO2 plume states (saturation and pressure) conditioned on multimodal time-lapse data, consisting of imaged surface seismic and well-log data, a generic recursive scheme is employed, where neural networks are trained on simulated ensembles for the time-advanced state and observations. Once trained, the digital shadow infers the state as time-lapse field data become available. Unlike ensemble Kalman filtering, corrections to predicted states are computed via a learned nonlinear prior-to-posterior mapping that supports non-Gaussian statistics and nonlinear models for the dynamics and observations. Training and inference are facilitated by the combined use of conditional invertible neural networks and bespoke physics-based summary statistics. Starting with a probabilistic permeability model derived from a baseline seismic survey, the digital shadow is validated against unseen simulated ground-truth time-lapse data. Results show that injection-site-specific uncertainty in permeability can be incorporated into state uncertainty, and the highest reconstruction quality is achieved when conditioning on both seismic and wellbore data. Despite incomplete permeability knowledge, the digital shadow accurately tracks the subsurface state throughout a realistic CO2 injection project. This work establishes the first proof-of-concept for an uncertainty-aware, scalable digital shadow, laying the foundation for a digital twin to optimize underground storage operations.
more » « less
WISER: Multimodal variational inference for full-waveform inversion without dimensionality reduction

https://doi.org/10.1190/geo2024-0483.1

Yin, Ziyi; Orozco, Rafael; Herrmann, Felix J (March 2025, GEOPHYSICS)

We develop a semiamortized variational inference (VI) framework designed for computationally feasible uncertainty quantification in full-waveform inversion to explore the multimodal posterior distribution without dimensionality reduction. The framework is called full-waveform VI via subsurface extensions with refinements (WISER). WISER builds on top of a supervised generative artificial intelligence method that performs approximate amortized inference that is low-cost albeit showing an amortization gap. This gap is closed through nonamortized refinements that make frugal use of wave physics. Case studies illustrate that WISER is capable of full-resolution, computationally feasible, and reliable uncertainty estimates of velocity models and imaged reflectivities.
more » « less
Free, publicly-accessible full text available March 1, 2026
Shadow-Activated Backdoor Attacks on Multimodal Large Language Models

https://doi.org/10.18653/v1/2025.findings-acl.248

Yin, Ziyi; Ye, Muchao; Cao, Yuanpu; Wang, Jiaqi; Chang, Aofei; Liu, Han; Chen, Jinghui; Wang, Ting; Ma, Fenglong (January 2025, Association for Computational Linguistics)

Full Text Available
UniT: A Unified Look at Certified Robust Training against Text Adversarial Perturbation

Ye, Muchao; Yin, Ziyi; Zhang, Tianrong; Du, Tianyu; Chen, Jinghui; Wang, Ting; Ma, Fenglong (September 2024, Annual Conference on Neural Information Processing Systems)

Full Text Available
WISE: Full-waveform variational inference via subsurface extensions

https://doi.org/10.1190/geo2023-0744.1

Yin, Ziyi; Orozco, Rafael; Louboutin, Mathias; Herrmann, Felix J (July 2024, GEOPHYSICS)

We introduce a probabilistic technique for full-waveform inversion, using variational inference and conditional normalizing flows to quantify uncertainty in migration-velocity models and its impact on imaging. Our approach integrates generative artificial intelligence with physics-informed common-image gathers, reducing reliance on accurate initial velocity models. Considered case studies demonstrate its efficacy producing realizations of migration-velocity models conditioned by the data. These models are used to quantify amplitude and positioning effects during subsequent imaging.
more » « less
Full Text Available
Recent Advances in Predictive Modeling with Electronic Health Records

https://doi.org/10.24963/ijcai.2024/914

Wang, Jiaqi; Luo, Junyu; Ye, Muchao; Wang, Xiaochen; Zhong, Yuan; Chang, Aofei; Huang, Guanjie; Yin, Ziyi; Xiao, Cao; Sun, Jimeng; et al (August 2024, International Joint Conferences on Artificial Intelligence Organization)

The development of electronic health records (EHR) systems has enabled the collection of a vast amount of digitized patient data. However, utilizing EHR data for predictive modeling presents several challenges due to its unique characteristics. With the advancements in machine learning techniques, deep learning has demonstrated its superiority in various applications, including healthcare. This survey systematically reviews recent advances in deep learning-based predictive models using EHR data. Specifically, we introduce the background of EHR data and provide a mathematical definition of the predictive modeling task. We then categorize and summarize predictive deep models from multiple perspectives. Furthermore, we present benchmarks and toolkits relevant to predictive modeling in healthcare. Finally, we conclude this survey by discussing open challenges and suggesting promising directions for future research.
more » « less
Full Text Available
VQAttack: Transferable Adversarial Attacks on Visual Question Answering via Pre-trained Models

https://doi.org/10.1609/aaai.v38i7.28499

Yin, Ziyi; Ye, Muchao; Zhang, Tianrong; Wang, Jiaqi; Liu, Han; Chen, Jinghui; Wang, Ting; Ma, Fenglong (March 2024, Proceedings of the AAAI Conference on Artificial Intelligence)

Visual Question Answering (VQA) is a fundamental task in computer vision and natural language process fields. Although the “pre-training & finetuning” learning paradigm significantly improves the VQA performance, the adversarial robustness of such a learning paradigm has not been explored. In this paper, we delve into a new problem: using a pre-trained multimodal source model to create adversarial image-text pairs and then transferring them to attack the target VQA models. Correspondingly, we propose a novel VQATTACK model, which can iteratively generate both im- age and text perturbations with the designed modules: the large language model (LLM)-enhanced image attack and the cross-modal joint attack module. At each iteration, the LLM-enhanced image attack module first optimizes the latent representation-based loss to generate feature-level image perturbations. Then it incorporates an LLM to further enhance the image perturbations by optimizing the designed masked answer anti-recovery loss. The cross-modal joint attack module will be triggered at a specific iteration, which updates the image and text perturbations sequentially. Notably, the text perturbation updates are based on both the learned gradients in the word embedding space and word synonym-based substitution. Experimental results on two VQA datasets with five validated models demonstrate the effectiveness of the proposed VQATTACK in the transferable attack setting, compared with state-of-the-art baselines. This work revealsa significant blind spot in the “pre-training & fine-tuning” paradigm on VQA tasks. The source code can be found in the link https://github.com/ericyinyzy/VQAttack.
more » « less
Full Text Available
MedDiffusion: Boosting Health Risk Prediction via Diffusion-based Data Augmentation

https://doi.org/10.1137/1.9781611978032.58

Zhong, Yuan; Cui, Suhan; Wang, Jiaqi; Wang, Xiaochen; Yin, Ziyi; Wang, Yaqing; Xiao, Houping; Huai, Mengdi; Wang, Ting; Ma, Fenglong (January 2024, Proceedings of the 2024 SIAM International Conference on Data Mining (SDM))
Optimized time-lapse acquisition design via spectral gap ratio minimization

https://doi.org/10.1190/geo2023-0024.1

Zhang, Yijun; Yin, Ziyi; López, Oscar; Siahkoohi, Ali; Louboutin, Mathias; Kumar, Rajiv; Herrmann, Felix J. (July 2023, GEOPHYSICS)

Modern-day reservoir management and monitoring of geologic carbon storage increasingly call for costly time-lapse seismic data collection. We demonstrate how techniques from graph theory can be used to optimize acquisition geometries for low-cost sparse 4D seismic data. Based on midpoint-offset-domain connectivity arguments, our algorithm automatically produces sparse nonreplicated time-lapse acquisition geometries that favor wavefield recovery.
more » « less
Full Text Available

« Prev Next »

Search for: All records