NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Tuning screw-to-edge dislocation slip discrepancy via graph neural network–accelerated exploration of local ordered states in refractory high-entropy alloys

https://doi.org/10.1016/j.scriptamat.2025.116962

Yao, Yi; Zhang, Zhengyu; Cappola, Jonathan; Wu, Xue; Gong, Jiaqi; Yan, Feng; Cai, Wenjun; Li, Lin (August 2025, Scripta materialia)

Free, publicly-accessible full text available August 27, 2026
StoryStudio: Enhancing Data Science Education with Explainable, Narrative-Driven Storytelling

https://doi.org/10.1145/3724389.3730811

Henry, Ryan; Hassan, Taha; Gong, Jiaqi (June 2025, ACM)

Free, publicly-accessible full text available June 13, 2026
Story Studio: A Coaching Tool to Support the Development of Data Storytelling Competency at Scale

https://doi.org/10.1145/3641555.3704771

Chen, Lujie Karen; Yarnall, Louise; Gong, Jiaqi (February 2025, ACM)

Free, publicly-accessible full text available February 18, 2026
Learning Motion Primitives for the Quantification and Diagnosis of Mobility Deficits

https://doi.org/10.1109/TBME.2024.3404357

Yan, Fujian; Gong, Jiaqi; Zhang, Qiang; He, Hongsheng (December 2024, IEEE Transactions on Biomedical Engineering)

Full Text Available
StructuGraphRAG: Structured Document-Informed Knowledge Graphs for Retrieval-Augmented Generation

https://doi.org/10.1609/aaaiss.v4i1.31798

Zhu, Xishi; Guo, Xiaoming; Cao, Shengting; Li, Shenglin; Gong, Jiaqi (November 2024, Proceedings of the AAAI Symposium Series)

Retrieval-augmented generation (RAG) enhances large language models (LLMs) by incorporating external data sources beyond their training sets and querying predefined knowledge bases to generate accurate, context-rich responses. Most RAG implementations use vector similarity searches, but the effectiveness of this approach and the representation of knowledge bases remain underexplored. Emerging research suggests knowledge graphs as a promising solution. Therefore, this paper presents StructuGraphRAG, which leverages document structures to inform the extraction process and constructs knowledge graphs to enhance RAG for social science research, specifically using NSDUH datasets. Our method parses document structures to extract entities and relationships, constructing comprehensive and relevant knowledge graphs. Experimental results show that StructuGraphRAG outperforms traditional RAG methods in accuracy, comprehensiveness, and contextual relevance. This approach provides a robust tool for social science researchers, facilitating precise analysis of social determinants of health and justice, and underscores the potential of structured document-informed knowledge graph construction in AI and social science research.
more » « less
Full Text Available
Show and Tell: Exploring Large Language Model's Potential in Formative Educational Assessment of Data Stories

https://doi.org/10.1109/GEN4DS63889.2024.00007

Sivakumar, Naren; Chen, Lujie Karen; Papasani, Pravalika; Majmundar, Vigna; Feng, Jinjuan Heidi; Yarnall, Louise; Gong, Jiaqi (October 2024, IEEE)

Full Text Available
Foundational Tools for Coaching Data Storytelling

https://doi.org/10.1145/3626253.3633432

Chen, Lujie Karen; Gong, Jiaqi; Yarnall, Louise (March 2024, ACM)

Data storytelling is the skill to communicate data effectively and efficiently. Effective data storytelling goes beyond data visualization and focuses on explanation with clear rhetorical functions. It starts with a set of data insights collected from the data science workflow and involves iterative and interactive processes of filtering those insights into story slices, from which data stories can be created through ordering, organizing and narration. Data storytelling is an integral component of a well-rounded data science education, which complements foundational skills like quantitative reasoning and programming. Despite its significance, solid understanding of the theory and practice of developing data storytelling competency is lacking. Data storytelling is often perceived as a mythical process where quantitative information magically transforms into compelling narratives. Designing scalable coaching tools for data storytelling requires leveraging multidisciplinary expertise from learning science, computer science, data science, communication science, and human-centered design. In this workshop, we will share some initial findings and reflections from our interdisciplinary team searching for effective coaching methods and tools to support coaching data storytelling at scale. We will present results from literature reviews and expert interviews which will be packaged into a set of foundational tools such as mental model, cognitive processes and schema for story construction, assessment strategy, as well as preliminary ideas of tools to support data storytelling coaching. We hope to use this workshop to build a community of researchers and practitioners in coaching data storytelling in postsecondary formal and informal learning context.
more » « less
Machine-Learning-Based Precipitation Reconstructions: A Study on Slovenia’s Sava River Basin

https://doi.org/10.3390/hydrology10110207

Ramírez_Molina, Abel Andrés; Bezak, Nejc; Tootle, Glenn; Wang, Chen; Gong, Jiaqi (November 2023, Hydrology)

The Sava River Basin (SRB) includes six countries (Slovenia, Croatia, Bosnia and Herzegovina, Serbia, Albania, and Montenegro), with the Sava River (SR) being a major tributary of the Danube River. The SR originates in the mountains (European Alps) of Slovenia and, because of a recent Slovenian government initiative to increase clean, sustainable energy, multiple hydropower facilities have been constructed within the past ~20 years. Given the importance of this river system for varying demands, including hydropower (energy production), information about past (paleo) dry (drought) and wet (pluvial) periods would provide important information to water managers and planners. Recent research applying traditional regression techniques and methods developed skillful reconstructions of seasonal (April–May–June–July–August–September or AMJJAS) streamflow using tree-ring-based proxies. The current research intends to expand upon these recent research efforts and investigate developing reconstructions of seasonal (AMJJAS) precipitation applying novel Artificial Intelligence (AI), Machine Learning (ML), and Deep Learning (DL) techniques. When comparing the reconstructed AMJJAS precipitation datasets, the AI/ML/DL techniques statistically outperformed traditional regression techniques. When comparing the SRB AMJJAS precipitation reconstruction developed in this research to the SRB AMJJAS streamflow reconstruction developed in previous research, the temporal variability of the two reconstructions compared favorably. However, pluvial magnitudes of extreme periods differed, while drought magnitudes of extreme periods were similar, confirming drought is likely better captured in tree-ring-based proxy reconstructions of hydrologic variables.
more » « less
Full Text Available
Full Waveform Inversion-Based Ultrasound Computed Tomography Acceleration Using Two-Dimensional Convolutional Neural Networks

https://doi.org/10.1115/1.4062092

Kleman, Christopher; Anwar, Shoaib; Liu, Zhengchun; Gong, Jiaqi; Zhu, Xishi; Yunker, Austin; Kettimuthu, Rajkumar; He, Jiaze (November 2023, Journal of Nondestructive Evaluation, Diagnostics and Prognostics of Engineering Systems)

Abstract Ultrasound computed tomography (USCT) shows great promise in nondestructive evaluation and medical imaging due to its ability to quickly scan and collect data from a region of interest. However, existing approaches are a tradeoff between the accuracy of the prediction and the speed at which the data can be analyzed, and processing the collected data into a meaningful image requires both time and computational resources. We propose to develop convolutional neural networks (CNNs) to accelerate and enhance the inversion results to reveal underlying structures or abnormalities that may be located within the region of interest. For training, the ultrasonic signals were first processed using the full waveform inversion (FWI) technique for only a single iteration; the resulting image and the corresponding true model were used as the input and output, respectively. The proposed machine learning approach is based on implementing two-dimensional CNNs to find an approximate solution to the inverse problem of a partial differential equation-based model reconstruction. To alleviate the time-consuming and computationally intensive data generation process, a high-performance computing-based framework has been developed to generate the training data in parallel. At the inference stage, the acquired signals will be first processed by FWI for a single iteration; then the resulting image will be processed by a pre-trained CNN to instantaneously generate the final output image. The results showed that once trained, the CNNs can quickly generate the predicted wave speed distributions with significantly enhanced speed and accuracy.
more » « less
Full Text Available
Machine Learning-Guided Exploration of Glass-Forming Ability in Multicomponent Alloys

https://doi.org/10.1007/s11837-022-05549-w

Yao, Yi; Sullivan, Timothy; Yan, Feng; Gong, Jiaqi; Li, Lin (October 2022, JOM)

Full Text Available

« Prev Next »

Search for: All records