NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

RAG Pipeline for Domain Specific Applications: A Case Study in Disseminating Dementia Care Practices

Cummings, Aaron; Zhang, Xinyue; Olaniran, Mercy; Akintomide, Modupe (June 2025, ACM/IEEE International Conference on Connected Health: Applications, Systems and Engineering Technologies (CHASE '25))

In closed-domain Question Answering (QA), Large Language Models (LLMs) often fail to deliver responses specialized enough for niche subdomains. Broadly trained models may not capture the nuanced terminology and contextual precision required in these fields, which frequently lack domain-specific conversational data and face computational constraints. To address this, we propose a methodology leveraging a Retrieval-Augmented Generation (RAG) framework that integrates data extraction with fine-tuning using domain-specific question-answer pairs. Our approach employs Question-Answer Generation (QAG) to create tailored training datasets, enabling fine-tuned models to incorporate specialized jargon and context while remaining computationally accessible to domain experts. To exemplify this methodology, we demonstrate its application within the medical domain through a case study centered on the creation of a dementia care chat assistant. A significant benefit of this approach lies in its ease of replication across various domains and scalability for integration into diverse user groups, making it a versatile solution for enhancing chat assistants.
more » « less
Free, publicly-accessible full text available June 24, 2026
FedKDShap: Enhancing Federated Learning via Shapley Values Driven Knowledge Distillation on Non-IID Data

https://doi.org/10.1145/3701716.3717645

Shadin, Nazmus Shakib; Zhang, Xinyue (May 2025, ACM)

Free, publicly-accessible full text available May 8, 2026
A Machine Learning Approach for Emergency Detection in Medical Scenarios Using Large Language Models

Akaybicen, Ferit; Cummings, Aaron; Iwuagwu, Lota; Zhang, Xinyue; Adewuyi, Modupe (March 2025, International Symposium on Intelligent Computing and Networking (ISICN 2025))

Free, publicly-accessible full text available March 17, 2026
Towards More Robust and Scalable Deep Learning Systems for Medical Image Analysis

https://doi.org/10.1109/BigData62323.2024.10825626

Yenumala, Akshaj; Zhang, Xinyue; Lo, Dan (December 2024, IEEE)

Deep learning (DL) has attracted interest in healthcare for disease diagnosis systems in medical imaging analysis (MedIA) and is especially applicable in Big Data environments like federated learning (FL) and edge computing. However, there is little research into mitigating the vulnerabilities and robustness of such systems against adversarial attacks, which can force DL models to misclassify, leading to concerns about diagnosis accuracy. This paper aims to evaluate the robustness and scalability of DL models for MedIA applications against adversarial attacks while ensuring their applicability in FL settings with Big Data. We fine-tune three state-of-the-art transfer learning models, DenseNet121, MobileNet-V2, and ResNet50, on several MedIA datasets of varying sizes and show that they are effective at disease diagnosis. We then apply the Fast Gradient Sign Method (FGSM) to attack the models and utilize adversarial training (AT) and knowledge distillation to defend them. We provide a performance comparison of the original transfer learning models and the defended models on the clean and perturbed data. The experimental results show that the defensive techniques can improve the robustness of the models to the FGSM attack and be scaled for Big Data as well as utilized for edge computing environments.
more » « less
Free, publicly-accessible full text available December 15, 2025

Search for: All records