NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Groups Matter: Investigating the Effects of Homophily in Child Interactions in an Inclusive Classroom.

Tian, Y; Vitale, L; Sarker, D; Perry, L; Messinger, D; Hung, H (September 2025, IEEE International Conference on Development and Learning (ICDL))

Free, publicly-accessible full text available September 16, 2026
SoK: Towards Effective Automated Vulnerability Repair

Li, Y; Shezan, F; Wei, B; Wang, G; Tian, Y (August 2025, Usenix Security Symposium 2025)

Free, publicly-accessible full text available August 1, 2026
Chimera: Creating Digitally Signed Fake Photos by Fooling Image Recapture and Deepfake Detectors

Park, S; Vilesov, A; Zhang, J; Khalili, H; Tian, Y; Kadambi, A; Sehatbakhsh, N (August 2025, Usenix Security Symposium)

Free, publicly-accessible full text available August 1, 2026
EIA: ENVIRONMENTAL INJECTION ATTACK ON GENERALIST WEB AGENTS FOR PRIVACY LEAKAGE

Liao, Z; Mo, L; Xu, C; Kang, M; Zhang, J; Xiao, C; Tian, Y; Li, B; Sun, H (April 2025, International Conference on Learning Representations (ICLR 2025))

Free, publicly-accessible full text available April 1, 2026
Towards the theory of unsupervised federated learning: non-asymptotic analysis of federated EM algorithms

Tian, Y; Weng, H; Feng, Y (July 2024, Proceedings of Machine Learning Research)

Full Text Available
Cyber-physical systems in chemical and energy processes

https://doi.org/10.1016/bs.mcps.2024.08.001

Liu, Y; Akundi, SS; Braniff, A; Dantas, B; Tian, Y; Niknezhad, SS; Khan, FI; Pistikopoulos, EN (October 2024, Elsevier)

Full Text Available
A Real-Time Risk-Based Optimization Framework for Safe and Smart Operations

Braniff, A; Akundi, S; Liu, Y; Khan, F; Pistikopoulos, E N; Tian, Y (June 2024, Computer Aided Chemical Engineering)

Full Text Available
Low-severity spruce beetle infestation mapped from high-resolution satellite imagery with a convolutional network

https://doi.org/10.1016/j.isprsjprs.2024.05.013

Zwieback, S; Young-Robertson, J; Robertson, M; Tian, Y; Chang, Q; Morris, M; White, J; Moan, J (June 2024, ISPRS Journal of Photogrammetry and Remote Sensing)

Full Text Available
Towards Real-time Voice Interaction Data Collection Monitoring and Ambient Light Privacy Notification for Voice-controlled Services

Le, T; Wang, Z; Huang, D; Yao, Y; Tian, Y (March 2024, Symposium on Usable Security and Privacy (USEC) 2024)

Full Text Available
PeaTMOSS: A Dataset and Initial Analysis of Pre-Trained Models in Open-Source Software

https://doi.org/10.1145/3643991.3644907

Jiang, W; Yasmin, J; Jones, J; Synovic, N; Kuo, J; Bielanski, N; Tian, Y; Thiruvathukal, G K; Davis, J C (May 2024, 2024 IEEE/ACM 21st International Conference on Mining Software Repositories (MSR))

The development and training of deep learning models have become increasingly costly and complex. Consequently, software engineers are adopting pre-trained models (PTMs) for their downstream applications. The dynamics of the PTM supply chain remain largely unexplored, signaling a clear need for structured datasets that document not only the metadata but also the subsequent applications of these models. Without such data, the MSR community cannot comprehensively understand the impact of PTM adoption and reuse.This paper presents the PeaTMOSS dataset, which comprises metadata for 281,638 PTMs and detailed snapshots for all PTMs with over 50 monthly downloads (14,296 PTMs), along with 28,575 open-source software repositories from GitHub that utilize these models. Additionally, the dataset includes 44,337 mappings from 15,129 downstream GitHub repositories to the 2,530 PTMs they use. To enhance the dataset’s comprehensiveness, we developed prompts for a large language model to automatically extract model metadata, including the model’s training datasets, parameters, and evaluation metrics. Our analysis of this dataset provides the first summary statistics for the PTM supply chain, showing the trend of PTM development and common shortcomings of PTM package documentation. Our example application reveals inconsistencies in software licenses across PTMs and their dependent projects. PeaTMOSS lays the foundation for future research, offering rich opportunities to investigate the PTM supply chain. We outline mining opportunities on PTMs, their downstream usage, and cross-cutting questions.Our artifact is available at https://github.com/PurdueDualityLab/PeaTMOSS-Artifact. Our dataset is available at https://transfer.rcac.purdue.edu/file-manager?origin_id=ff978999-16c2-4b50-ac7a-947ffdc3eb1d&origin_path=%2F.
more » « less
Full Text Available

« Prev Next »

Search for: All records