skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Search for: All records

Creators/Authors contains: "He, Yi"

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. Free, publicly-accessible full text available August 16, 2026
  2. Free, publicly-accessible full text available July 15, 2026
  3. Free, publicly-accessible full text available July 13, 2026
  4. Online Anomaly Detection (OAD) is critical for identifying rare yet important data points in large, dynamic, and complex data streams. A key challenge lies in achieving accurate and consistent detection of anomalies while maintaining computational and memory efficiency. Conventional OAD approaches, which depend on distributional deviations and static thresholds, struggle with model update delays and catastrophic forgetting, leading to missed detections and high false positive rates. To address these limitations, we propose a novel Streaming Anomaly Detection (SAD) method, grounded in a sparse active online learning framework. Our approach uniquely integrates ℓ1,2-norm sparse online learning with CUR decomposition-based active learning, enabling simultaneous fast feature selection and dynamic instance selection. The efficient CUR decomposition further supports real-time residual analysis for anomaly scoring, eliminating the need for manual threshold settings about temporal data distributions. Extensive experiments on diverse streaming datasets demonstrate SAD's superiority, achieving a 14.06% reduction in detection error rates compared to five state-of-the-art competitors. 
    more » « less
    Free, publicly-accessible full text available September 1, 2026
  5. Abstract Ebola virus (EBOV) and Marburg virus (MARV) are zoonotic filoviruses that cause hemorrhagic fever in humans. Correlative data implicate bats as natural EBOV hosts, but neither a full-length genome nor an EBOV isolate has been found in any bats sampled. Here, we model filovirus infection in the Jamaican fruit bat (JFB),Artibeus jamaicensis,by inoculation with either EBOV or MARV through a combination of oral, intranasal, and subcutaneous routes. Infection with EBOV results in systemic virus replication and oral shedding of infectious virus. MARV replication is transient and does not shed. In vitro, JFB cells replicate EBOV more efficiently than MARV, and MARV infection induces innate antiviral responses that EBOV efficiently suppresses. Experiments using VSV pseudoparticles or replicating VSV expressing the EBOV or MARV glycoprotein demonstrate an advantage for EBOV entry and replication early, respectively, in JFB cells. Overall, this study describes filovirus species-specific phenotypes for both JFB and their cells. 
    more » « less
    Free, publicly-accessible full text available December 1, 2026
  6. Group Fairness-aware Continual Learning (GFCL) aims to eradicate discriminatory predictions against certain demographic groups in a sequence of diverse learning tasks.This paper explores an even more challenging GFCL problem – how to sustain a fair classifier across a sequence of tasks with covariate shifts and unlabeled data. We propose the MacFRL solution, with its key idea to optimizethe sequence of learning tasks. We hypothesize that high-confident learning can be enabled in the optimized task sequence, where the classifier learns from a set of prioritized tasks to glean knowledge, thereby becoming more capable to handle the tasks with substantial distribution shifts that were originally deferred. Theoretical and empirical studies substantiate that MacFRL excels among its GFCL competitors in terms of prediction accuracy and group fair-ness metrics. 
    more » « less
    Free, publicly-accessible full text available April 11, 2026
  7. Free, publicly-accessible full text available February 25, 2026
  8. This paper presents HGEN that pioneers ensemble learning for heterogeneous graphs. We argue that the heterogeneity in node types, nodal features, and local neighborhood topology poses significant challenges for ensemble learning, particularly in accommodating diverse graph learners. Our HGEN framework ensembles multiple learners through a meta-path and transformation-based optimization pipeline to uplift classification accuracy. Specifically, HGEN uses meta-path combined with random dropping to create Allele Graph Neural Networks (GNNs), whereby the base graph learners are trained and aligned for later ensembling. To ensure effective ensemble learning, HGEN presents two key components:1) a residual-attention mechanism to calibrate allele GNNs of different meta-paths, thereby enforcing node embeddings to focus on more informative graphs to improve base learner accuracy, and 2) a correlation-regularization term to enlarge the disparity among embedding matrices generated from different meta-paths, thereby enriching base learner diversity. We analyze the convergence of HGEN and attest its higher regularization magnitude over simple voting. Experiments on five heterogeneous networks validate that HGEN consistently outperforms its state-of-the-art competitors by substantial margin. Codes are available at https://github.com/Chrisshen12/HGEN. 
    more » « less
    Free, publicly-accessible full text available September 1, 2026
  9. Globersons, Amir; Mackey, Lester; Belgrave, Danielle; Fan, Angela; Paquet, Ulrich; Tomczak, Jakub M; Zhang, Cheng (Ed.)
    Free, publicly-accessible full text available December 10, 2025
  10. Free, publicly-accessible full text available December 9, 2025