NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Frequency-Semantic Enhanced Variational Autoencoder for Zero-Shot Skeleton-based Action Recognition

Wu, Wenhan; Guo, Zhishuai; Chen, Chen; Xue, Hongfei; Lu, Aidong (October 2025, International Conference on Computer Vision (ICCV))

Free, publicly-accessible full text available October 16, 2026
Towards an interoperable perovskite description or how to keep track of 300 perovskite ions

https://doi.org/10.1038/s41467-025-64325-x

Maqsood, Ayman; Näsström, Hampus; Chen, Chen; Qiutong, Li; Luo, Jingshan; Chakraborty, Rayan; Blum, Volker; Unger, Eva; Draxl, Claudia; Márquez, José A; et al (December 2025, Nature Communications)

Abstract Hybrid perovskites are interesting optoelectronic materials. The perovskite ABX₃structure offers a vast compositional space, and we have identified over 300 perovskite ions. This flexibility enables tuneable properties and has significantly contributed to the success of perovskite optoelectronics. However, this diversity also leads to confusion, ambiguity, and inconsistencies causing challenges for data mining and machine learning applications. To address this issue, we propose guidelines and a JSON schema to standardize the reporting of perovskite compositions. The schema adheres to IUPAC recommendations and is designed to make data both human- and machine-readable. It captures key descriptors such as perovskite composition, molecular formula, SMILES representation, IUPAC name, and CAS number for each ion. To facilitate adoption, we have developed utilities to automatically generate comprehensive and standardized perovskite descriptions from standard ion abbreviations and stoichiometric coefficients. Additionally, we provide a curated database of all identified perovskite ions with associated descriptive data.
more » « less
Free, publicly-accessible full text available December 1, 2026
Electrochemistry as a Tool for Redox‐Based Bio‐Information Processing

https://doi.org/10.1002/advs.202510184

Kim, Eunkyoung; Chen, Chen‐yu; Zakaria, Fauziah Rahma; Motabar, Dana; Kang, Mijeong; Kelly, Deanna L; Napolitano, Alessandra; Bentley, William E; Payne, Gregory F (August 2025, Advanced Science)

Abstract Redox, a native modality in biology involving the flow of electrons, energy, and information, is used for energy‐harvesting, biosynthesis, immune‐defense, and signaling. Because electrons (in contrast to protons) are not soluble in the medium, electron‐flow through the redox modality occurs through redox reactions that are sometimes organized into pathways and networks (e.g., redox interactomes). Redox is also accessible to electrochemistry, which enables electrodes to receive and transmit electrons to exchange energy and information with biology. In this Perspective, efforts to develop electrochemistry as a tool for redox‐based bio‐information processing: to interconvert redox‐based molecular attributes into interpretable electronic signals, are described. Using a series of Case Studies, how the information‐content of the measurements can be enriched using: diffusible mediators; tuned electrical input sequences; and cross‐modal measurements (e.g., electrical plus spectral), is shown. Also, theory‐guided feature engineering approaches to compress the information in the electronic signals into quantitative metrics (i.e., features) that can serve as correlating variables for pattern recognition by data‐driven analysis are described. Finally, how redox provides a modality for electrogenetic actuation is illustrated. It is suggested that electrochemistry's capabilities to provide real‐time, low‐cost, and high‐content data in an electronic format allow the feedback‐control needed for autonomous learning and deployable sensing/actuation.
more » « less
Free, publicly-accessible full text available August 22, 2026
Unified Designs of Multi-Rail-Aware MPI Allreduce and Alltoall Operations Across Diverse GPU and Interconnect Systems

https://doi.org/10.1109/IPDPS64566.2025.00088

Chen, Chen-Chun; Yao, Jinghan; Xu, Lang; Subramoni, Hari; Panda, Dhabaleswar K (June 2025, IEEE)

Free, publicly-accessible full text available June 3, 2026
ST-FiT: Inductive Spatial-Temporal Forecasting with Limited Training Data

https://doi.org/10.1609/aaai.v39i11.33310

Lei, Zhenyu; Dong, Yushun; Li, Jundong; Chen, Chen (April 2025, Proceedings of the AAAI Conference on Artificial Intelligence)

Spatial-temporal graphs are widely used in a variety of real-world applications. Spatial-Temporal Graph Neural Networks (STGNNs) have emerged as a powerful tool to extract meaningful insights from this data. However, in real-world applications, most nodes may not possess any available temporal data during training. For example, the pandemic dynamics of most cities on a geographical graph may not be available due to the asynchronous nature of outbreaks. Such a phenomenon disagrees with the training requirements of most existing spatial-temporal forecasting methods, which jeopardizes their effectiveness and thus blocks broader deployment. In this paper, we propose to formulate a novel problem of inductive forecasting with limited training data. In particular, given a spatial-temporal graph, we aim to learn a spatial-temporal forecasting model that can be easily generalized onto those nodes without any available temporal training data. To handle this problem, we propose a principled framework named ST-FiT. ST-FiT consists of two key learning components: temporal data augmentation and spatial graph topology learning. With such a design, ST-FiT can be used on top of any existing STGNNs to achieve superior performance on the nodes without training data. Extensive experiments verify the effectiveness of ST-FiT in multiple key perspectives.
more » « less
Free, publicly-accessible full text available April 11, 2026
Virtual Nodes Can Help: Tackling Distribution Shifts in Federated Graph Learning

https://doi.org/10.1609/aaai.v39i16.33830

Fu, Xingbo; Chen, Zihan; He, Yinhan; Wang, Song; Zhang, Binchi; Chen, Chen; Li, Jundong (April 2025, Proceedings of the AAAI Conference on Artificial Intelligence)

Federated Graph Learning (FGL) enables multiple clients to jointly train powerful graph learning models, e.g., Graph Neural Networks (GNNs), without sharing their local graph data for graph-related downstream tasks, such as graph property prediction. In the real world, however, the graph data can suffer from significant distribution shifts across clients as the clients may collect their graph data for different purposes. In particular, graph properties are usually associated with invariant label-relevant substructures (i.e., subgraphs) across clients, while label-irrelevant substructures can appear in a client-specific manner. The issue of distribution shifts of graph data hinders the efficiency of GNN training and leads to serious performance degradation in FGL. To tackle the aforementioned issue, we propose a novel FGL framework entitled FedVN that eliminates distribution shifts through client-specific graph augmentation strategies with multiple learnable Virtual Nodes (VNs). Specifically, FedVN lets the clients jointly learn a set of shared VNs while training a global GNN model. To eliminate distribution shifts, each client trains a personalized edge generator that determines how the VNs connect local graphs in a client-specific manner. Furthermore, we provide theoretical analyses indicating that FedVN can eliminate distribution shifts of graph data across clients. Comprehensive experiments on four datasets under five settings demonstrate the superiority of our proposed FedVN over nine baselines.
more » « less
Free, publicly-accessible full text available April 11, 2026
Demystify Epidemic Containment in Directed Networks: Theory and Algorithms

https://doi.org/10.1145/3701551.3703575

He, Yinhan; Chen, Chen; Wang, Song; Min, Guanghui; Li, Jundong (March 2025, ACM)

Epidemic containment has long been a crucial task in many high-stake application domains, ranging from public health to misinformation dissemination. Existing studies for epidemic containment are primarily focused on undirected networks, assuming that the infection rate is constant throughout the contact network regardless of the strength and direction of contact. However, such an assumption can be unrealistic given the asymmetric nature of the real-world infection process. To tackle the epidemic containment problem in directed networks, simply grafting the methods designed for undirected network can be problematic, as most of the existing methods rely on the orthogonality and Lipschitz continuity in the eigensystem of the underlying contact network, which do not hold for directed networks. In this work, we derive a theoretical analysis on the general epidemic threshold condition for directed networks and show that such threshold condition can be used as an optimization objective to control the spread of the disease. Based on the epidemic threshold, we propose an asymptotically greedy algorithm DINO (DIrected NetwOrk epidemic containment) to identify the most critical nodes for epidemic containment. The proposed algorithm is evaluated on real-world directed networks, and the results validate its effectiveness and efficiency.
more » « less
Free, publicly-accessible full text available March 10, 2026
Disco: A Compact Index for LSM-trees

https://doi.org/10.1145/3709683

Zhong, Wenshao; Chen, Chen; Wu, Xingbo; Eriksson, Jakob (February 2025, Proceedings of the ACM on Management of Data)

Many key-value stores and database systems use log-structured merge-trees (LSM-trees) as their storage engines because of their excellent write performance. However, the read performance of LSM-trees is suboptimal due to the overlapping sorted runs. Most existing efforts rely on filters to reduce unnecessary I/Os, but filters fundamentally do not help locate items and often become the bottleneck of the system. We identify that the lack of efficient index is the root cause of subpar read performance in LSM-trees. In this paper, we propose Disco: a compact index for LSM-trees. Disco indexes all the keys in an LSM-tree, so a query does not have to search every run of the LSM-tree. It records compact key representations to minimize the number of key comparisons so as to minimize cache misses and I/Os for both point and range queries. Disco guarantees that both point queries and seeks issue at most one I/O to the underlying runs, achieving an I/O efficiency close to a B⁺-tree. Disco improves upon REMIX's pioneering multi-run index design with additional compact key representations to help improve read performance. The representations are compact so the cost of persisting Disco to disk is small. Moreover, while a traditional LSM-tree has to choose a more aggressive compaction policy that slows down write performance to have better read performance, a Disco-indexed LSM-tree can employ a write-efficient policy and still have good read performance. Experimental results show that Disco can save I/Os and improve point and range query performance by up to 220% over RocksDB while maintaining efficient writes.
more » « less
Free, publicly-accessible full text available February 10, 2026
FreqMixFormerV2: Lightweight Frequency-aware Mixed Transformer for Human Skeleton Action Recognition

Wu, Wenhan; Wang, Pengfei; Chen, Chen; Lu, Aidong (January 2025, IEEE International Conference on Automatic Face and Gesture Recognition)

Transformer-based human skeleton action recognition has been developed for years. However, the complexity and high parameter count demands of these models hinder their practical applications, especially in resource-constrained environments. In this work, we propose FreqMixForemrV2, which was built upon the Frequency-aware Mixed Transformer (FreqMixFormer) for identifying subtle and discriminative actions with pioneered frequency-domain analysis. We design a lightweight architecture that maintains robust performance while significantly reducing the model complexity. This is achieved through a redesigned frequency operator that optimizes high-frequency and low-frequency parameter adjustments, and a simplified frequency-aware attention module. These improvements result in a substantial reduction in model parameters, enabling efficient deployment with only a minimal sacrifice in accuracy. Comprehensive evaluations of standard datasets (NTU RGB+D, NTU RGB+D 120, and NW-UCLA datasets) demonstrate that the proposed model achieves a superior balance between efficiency and accuracy, outperforming state-of-the-art methods with only 60% of the parameters.
more » « less
Free, publicly-accessible full text available January 30, 2026
Manipulable Semantic Components: A Computational Representation of Data Visualization Scenes

https://doi.org/10.1109/TVCG.2024.3456296

Liu, Zhicheng; Chen, Chen; Hooker, John (January 2025, IEEE Transactions on Visualization and Computer Graphics)

Free, publicly-accessible full text available January 1, 2026

« Prev Next »

Search for: All records