NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Scientific Equation Discovery using Modular Symbolic Regression via Vision-Language Guidance

Fatemi, Sepideh; Neog, Abhilash; Marchisin, Emma; Dutta, Amartya; Sawhney, Medha; Hanson, Paul; Karpatne, Anuj (June 2025, CVPR 2025 Workshop on Computer Vision for Science)

Free, publicly-accessible full text available June 11, 2026
Physics-Guided Hierarchical Neural Networks for Maxwell’s Equations in Plasmonic Metamaterials

https://doi.org/10.1021/acsphotonics.5c00552

Lynch, Sean; LaMountain, Jacob; Fan, Bo; Bu, Jie; Raju, Amogh; Wasserman, Daniel; Karpatne, Anuj; Podolskiy, Viktor_A (July 2025, ACS Photonics)
Masking the Gaps: An Imputation-Free Approach to Time Series Modeling with Missing Data

Neog, Abhilash; Daw, Arka; Khorasgani, Sepideh Fatemi; Karpatne, Anuj (December 2024, NeurIPS 2024 Workshop on Time Series in the Age of Large Models)

Full Text Available
Prompt-CAM: Making Vision Transformers Interpretable for Fine-Grained Analysis

Chowdhury, Arpita; Paul, Dipanjyoti; Mai, Zheda; Gu, Jianyang; Zhang, Ziheng; Mehrab, Kazi Sajeed; Campolongo, Elizabeth G; Rubenstein, Daniel; Stewart, Charles V; Karpatne, Anuj; et al (June 2025, Proceedings of the Computer Vision and Pattern Recognition Conference)

We present a simple approach to make pre-trained Vision Transformers (ViTs) interpretable for fine-grained analysis, aiming to identify and localize the traits that distinguish visually similar categories, such as bird species. Pre-trained ViTs, such as DINO, have demonstrated remarkable capabilities in extracting localized, discriminative features. However, saliency maps like Grad-CAM often fail to identify these traits, producing blurred, coarse heatmaps that highlight entire objects instead. We propose a novel approach, Prompt Class Attention Map (Prompt-CAM), to address this limitation. Prompt-CAM learns class-specific prompts for a pre-trained ViT and uses the corresponding outputs for classification. To correctly classify an image, the true-class prompt must attend to unique image patches not present in other classes' images (i.e., traits). As a result, the true class's multi-head attention maps reveal traits and their locations. Implementation-wise, Prompt-CAM is almost a "free lunch," requiring only a modification to the prediction head of Visual Prompt Tuning (VPT). This makes Prompt-CAM easy to train and apply, in stark contrast to other interpretable methods that require designing specific models and training processes. Extensive empirical studies on a dozen datasets from various domains (e.g., birds, fishes, insects, fungi, flowers, food, and cars) validate the superior interpretation capability of Prompt-CAM. The source code and demo are available at https://github.com/Imageomics/Prompt_CAM.
more » « less
Free, publicly-accessible full text available June 1, 2026
Fish-Vista: A Multi-Purpose Dataset for Understanding & Identification of Traits from Images

Mehrab, Kazi Sajeed; Maruf, M; Daw, Arka; Neog, Abhilash; Manogaran, Harish Babu; Khurana, Mridul; Feng, Zhenyang; Altintas, Bahadir; Bakis, Yasin; Campolongo, Elizabeth; et al (June 2025, CVPR)

The availability of large datasets of organism images combined with advances in artificial intelligence (AI) has significantly enhanced the study of organisms through images, unveiling biodiversity patterns and macro-evolutionary trends. However, existing machine learning (ML)-ready organism datasets have several limitations. First, these datasets often focus on species classification only, overlooking tasks involving visual traits of organisms. Second, they lack detailed visual trait annotations, like pixel-level segmentation, that are crucial for in-depth biological studies. Third, these datasets predominantly feature organisms in their natural habitats, posing challenges for aquatic species like fish, where underwater images often suffer from poor visual clarity, obscuring critical biological traits. This gap hampers the study of aquatic biodiversity patterns which is necessary for the assessment of climate change impacts, and evolutionary research on aquatic species morphology. To address this, we introduce the Fish-Visual Trait Analysis (Fish-Vista) dataset—a large, annotated collection of about 80K fish images spanning 3000 different species, supporting several challenging and biologically relevant tasks including species classification, trait identification, and trait segmentation. These images have been curated through a sophisticated data processing pipeline applied to a cumulative set of images obtained from various museum collections. Fish-Vista ensures that visual traits of images are clearly visible, and provides fine-grained labels of various visual traits present in each image. It also offers pixel-level annotations of 9 different traits for about 7000 fish images, facilitating additional trait segmentation and localization tasks. The ultimate goal of Fish-Vista is to provide a clean, carefully curated, high-resolution dataset that can serve as a foundation for accelerating biological discoveries using advances in AI. Finally, we provide a comprehensive analysis of state-of-the-art deep learning techniques on Fish-Vista.
more » « less
Free, publicly-accessible full text available June 15, 2026
LakeBeD-US: a benchmark dataset for lake water quality time series and vertical profiles

https://doi.org/10.5194/essd-17-3141-2025

McAfee, Bennett J; Pradhan, Aanish; Neog, Abhilash; Fatemi, Sepideh; Hensley, Robert T; Lofton, Mary E; Karpatne, Anuj; Carey, Cayelan C; Hanson, Paul C (January 2025, Earth System Science Data)

Abstract. Water quality in lakes is an emergent property of complex biotic and abiotic processes that differ across spatial and temporal scales. Water quality is also a determinant of ecosystem services that lakes provide and is thus of great interest to ecologists. Machine learning and other computer science techniques are increasingly being used to predict water quality dynamics as well as to gain a greater understanding of water quality patterns and controls. To benefit the sciences of both ecology and computer science, we have created a benchmark dataset of lake water quality time series and vertical profiles. LakeBeD-US contains over 500 million unique observations of lake water quality collected by multiple long-term monitoring programs across 17 water quality variables from 21 lakes in the United States. There are two published versions of LakeBeD-US: the “Ecology Edition” published in the Environmental Data Initiative repository (https://doi.org/10.6073/pasta/c56a204a65483790f6277de4896d7140, McAfee et al., 2024) and the “Computer Science Edition” published in the Hugging Face repository (https://doi.org/10.57967/hf/3771, Pradhan et al., 2024). Each edition is formatted in a manner conducive to inquiries and analyses specific to each domain. For ecologists, LakeBeD-US: Ecology Edition provides an opportunity to study the spatial and temporal dynamics of several lakes with varying water quality, ecosystem, and landscape characteristics. For computer scientists, LakeBeD-US: Computer Science Edition acts as a benchmark dataset that enables the advancement of machine learning for water quality prediction.
more » « less
Full Text Available
Physics-guided hierarchical Neural Networks for Maxwell’s equations in metamaterials

https://doi.org/10.1364/CLEO_FS.2024.FW4Q.2

Lynch, Sean; LaMountain, Jacob; Fan, Bo; Bu, Jie; Raju, Amogh; Wasserman, Daniel; Karpatne, Anuj; Podolskiy, Viktor (January 2024, Optica Publishing Group)

We develop a hierarchical approach to building a Physics-guided neural network (PGNN) for scalable solutions of Maxwell equations with high spatial resolution and illustrate the developed formalism on a metamaterial photonic funnel example.
more » « less
Full Text Available
Improving biosensor accuracy and speed using dynamic signal change and theory-guided deep learning

https://doi.org/10.1016/j.bios.2023.115829

Zhang, Junru; Srivatsa, Purna; Ahmadzai, Fazel Haq; Liu, Yang; Song, Xuerui; Karpatne, Anuj; Kong, Zhenyu; Johnson, Blake N. (February 2024, Biosensors and Bioelectronics)

Full Text Available
MEMTRACK: A Deep Learning-Based Approach to Microrobot Tracking in Dense and Low-Contrast Environments

Sawhney, Medha; Karmarkar, Bhas; Leaman, Eric J.; Daw, Arka; Karpatne, Anuj; Behkam, Bahareh (October 2023, arXivorg)

Tracking microrobots is challenging, considering their minute size and high speed. As the field progresses towards developing microrobots for biomedical applications and conducting mechanistic studies in physiologically relevant media (e.g., collagen), this challenge is exacerbated by the dense surrounding environments with feature size and shape comparable to microrobots. Herein, we report Motion Enhanced Multi-level Tracker (MEMTrack), a robust pipeline for detecting and tracking microrobots using synthetic motion features, deep learning-based object detection, and a modified Simple Online and Real-time Tracking (SORT) algorithm with interpolation for tracking. Our object detection approach combines different models based on the object's motion pattern. We trained and validated our model using bacterial micro-motors in collagen (tissue phantom) and tested it in collagen and aqueous media. We demonstrate that MEMTrack accurately tracks even the most challenging bacteria missed by skilled human annotators, achieving precision and recall of 77% and 48% in collagen and 94% and 35% in liquid media, respectively. Moreover, we show that MEMTrack can quantify average bacteria speed with no statistically significant difference from the laboriously-produced manual tracking data. MEMTrack represents a significant contribution to microrobot localization and tracking, and opens the potential for vision-based deep learning approaches to microrobot control in dense and low-contrast settings. All source code for training and testing MEMTrack and reproducing the results of the paper have been made publicly available this https URL.
more » « less
Full Text Available
A SIMPLE INTERPRETABLE TRANSFORMER FOR FINEGRAINED IMAGE CLASSIFICATION AND ANALYSIS

Paul, Dipanjyoti; Chowdhury, Arpita; Xiong, Xinqi; Chang, Feng-Ju; Carlyn, David; Stevens, Samuel; Provost, Kaiya; Karpatne, Anuj; Carstens, Bryan; Rubenstein, Daniel; et al (May 2024, ICLR)

We present a novel usage of Transformers to make image classification interpretable. Unlike mainstream classifiers that wait until the last fully connected layer to incorporate class information to make predictions, we investigate a proactive approach, asking each class to search for itself in an image. We realize this idea via a Transformer encoder-decoder inspired by DEtection TRansformer (DETR). We learn “class-specific” queries (one for each class) as input to the decoder, enabling each class to localize its patterns in an image via cross-attention. We name our approach INterpretable TRansformer (INTR), which is fairly easy to implement and exhibits several compelling properties. We show that INTR intrinsically encourages each class to attend distinctively; the cross-attention weights thus provide a faithful interpretation of the prediction. Interestingly, via “multi-head” cross-attention, INTR could identify different “attributes” of a class, making it particularly suitable for fine-grained classification and analysis, which we demonstrate on eight datasets. Our code and pre-trained models are publicly accessible at the Imageomics Institute GitHub site: https://github.com/Imageomics/INTR.
more » « less
Full Text Available

« Prev Next »

Search for: All records