NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Improving Multimodal Large Language Models Using Continual Learning

Srivastava, S; Harun, MY; Singh, R; Kanan, C (August 2025, Proc. Conference on Lifelong Learning Agents (CoLLAs))

Generative large language models (LLMs) exhibit impressive capabilities, which can be further augmented by integrating a pre-trained vision model into the original LLM to create a multimodal LLM (MLLM). However, this integration often significantly decreases performance on natural language understanding and generation tasks, compared to the original LLM. This study investigates this issue using the LLaVA MLLM, treating the integration as a continual learning problem. We evaluate five continual learning methods to mitigate forgetting and identify a technique that enhances visual understanding while minimizing linguistic performance loss. Our approach reduces linguistic performance degradation by up to 15% over the LLaVA recipe, while maintaining high multimodal accuracy. We also demonstrate the robustness of our method through continual learning on a sequence of vision-language tasks, effectively preserving linguistic skills while acquiring new multimodal capabilities.
more » « less
Free, publicly-accessible full text available August 11, 2026
Image Segmentation by Latent Space Phase-Gating with Applications in High-Content Screening

Yu, J; Singh, R (January 2025, Springer)
Bebis, G (Ed.)
Schistosomiasis is a parasitic disease with significant global health and socio-economic implications. Drug discovery for schistosomiasis typically involves high-content whole-organism screening. In this approach, parasites are ex-posed to various chemical compounds and their systemic, whole-organism-level responses are captured via microscopy and analyzed to obtain a quanti-tative assessment of chemical effect. These effects are multidimensional and time-varying, impacting shape, appearance, and behavior. Accurate identifi-cation of object boundaries is essential for preparing images for subsequent analysis in high-content studies. Object segmentation is one of the most deeply studied problems in computer vision where recent efforts have incor-porated deep learning. Emerging results indicate that acquiring robust fea-tures in spectral domain using Fast Fourier Transform (FFT) within Deep Neural Networks (DNNs) can enhance segmentation accuracy. In this paper, we explore this direction further and propose a latent space Phase-Gating (PG) method that builds upon FFT and leverages phase information to effi-ciently identify globally significant features. While the importance of phase in analyzing signals has long been known, technical difficulties in calculat-ing phase in manners that are invariant to imaging parameters has limited its use. A key result of this paper is to show how phase information can be in-corporated in neural architectures that are compact. Experiments conducted on complex HCS datasets demonstrate how this idea leads to improved seg-mentation accuracy, while maintaining robustness against commonly en-countered noise (blurring) in HCS. The compactness of the proposed method also makes it well-suited for application specific architectures (ASIC) de-signed for high-content screening.
more » « less
Free, publicly-accessible full text available January 22, 2026
Finite Time Logarithmic Regret Bounds for Self-Tuning Regulation

Singh, R; Mete, A; Kar, A; Kumar, P R (July 2024, Proceedings of the 41st International Conference on Machine Learning)

We establish the first finite-time logarithmic regret bounds for the self-tuning regulation problem. We introduce a modified version of the certainty equivalence algorithm, which we call PIECE, that clips inputs in addition to utilizing probing inputs for exploration. We show that it has a ClogT upper bound on the regret after T time-steps for bounded noise, and Clog3T in the case of sub-Gaussian noise, unlike the LQ problem where logarithmic regret is shown to be not possible. The PIECE algorithm is also designed to address the critical challenge of poor initial transient performance of reinforcement learning algorithms for linear systems. Comparative simulation results illustrate the improved performance of PIECE.
more » « less
Full Text Available
Spin Echo, Fidelity, and the Quantum Critical Fan in ${TmVO}_{4}$

https://doi.org/10.1103/PhysRevLett.132.216502

Nian, Y-H; Vinograd, I.; Green, T.; Chaffey, C.; Massat, P.; Singh, R. R. P.; Zic, M. P.; Fisher, I. R.; Curro, N. J. (May 2024, Physical Review Letters)
An Integrated Shape-Texture Descriptor for Modeling Whole-Organism Phenotypes in Drug Screening

Yu, J; Singh, R (January 2023, Springer)
Bebis, G (Ed.)
Full Text Available
An Integrated Shape-Texture Descriptor for Modeling Whole-Organism Phenotypes in Drug Screening

Yu, J; Singh, R (January 2023, Springer)
Bebis, G (Ed.)
Full Text Available
Analysis of SARS-CoV-2 Temporal Molecular Networks using Global and Local Topological Characteristics

Senchyna, F; Singh, R (October 2022, Springer)

Full Text Available
Using Authorship Verification to Mitigate Abuse in Online Communities

Weerasinghe, J.; Singh, R.; Greenstadt, R. (May 2022, Proceedings of the International AAAI Conference on Weblogs and Social Media)

Social media has become an important method for information sharing. This has also created opportunities for bad actors to easily spread disinformation and manipulate public opinion. This paper explores the possibility of applying Authorship Verification on online communities to mitigate abuse by analyzing the writing style of online accounts to identify accounts managed by the same person. We expand on our similarity-based authorship verification approach, previously applied on large fanfictions, and show that it works in open-world settings, shorter documents, and is largely topic-agnostic. Our expanded model can link Reddit accounts based on the writing style of only 40 comments with an AUC of 0.95, and the performance increases to 0.98 given more content. We apply this model on a set of suspicious Reddit accounts associated with the disinformation campaign surrounding the 2016 U.S. presidential election and show that the writing style of these accounts are inconsistent, indicating that each account was likely maintained by multiple individuals. We also apply this model to Reddit user accounts that commented on the WallStreetBets subreddit around the 2021 GameStop short squeeze and show that a number of account pairs share very similar writing styles. We also show that this approach can link accounts across Reddit and Twitter with an AUC of 0.91 even when training data is very limited.
more » « less
Full Text Available
A simple and general debiased machine learning theorem with finite-sample guarantees

https://doi.org/10.1093/biomet/asac033

Chernozhukov, V; Newey, W K; Singh, R (June 2022, Biometrika)

Debiased machine learning is a meta-algorithm based on bias correction and sample splitting to calculate confidence intervals for functionals, i.e., scalar summaries, of machine learning algorithms. For example, an analyst may seek the confidence interval for a treatment effect estimated with a neural network. We present a non-asymptotic debiased machine learning theorem that encompasses any global or local functional of any machine learning algorithm that satisfies a few simple, interpretable conditions. Formally, we prove consistency, Gaussian approximation and semiparametric efficiency by finite-sample arguments. The rate of convergence is $$n^{-1/2}$$ for global functionals, and it degrades gracefully for local functionals. Our results culminate in a simple set of conditions that an analyst can use to translate modern learning theory rates into traditional statistical inference. The conditions reveal a general double robustness property for ill-posed inverse problems.
more » « less
Full Text Available
Identifying and Characterizing Opioid Addiction States Using Social Media Posts

https://doi.org/10.1109/BIBM52615.2021.9669628

Jha, D; La Marca, S; Singh R (December 2021, Proceedings IEEE International Conference on Bioinformatics and Biomedicine Workshops)

Opioid addiction constitutes a significant contemporary health crisis that is multifarious in its complexity. Modeling the epidemiology of any addiction is challenging in its own right. For opioid addiction, the challenge is exacerbated due to the difficulties in collecting real-time data and the circumscribed nature of information opioid users may disclose owing to stigma associated with prescription misuse. Given this context, identifying the progression of individuals through the stages of (opioid) addiction is one of the more acute problems in epidemiological modeling whose solution is crucial for designing specific interventions at both personal and population levels. We describe a computational approach for determining and characterizing addiction stages of opioid users from their social media posts. The proposed approach combines recurrent neural network learning with information-theoretic analysis of word-associations and context-based word embedding to determine addiction stage-specific language usage. Users who have a high likelihood for relapsing back to drug-use are identified and characterized using propensity score matching and logistic regression. Experimental evaluations indicate that the proposed approach can distinguish between various addiction stages and identify users prone to relapse with high accuracy as evidenced by F1 scores of 0.88 and 0.79 respectively
more » « less
Full Text Available

« Prev Next »

Search for: All records