NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Continual Learning Using a Kernel-Based Method Over Foundation Models

Momeni, Saleh; Mazumder, Sahisnu; Liu, Bing (February 2025, The 39th Annual AAAI Conference on Artificial Intelligence)

Continual learning (CL) learns a sequence of tasks incre- mentally. This paper studies the challenging CL setting of class-incremental learning (CIL). CIL has two key chal- lenges: catastrophic forgetting (CF) and inter-task class sep- aration (ICS). Despite numerous proposed methods, these issues remain persistent obstacles. This paper proposes a novel CIL method, called Kernel Linear Discriminant Analy- sis (KLDA), that can effectively avoid CF and ICS problems. It leverages only the powerful features learned in a foundation model (FM). However, directly using these features proves suboptimal. To address this, KLDA incorporates the Radial Basis Function (RBF) kernel and its Random Fourier Fea- tures (RFF) to enhance the feature representations from the FM, leading to improved performance. When a new task ar- rives, KLDA computes only the mean for each class in the task and updates a shared covariance matrix for all learned classes based on the kernelized features. Classification is performed using Linear Discriminant Analysis. Our empir- ical evaluation using text and image classification datasets demonstrates that KLDA significantly outperforms baselines. Remarkably, without relying on replay data, KLDA achieves accuracy comparable to joint training of all classes, which is considered the upper bound for CIL performance. The KLDA code is available at https://github.com/salehmomeni/klda.
more » « less
Free, publicly-accessible full text available February 25, 2026
Continual Learning Using Only Large Language Model Prompting

Qiu, Jiabao; Ke, Zixuan; Liu, Bing (January 2025, The 31st International Conference on Computational Linguistics (COLING-2025))

We introduce CLOB, a novel continual learning (CL) paradigm wherein a large language model (LLM) is regarded as a black box. Learning is done incrementally via only verbal prompting. CLOB does not fine-tune any part of the LLM or add any trainable parameters to it. It is particularly suitable for LLMs that are accessible via APIs. We also propose a new CL technique, called CIS, based on incremental summarization that also overcomes the LLM’s input length limit. Experiments show CIS outperforms baselines by a very large margin.
more » « less
Free, publicly-accessible full text available January 19, 2026
In-Context Continual Learning Assisted by an External Continual Learner

Momeni, Saleh; Mazumder, Sahisnu; Ke, Zixuan; Liu, Bing (January 2025, The 31st International Conference on Computational Linguistics (COLING-2025))

Existing continual learning (CL) methods mainly rely on fine-tuning or adapting large language mod- els (LLMs). They still suffer from catastrophic for- getting (CF). Little work has been done to exploit in-context learning (ICL) to leverage the extensive knowledge within LLMs for CL without updating any parameters. However, incrementally learning each new task in ICL necessitates adding training examples from each class of the task to the prompt, which hampers scalability as the prompt length in- creases. This issue not only leads to excessively long prompts that exceed the input token limit of the underlying LLM but also degrades the model’s performance due to the overextended context. To address this, we introduce InCA, a novel approach that integrates an external continual learner (ECL) with ICL to enable scalable CL without CF. The ECL is built incrementally to pre-select a small subset of likely classes for each test instance. By restricting the ICL prompt to only these selected classes, InCA prevents prompt lengths from becom- ing excessively long, while maintaining high per- formance. Experimental results demonstrate that InCA significantly outperforms existing CL base- lines, achieving substantial performance gains.
more » « less
Free, publicly-accessible full text available January 19, 2026
Open-world continual learning: Unifying novelty detection and continual learning

https://doi.org/10.1016/j.artint.2024.104237

Kim, Gyuhak; Xiao, Changnan; Konishi, Tatsuya; Ke, Zixuan; Liu, Bing (January 2025, Artificial Intelligence)

As AI agents are increasingly used in the real open world with unknowns or novelties, they need the ability to (1) recognize objects that (a) they have learned before and (b) detect items that they have never seen or learned, and (2) learn the new items incrementally to become more and more knowledgeable and powerful. (1) is called novelty detection or out-of-distribution (OOD) detection and (2) is called class incremental learning (CIL), which is a setting of continual learning (CL). In existing research, OOD detection and CIL are regarded as two completely different problems. This paper first provides a theoretical proof that good OOD detection for each task within the set of learned tasks (called closed-world OOD detection) is necessary for successful CIL. We show this by decomposing CIL into two sub-problems: within-task prediction (WP) and task-id prediction (TP), and proving that TP is correlated with closed-world OOD detection. The key theoretical result is that regardless of whether WP and OOD detection (or TP) are defined explicitly or implicitly by a CIL algorithm, good WP and good closed-world OOD detection are necessary and sufficient conditions for good CIL, which unifies novelty or OOD detection and continual learning (CIL, in particular). We call this traditional CIL the closed-world CIL as it does not detect future OOD data in the open world. The paper then proves that the theory can be generalized or extended to open-world CIL, which is the proposed open-world continual learning, that can perform CIL in the open world and detect future or open-world OOD data. Based on the theoretical results, new CIL methods are also designed, which outperform strong baselines in CIL accuracy and in continual OOD detection by a large margin.
more » « less
Free, publicly-accessible full text available January 1, 2026
Replay-and-Forget-Free Graph Class-Incremental Learning: A Task Profiling and Prompting Approach

Niu, Chaoxi; Pang, Guansong; Chen, Ling; Liu, Bing (December 2024, The Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS))

Class-incremental learning (CIL) aims to continually learn a sequence of tasks, with each task consisting of a set of unique classes. Graph CIL (GCIL) follows the same setting but needs to deal with graph tasks (e.g., node classification in a graph). The key characteristic of CIL lies in the absence of task identifiers (IDs) during inference, which causes a significant challenge in separating classes from different tasks (i.e., inter-task class separation). Being able to accurately predict the task IDs can help address this issue, but it is a challenging problem. In this paper, we show theoretically that accurate task ID prediction on graph data can be achieved by a Laplacian smoothing-based graph task profiling approach, in which each graph task is modeled by a task prototype based on Laplacian smoothing over the graph. It guarantees that the task prototypes of the same graph task are nearly the same with a large smoothing step, while those of different tasks are distinct due to differences in graph structure and node attributes. Further, to avoid the catastrophic forgetting of the knowledge learned in previous graph tasks, we propose a novel graph prompting approach for GCIL which learns a small discriminative graph prompt for each task, essentially resulting in a separate classification model for each task. The prompt learning requires the training of a single graph neural network (GNN) only once on the first task, and no data replay is required thereafter, thereby obtaining a GCIL model being both replay-free and forget-free. Extensive experiments on four GCIL benchmarks show that i) our task prototype-based method can achieve 100% task ID prediction accuracy on all four datasets, ii) our GCIL model significantly outperforms state-of-the-art competing methods by at least 18% in average CIL accuracy, and iii) our model is fully free of forgetting on the four datasets. Code is available at https://github.com/mala-lab/TPP.
more » « less
Free, publicly-accessible full text available December 10, 2025
Open-world continual learning: Unifying novelty detection and continual learning

Kim, Gyuhak; Xiao, Changnan; Konishi, Tatsuya; Ke, Zixuan; Liu, Bing (October 2024, Artificial intelligence)

Full Text Available
Sentiment Analysis in the Era of Large Language Models: A Reality Check

Zhang, Wenxuan; Deng, Yue; Liu, Bing; Pan, Sinno J; Bing, Lidong (June 2024, ACL)

Full Text Available
Modeling Low-Resource Health Coaching Dialogues via Neuro-Symbolic Goal Summarization and Text-Units-Text Generation

Zhou, Yue; Di_Eugenio, Barbara; Ziebart, Brian; Sharp, Lisa; Liu, Bing; Agadakos, Nikolaos (May 2024, Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024))

Health coaching helps patients achieve personalized and lifestyle-related goals, effectively managing chronic conditions and alleviating mental health issues. It is particularly beneficial, however cost-prohibitive, for low-socioeconomic status populations due to its highly personalized and labor-intensive nature. In this paper, we propose a neuro-symbolic goal summarizer to support health coaches in keeping track of the goals and a text-units-text dialogue generation model that converses with patients and helps them create and accomplish specific goals for physical activities. Our models outperform previous state-of-the-art while eliminating the need for predefined schema and corresponding annotation. We also propose a new health coaching dataset extending previous work and a metric to measure the unconventionality of the patient’s response based on data difficulty, facilitating potential coach alerts during deployment.
more » « less
Full Text Available
Maximizing Entanglement Routing Rate in Quantum Networks: Approximation Algorithms

https://doi.org/10.1109/TNSE.2025.3542332

Nguyen, Dung_H P; Hunt, Ethan; Horton, Dillon J; Nguyen, Tu N; Liu, Bing-Hong (May 2025, IEEE Transactions on Network Science and Engineering)

Free, publicly-accessible full text available May 1, 2026
Service Recovery in NFV-Enabled Networks: Algorithm Design and Analysis

https://doi.org/10.1109/TCC.2024.3402185

Nguyen, Dung_H P; Lin, Chih-Chieh; Nguyen, Tu N; Chu, Shao-I; Liu, Bing-Hong (April 2024, IEEE Transactions on Cloud Computing)

Full Text Available

« Prev Next »

Search for: All records