NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

State Space Models are Strong Text Rerankers

https://doi.org/10.18653/v1/2025.repl4nlp-1.12

Xu, Zhichao; Yan, Jinghua; Gupta, Ashim; Srikumar, Vivek (May 2025, Proceedings of the 10th Workshop on Representation Learning for NLP (RepL4NLP 2025), Association for Computational Linguistics)

Free, publicly-accessible full text available May 4, 2026
In-Context Example Ordering Guided by Label Distributions

https://doi.org/10.18653/v1/2024.findings-naacl.167

Xu, Zhichao; Cohen, Daniel; Wang, Bei; Srikumar, Vivek (June 2024, Association for Computational Linguistics, Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL))

Full Text Available
An In-depth Investigation of User Response Simulation for Conversational Search

https://doi.org/10.1145/3589334.3645447

Wang, Zhenduo; Xu, Zhichao; Srikumar, Vivek; Ai, Qingyao (May 2024, ACM)

Full Text Available
Beyond Perplexity: Multi-dimensional Safety Evaluation of LLM Compression

https://doi.org/10.18653/v1/2024.findings-emnlp.901

Xu, Zhichao; Gupta, Ashim; Li, Tao; Bentham, Oliver; Srikumar, Vivek (January 2024, Association for Computational Linguistics)

Full Text Available
Measuring and Improving Attentiveness to Partial Inputs with Counterfactuals

https://doi.org/10.18653/v1/2024.findings-emnlp.205

Elazar, Yanai; Paranjape, Bhargavi; Peng, Hao; Wiegreffe, Sarah; Chandu, Khyathi; Srikumar, Vivek; Singh, Sameer; Smith, Noah A (January 2024, Association for Computational Linguistics)

The inevitable appearance of spurious correlations in training datasets hurts the generalization of NLP models on unseen data. Previous work has found that datasets with paired inputs are prone to correlations between a specific part of the input (e.g., the hypothesis in NLI) and the label; consequently, models trained only on those outperform chance. Are these correlations picked up by models trained on the full input data? To address this question, we propose a new evaluation method, Counterfactual Attentiveness Test (CAT). CAT uses counterfactuals by replacing part of the input with its counterpart from a different example (subject to some restrictions), expecting an attentive model to change its prediction. Using CAT, we systematically investigate established supervised and in-context learning models on ten datasets spanning four tasks: natural language inference, reading comprehension, paraphrase detection, and visual & language reasoning. CAT reveals that reliance on such correlations is mainly data-dependent. Surprisingly, we find that GPT3 becomes less attentive with an increased number of demonstrations, while its accuracy on the test data improves. Our results demonstrate that augmenting training or demonstration data with counterfactuals is effective in improving models’ attentiveness. We show that models’ attentiveness measured by CAT reveals different conclusions from solely measuring correlations in data.
more » « less
Full Text Available
TopoBERT: Exploring the topology of fine-tuned word representations

https://doi.org/10.1177/14738716231168671

Rathore, Archit; Zhou, Yichu; Srikumar, Vivek; Wang, Bei (July 2023, Information Visualization)

Transformer-based language models such as BERT and its variants have found widespread use in natural language processing (NLP). A common way of using these models is to fine-tune them to improve their performance on a specific task. However, it is currently unclear how the fine-tuning process affects the underlying structure of the word embeddings from these models. We present TopoBERT, a visual analytics system for interactively exploring the fine-tuning process of various transformer-based models – across multiple fine-tuning batch updates, subsequent layers of the model, and different NLP tasks – from a topological perspective. The system uses the mapper algorithm from topological data analysis (TDA) to generate a graph that approximates the shape of a model’s embedding space for an input dataset. TopoBERT enables its users (e.g. experts in NLP and linguistics) to (1) interactively explore the fine-tuning process across different model-task pairs, (2) visualize the shape of embedding spaces at multiple scales and layers, and (3) connect linguistic and contextual information about the input dataset with the topology of the embedding space. Using TopoBERT, we provide various use cases to exemplify its applications in exploring fine-tuned word embeddings. We further demonstrate the utility of TopoBERT, which enables users to generate insights about the fine-tuning process and provides support for empirical validation of these insights.
more » « less
Full Text Available
VERB: Visualizing and Interpreting Bias Mitigation Techniques Geometrically for Word Representations

Rathore, Archit; Dev, Sunipa; Phillips, Jeff M.; Srikumar, Vivek; Zheng, Yan; Yeh, Chin-Chia Michael; Wang, Junpeng; Zhang, Wei; Wang, Bei (January 2024, ACM transactions on interactive intelligent systems)

Word vector embeddings have been shown to contain and amplify biases in the data they are extracted from. Consequently, many techniques have been proposed to identify, mitigate, and attenuate these biases in word representations. In this paper, we utilize interactive visualization to increase the interpretability and accessibility of a collection of state-of-the-art debiasing techniques. To aid this, we present the Visualization of Embedding Representations for deBiasing (“VERB”) system, an open-source web-based visualization tool that helps users gain a technical understanding and visual intuition of the inner workings of debiasing techniques, with a focus on their geometric properties. In particular, VERB offers easy-to-follow examples that explore the effects of these debiasing techniques on the geometry of high-dimensional word vectors. To help understand how various debiasing techniques change the underlying geometry, VERB decomposes each technique into interpretable sequences of primitive transformations and highlights their effect on the word vectors using dimensionality reduction and interactive visual exploration. VERB is designed to target natural language processing (NLP) practitioners who are designing decision-making systems on top of word embeddings, and also researchers working with the fairness and ethics of machine learning systems in NLP. It can also serve as a visual medium for education, which helps an NLP novice understand and mitigate biases in word embeddings.
more » « less
Full Text Available
A machine learning approach to identifying suicide risk among text-based crisis counseling encounters

https://doi.org/10.3389/fpsyt.2023.1110527

Broadbent, Meghan; Medina Grespan, Mattia; Axford, Katherine; Zhang, Xinyao; Srikumar, Vivek; Kious, Brent; Imel, Zac (March 2023, Frontiers in Psychiatry)

IntroductionWith the increasing utilization of text-based suicide crisis counseling, new means of identifying at risk clients must be explored. Natural language processing (NLP) holds promise for evaluating the content of crisis counseling; here we use a data-driven approach to evaluate NLP methods in identifying client suicide risk. MethodsDe-identified crisis counseling data from a regional text-based crisis encounter and mobile tipline application were used to evaluate two modeling approaches in classifying client suicide risk levels. A manual evaluation of model errors and system behavior was conducted. ResultsThe neural model outperformed a term frequency-inverse document frequency (tf-idf) model in the false-negative rate. While 75% of the neural model’s false negative encounters had some discussion of suicidality, 62.5% saw a resolution of the client’s initial concerns. Similarly, the neural model detected signals of suicidality in 60.6% of false-positive encounters. DiscussionThe neural model demonstrated greater sensitivity in the detection of client suicide risk. A manual assessment of errors and model performance reflected these same findings, detecting higher levels of risk in many of the false-positive encounters and lower levels of risk in many of the false negatives. NLP-based models can detect the suicide risk of text-based crisis encounters from the encounter’s content.
more » « less
Full Text Available
VERB: Visualizing and Interpreting Bias Mitigation Techniques Geometrically for Word Representations

https://doi.org/10.1145/3604433

Rathore, Archit; Dev, Sunipa; Phillips, Jeff M.; Srikumar, Vivek; Zheng, Yan; Yeh, Chin-Chia Michael; Wang, Junpeng; Zhang, Wei; Wang, Bei (June 2023, ACM Transactions on Interactive Intelligent Systems)

Word vector embeddings have been shown to contain and amplify biases in the data they are extracted from. Consequently, many techniques have been proposed to identify, mitigate, and attenuate these biases in word representations. In this paper, we utilize interactive visualization to increase the interpretability and accessibility of a collection of state-of-the-art debiasing techniques. To aid this, we present the Visualization of Embedding Representations for deBiasing (“VERB”) system, an open-source web-based visualization tool that helps users gain a technical understanding and visual intuition of the inner workings of debiasing techniques, with a focus on their geometric properties. In particular, VERB offers easy-to-follow examples that explore the effects of these debiasing techniques on the geometry of high-dimensional word vectors. To help understand how various debiasing techniques change the underlying geometry, VERB decomposes each technique into interpretable sequences of primitive transformations and highlights their effect on the word vectors using dimensionality reduction and interactive visual exploration. VERB is designed to target natural language processing (NLP) practitioners who are designing decision-making systems on top of word embeddings, and also researchers working with the fairness and ethics of machine learning systems in NLP. It can also serve as a visual medium for education, which helps an NLP novice understand and mitigate biases in word embeddings.
more » « less
Full Text Available
Logic-driven Indirect Supervision: An Application to Crisis Counseling

https://doi.org/10.18653/v1/2023.acl-long.654

Medina Grespan, Mattia; Broadbent, Meghan; Zhang, Xinyao; Axford, Katherine; Kious, Brent; Imel, Zac; Srikumar, Vivek (January 2023, Association for Computational Linguistics)

Ensuring the effectiveness of text-based crisis counseling requires observing ongoing conversations and providing feedback, both labor-intensive tasks. Automatic analysis of conversations—at the full chat and utterance levels—may help support counselors and provide better care. While some session-level training data (e.g., rating of patient risk) is often available from counselors, labeling utterances requires expensive post hoc annotation. But the latter can not only provide insights about conversation dynamics, but can also serve to support quality assurance efforts for counselors. In this paper, we examine if inexpensive—and potentially noisy—session-level annotation can help improve label utterances. To this end, we propose a logic-based indirect supervision approach that exploits declaratively stated structural dependencies between both levels of annotation to improve utterance modeling. We show that adding these rules gives an improvement of 3.5% f-score over a strong multi-task baseline for utterance-level predictions. We demonstrate via ablation studies how indirect supervision via logic rules also improves the consistency and robustness of the system.
more » « less
Full Text Available

« Prev Next »

Search for: All records