NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

An Empirical Analysis of External and Internal Factors Affecting Manufacturing Firm Failure and Resilience

https://doi.org/10.1002/jcaf.22797

Yeh, Ting‐Tsen; Xiao, Yuanzhang; Daniel, Shirley J; Nguyen, Minh (May 2025, Journal of Corporate Accounting & Finance)

ABSTRACT We develop machine learning models that incorporate both external (deterministic) and internal (voluntaristic) factors affecting firm failure and survival. Using structured and unstructured data, we empirically investigate the external and internal factors that affect the US manufacturing firms’ business failure. We also examine how the interactions between external shocks and firm responses impact business failure. Our findings indicate that while external factors can significantly impact the likelihood that firms fail, specific management responses to these challenges can effectively mitigate the negative effects and contribute to firm survival.
more » « less
Free, publicly-accessible full text available May 29, 2026
REAL: Response Embedding-based Alignment for LLMs

Zhang, Honggen; Zhao, Xufeng; Molybog, Igor; Zhang, June (July 2025, arXiV)

Aligning large language models (LLMs) to human preferences is a crucial step in building helpful and safe AI tools, which usually involve training on supervised datasets. Popular algorithms such as Direct Preference Optimization (DPO) rely on pairs of AI-generated responses ranked according to human annotation. The response pair annotation process might bring human bias. Building a correct preference dataset is the costly part of the alignment pipeline. To improve annotation efficiency and quality in the LLMs alignment, we propose REAL:Response Embedding-based Alignment for LLMs, a strategy for constructing a high-quality training dataset that focuses on acquiring the less ambiguous preference pairs for labeling out of a set of response candidates. Our selection process is based on the similarity of embedding responses independently of prompts, which guarantees the selection process in an off-policy setting, avoiding adaptively measuring the similarity during the training. Experimental results on real-world dataset SHP2 and synthetic HH-RLHF benchmarks indicate that choosing dissimilar response pairs enhances the direct alignment of LLMs while reducing inherited labeling errors. The model aligned with dissimilar response pairs obtained a better margin and win rate on the dialogue task. Our findings suggest that focusing on distinct pairs can reduce the label error and improve LLM alignment efficiency, saving up to 65% of annotators’ work. The code of the work can be found https://github.com/ honggen-zhang/REAL-Alignment.
more » « less
Free, publicly-accessible full text available July 16, 2026
Autoformulation of Mathematical Optimization Models Using LLMs

Astorg, Nicolas; Liu, Tennison; Xiao, Yuanzhang; van_der_Schaar, Mihaela (July 2025, Proceedings of Machine Learning Research)

Mathematical optimization is fundamental to decision-making across diverse domains, from operations research to healthcare. Yet, translating real-world problems into optimization models remains a difficult task, often demanding specialized expertise. This paper approaches the problem of : the automated creation of solver-ready optimization models from natural language problem descriptions. We identify three core challenges of autoformulation: the vast, problem-dependent hypothesis space, efficient and diverse exploration of this space under uncertainty, and evaluation of formulation correctness against problem description. To address these challenges, we present a novel method leveraging (LLMs) with , exploiting the hierarchical nature of optimization modeling to generate and systematically explore possible formulations. To enhance search efficiency, we introduce symbolic pruning to eliminate trivially equivalent search paths (branches), and employ LLM-based evaluation of partial formulations to guide search. Empirical analysis on linear and mixed-integer programming benchmarks demonstrates our method's effectiveness, with significant performance gains from both LLM-based value estimation and symbolic pruning techniques.
more » « less
Free, publicly-accessible full text available July 14, 2026
A Novel Multidisciplinary Graduate Education Program in Data Science

Kuh, Anthony; Santhanam, Narayana; Port, Daniel; Thompson, London; Lee, Mary (July 2025, nternational Joint Conference on Neural Networks)

There has been an explosion of growth in using AI, data science, and machine learning in all aspects of our daily life. There is a global competition among governments, industry, and academic institutions to lead research and development in this area. This paper discusses a novel multidisciplinary graduate education and research program at our institution to help develop a trained workforce to meet the demands required to understand and develop AI, data science and machine learning technologies. The program brings together faculty and students in engineering, computer science, and social science to build a traineeship program where cohort teams study fundamental and applied data science research, using compact modules across courses to personalize instruction and prepare each trainee with skills tailored to their prior experience and future career goals.
more » « less
Free, publicly-accessible full text available July 7, 2026
On logistic regression and maximum entropy approaches

Santhanam, Narayana; Zhang, Yixin (June 2025, IEEE Symposium on Information Theory)

Logistic Regression is a widely used generalized linear model applied in classification settings to assign probabilities to class labels. It is also well known that logistic regression is a maximum entropy procedure subject to what are sometimes called the balance conditions. The dominant view in existing explanations are all discriminative, i.e., modeling labels given the data. This paper adds to the maximum entropy interpretation, establishing a generative, maximum entropy explanation for the commonly used logistic regression training and optimization procedures. We show that logistic regression models the conditional distribution on the instance space given class labels with a maximum entropy model subject to a first moment constraint on the training data, and that the commonly used fitting procedure would be a Monte-Carlo fit for the generative view.
more » « less
Free, publicly-accessible full text available June 27, 2026
Enhancing Contrastive Representation Learning through Data

Zhang, Honggen (May 2025, ProQuest Dissertation)

Contrastive learning learns input representation by pushing similar data together and pulling dissimilar data away, along with data augmentation and pretext task construction. It enhances the large model learning due to its ability to use a large amount of unlabeled data. It has been suc- cessfully applied to large language models, pre-trained image models, and multimodal models. In addition, contrastive learning learns a representation from modeling the explainable structure of the latent space, which has a broad application in scientific discovery and interpretable Artificial Intelligence (AI). The primary focus of this thesis is to explore contrastive learning from a data construction perspective in real-world problems to fill the gap between the principle of contrastive learning and its application. The challenges, such as sampling bias and data quality, will largely affect the representations learned by contrastive learning. This thesis analyzes the data construction chanlledges and limitations in 1) the negative sampling of knowledge graph embedding (KGE), 2) high-quliaty preference data labeling of Large Language Models (LLMs) alignment, 3) data augmentation in Non-linear dynamic system modeling, and 4) data properties in functions of mesange RNA (mRNA) sequence. To solve the challenges 1), a hardness and structure-based objective function was proposed by considering sampling bias in hard negative sampling. For challenge 2), the similarity of response embedding is used to evaluate the quality of preference pairs to mitigate the labeling error of humans when they face an ambiguous response pair. Chal- lenge 3) is solved by systematically considering the physical system and contrastive learning. A data augmentation strategy by partitioning the full sequence is used for learning the transition matrix in the latent linear space. Challenge 4) is common to see in the biological domain due to the high cost of lab experiments. Pre-trained model will advantage the limited dataset su- pervised learning by learning general features from domain knowledge. A contrastive learning based teacher-student framework is proposed for mRNA sequence learning by contrasting the unmasked sequence and the hard-masked sequence. By providing careful data construction or data sampling, contrastive learning will be boosted to solve tasks in reality. For the KGE, the novel contrastive loss function learns the boundary between negative samples and positive samples to improve the link prediction task in the knowl- edge graph; For the LLM alignment, in the same labeling cost, the selected dissimilar responses will improve the vanilla direct preference optimization (DPO) alignment; The data augmentation with contrastive loss play crucial role to learn more accuracy dynamic system, which explained by the learned the continiues eigenfunction; By considering the tearch-student framework with hard-masked strategy, the pre-trained model achieve the state-of-the-art result by fine-tuning on limited downstrame task data. Overall, this thesis provides a broad data-driven contrastive learning methodology to enhance representation learning in different domains. The methodology consists of a imprived objective function in the face of data bias, a better data selection reducing labeling error, and proper data augmentation for a particular application domain. This methodology improve the learning result compare to traditional method.
more » « less
Free, publicly-accessible full text available May 2, 2026
Unsupervised AoA Estimation Based on Dual-Path Knowledge-Aware Auto-Encoders

https://doi.org/10.1109/TWC.2025.3540470

Guo, Zhiheng; Xiao, Yuanzhang; Wang, Xijun; Chen, Xiang (May 2025, IEEE Transactions on Wireless Communications)

Free, publicly-accessible full text available May 1, 2026
Adapting to health crisis at Japanese hot springs: Understanding visitor travel intention and perceptions of countermeasures

https://doi.org/10.1016/j.ijdrr.2025.105399

Nguyen, David N; Tarui, Nori; Sasaki, Daisuke; Tsukuda, Haruka; Suppasri, Anawat; Imamura, Fumihiko (April 2025, International Journal of Disaster Risk Reduction)

Japan’s hot spring tourism, vital for rural economies, faced major setbacks during the COVID-19 pandemic. While research on travel intentions during health crises exists, there is limited exploration of public perceptions of health risk countermeasures in hot spring tourism. This study aims to fill this gap by examining the countermeasures implemented by hot spring operators in Japan and their perceived effectiveness by the public. A case study in disaster-affected areas reveals the challenges operators faced and how countermeasures influenced travel intentions, with demographic factors playing a key role in perceptions of effectiveness. This study makes several contributions: it is the first to explore public perceptions of health countermeasures in hot spring tourism, advancing the field of adaptive tourism by highlighting the importance of health protocols in rebuilding tourism industries after a crisis. Findings suggest that sanitation measures were viewed as the most effective, and operators can better allocate resources by focusing on these areas. Moreover, clear communication about countermeasures is crucial for boosting visitor confidence and facilitating recovery. Despite its focus on Japan and reliance on self-reported data, this research provides valuable insights for hot spring managers worldwide. The study’s findings offer practical guidance on prioritizing countermeasures.
more » « less
Free, publicly-accessible full text available April 1, 2026
A Bound for Learning Lossless Source Coding with Online Learning

https://doi.org/10.23919/ISITA60732.2024.10858204

Høst-Madsen, Anders; Amirani, Mohammad Zaeri; Santhanam, Narayana Prasad (November 2024, IEEE)

Free, publicly-accessible full text available November 10, 2025
Adaptive Top-K in SGD for Communication-Efficient Distributed Learning in Multi-Robot Collaboration

https://doi.org/10.1109/JSTSP.2024.3381373

Ruan, Mengzhe; Yan, Guangfeng; Xiao, Yuanzhang; Song, Linqi; Xu, Weitao (April 2024, IEEE Journal of Selected Topics in Signal Processing)

Full Text Available

« Prev Next »

Search for: All records