Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher.
Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?
Some links on this page may take you to non-federal websites. Their policies may differ from this site.
-
Code search is vital in the maintenance and extension of software systems. Past works have used separate language models for the natural language and programming language artifacts on models with multiple encoders and different loss functions. Similarly, this work approaches code search for Python as a translation retrieval problem while the natural language queries and the programming language are treated as two types of languages. By using dual encoders, these two types of language sequences are projected onto a shared embedding space, in which the distance reflects the similarity between a given pair of query and code. However, in contrast to previous work, this approach uses a unified language model, and a dual encoder structure with a cosine similarity loss function. A unified language model helps the model take advantage of the considerable overlap of words between the artifacts, making the learning much easier. On the other hand, the dual encoders trained with cosine similarity loss helps the model learn the underlining patterns of which terms are important for predicting linked pairs of artifacts. Evaluation shows the proposed model achieves performance better than state-of-the-art code search models. In addition, this model is much less expensive in terms of time and complexity, offering a cheaper, faster, and better alternative.more » « lessFree, publicly-accessible full text available January 1, 2026
-
In a power system, when the participation factors of generators are computed to rank their participations into an oscillatory mode, a model-based approach is conventionally used on the linearized system model by means of the corresponding right and left eigenvectors. This paper proposes a new approach for estimating participation factors directly from measurement data on generator responses under selected disturbances. The approach computes extended participation factors that coincide with accurate model-based participation factors when the measured responses satisfy an ideally symmetric condition. This paper relaxes this symmetric condition with the original measurement space by identifying and utilizing a coordinate transformation to a new space optimally recovering the symmetry. Thus, the optimal estimates of participation factors solely from measurements are achieved, and the accuracy and influencing factors are discussed. The proposed approach is first demonstrated in detail on a two-area system and then tested on an NPCC 48-machine power system. The penetration of inverter-based resources is also considered.more » « lessFree, publicly-accessible full text available January 1, 2026
-
This research seeks to benefit the software engineering society by providing a simple yet effective pre-processing approach to achieve equalized odds fairness in machine learning software. Fairness issues have attracted increasing attention since machine learning software is increasingly used for high-stakes and high-risk decisions. It is the responsibility of all software developers to make their software accountable by ensuring that the machine learning software do not perform differently on different sensitive demographic groups—satisfying equalized odds. Different from prior works which either optimize for an equalized odds related metric during the learning process like a black-box, or manipulate the training data following some intuition; this work studies the root cause of the violation of equalized odds and how to tackle it. We found that equalizing the class distribution in each demographic group with sample weights is a necessary condition for achieving equalized odds without modifying the normal training process. In addition, an important partial condition for equalized odds (zero average odds difference) can be guaranteed when the class distributions are weighted to be not only equal but also balanced (1:1). Based on these analyses, we proposed FairBalance, a pre-processing algorithm which balances the class distribution in each demographic group by assigning calculated weights to the training data. On eight real-world datasets, our empirical results show that, at low computational overhead, the proposed pre-processing algorithm FairBalance can significantly improve equalized odds without much, if any damage to the utility. FairBalance also outperforms existing state-of-the-art approaches in terms of equalized odds. To facilitate reuse, reproduction, and validation, we made our scripts available at https://github.com/hil-se/FairBalance.more » « lessFree, publicly-accessible full text available September 1, 2025
-
Abstract We performed a rigorous reverberation-mapping analysis of the broad-line region (BLR) in a highly accreting (L/LEdd= 0.74–3.4) active galactic nucleus, Markarian 142 (Mrk 142), for the first time using concurrent observations of the inner accretion disk and the BLR to determine a time lag for the Hβλ4861 emission relative to the ultraviolet (UV) continuum variations. We used continuum data taken with the Niel Gehrels Swift Observatory in theUVW2 band, and the Las Cumbres Observatory, Dan Zowada Memorial Observatory, and Liverpool Telescope in thegband, as part of the broader Mrk 142 multiwavelength monitoring campaign in 2019. We obtained new spectroscopic observations covering the Hβbroad emission line in the optical from the Gemini North Telescope and the Lijiang 2.4 m Telescope for a total of 102 epochs (over a period of 8 months) contemporaneous to the continuum data. Our primary result states a UV-to-Hβtime lag of days in Mrk 142 obtained from light-curve analysis with a Python-based running optimal average algorithm. We placed our new measurements for Mrk 142 on the optical and UV radius–luminosity relations for NGC 5548 to understand the nature of the continuum driver. The positions of Mrk 142 on the scaling relations suggest that UV is closer to the “true” driving continuum than the optical. Furthermore, we obtain = 6.32 ± 0.29 assuming UV as the primary driving continuum.more » « less
-
Ensuring the stability of power systems is gaining more attention today than ever before due to the rapid growth of uncertainties in load and increased renewable energy penetration. Lately, wide-area measurement system (WAMS)-based centralized controlling techniques are offering flexibility and more robust control to keep the system stable. WAMS-based controlling techniques, however, face pressing challenges of irregular delays in long-distance communication channels and subsequent responses of equipment to control actions. This paper presents an innovative control strategy for damping down low-frequency oscillations in transmission systems. The method uses a reinforcement learning technique to overcome the challenges of communication delays and other non-linearity in wide-area damping control. It models the traditional problem of oscillation damping control as a novel faster exploration-based deep deterministic policy gradient (DDPG-S). An effective reward function is designed to capture necessary features of oscillations enabling timely damping of such oscillations, even under various kinds of uncertainties. A detailed analysis and a systematically designed numerical validation are presented to prove feasibility, scalability, interpretability, and comparative performance of the modelled low-frequency oscillation damping controller. The benefit of the technique is that stability is ensured even when uncertainties of load and generation are on the rise.more » « less