Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher.
Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?
Some links on this page may take you to non-federal websites. Their policies may differ from this site.
-
The potential for pre-trained large language models (LLMs) to use natural language feedback at inference time has been an exciting recent development. We build upon this observation by formalizing an algorithm for learning from natural language feedback at training time instead, which we call Imitation learning from Language Feedback (ILF). ILF requires only a small amount of human-written feedback during training and does not require the same feedback at test time, making it both user-friendly and sample-efficient. We further show that ILF can be seen as a form of minimizing the KL divergence to the target distribution and demonstrate proof-of-concepts on text summarization and program synthesis tasks. For code generation, ILF improves a Codegen-Mono 6.1B model’s pass@1 rate from 22% to 36% on the MBPP benchmark, outperforming both fine-tuning on MBPP and on human- written repaired programs. For summarization, we show that ILF can be combined with learning from human preferences to improve a GPT-3 model’s summarization performance to be comparable to human quality, outperforming fine-tuning on human-written summaries. Overall, our results suggest that ILF is both more effective and sample-efficient than training exclusively on demonstrations for improving an LLM’s performance on a variety of tasks.more » « less
-
Large language models (LLMs) have achieved widespread success on a variety of in-context few shot tasks, but this success is typically evaluated via correctness rather than consistency. We argue that self-consistency is an important criteria for valid multi-step reasoning in tasks where the solution is composed of the answers to multiple sub-steps. We propose two types of self consistency that are particularly important for multi-step reasoning – hypothetical consistency (a model’s ability to predict what its output would be in a hypothetical other context) and compositional consistency (consistency of a model’s final outputs when intermediate sub-steps are replaced with the model’s outputs for those steps). We demonstrate that multiple variants of the GPT-3/-4 models exhibit poor consistency rates across both types of consistency on a variety of tasks.more » « less
-
A popular approach to protein design is to combine a generative model with a discriminative model for conditional sampling. The generative model samples plausible sequences while the discriminative model guides a search for sequences with high fitness. Given its broad success in conditional sampling, classifier-guided diffusion modeling is a promising foundation for protein design, leading many to develop guided diffusion models for structure with inverse folding to recover sequences. In this work, we propose diffusioN Optimized Sampling (NOS), a guidance method for discrete diffusion models that follows gradients in the hidden states of the denoising network. NOS makes it possible to perform design directly in sequence space, circumventing significant limitations of structure-based methods, including scarce data and challenging inverse design. Moreover, we use NOS to generalize LaMBO, a Bayesian optimization procedure for sequence design that facilitates multiple objectives and edit-based constraints. The resulting method, LaMBO-2, enables discrete diffusions and stronger performance with limited edits through a novel application of saliency maps. We apply LaMBO-2 to a real-world protein design task, optimizing antibodies for higher expression yield and binding affinity to several therapeutic targets under locality and developability constraints, attaining a 99% expression rate and 40% binding rate in exploratory in vitro experiments.more » « less
-
Abstract Recent rapid thinning of West Antarctic ice shelves are believed to be caused by intrusions of warm deep water that induce basal melting and seaward meltwater export. This study uses data from three bottom-mounted mooring arrays to show seasonal variability and local forcing for the currents moving into and out of the Dotson ice shelf cavity. A southward flow of warm, salty water had maximum current velocities along the eastern channel slope, while northward outflows of freshened ice shelf meltwater spread at intermediate depth above the western slope. The inflow correlated with the local ocean surface stress curl. At the western slope, meltwater outflows followed the warm influx along the eastern slope with a ~2–3 month delay. Ocean circulation near Dotson Ice Shelf, affected by sea ice distribution and wind, appears to significantly control the inflow of warm water and subsequent ice shelf melting on seasonal time-scales.more » « less
-
null (Ed.)We present a framework for compactly summarizing many recent results in efficient and/or biologically plausible online training of recurrent neural networks (RNN). The framework organizes algorithms according to several criteria: (a) past vs. future facing, (b) tensor structure, (c) stochastic vs. deterministic, and (d) closed form vs. numerical. These axes reveal latent conceptual connections among several recent advances in online learning. Furthermore, we provide novel mathematical intuitions for their degree of success. Testing various algorithms on two synthetic tasks shows that performances cluster according to our criteria. Although a similar clustering is also observed for gradient alignment, alignment with exact methods does not alone explain ultimate performance, especially for stochastic algorithms. This suggests the need for better comparison metrics.more » « less
-
We present the measurement of -argon inelastic cross sections using the ProtoDUNE single-phase liquid argon time projection chamber in the incident kinetic energy range of 500–800 MeV in multiple exclusive channels (absorption, charge exchange, and the remaining inelastic interactions). The results of this analysis are important inputs to simulations of liquid argon neutrino experiments such as the Deep Underground Neutrino Experiment and the Short Baseline Neutrino program at Fermi National Accelerator Laboratory. They will be employed to improve the modeling of final state interactions within neutrino event generators used by these experiments, as well as the modeling of -argon secondary interactions within the liquid argon. This is the first measurement of -argon absorption at this kinetic energy range as well as the first ever measurement of -argon charge exchange.more » « less
-
We report measurements of production cross sections for , , , , , , , , , , , , , , and in collisions at a center-of-mass energy near 10.58 GeV. The data were recorded by the Belle experiment, consisting of at 10.58 GeV and at 10.52 GeV. Production cross sections are extracted as a function of the fractional hadron momentum . The measurements are compared to Monte Carlo generator predictions with various fragmentation settings, including those that have increased fragmentation into vector mesons over pseudoscalar mesons. The cross sections measured for light hadrons are consistent with no additional increase of vector over pseudoscalar mesons. The charmed-meson cross sections are compared to earlier measurements—when available—including older Belle results, which they supersede. They are in agreement before application of an improved initial-state radiation correction procedure that causes slight changes in their shapes. Published by the American Physical Society2025more » « less
-
The Deep Underground Neutrino Experiment (DUNE) is a next-generation neutrino experiment with a rich physics program that includes searches for the hypothetical phenomenon of proton decay. Utilizing liquid-argon time-projection chamber technology, DUNE is expected to achieve world-leading sensitivity in the proton decay channels that involve charged kaons in their final states. The first DUNE demonstrator, ProtoDUNE Single-Phase, was a 0.77 kt detector that operated from 2018 to 2020 at the CERN Neutrino Platform, exposed to a mixed hadron and electron test-beam with momenta ranging from 0.3 to . We present a selection of low-energy kaons among the secondary particles produced in hadronic reactions, using data from the 6 and beam runs. The selection efficiency is 1% and the sample purity 92%. The initial energies of the selected kaon candidates encompass the expected energy range of kaons originating from proton decay events in DUNE (below ). In addition, we demonstrate the capability of this detector technology to discriminate between kaons and other particles such as protons and muons, and provide a comprehensive description of their energy loss in liquid argon, which shows good agreement with the simulation. These results pave the way for future proton decay searches at DUNE.more » « less
An official website of the United States government

Full Text Available