NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Learning from Natural Language Feedback

Chen, A; Scheurer, J; Campos, JA; Korbak, T; Chan, JS; Bowman, SR; Cho, K; Perez, E (February 2024, Transactions on machine learning research)

The potential for pre-trained large language models (LLMs) to use natural language feedback at inference time has been an exciting recent development. We build upon this observation by formalizing an algorithm for learning from natural language feedback at training time instead, which we call Imitation learning from Language Feedback (ILF). ILF requires only a small amount of human-written feedback during training and does not require the same feedback at test time, making it both user-friendly and sample-efficient. We further show that ILF can be seen as a form of minimizing the KL divergence to the target distribution and demonstrate proof-of-concepts on text summarization and program synthesis tasks. For code generation, ILF improves a Codegen-Mono 6.1B model’s pass@1 rate from 22% to 36% on the MBPP benchmark, outperforming both fine-tuning on MBPP and on human- written repaired programs. For summarization, we show that ILF can be combined with learning from human preferences to improve a GPT-3 model’s summarization performance to be comparable to human quality, outperforming fine-tuning on human-written summaries. Overall, our results suggest that ILF is both more effective and sample-efficient than training exclusively on demonstrations for improving an LLM’s performance on a variety of tasks.
more » « less
Full Text Available
Two Failures of Self-Consistency in the Multi-Step Reasoning of LLMs

Chen, A; Phang, J; Parrish, A; Padmakumar, V; Zhao, C; Bowman, SR; Cho, K (January 2024, Transactions on machine learning research)

Large language models (LLMs) have achieved widespread success on a variety of in-context few shot tasks, but this success is typically evaluated via correctness rather than consistency. We argue that self-consistency is an important criteria for valid multi-step reasoning in tasks where the solution is composed of the answers to multiple sub-steps. We propose two types of self consistency that are particularly important for multi-step reasoning – hypothetical consistency (a model’s ability to predict what its output would be in a hypothetical other context) and compositional consistency (consistency of a model’s final outputs when intermediate sub-steps are replaced with the model’s outputs for those steps). We demonstrate that multiple variants of the GPT-3/-4 models exhibit poor consistency rates across both types of consistency on a variety of tasks.
more » « less
Full Text Available
Protein Design with Guided Discrete Diffusion

Gruver, N; Stanton, S; Frey, N; Rudner, T; Hotzel, I; Lafrance-Vanasse, J; Rajpal, A; Cho, K; Wilson, AG (December 2023, Advances in Neural Information Processing Systems)

Full Text Available
Protein Design with Guided Discrete Diffusion

Gruver, N; Stanton, S; Frey, Nathan C; Rudner, Tim G; Hotzel, I; Lafrance-Vanasse, J; Rajpal, A; Cho, K; Wilson, Andrew G (December 2023, Advances in Neural Information Processing Systems)

A popular approach to protein design is to combine a generative model with a discriminative model for conditional sampling. The generative model samples plausible sequences while the discriminative model guides a search for sequences with high fitness. Given its broad success in conditional sampling, classifier-guided diffusion modeling is a promising foundation for protein design, leading many to develop guided diffusion models for structure with inverse folding to recover sequences. In this work, we propose diffusioN Optimized Sampling (NOS), a guidance method for discrete diffusion models that follows gradients in the hidden states of the denoising network. NOS makes it possible to perform design directly in sequence space, circumventing significant limitations of structure-based methods, including scarce data and challenging inverse design. Moreover, we use NOS to generalize LaMBO, a Bayesian optimization procedure for sequence design that facilitates multiple objectives and edit-based constraints. The resulting method, LaMBO-2, enables discrete diffusions and stronger performance with limited edits through a novel application of saliency maps. We apply LaMBO-2 to a real-world protein design task, optimizing antibodies for higher expression yield and binding affinity to several therapeutic targets under locality and developability constraints, attaining a 99% expression rate and 40% binding rate in exploratory in vitro experiments.
more » « less
Full Text Available
Seasonal variability of ocean circulation near the Dotson Ice Shelf, Antarctica

https://doi.org/10.1038/s41467-022-28751-5

Yang, H. W.; Kim, T.-W.; Dutrieux, Pierre; Wåhlin, A. K.; Jenkins, Adrian; Ha, H. K.; Kim, C. S.; Cho, K.-H.; Park, T.; Lee, S. H.; et al (December 2022, Nature Communications)

Abstract Recent rapid thinning of West Antarctic ice shelves are believed to be caused by intrusions of warm deep water that induce basal melting and seaward meltwater export. This study uses data from three bottom-mounted mooring arrays to show seasonal variability and local forcing for the currents moving into and out of the Dotson ice shelf cavity. A southward flow of warm, salty water had maximum current velocities along the eastern channel slope, while northward outflows of freshened ice shelf meltwater spread at intermediate depth above the western slope. The inflow correlated with the local ocean surface stress curl. At the western slope, meltwater outflows followed the warm influx along the eastern slope with a ~2–3 month delay. Ocean circulation near Dotson Ice Shelf, affected by sea ice distribution and wind, appears to significantly control the inflow of warm water and subsequent ice shelf melting on seasonal time-scales.
more » « less
Full Text Available
Production cross sections of light and charmed mesons in $e^{+} e^{-}$ annihilation near 10.58 GeV

https://doi.org/10.1103/PhysRevD.111.052003

Seidl, R; Adachi, I; Aihara, H; Aushev, T; Ayad, R; Banerjee, Sw; Belous, K; Bennett, J; Bessner, M; Bhuyan, B; et al (March 2025, Physical Review D)

We report measurements of production cross sections for $ρ^{+}$ , $ρ^{0}$ , $ω$ , $K^{* +}$ , $K^{* 0}$ , $ϕ$ , $η$ , $K_{S}^{0}$ , $f_{0} (980)$ , $D^{+}$ , $D^{0}$ , $D_{s}^{+}$ , $D^{* +}$ , $D^{* 0}$ , and $D_{s}^{* +}$ in $e^{+} e^{-}$ collisions at a center-of-mass energy near 10.58 GeV. The data were recorded by the Belle experiment, consisting of $571 {fb}^{- 1}$ at 10.58 GeV and $74 {fb}^{- 1}$ at 10.52 GeV. Production cross sections are extracted as a function of the fractional hadron momentum $x_{p}$ . The measurements are compared to Monte Carlo generator predictions with various fragmentation settings, including those that have increased fragmentation into vector mesons over pseudoscalar mesons. The cross sections measured for light hadrons are consistent with no additional increase of vector over pseudoscalar mesons. The charmed-meson cross sections are compared to earlier measurements—when available—including older Belle results, which they supersede. They are in agreement before application of an improved initial-state radiation correction procedure that causes slight changes in their $x_{p}$ shapes. Published by the American Physical Society2025
more » « less
Free, publicly-accessible full text available March 1, 2026
Suppression and reactivation of transformation and twinning induced plasticity in laser powder bed fusion additively manufactured Ti-10V-2Fe-3Al

https://doi.org/10.1016/j.addma.2021.102406

Mantri, S.A.; Nartu, M.S.K.K.Y.; Dasari, S.; Sharma, A.; Agrawal, P.; Salloom, R.; Sun, F.; Ivanov, E.; Cho, K.; McWilliams, B.; et al (December 2021, Additive Manufacturing)

Full Text Available
A Unified Framework of Online Learning Algorithms for Training Recurrent Neural Networks

Marschall, O.; Cho, K.; Savin, C. (January 2020, Journal of machine learning research)
null (Ed.)
We present a framework for compactly summarizing many recent results in efficient and/or biologically plausible online training of recurrent neural networks (RNN). The framework organizes algorithms according to several criteria: (a) past vs. future facing, (b) tensor structure, (c) stochastic vs. deterministic, and (d) closed form vs. numerical. These axes reveal latent conceptual connections among several recent advances in online learning. Furthermore, we provide novel mathematical intuitions for their degree of success. Testing various algorithms on two synthetic tasks shows that performances cluster according to our criteria. Although a similar clustering is also observed for gradient alignment, alignment with exact methods does not alone explain ultimate performance, especially for stochastic algorithms. This suggests the need for better comparison metrics.
more » « less
Full Text Available
Search for $h_{b} (2 P) \to γ χ_{b J} (1 P)$ at $\sqrt{s} = 10.860 GeV$

https://doi.org/10.1103/PhysRevD.111.L011102

Boschetti, A; Mussa, R; Tamponi, U; Adachi, I; Aihara, H; Asner, D M; Aushev, T; Ayad, R; Banerjee, Sw; Belous, K; et al (January 2025, Physical Review D)

In the bottomonium sector, the hindered magnetic dipole transitions between P-wave states $h_{b} (2 P) \to χ_{b J} (1 P) γ$ , $J = 0$ , 1, 2, are expected to be severely suppressed according to the relativized quark model, due to the spin flip of the $b$ quark. Nevertheless, a recent model following the coupled-channel approach predicts the corresponding branching fractions to be enhanced by orders of magnitude. In this Letter, we report the first search for such transitions. We find no significant signals and set upper limits at 90% confidence level on the corresponding branching fractions: $B [h_{b} (2 P) \to γ χ_{b 0} (1 P)] < 2.7 \times 10^{- 1}$ , $B [h_{b} (2 P) \to γ χ_{b 1} (1 P)] < 5.4 \times 10^{- 3}$ and $B [h_{b} (2 P) \to γ χ_{b 2} (1 P)] < 1.3 \times 10^{- 2}$ . These values help to constrain the parameters of the coupled-channel models. The results are obtained using a $121.4 {fb}^{- 1}$ data sample taken around $\sqrt{s} = 10.860 GeV$ with the Belle detector at the KEKB asymmetric-energy $e^{+} e^{-}$ collider. Published by the American Physical Society2025
more » « less
Free, publicly-accessible full text available January 1, 2026
Evidence of $h_{b} (2 P) \to ϒ (1 S) η$ Decay and Search for $h_{b} (1 P, 2 P) \to ϒ (1 S) π^{0}$ with the Belle Detector

https://doi.org/10.1103/PhysRevLett.133.261901

Kovalenko, E; Adachi, I; Aihara, H; Asner, D M; Aushev, T; Ayad, R; Babu, V; Banerjee, Sw; Belous, K; Bennett, J; et al (December 2024, Physical Review Letters)

We report the first evidence for the $h_{b} (2 P) \to ϒ (1 S) η$ transition with a significance of 3.5 standard deviations. The decay branching fraction is measured to be $B [h_{b} (2 P) \to ϒ (1 S) η] = ({7.1}_{- 3.2}^{+ 3.7} \pm 0.8) \times 10^{- 3}$ , which is noticeably smaller than expected. We also set upper limits on $π^{0}$ transitions of $B [h_{b} (2 P) \to ϒ (1 S) π^{0}] < 1.8 \times 10^{- 3}$ , and $B [h_{b} (1 P) \to ϒ (1 S) π^{0}] < 1.8 \times 10^{- 3}$ , at the 90% confidence level. These results are obtained with a $131.4 {fb}^{- 1}$ data sample collected near the $ϒ (5 S)$ resonance with the Belle detector at the KEKB asymmetric-energy $e^{+} e^{-}$ collider. Published by the American Physical Society2024
more » « less
Free, publicly-accessible full text available December 1, 2025

« Prev Next »

Search for: All records