Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher.
Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?
Some links on this page may take you to non-federal websites. Their policies may differ from this site.
-
Existing efforts on quantifying privacy implications for large language models (LLMs) solely focus on measuring leakage of training data. In this work, we shed light on the often-overlooked interactive settings where an LLM receives information from multiple sources and generates an output to be shared with other entities, creating the potential of exposing sensitive input data in inappropriate contexts. In these scenarios, humans nat- urally uphold privacy by choosing whether or not to disclose information depending on the context. We ask the question “Can LLMs demonstrate an equivalent discernment and reasoning capability when considering privacy in context?” We propose CONFAIDE, a benchmark grounded in the theory of contextual integrity and designed to identify critical weaknesses in the privacy reasoning capabilities of instruction-tuned LLMs. CONFAIDE consists of four tiers, gradually increasing in complexity, with the final tier evaluating contextual privacy reasoning and theory of mind capabilities. Our experiments show that even commercial models such as GPT-4 and ChatGPT reveal private information in contexts that humans would not, 39% and 57% of the time, respectively, highlighting the urgent need for a new direction of privacy-preserving approaches as we demonstrate a larger underlying problem stemmed in the models’ lack of reasoning capabilities.more » « lessFree, publicly-accessible full text available May 15, 2025
-
Free, publicly-accessible full text available January 16, 2025
-
Biased associations have been a challenge in the development of classifiers for detecting toxic language, hindering both fairness and accuracy. As potential solutions, we investigate recently introduced debiasing methods for text classification datasets and models, as applied to toxic language detection. Our focus is on lexical (e.g., swear words, slurs, identity mentions) and dialectal markers (specifically African American English). Our comprehensive experiments establish that existing methods are limited in their ability to prevent biased behavior in current toxicity detectors. We then propose an automatic, dialect-aware data correction method, as a proof-of-concept. Despite the use of synthetic labels, this method reduces dialectal associations with toxicity. Overall, our findings show that debiasing a model trained on biased toxic language data is not as effective as simply relabeling the data to remove existing biases.more » « less
-
Abstract Temperature sensitivity of soil organic carbon (
SOC ) decomposition is one of the major uncertainties in predicting climate‐carbon (C) cycle feedback. Results from previous studies are highly contradictory with old soil C decomposition being more, similarly, or less sensitive to temperature than decomposition of young fractions. The contradictory results are partly from difficulties in distinguishing old from youngSOC and their changes over time in the experiments with or without isotopic techniques. In this study, we have conducted a long‐term field incubation experiment with deep soil collars (0–70 cm in depth, 10 cm in diameter ofPVC tubes) for excluding root C input to examine apparent temperature sensitivity ofSOC decomposition under ambient and warming treatments from 2002 to 2008. The data from the experiment were infused into a multi‐pool soil C model to estimate intrinsic temperature sensitivity ofSOC decomposition and C residence times of threeSOC fractions (i.e., active, slow, and passive) using a data assimilation (DA ) technique. As activeSOC with the short C residence time was progressively depleted in the deep soil collars under both ambient and warming treatments, the residences times of the wholeSOC became longer over time. Concomitantly, the estimated apparent and intrinsic temperature sensitivity ofSOC decomposition also became gradually higher over time as more than 50% of activeSOC was depleted. Thus, the temperature sensitivity of soil C decomposition in deep soil collars was positively correlated with the mean C residence times. However, the regression slope of the temperature sensitivity against the residence time was lower under the warming treatment than under ambient temperature, indicating that other processes also regulated temperature sensitivity ofSOC decomposition. These results indicate that oldSOC decomposition is more sensitive to temperature than young components, making the old C more vulnerable to future warmer climate.