skip to main content

Attention:

The NSF Public Access Repository (PAR) system and access will be unavailable from 11:00 PM ET on Thursday, January 16 until 2:00 AM ET on Friday, January 17 due to maintenance. We apologize for the inconvenience.


Search for: All records

Creators/Authors contains: "Zhou, Xuhui"

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. Existing efforts on quantifying privacy implications for large language models (LLMs) solely focus on measuring leakage of training data. In this work, we shed light on the often-overlooked interactive settings where an LLM receives information from multiple sources and generates an output to be shared with other entities, creating the potential of exposing sensitive input data in inappropriate contexts. In these scenarios, humans nat- urally uphold privacy by choosing whether or not to disclose information depending on the context. We ask the question “Can LLMs demonstrate an equivalent discernment and reasoning capability when considering privacy in context?” We propose CONFAIDE, a benchmark grounded in the theory of contextual integrity and designed to identify critical weaknesses in the privacy reasoning capabilities of instruction-tuned LLMs. CONFAIDE consists of four tiers, gradually increasing in complexity, with the final tier evaluating contextual privacy reasoning and theory of mind capabilities. Our experiments show that even commercial models such as GPT-4 and ChatGPT reveal private information in contexts that humans would not, 39% and 57% of the time, respectively, highlighting the urgent need for a new direction of privacy-preserving approaches as we demonstrate a larger underlying problem stemmed in the models’ lack of reasoning capabilities. 
    more » « less
    Free, publicly-accessible full text available May 15, 2025
  2. Free, publicly-accessible full text available January 16, 2025
  3. Biased associations have been a challenge in the development of classifiers for detecting toxic language, hindering both fairness and accuracy. As potential solutions, we investigate recently introduced debiasing methods for text classification datasets and models, as applied to toxic language detection. Our focus is on lexical (e.g., swear words, slurs, identity mentions) and dialectal markers (specifically African American English). Our comprehensive experiments establish that existing methods are limited in their ability to prevent biased behavior in current toxicity detectors. We then propose an automatic, dialect-aware data correction method, as a proof-of-concept. Despite the use of synthetic labels, this method reduces dialectal associations with toxicity. Overall, our findings show that debiasing a model trained on biased toxic language data is not as effective as simply relabeling the data to remove existing biases. 
    more » « less
  4. null (Ed.)
  5. Abstract

    Temperature sensitivity of soil organic carbon (SOC) decomposition is one of the major uncertainties in predicting climate‐carbon (C) cycle feedback. Results from previous studies are highly contradictory with old soil C decomposition being more, similarly, or less sensitive to temperature than decomposition of young fractions. The contradictory results are partly from difficulties in distinguishing old from youngSOCand their changes over time in the experiments with or without isotopic techniques. In this study, we have conducted a long‐term field incubation experiment with deep soil collars (0–70 cm in depth, 10 cm in diameter ofPVCtubes) for excluding root C input to examine apparent temperature sensitivity ofSOCdecomposition under ambient and warming treatments from 2002 to 2008. The data from the experiment were infused into a multi‐pool soil C model to estimate intrinsic temperature sensitivity ofSOCdecomposition and C residence times of threeSOCfractions (i.e., active, slow, and passive) using a data assimilation (DA) technique. As activeSOCwith the short C residence time was progressively depleted in the deep soil collars under both ambient and warming treatments, the residences times of the wholeSOCbecame longer over time. Concomitantly, the estimated apparent and intrinsic temperature sensitivity ofSOCdecomposition also became gradually higher over time as more than 50% of activeSOCwas depleted. Thus, the temperature sensitivity of soil C decomposition in deep soil collars was positively correlated with the mean C residence times. However, the regression slope of the temperature sensitivity against the residence time was lower under the warming treatment than under ambient temperature, indicating that other processes also regulated temperature sensitivity ofSOCdecomposition. These results indicate that oldSOCdecomposition is more sensitive to temperature than young components, making the old C more vulnerable to future warmer climate.

     
    more » « less