skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: A Silicon Valley Love Triangle: Hiring Algorithms, Pseudo-Science, and the Quest for Auditability
In this paper, we suggest a systematic approach for developing socio-technical assessment for hiring ADS. We suggest using a matrix to expose underlying assumptions rooted in pseudoscientific essentialized understandings of human nature and capability, and to critically investigate emerging auditing standards and practices that fail to address these assumptions.  more » « less
Award ID(s):
1704369
PAR ID:
10283950
Author(s) / Creator(s):
; ;
Date Published:
Journal Name:
CHI ’21: ACM CHI Virtual Conference on Human Factors in Computing Systems
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. We survey a collection of closely related methods for generalizing fans of toric varieties, include skeletons, Kato fans, Artin fans, and polyhedral cone complexes, all of which apply in the wider context of logarithmic geometry. Under appropriate assumptions these structures are equivalent, but their different realizations have provided for surprisingly disparate uses. We highlight several current applications and suggest some future possibilities. 
    more » « less
  2. Adversarial training has emerged as a popular approach for training models that are robust to inference-time adversarial attacks. However, our theoretical understanding of why and when it works remains limited. Prior work has offered generalization analysis of adversarial training, but they are either restricted to the Neural Tangent Kernel (NTK) regime or they make restrictive assumptions about data such as (noisy) linear separability or robust realizability. In this work, we study the stability and generalization of adversarial training for two-layer networks without any data distribution assumptions and beyond the NTK regime. Our findings suggest that for networks with any given initialization and sufficiently large width, the generalization bound can be effectively controlled via early stopping. We further improve the generalization bound by leveraging smoothing using Moreau’s envelope. 
    more » « less
  3. We study the prediction problem in the context of the high-dimensional linear regression model. We focus on the practically relevant framework where a fraction of the linear measurements is corrupted while the columns of the design matrix can be moderately correlated. Our findings suggest that for most sparse signals, the Lasso estimator admits strong performance guarantees under more easily verifiable and less stringent assumptions on the design matrix compared to much of the existing literature. 
    more » « less
  4. null (Ed.)
    Soils store more carbon than the biosphere and atmosphere combined, and the efficiency to which soil microorganisms allocate carbon to growth rather than respiration is increasingly considered a proxy for the soil capacity to store carbon. This carbon use efficiency (CUE) is measured via different methods, and more recently, the 18O-H2O method has been embraced as a significant improvement for measuring CUE of soil microbial communities. Based on extrapolating 18O incorporation into DNA to new biomass, this measurement makes various implicit assumptions about the microbial community at hand. Here we conducted a literature review to evaluate how viable these assumptions are and then developed a mathematical model to test how violating them affects estimates of the growth component of CUE in soil. We applied this model to previously collected data from two kinds of soil microbial communities. By changing one parameter at a time, we confirmed our previous observation that CUE was reduced by fungal removal. Our results also show that depending on the microbial community composition, there can be substantial discrepancies between estimated and true microbial growth. Of the numerous implicit assumptions that might be violated, not accounting for the contribution of sources of oxygen other than extracellular water to DNA leads to a consistent underestimation of CUE. We present a framework that allows researchers to evaluate how their experimental conditions may influence their 18O-H2O-based CUE measurements and suggest the parameters that need further constraining to more accurately quantify growth and CUE. 
    more » « less
  5. Villata, S. (Ed.)
    The European Union’s General Data Protection Regulation (GDPR) has compelled businesses and other organizations to update their privacy policies to state specific information about their data practices. Simultaneously, researchers in natural language processing (NLP) have developed corpora and annotation schemes for extracting salient information from privacy policies, often independently of specific laws. To connect existing NLP research on privacy policies with the GDPR, we introduce a mapping from GDPR provisions to the OPP-115 annotation scheme, which serves as the basis for a growing number of projects to automatically classify privacy policy text. We show that assumptions made in the annotation scheme about the essential topics for a privacy policy reflect many of the same topics that the GDPR requires in these documents. This suggests that OPP-115 continues to be representative of the anatomy of a legally compliant privacy policy, and that the legal assumptions behind it represent the elements of data processing that ought to be disclosed within a policy for transparency. The correspondences we show between OPP-115 and the GDPR suggest the feasibility of bridging existing computational and legal research on privacy policies, benefiting both areas. 
    more » « less