NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Understanding Fixed Predictions via Confined Regions

Lawless, Connor; Weng, Tsui-Wei; Ustun, Berk; Udell, Madeleine (July 2025, ICML 2025)

Free, publicly-accessible full text available July 14, 2026
Randomized Nyström Preconditioning

https://doi.org/10.1137/21M1466244

Frangella, Zachary; Tropp, Joel A.; Udell, Madeleine (June 2023, SIAM Journal on Matrix Analysis and Applications)

Full Text Available
Robust Non-Linear Matrix Factorization for Dictionary Learning, Denoising, and Clustering

https://doi.org/10.1109/TSP.2021.3062988

Fan, Jicong; Yang, Chengrun; Udell, Madeleine (January 2021, IEEE Transactions on Signal Processing)
null (Ed.)
Full Text Available
Randomized Sketching Algorithms for Low-Memory Dynamic Optimization

https://doi.org/10.1137/19M1272561

Muthukumar, Ramchandran; Kouri, Drew P.; Udell, Madeleine (January 2021, SIAM Journal on Optimization)
null (Ed.)
Full Text Available
Scalable Semidefinite Programming

https://doi.org/10.1137/19M1305045

Yurtsever, Alp; Tropp, Joel A.; Fercoq, Olivier; Udell, Madeleine; Cevher, Volkan (January 2021, SIAM Journal on Mathematics of Data Science)
null (Ed.)
Full Text Available
Approximate Cross-Validation with Low-Rank Data in High Dimensions

Stephenson, William; Udell, Madeleine; Broderick, Tamara (January 2020, Advances in neural information processing systems)
null (Ed.)
Full Text Available
Factor Group-Sparse Regularization for Efficient Low-Rank Matrix Recovery

Fan, Jicong; Ding, Lijun; Chen, Yudong; Udell, Madeleine (December 2019, 33rd Conference on Neural Information Processing Systems (NeurIPS 2019))

Full Text Available
Low-Rank Tucker Approximation of a Tensor from Streaming Data

https://doi.org/10.1137/19M1257718

Sun, Yiming; Guo, Yang; Luo, Charlene; Tropp, Joel; Udell, Madeleine (January 2020, SIAM Journal on Mathematics of Data Science)
null (Ed.)
Full Text Available
Sparse Data Reconstruction, Missing Value and Multiple Imputation through Matrix Factorization

https://doi.org/10.1177/00811750221125799

Sengupta, Nandana; Udell, Madeleine; Srebro, Nathan; Evans, James (October 2022, Sociological Methodology)

Social science approaches to missing values predict avoided, unrequested, or lost information from dense data sets, typically surveys. The authors propose a matrix factorization approach to missing data imputation that (1) identifies underlying factors to model similarities across respondents and responses and (2) regularizes across factors to reduce their overinfluence for optimal data reconstruction. This approach may enable social scientists to draw new conclusions from sparse data sets with a large number of features, for example, historical or archival sources, online surveys with high attrition rates, or data sets created from Web scraping, which confound traditional imputation techniques. The authors introduce matrix factorization techniques and detail their probabilistic interpretation, and they demonstrate these techniques’ consistency with Rubin’s multiple imputation framework. The authors show via simulations using artificial data and data from real-world subsets of the General Social Survey and National Longitudinal Study of Youth cases for which matrix factorization techniques may be preferred. These findings recommend the use of matrix factorization for data reconstruction in several settings, particularly when data are Boolean and categorical and when large proportions of the data are missing.
more » « less
Fairness Under Unawareness: Assessing Disparity When Protected Class Is Unobserved

https://doi.org/10.1145/3287560.3287594

Chen, Jiahao; Kallus, Nathan; Mao, Xiaojie; Svacha, Geoffry; Udell, Madeleine (January 2019, Proceedings of the Conference on Fairness, Accountability, and Transparency)

Assessing the fairness of a decision making system with respect to a protected class, such as gender or race, is challenging when class membership labels are unavailable. Probabilistic models for predicting the protected class based on observable proxies, such as surname and geolocation for race, are sometimes used to impute these missing labels for compliance assessments. Empirically, these methods are observed to exaggerate disparities, but the reason why is unknown. In this paper, we decompose the biases in estimating outcome disparity via threshold-based imputation into multiple interpretable bias sources, allowing us to explain when over- or underestimation occurs. We also propose an alternative weighted estimator that uses soft classification, and show that its bias arises simply from the conditional covariance of the outcome with the true class membership. Finally, we illustrate our results with numerical simulations and a public dataset of mortgage applications, using geolocation as a proxy for race. We confirm that the bias of threshold-based imputation is generally upward, but its magnitude varies strongly with the threshold chosen. Our new weighted estimator tends to have a negative bias that is much simpler to analyze and reason about.
more » « less
Full Text Available

Search for: All records