Search for: All records

Creators/Authors contains: "Yan, Chao"

« Prev Next »

Total Resources

38

Resource Type
Conference Paper

4

Conference Proceeding

1

Dataset

0

Journal Article

33

Workshop Report

0

Availability
Full Text / Resource Available

33

Citation Only

5

Save Results
Excel (limit 2000)
CSV (limit 5000)
XML (limit 5000)

Have feedback or suggestions for a way to improve these results?
!

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Private Everlasting Prediction

Naor, Moni Naor ; Nissim, Kobbi Nissim ; Stemmer, Uri ; Yan, Chao ( December 2023 , proceedings of 37th NeurIPS 2023)

A private learner is trained on a sample of labeled points and generates a hypothesis that can be used for predicting the labels of newly sampled points while protecting the privacy of the training set [Kasiviswannathan et al., FOCS 2008]. Past research uncovered that private learners may need to exhibit significantly higher sample complexity than non-private learners as is the case of learning of one-dimensional threshold functions [Bun et al., FOCS 2015, Alon et al., STOC 2019]. We explore prediction as an alternative to learning. A predictor answers a stream of classification queries instead of outputting a hypothesis. Earlier work has considered a private prediction model with a single classification query [Dwork and Feldman, COLT 2018]. We observe that when answering a stream of queries, a predictor must modify the hypothesis it uses over time, and in a manner that cannot rely solely on the training set. We introduce private everlasting prediction taking into account the privacy of both the training set and the (adaptively chosen) queries made to the predictor. We then present a generic construction of private everlasting predictors in the PAC model. The sample complexity of the initial training sample in our construction is quadratic (up to polylog factors) in the VC dimension of the concept class. Our construction allows prediction for all concept classes with finite VC dimension, and in particular threshold functions over infinite domains, for which (traditional) private learning is known to be impossible.
more » « less
Free, publicly-accessible full text available December 10, 2024
Increasing contribution of nighttime nitrogen chemistry to wintertime haze formation in Beijing observed during COVID-19 lockdowns

https://doi.org/10.1038/s41561-023-01285-1

Yan, Chao ; Tham, Yee Jun ; Nie, Wei ; Xia, Men ; Wang, Haichao ; Guo, Yishuo ; Ma, Wei ; Zhan, Junlei ; Hua, Chenjie ; Li, Yuanyuan ; et al ( November 2023 , Nature Geoscience)

Free, publicly-accessible full text available November 1, 2024
Nitrate Radicals Suppress Biogenic New Particle Formation from Monoterpene Oxidation

https://doi.org/10.1021/acs.est.3c07958

Li, Dandan ; Huang, Wei ; Wang, Dongyu ; Wang, Mingyi ; Thornton, Joel A. ; Caudillo, Lucía ; Rörup, Birte ; Marten, Ruby ; Scholz, Wiebke ; Finkenzeller, Henning ; et al ( January 2024 , Environmental Science & Technology)

Free, publicly-accessible full text available January 23, 2025
Precursor apportionment of atmospheric oxygenated organic molecules using a machine learning method

https://doi.org/10.1039/d2ea00128d

Qiao, Xiaohui ; Li, Xiaoxiao ; Yan, Chao ; Sarnela, Nina ; Yin, Rujing ; Guo, Yishuo ; Yao, Lei ; Nie, Wei ; Huang, Dandan ; Wang, Zhe ; et al ( January 2023 , Environmental Science: Atmospheres)

Gas-phase oxygenated organic molecules (OOMs) can contribute significantly to both atmospheric new particle growth and secondary organic aerosol formation. Precursor apportionment of atmospheric OOMs connects them with volatile organic compounds (VOCs). Since atmospheric OOMs are often highly functionalized products of multistep reactions, it is challenging to reveal the complete mapping relationships between OOMs and their precursors. In this study, we demonstrate that the machine learning method is useful in attributing atmospheric OOMs to their precursors using several chemical indicators, such as O/C ratio and H/C ratio. The model is trained and tested using data acquired in controlled laboratory experiments, covering the oxidation products of four main types of VOCs (isoprene, monoterpenes, aliphatics, and aromatics). Then, the model is used for analyzing atmospheric OOMs measured in both urban Beijing and a boreal forest environment in southern Finland. The results suggest that atmospheric OOMs in these two environments can be reasonably assigned to their precursors. Beijing is an anthropogenic VOC dominated environment with ∼64% aromatic and aliphatic OOMs, and the other boreal forested area has ∼76% monoterpene OOMs. This pilot study shows that machine learning can be a promising tool in atmospheric chemistry for connecting the dots.
more » « less
Full Text Available
NO at low concentration can enhance the formation of highly oxygenated biogenic molecules in the atmosphere

https://doi.org/10.1038/s41467-023-39066-4

Nie, Wei ; Yan, Chao ; Yang, Liwen ; Roldin, Pontus ; Liu, Yuliang ; Vogel, Alexander L. ; Molteni, Ugo ; Stolzenburg, Dominik ; Finkenzeller, Henning ; Amorim, Antonio ; et al ( December 2023 , Nature Communications)

Abstract The interaction between nitrogen monoxide (NO) and organic peroxy radicals (RO 2 ) greatly impacts the formation of highly oxygenated organic molecules (HOM), the key precursors of secondary organic aerosols. It has been thought that HOM production can be significantly suppressed by NO even at low concentrations. Here, we perform dedicated experiments focusing on HOM formation from monoterpenes at low NO concentrations (0 – 82 pptv). We demonstrate that such low NO can enhance HOM production by modulating the RO 2 loss and favoring the formation of alkoxy radicals that can continue to autoxidize through isomerization. These insights suggest that HOM yields from typical boreal forest emissions can vary between 2.5%-6.5%, and HOM formation will not be completely inhibited even at high NO concentrations. Our findings challenge the notion that NO monotonically reduces HOM yields by extending the knowledge of RO 2 -NO interactions to the low-NO regime. This represents a major advance towards an accurate assessment of HOM budgets, especially in low-NO environments, which prevails in the pre-industrial atmosphere, pristine areas, and the upper boundary layer.
more » « less
Free, publicly-accessible full text available December 1, 2024
Sulfur Dioxide Transported From the Residual Layer Drives Atmospheric Nucleation During Haze Periods in Beijing

https://doi.org/10.1029/2022GL100514

Wang, Yonghong ; Ma, Yongjing ; Yan, Chao ; Yao, Lei ; Cai, Runlong ; Li, Shuying ; Lin, Zhuohui ; Zhao, Xiujuan ; Yin, Rujing ; Deng, Chenjuan ; et al ( March 2023 , Geophysical Research Letters)

Full Text Available
Role of sesquiterpenes in biogenic new particle formation

https://doi.org/10.1126/sciadv.adi5297

Dada, Lubna ; Stolzenburg, Dominik ; Simon, Mario ; Fischer, Lukas ; Heinritzi, Martin ; Wang, Mingyi ; Xiao, Mao ; Vogel, Alexander L. ; Ahonen, Lauri ; Amorim, Antonio ; et al ( September 2023 , Science Advances)

Biogenic vapors form new particles in the atmosphere, affecting global climate. The contributions of monoterpenes and isoprene to new particle formation (NPF) have been extensively studied. However, sesquiterpenes have received little attention despite a potentially important role due to their high molecular weight. Via chamber experiments performed under atmospheric conditions, we report biogenic NPF resulting from the oxidation of pure mixtures of β-caryophyllene, α-pinene, and isoprene, which produces oxygenated compounds over a wide range of volatilities. We find that a class of vapors termed ultralow-volatility organic compounds (ULVOCs) are highly efficient nucleators and quantitatively determine NPF efficiency. When compared with a mixture of isoprene and monoterpene alone, adding only 2% sesquiterpene increases the ULVOC yield and doubles the formation rate. Thus, sesquiterpene emissions need to be included in assessments of global aerosol concentrations in pristine climates where biogenic NPF is expected to be a major source of cloud condensation nuclei.

more » « less
Free, publicly-accessible full text available September 8, 2024
Dynamically adjusting case reporting policy to maximize privacy and public health utility in the face of a pandemic

https://doi.org/10.1093/jamia/ocac011

Brown, J Thomas ; Yan, Chao ; Xia, Weiyi ; Yin, Zhijun ; Wan, Zhiyu ; Gkoulalas-Divanis, Aris ; Kantarcioglu, Murat ; Malin, Bradley A ( February 2022 , Journal of the American Medical Informatics Association)

Abstract Objective Supporting public health research and the public’s situational awareness during a pandemic requires continuous dissemination of infectious disease surveillance data. Legislation, such as the Health Insurance Portability and Accountability Act of 1996 and recent state-level regulations, permits sharing deidentified person-level data; however, current deidentification approaches are limited. Namely, they are inefficient, relying on retrospective disclosure risk assessments, and do not flex with changes in infection rates or population demographics over time. In this paper, we introduce a framework to dynamically adapt deidentification for near-real time sharing of person-level surveillance data. Materials and Methods The framework leverages a simulation mechanism, capable of application at any geographic level, to forecast the reidentification risk of sharing the data under a wide range of generalization policies. The estimates inform weekly, prospective policy selection to maintain the proportion of records corresponding to a group size less than 11 (PK11) at or below 0.1. Fixing the policy at the start of each week facilitates timely dataset updates and supports sharing granular date information. We use August 2020 through October 2021 case data from Johns Hopkins University and the Centers for Disease Control and Prevention to demonstrate the framework’s effectiveness in maintaining the PK11 threshold of 0.01. Results When sharing COVID-19 county-level case data across all US counties, the framework’s approach meets the threshold for 96.2% of daily data releases, while a policy based on current deidentification techniques meets the threshold for 32.3%. Conclusion Periodically adapting the data publication policies preserves privacy while enhancing public health utility through timely updates and sharing epidemiologically critical features.
more » « less
Full Text Available
How Adversarial Assumptions Influence Re-identification Risk Measures: A COVID-19 Case Study

Wan, Zhiyu ; Yan, Chao ; Brown, J. Thomas ; Xia, Weiyi ; Gkoulalas-Divanis, Aris ; Kantarcioglu, Murat ; Malin, Bradley ( January 2022 , International Conference on Privacy in Statistical Databases)

The COVID-19 pandemic highlights the need for broad dissemination of case surveillance data. Local and global public health agencies have initiated efforts to do so, but there remains limited data available, due in part to concerns over privacy. As a result, current COVID-19 case surveillance data sharing policies are based on strong adversarial assumptions, such as the expectation that an attacker can readily re-identify individuals based on their distinguishability in a dataset. There are various re-identification risk measures to account for adversarial capabilities; however, the current array insufficiently accounts for real world data challenges - particularly issues of missing records in resources of identifiable records that adversaries may rely upon to execute attacks (e.g., 10 50-year-old male in the de-identified dataset vs. 5 50-year-old male in the identified dataset). In this paper, we introduce several approaches to amend such risk measures and assess re-identification risk in light of how an attacker's capabilities relate to missing records. We demonstrate the potential for these measures through a record linkage attack using COVID-19 case surveillance data and voter registration records in the state of Florida. Our findings demonstrate that adversarial assumptions, as realized in a risk measure, can dramatically affect re-identification risk estimation. Notably, we show that the re-identification risk is likely to be substantially smaller than the typical risk thresholds, which suggests that more detailed data could be shared publicly than is currently the case.
more » « less
Full Text Available
Increasing Iridium Oxide Activity for the Oxygen Evolution Reaction with Hafnium Modification

https://doi.org/10.1021/jacs.1c03473

Zhao, Fang ; Wen, Bo ; Niu, Wenhan ; Chen, Zhu ; Yan, Chao ; Selloni, Annabella ; Tully, Christopher G. ; Yang, Xiaofang ; Koel, Bruce E. ( September 2021 , Journal of the American Chemical Society)

Full Text Available

« Prev Next »