skip to main content


Title: Semi-Leak: Membership Inference Attacks Against Semi-supervised Learning
Award ID(s):
1937786
NSF-PAR ID:
10433682
Author(s) / Creator(s):
; ; ;
Date Published:
Journal Name:
European Conference on Computer Vision
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Precise determination of structural organization of semi-conducting polymers is of paramount importance for the further development of these materials in organic electronic technologies. Yet, prior characterization of some of the best-performing materials for transistor and photovoltaic applications, which are based on polymers with rigid backbones, often resulted in conundrums in which X-ray scattering and microscopy yielded seemingly contradicting results. Here we solve the paradox by introducing a new structural model, i.e. , semi-paracrystalline organization. The model establishes that the microstructure of these materials relies on a dense array of small paracrystalline domains embedded in a more disordered matrix. Thus, the overall structural order relies on two parameters: the novel concept of degree of paracrystallinity ( i.e. , paracrystalline volume/mass fraction, introduced here for the first time) and the lattice distortion parameter of paracrystalline domains ( g -parameter from X-ray scattering). Structural parameters of the model are correlated with long-range charge carrier transport, revealing that charge transport in semi-paracrystalline materials is particularly sensitive to the interconnection of paracrystalline domains. 
    more » « less
  2. Genome sequencing generates large volumes of data and hence requires increasingly higher computational resources. The growing data problem is even more acute in metagenomics applications, where data from an environmental sample include many organisms instead of just one for the common single organism sequencing. Traditional taxonomic classification and clustering approaches and platforms - while designed to be computationally efficient - are not capable of incrementally updating a previously trained system when new data arrive, which then requires complete re-training with the augmented (old plus new) data. Such complete retraining is inefficient and leads to poor utilization of computational resources. An ability to update a classification system with only new data offers a much lower run-time as new data are presented, and does not require the approach to be re-trained on the entire previous dataset. In this paper, we propose Incremental VSEARCH (I-VSEARCH) and its semi-supervised version for taxonomic classification, as well as a threshold independent VSEARCH (TI-VSEARCH) as wrappers around VSEARCH, a well-established (unsupervised) clustering algorithm for metagenomics. We show - on a 16S rRNA gene dataset - that I-VSEARCH, running incrementally only on the new batches of data that become available over time, does not lose any accuracy over VSEARCH that runs on the full data, while providing attractive computational benefits. 
    more » « less