skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.
Attention:The NSF Public Access Repository (PAR) system and access will be unavailable from 11:00 PM ET on Thursday, May 14 until 2:00 AM ET on Friday, May 15 due to maintenance. We apologize for the inconvenience.


Search for: All records

Creators/Authors contains: "Jain, Anubhav"

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. Face recognition systems have made significant strides thanks to data-heavy deep learning models, but these models rely on large privacy-sensitive datasets. Recent work in facial analysis and recognition have thus started making use of synthetic datasets generated from GANs and diffusion based generative models. These models, however, lack fairness in terms of demographic representation and can introduce the same biases in the trained downstream tasks. This can have serious societal and security implications. To address this issue, we propose a methodology that generates unbiased data from a biased generative model using an evolutionary algorithm. We show results for StyleGAN2 model trained on the Flicker Faces High Quality dataset to generate data for singular and combinations of demographic attributes such as Black and Woman. We generate a large racially balanced dataset of 13.5 million images, and show that it boosts the performance of facial recognition and analysis systems whilst reducing their biases. We have made our code-base ( https://github.com/anubhav1997/youneednodataset ) public to allow researchers to reproduce our work. 
    more » « less
  2. A dictionary attack in a biometric system entails the use of a small number of strategically generated images or tem- plates to successfully match with a large number of identi- ties, thereby compromising security. We focus on dictionary attacks at the template level, specifically the IrisCodes used in iris recognition systems. We present an hitherto unknown vulnerability wherein we mix IrisCodes using simple bit- wise operators to generate alpha-mixtures —alpha-wolves (combining a set of “wolf” samples) and alpha-mammals (combining a set of users selected via search optimization) that increase false matches. We evaluate this vulnerabil- ity using the IITD, CASIA-IrisV4-Thousand and Synthetic datasets, and observe that an alpha-wolf (from two wolves) can match upto 71 identities @FMR=0.001%, while an alpha-mammal (from two identities) can match upto 133 other identities @FMR=0.01% on the IITD dataset. 
    more » « less
  3. Abstract Deep learning (DL) is one of the fastest-growing topics in materials data science, with rapidly emerging applications spanning atomistic, image-based, spectral, and textual data modalities. DL allows analysis of unstructured data and automated identification of features. The recent development of large materials databases has fueled the application of DL methods in atomistic prediction in particular. In contrast, advances in image and spectral data have largely leveraged synthetic data enabled by high-quality forward models as well as by generative unsupervised DL methods. In this article, we present a high-level overview of deep learning methods followed by a detailed discussion of recent developments of deep learning in atomistic simulation, materials imaging, spectral analysis, and natural language processing. For each modality we discuss applications involving both theoretical and experimental data, typical modeling approaches with their strengths and limitations, and relevant publicly available software and datasets. We conclude the review with a discussion of recent cross-cutting work related to uncertainty quantification in this field and a brief perspective on limitations, challenges, and potential growth areas for DL methods in materials science. 
    more » « less
  4. Computational prediction of good thermoelectric (TE) performance in several n-type doped Zintl phases, combined with successful experimental realization, has sparked interest in discovering new n-type dopable members of this family of materials. However, most known Zintls are typically only p-type dopable; prior successes in finding n-type Zintl phases have been largely serendipitous. Here, we go beyond previously synthesized Zintl phases and perform chemical substitutions in known n-type dopable ABX Zintl phases to discover new ones. We use first-principles calculations to predict their stability, potential for TE performance as well as their n-type dopability. Using this approach, we find 17 new ABX Zintl phases in the KSnSb structure type that are predicted to be stable. Several of these newly predicted phases (KSnBi, RbSnBi, NaGeP) are found to exhibit promising n-type TE performance and are n-type dopable. We propose these compounds for further experimental studies, especially KSnBi and RbSnBi, which are both predicted to be good TE materials with high electron concentrations due to self-doping by native defects, when grown under alkali-rich conditions. 
    more » « less
  5. Half-Heusler materials are strong candidates for thermoelectric applications due to their high weighted mobilities and power factors, which is known to be correlated to valley degeneracy in the electronic band structure. However, there are over 50 known semiconducting half-Heusler phases, and it is not clear how the chemical composition affects the electronic structure. While all the n-type electronic structures have their conduction band minimum at either the Γ - or X -point, there is more diversity in the p-type electronic structures, and the valence band maximum can be at either the Γ -, L -, or W -point. Here, we use high throughput computation and machine learning to compare the valence bands of known half-Heusler compounds and discover new chemical guidelines for promoting the highly degenerate W -point to the valence band maximum. We do this by constructing an “orbital phase diagram” to cluster the variety of electronic structures expressed by these phases into groups, based on the atomic orbitals that contribute most to their valence bands. Then, with the aid of machine learning, we develop new chemical rules that predict the location of the valence band maximum in each of the phases. These rules can be used to engineer band structures with band convergence and high valley degeneracy. 
    more » « less
  6. Abstract Contents 1. Introduction- Methods and software for electronic structure based simulations of chemistry and materials 2. Density Functional Theory: Formalism and Current Directions 3. Density functional methods - implementation, challenges, successes 4. Green’s function based many-body perturbation theory 5. Wave-function theory approaches – explicit approaches to electron correlation 6. Quantum Monte Carlo and stochastic electronic structure methods 7. Heavy element relativity, spin-orbit physics, and magnetism 8. Semiempirical methods 9. Simulating Nuclear Dynamics with Quantum Effects 10. Real-Time Propagation in Electronic Structure Theory 11. Spectroscopy 12. Tools for exploring potential energy surfaces 13. Managing complex computational workflows 14. Current and Future Computer Architectures 15. Electronic structure software engineering 16. Education and Training in Electronic Structure Theory: Navigating an Evolving Landscape 17. Electronic structure theory facing industry and realistic modeling of experiments 18. List of Acronyms 
    more » « less
  7. null (Ed.)
    Accurate density functional theory calculations of the interrelated properties of thermoelectric materials entail high computational cost, especially as crystal structures increase in complexity and size. New methods involving ab initio scattering and transport (AMSET) and compressive sensing lattice dynamics are used to compute the transport properties of quaternary CaAl 2 Si 2 -type rare-earth phosphides RECuZnP 2 (RE = Pr, Nd, Er), which were identified to be promising thermoelectrics from high-throughput screening of 20 000 disordered compounds. Experimental measurements of the transport properties agree well with the computed values. Compounds with stiff bulk moduli (>80 GPa) and high speeds of sound (>3500 m s −1 ) such as RECuZnP 2 are typically dismissed as thermoelectric materials because they are expected to exhibit high lattice thermal conductivity. However, RECuZnP 2 exhibits not only low electrical resistivity, but also low lattice thermal conductivity (∼1 W m −1 K −1 ). Contrary to prior assumptions, polar-optical phonon scattering was revealed by AMSET to be the primary mechanism limiting the electronic mobility of these compounds, raising questions about existing assumptions of scattering mechanisms in this class of thermoelectric materials. The resulting thermoelectric performance ( zT of 0.5 for ErCuZnP 2 at 800 K) is among the best observed in phosphides and can likely be improved with further optimization. 
    more » « less