The excited-state properties of molecular crystals are important for applications in organic electronic devices. The GW approximation and Bethe-Salpeter equation (GW+BSE) is the state-of-the-art method for calculating the excited-state properties of crystalline solids with periodic boundary conditions. We present the PAH101 dataset of GW +BSE calculations for 101 molecular crystals of polycyclic aromatic hydrocarbons (PAHs) with up to ∼500 atoms in the unit cell. The data records include the GW quasiparticle band structure, the fundamental band gap, the static dielectric constant, the first singlet exciton energy (optical gap), the first triplet exciton energy, the dielectric function, and optical absorption spectra for light polarized along the three lattice vectors. In addition, the dataset includes the density functional theory (DFT) single-molecule and crystal features used in Liu et al. [npj Computational Materials, 8, 70 (2022)]. We envision the dataset being used to (i) identify correlations between DFT and GW +BSE quantities, (ii) discover materials with desired electronic/ optical properties in the dataset itself, and (iii) train machine-learned models to help in materials discovery efforts. We provide examples to illustrate these three use cases.
more »
« less
This content will become publicly available on May 14, 2026
Predicting the excited-state properties of crystalline organic semiconductors using GW+BSE and machine learning
Excited-state properties of crystalline organic semiconductors are key to organic electronic device applications. Machine learning (ML) models capable of predicting these properties could significantly accelerate materials discovery. We use the sure-independence-screening-and-sparsifying-operator (SISSO) ML algorithm to generate models to predict the first singlet excitation energy, which corresponds to the optical gap, the first triplet excitation energy, the singlet–triplet gap, and the singlet exciton binding energy of organic molecular crystals. To train the models we use the “PAH101” dataset of many-body perturbation theory calculations within the GW approximation and Bethe–Salpeter equation (GW+BSE) for 101 crystals of polycyclic aromatic hydrocarbons (PAHs). The best performing SISSO models yield predictions within about 0.2 eV of the GW+BSE reference values. SISSO models are selected based on considerations of accuracy and computational cost to construct materials screening workflows for each property. The screening targets are chosen to demonstrate typical use-cases relevant for organic electronic devices. We show that the workflows based on SISSO models can effectively screen out most of the materials that are not of interest and significantly reduce the number of candidates selected for further evaluation using computationally expensive excited-state theory.
more »
« less
- Award ID(s):
- 2323749
- PAR ID:
- 10633436
- Publisher / Repository:
- RSC
- Date Published:
- Journal Name:
- Digital Discovery
- Volume:
- 4
- Issue:
- 5
- ISSN:
- 2635-098X
- Page Range / eLocation ID:
- 1306 to 1322
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
The excited-state properties of molecular crystals are important for applications in organic electronic devices. The GW approximation and Bethe-Salpeter equation (GW +BSE) is the state-of-the-art method for calculating the excited-state properties of crystalline solids with periodic boundary conditions. We present the PAH101 dataset of GW +BSE calculations for 101 molecular crystals of polycyclic aromatic hydrocarbons (PAHs) with up to ∼500 atoms in the unit cell. The data records include the GW quasiparticle band structure, the fundamental band gap, the static dielectric constant, the first singlet exciton energy (optical gap), the first triplet exciton energy, the dielectric function, and optical absorption spectra for light polarized along the three lattice vectors. In addition, the dataset includes the density functional theory (DFT) single-molecule and crystal features used in Liu et al. [npj Computational Materials, 8, 70 (2022)]. We envision the dataset being used to (i) identify correlations between DFT and GW +BSE quantities, (ii) discover materials with desired electronic/ optical properties in the dataset itself, and (iii) train machine-learned models to help in materials discovery efforts. We provide examples to illustrate these three use cases.more » « less
-
Abstract The excited-state properties of molecular crystals are important for applications in organic electronic devices. TheGWapproximation and Bethe-Salpeter equation (GW+BSE) is the state-of-the-art method for calculating the excited-state properties of crystalline solids with periodic boundary conditions. We present the PAH101 dataset ofGW+BSE calculations for 101 molecular crystals of polycyclic aromatic hydrocarbons (PAHs) with up to ~500 atoms in the unit cell. To the best of our knowledge, this is the firstGW+BSE dataset for molecular crystals. The data records include theGWquasiparticle band structure, the fundamental band gap, the static dielectric constant, the first singlet exciton energy (optical gap), the first triplet exciton energy, the dielectric function, and optical absorption spectra for light polarized along the three lattice vectors. The dataset can be used to (i) discover materials with desired electronic/optical properties, (ii) identify correlations between DFT andGW+BSE quantities, and (iii) train machine learned models to help in materials discovery efforts.more » « less
-
Abstract Singlet fission (SF), the conversion of one singlet exciton into two triplet excitons, could significantly enhance solar cell efficiency. Molecular crystals that undergo SF are scarce. Computational exploration may accelerate the discovery of SF materials. However, many-body perturbation theory (MBPT) calculations of the excitonic properties of molecular crystals are impractical for large-scale materials screening. We use the sure-independence-screening-and-sparsifying-operator (SISSO) machine-learning algorithm to generate computationally efficient models that can predict the MBPT thermodynamic driving force for SF for a dataset of 101 polycyclic aromatic hydrocarbons (PAH101). SISSO generates models by iteratively combining physical primary features. The best models are selected by linear regression with cross-validation. The SISSO models successfully predict the SF driving force with errors below 0.2 eV. Based on the cost, accuracy, and classification performance of SISSO models, we propose a hierarchical materials screening workflow. Three potential SF candidates are found in the PAH101 set.more » « less
-
Understanding the process of molecular photoexcitation is crucial in various fields, including drug development, materials science, photovoltaics, and more. The electronic vertical excitation energy is a critical property, for example in determining the singlet–triplet gap of chromophores. However, a full understanding of excited-state processes requires additional explorations of the excited-state potential energy surface and electronic properties, which is greatly aided by the availability of analytic energy gradients. Owing to its robust high accuracy over a wide range of chemical problems, equation-of-motion coupled cluster with single and double excitations (EOM-CCSD) is a powerful method for predicting excited-state properties, and the implementation of analytic gradients of many EOM-CCSD variants (excitation energies, ionization potentials, electron attachment energies, etc.) along with numerous successful applications highlights the flexibility of the method. In specific cases where a higher level of accuracy is needed or in more complex electronic structures, the inclusion of triple excitations becomes essential, for example, in the EOM-CCSD* approach of Saeh and Stanton. In this work, we derive and implement for the first time the analytic gradients of EOMEE-CCSD*, which also provides a template for analytic gradients of related excited-state methods with perturbative triple excitations. The capabilities of analytic EOMEE-CCSD* gradients are illustrated by several representative examples.more » « less
An official website of the United States government
