Abstract The emergence of high repetition-rate X-ray free-electron lasers (XFELs) powered by superconducting accelerator technology enables the measurement of significantly more experimental data per day than was previously possible. The European XFEL is expected to provide 27,000 pulses per second, over two orders of magnitude more than any other XFEL. The increased pulse rate is a key enabling factor for single-particle X-ray diffractive imaging, which relies on averaging the weak diffraction signal from single biological particles. Taking full advantage of this new capability requires that all experimental steps, from sample preparation and delivery to the acquisition of diffraction patterns, are compatible with the increased pulse repetition rate. Here, we show that single-particle imaging can be performed using X-ray pulses at megahertz repetition rates. The results obtained pave the way towards exploiting high repetition-rate X-ray free-electron lasers for single-particle imaging at their full repetition rate.
more »
« less
Selecting XFEL single-particle snapshots by geometric machine learning
A promising new route for structural biology is single-particle imaging with an X-ray Free-Electron Laser (XFEL). This method has the advantage that the samples do not require crystallization and can be examined at room temperature. However, high-resolution structures can only be obtained from a sufficiently large number of diffraction patterns of individual molecules, so-called single particles. Here, we present a method that allows for efficient identification of single particles in very large XFEL datasets, operates at low signal levels, and is tolerant to background. This method uses supervised Geometric Machine Learning (GML) to extract low-dimensional feature vectors from a training dataset, fuse test datasets into the feature space of training datasets, and separate the data into binary distributions of “single particles” and “non-single particles.” As a proof of principle, we tested simulated and experimental datasets of the Coliphage PR772 virus. We created a training dataset and classified three types of test datasets: First, a noise-free simulated test dataset, which gave near perfect separation. Second, simulated test datasets that were modified to reflect different levels of photon counts and background noise. These modified datasets were used to quantify the predictive limits of our approach. Third, an experimental dataset collected at the Stanford Linear Accelerator Center. The single-particle identification for this experimental dataset was compared with previously published results and it was found that GML covers a wide photon-count range, outperforming other single-particle identification methods. Moreover, a major advantage of GML is its ability to retrieve single particles in the presence of structural variability.
more »
« less
- Award ID(s):
- 1231306
- PAR ID:
- 10589274
- Publisher / Repository:
- American Institute of Physics
- Date Published:
- Journal Name:
- Structural Dynamics
- Volume:
- 8
- Issue:
- 1
- ISSN:
- 2329-7778
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Using X-ray free-electron lasers (XFELs), it is possible to determine three-dimensional structures of nanoscale particles using single-particle imaging methods. Classification algorithms are needed to sort out the single-particle diffraction patterns from the large amount of XFEL experimental data. However, different methods often yield inconsistent results. This study compared the performance of three classification algorithms: convolutional neural network, graph cut and diffusion map manifold embedding methods. The identified single-particle diffraction data of the PR772 virus particles were assembled in the three-dimensional Fourier space for real-space model reconstruction. The comparison showed that these three classification methods lead to different datasets and subsequently result in different electron density maps of the reconstructed models. Interestingly, the common dataset selected by these three methods improved the quality of the merged diffraction volume, as well as the resolutions of the reconstructed maps.more » « less
-
Ultra-bright femtosecond X-ray pulses generated by X-ray free-electron lasers (XFELs) can be used to image high-resolution structures without the need for crystallization. For this approach, aerosol injection has been a successful method to deliver 70–2000 nm particles into the XFEL beam efficiently and at low noise. Improving the technique of aerosol sample delivery and extending it to single proteins necessitates quantitative aerosol diagnostics. Here a lab-based technique is introduced for Rayleigh-scattering microscopy allowing us to track and size aerosolized particles down to 40 nm in diameter as they exit the injector. This technique was used to characterize the `Uppsala injector', which is a pioneering and frequently used aerosol sample injector for XFEL single-particle imaging. The particle-beam focus, particle velocities, particle density and injection yield were measured at different operating conditions. It is also shown how high particle densities and good injection yields can be reached for large particles (100–500 nm). It is found that with decreasing particle size, particle densities and injection yields deteriorate, indicating the need for different injection strategies to extend XFEL imaging to smaller targets, such as single proteins. This work demonstrates the power of Rayleigh-scattering microscopy for studying focused aerosol beams quantitatively. It lays the foundation for lab-based injector development and online injection diagnostics for XFEL research. In the future, the technique may also find application in other fields that employ focused aerosol beams, such as mass spectrometry, particle deposition, fuel injection and three-dimensional printing techniques.more » « less
-
null (Ed.)Abstract Background Cryo-electron microscopy (Cryo-EM) is widely used in the determination of the three-dimensional (3D) structures of macromolecules. Particle picking from 2D micrographs remains a challenging early step in the Cryo-EM pipeline due to the diversity of particle shapes and the extremely low signal-to-noise ratio of micrographs. Because of these issues, significant human intervention is often required to generate a high-quality set of particles for input to the downstream structure determination steps. Results Here we propose a fully automated approach (DeepCryoPicker) for single particle picking based on deep learning. It first uses automated unsupervised learning to generate particle training datasets. Then it trains a deep neural network to classify particles automatically. Results indicate that the DeepCryoPicker compares favorably with semi-automated methods such as DeepEM, DeepPicker, and RELION, with the significant advantage of not requiring human intervention. Conclusions Our framework combing supervised deep learning classification with automated un-supervised clustering for generating training data provides an effective approach to pick particles in cryo-EM images automatically and accurately.more » « less
-
Axion-like particles may form a network of cosmic strings in the Universe today that can rotate the plane of polarization of cosmic microwave background (CMB) photons. Future CMB observations with improved sensitivity might detect this axion-string-induced birefringence effect, thereby revealing an as-yet unseen constituent of the Universe and offering a new probe of particles and forces that are beyond the Standard Model of Elementary Particle Physics. In this work, we explore how spherical convolutional neural networks (SCNNs) may be used to extract information about the axion string network from simulated birefringence maps. We construct a pipeline to simulate the anisotropic birefringence that would arise from an axion string network, and we train SCNNs to estimate three parameters related to the cosmic string length, the cosmic string abundance, and the axion-photon coupling. Our results demonstrate that neural networks are able to extract information from a birefringence map that is inaccessible with two-point statistics alone (i.e., the angular power spectrum). We also assess the impact of noise on the accuracy of our SCNN estimators, demonstrating that noise at the level anticipated for Stage IV (CMB-S4) measurements would significantly bias parameter estimation for SCNNs trained on noiseless simulated data, and necessitate modeling the noise in the training data.more » « less
An official website of the United States government
