Abstract

Free-electron lasers could enable X-ray imaging of single biological macromolecules and the study of protein dynamics, paving the way for a powerful new imaging tool in structural biology, but a low signal-to-noise ratio and missing regions in the detectors, colloquially termed 'masks', affect data collection and hamper real-time evaluation of experimental data. In this article, the challenges posed by noise and masks are tackled by introducing a neural network pipeline that aims to restore diffraction intensities. For training and testing of the model, a data set of diffraction patterns was simulated from 10 900 different proteins with molecular weights within the range of 10-100 kDa and collected at a photon energy of 8 keV. The method is compared with a simple low-pass filtering algorithm based on autocorrelation constraints. The results show an improvement in the mean-squared error of roughly two orders of magnitude in the presence of masks compared with the noisy data. The algorithm was also tested at increasing mask width, leading to the conclusion that demasking can achieve good results when the mask is smaller than half of the central speckle of the pattern. The results highlight the competitiveness of this model for data processing and the feasibility of restoring diffraction intensities from unknown structures in real time using deep learning methods. Finally, an example is shown of this preprocessing making orientation recovery more reliable, especially for data sets containing very few patterns, using the expansion-maximization-compression algorithm.

Highlights

  • Knowing the structure of biological macromolecules, such as proteins, is fundamental to understanding their mechanisms and function

  • These, and all the denoised images shown in this paper, come from the respective test data sets and the neural network did not have access to them during training

  • The mean-squared error (MSE) is a useful metric for these applications since it is preserved by the Fourier transform and represents the error in the autocorrelation of the pattern

Read more

Summary

Introduction

Knowing the structure of biological macromolecules, such as proteins, is fundamental to understanding their mechanisms and function. X-ray crystallography has been the most successful tool for structure determination, and it is still the most commonly used technique. The major drawback is that the proteins must be crystallized, which is not always feasible and makes it hard to study protein dynamics. Singleparticle imaging (SPI) methods aim to investigate biological structures without the need for crystallization. One SPI method that has shown impressive results is electron microscopy (EM). Cryogenic EM (cryo-EM), in particular, is routinely used to study protein structures and dynamics. Cryo-EM is limited in time resolution, as it cannot reach time scales much faster than milliseconds (Chen & Frank, 2015)

Objectives
Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.