Abstract

Prediction of protein secondary structure from FTIR spectra usually relies on the absorbance in the amide I–amide II region of the spectrum. It assumes that the absorbance in this spectral region, i.e., roughly 1700–1500 cm−1 is solely arising from amide contributions. Yet, it is accepted that, on the average, about 20% of the absorbance is due to amino acid side chains. The present paper evaluates the contribution of amino acid side chains in this spectral region and the potential to improve secondary structure prediction after correcting for their contribution. We show that the β-sheet content prediction is improved upon subtraction of amino acid side chain contributions in the amide I–amide II spectral range. Improvement is relatively important, for instance, the error of prediction of β-sheet content decreases from 5.42 to 4.97% when evaluated by ascending stepwise regression. Other methods tested such as partial least square regression and support vector machine have also improved accuracy for β-sheet content evaluation. The other structures such as α-helix do not significantly benefit from side chain contribution subtraction, in some cases prediction is even degraded. We show that co-linearity between secondary structure content and amino acid composition is not a main limitation for improving secondary structure prediction. We also show that, even though based on different criteria, secondary structures defined by DSSP and XTLSSTR both arrive at the same conclusion: only the β-sheet structure clearly benefits from side chain subtraction. It must be concluded that side chain contribution subtraction benefit for the evaluation of other secondary structure contents is limited by the very rough description of side chain absorbance which does not take into account the variations related to their environment. The study was performed on a large protein set. To deal with the large number of proteins present, we worked on protein microarrays deposited on BaF2 slides and FTIR spectra were acquired with an imaging system.

Highlights

  • Fourier transform infrared (FTIR) spectroscopy has become a global tool for the study of protein structure (Barth 2007; Wang et al 2008), protein glycans (Derenne et al 2020) and lipids (Dreissig et al 2009; Derenne et al 2014)

  • To improve protein secondary structure prediction from FTIR spectra, we recently defined a large reference protein set, called cSP92, containing 92 commercially available proteins for which high-resolution structures are available (De Meutter and Goormaghtigh 2020) and we investigated the potential of partial deuteration to improve secondary structure prediction (De Meutter and Goormaghtigh 2021a) This protein set was designed to span the entire structural space in terms of secondary structures and higher-order structures as described by CATH (Orengo et al 1997)

  • The key issue for the purpose of this work is that the actual amino acid side chain contributions in the amide I–amide II spectral region depends on many parameters

Read more

Summary

Introduction

Fourier transform infrared (FTIR) spectroscopy has become a global tool for the study of protein structure (Barth 2007; Wang et al 2008), protein glycans (Derenne et al 2020) and lipids (Dreissig et al 2009; Derenne et al 2014). The first extensive series of data was reported by Chirgadze et al (1975) and Venyaminov and Kalnin (1991), reviewed in Goormaghtigh et al (1994a) These data have the advantage to express side chain contributions as a sum of Lorentzian/Gaussian bands whose parameters (band position, width, intensity, fraction of Gaussian component) are known. Numerous papers have added information to this first set of data as reviewed extensively by Barth (2000, 2007) Another extensive study of the infrared spectra and molar absorption coefficient of the 20 amino acid side chains was reported by Wolpert and Hellwig (2006). Results cannot be directly used to process experimental data as frequencies, scaling and band widths are not precisely known

Objectives
Methods
Results
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call