A new modeling method in feature construction for the HSQC spectra screening problem

Hiromi Arai,Masayuki Yamamura,Satoru Watanabe,Takanori Kigawa

doi:10.1093/bioinformatics/btn345

Hiromi Arai, Masayuki Yamamura + Show 2 more

Open Access

https://doi.org/10.1093/bioinformatics/btn345

Copy DOI

Journal: Bioinformatics	Publication Date: Jul 4, 2008
Citations: 2

Affiliation: Tokyo Institute of Technology

Abstract

Large-scale biological analyses produce huge amounts of data. As a consequence, automation in the data analysis process is needed. Sample screening problems in NMR high-throughput protein structure analysis are the typical examples. Especially, screening by protein (1)H-(15)N heteronuclear single quantum coherence (HSQC) spectra must be done quantitatively by a human expert. One popular solution for this problem is data mining. Machine learning methods can automatically extract rules and achieve high accuracy in prediction when a good quality training dataset is prepared. However, they tend to be a black box and the learned machines suffer the risk of overfitting to the dataset. We propose a model which evaluates HSQC spectra for feature construction. The model calculates similarity between the measured chemical shifts and those of a random coil peak model. We applied our feature construction method for the machine learning discrimination of folded protein HSQC spectra from unfolded ones, and compared our model-based features with those of conventional sequence-based features and image recognition features. The results revealed that our method has sufficient discrimination power and less overfits on training data, as compared to the other methods. In addition, our method succeeded reduction of input data complexity towards further investigation. Supplementary data are available at Bioinformatics online.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A new modeling method in feature construction for the HSQC spectra screening problem

Abstract

Talk to us

Similar Papers

More From: Bioinformatics

Lead the way for us

Similar Papers

The Longin SNARE VAMP7/TI-VAMP Adopts a Closed Conformation
Sandro Vivona ... Axel T Brunger
Journal of Biological Chemistry | VOL. 285
Sandro Vivona, et. al.Sandro Vivona ... Axel T Brunger
01 Jun 2010
Journal of Biological Chemistry | VOL. 285

Interrupted Hydrogen/Deuterium Exchange Reveals the Stable Core of the Remarkably Helical Molten Globule of α-β Parallel Protein Flavodoxin
Sanne M Nabuurs ... Carlo P.M Van Mierlo
Journal of Biological Chemistry | VOL. 285
Sanne M Nabuurs, et. al.Sanne M Nabuurs ... Carlo P.M Van Mierlo
01 Feb 2010
Journal of Biological Chemistry | VOL. 285

HSQC Spectra Simulation and Matching for Molecular Identification.
Martin Priessner ... Anna Tomberg
Journal of chemical information and modeling | VOL. 64
Martin Priessner, et. al.Martin Priessner ... Anna Tomberg
27 Mar 2024
Journal of chemical information and modeling | VOL. 64

Conformational Complexity and Dynamics in a Muscarinic Receptor Revealed by NMR Spectroscopy.
Jun Xu ... Brian K Kobilka
Molecular Cell | VOL. 75
Jun Xu, et. al.Jun Xu ... Brian K Kobilka
15 May 2019
Molecular Cell | VOL. 75

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A new modeling method in feature construction for the HSQC spectra screening problem

Abstract

Talk to us

Similar Papers

More From: Bioinformatics