Abstract

Cytochrome P450 monooxygenases (CYPs) are a large, highly diverse protein family with a common fold. The sequences, structures, and functions of CYPs have been extensively studied resulting in more than 53,000 scientific articles. A sequence-based literature mining algorithm was designed to systematically analyze this wealth of information on SNPs, designed mutations, structural interactions, or functional roles of individual residues. Structurally corresponding positions in different CYPs were compared and universal selectivity-determining positions were identified. Based on the Cytochrome P450 Engineering Database (www.CYPED.BioCatNet.de) and a standard numbering scheme for all CYPs, 4000 residues in 168 CYPs mentioned in 2400 articles could be assigned to 440 structurally corresponding standard positions of the CYP fold, covering 96% of all standard positions. Seventeen individual standard positions were mentioned in the context of more than 32 different CYPs. The majority of these most frequently mentioned positions are located on the six substrate recognition sites and are involved in control of selectivity, such as the well-studied position 87 in CYP102A1 (P450(BM-3)) which was mentioned in the articles on 63 different CYPs. The recurrent citation of the 17 frequently mentioned positions for different CYPs suggests their universal functional relevance.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call