New analysis pipeline for high-throughput domain–peptide affinity experiments improves SH2 interaction data

Tom Ronan,Roman Garnett,Kristen M Naegle

doi:10.1074/jbc.ra120.012503

Tom Ronan, Roman Garnett + Show 1 more

Open Access

https://doi.org/10.1074/jbc.ra120.012503

Copy DOI

Abstract

Protein domain interactions with short linear peptides, such as those of the Src homology 2 (SH2) domain with phosphotyrosine-containing peptide motifs (pTyr), are ubiquitous and important to many biochemical processes of the cell. The desire to map and quantify these interactions has resulted in the development of high-throughput (HTP) quantitative measurement techniques, such as microarray or fluorescence polarization assays. For example, in the last 15 years, experiments have progressed from measuring single interactions to covering 500,000 of the 5.5 million possible SH2-pTyr interactions in the human proteome. However, high variability in affinity measurements and disagreements about positive interactions between published data sets led us here to reevaluate the analysis methods and raw data of published SH2-pTyr HTP experiments. We identified several opportunities for improving the identification of positive and negative interactions and the accuracy of affinity measurements. We implemented model-fitting techniques that are more statistically appropriate for the nonlinear SH2-pTyr interaction data. We also developed a method to account for protein concentration errors due to impurities and degradation or protein inactivity and aggregation. Our revised analysis increases the reported affinity accuracy, reduces the false-negative rate, and increases the amount of useful data by adding reliable true-negative results. We demonstrate improvement in classification of binding versus nonbinding when using machine-learning techniques, suggesting improved coherence in the reanalyzed data sets. We present revised SH2-pTyr affinity results and propose a new analysis pipeline for future HTP measurements of domain-peptide interactions.

Highlights

Replicates, reflecting random noise and experimental error, tak- est likelihood of the true population value of affinity
Protein concentration errors due to batch impurities or degradation can manifest as a range of Kd values in replicate measurements made from different batches of protein, all of which would be equal to or higher than the true Kd, while simultaneously coming from high-quality, low-noise replicate fits
Because we do not have true information at the batch level or activity of each protein sample, these patterns must be inferred from the data. These patterns are difficult to spot due to the nature of the experimental design, we find examples of nonrandom run-dependent variations in affinity in the data (Fig. S11)

Summary

Results

In the process of evaluating published high-throughput data, we found significant disagreement between data sets. Protein concentration errors due to batch impurities or degradation can manifest as a range of Kd values in replicate measurements made from different batches of protein, all of which would be equal to or higher than the true Kd, while simultaneously coming from high-quality, low-noise replicate fits This exact phenomenon has been demonstrated experimentally [31]. Note that the minimum of each replicate group was selected as most accurately reflecting the true affinity, our revised affinity values are not all lower than the original publication This is primarily due to significant changes at the replicate level, where some original replicates were removed from consideration by changes in the fitting process, and a number of new replicates were included in each replicate set. The improved average performance and lower variability in our revised results suggest improved coherency in our revised analysis over the original published results

Discussion

Experimental procedures

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Biological Chemistry	Publication Date: Aug 1, 2020
Citations: 2	License type: cc-by

R Discovery Prime

R Discovery Prime

New analysis pipeline for high-throughput domain–peptide affinity experiments improves SH2 interaction data

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Biological Chemistry

Lead the way for us

Similar Papers

Association between parent-infant interactions in infancy and disruptive behaviour disorders at age seven: a nested, case-control ALSPAC study.
Christine Puckering ... Clare S Allely
BMC Pediatrics | VOL. 14
Christine Puckering, et. al.Christine Puckering ... Clare S Allely
06 Sep 2014
BMC Pediatrics | VOL. 14

The distribution of positive and negative species interactions across environmental gradients on a dual-lattice model
J.M.J Travis ... C Dytham
Journal of Theoretical Biology | VOL. 241
J.M.J Travis, et. al.J.M.J Travis ... C Dytham
09 Mar 2006
Journal of Theoretical Biology | VOL. 241

Couples’ relationship affects mothers’ and fathers’ anxiety and depression trajectories over the transition to parenthood
Bárbara Figueiredo ... Tiffany Field
Journal of Affective Disorders | VOL. 238
Bárbara Figueiredo, et. al.Bárbara Figueiredo ... Tiffany Field
29 May 2018
Journal of Affective Disorders | VOL. 238

Acculturation and depressive symptoms among older Chinese immigrants in the United States: the roles of positive and negative social interactions
Ling Xu ... Xinqi Dong
Aging & Mental Health | VOL. 27
Ling Xu, et. al.Ling Xu ... Xinqi Dong
22 Jan 2022
Aging & Mental Health | VOL. 27

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

New analysis pipeline for high-throughput domain–peptide affinity experiments improves SH2 interaction data

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Biological Chemistry