In Vitro Transcription Factor Binding Site Predictions Using Support Vector Machine Classification

Diego A Pomales‐Matos,Emmanuel A Carrasquillo‐Dones,Diego A Rosado‐Tristani,José A Rodríguez‐Martínez

doi:10.1096/fasebj.2022.36.s1.r6070

Diego A Pomales‐Matos, Emmanuel A Carrasquillo‐Dones + Show 2 more

Open Access

https://doi.org/10.1096/fasebj.2022.36.s1.r6070

Copy DOI

Abstract

Transcription factors (TFs) are sequence‐specific DNA‐binding proteins essential in regulating gene expression. Determining TF DNA‐binding specificity can help to study gene regulatory networks within cells and how genetic variation can disrupt normal gene expression. One method for characterizing TF specificity is through Support Vector Machines (SVMs) by analyzing chromatin immunoprecipitation followed by DNA‐sequencing (ChIP‐seq) data. However, this can also be achieved using Systematic Evolution of Ligands by Exponential Enrichment (SELEX) data, a method that also aids in determining TF‐DNA preferences. During this project, I implemented a gapped kmer SVM to study TF‐DNA binding preferences by using data from SELEX‐seq. I used a large scale‐gapped kmer, a sequence‐based SVM for analyzing TF specificity. It works by creating a predictive model that is trained with bound and unbound sequences from SELEX data. For purposes of this project, we used the T‐box transcription factor 5 (TBX5). After training the model for TBX5 and testing its performance, it had an AUROC value of 0.8248, indicating a significant degree of reliability. Likewise, the sequences with highest scores contained motifs for the TBX5. Given these results, we concluded that SVM was successfully implemented. In addition, SELEX data had not been previously used to train SVM based predictive models, meaning SELEX data is compatible and useful for developing predictive models.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

In Vitro Transcription Factor Binding Site Predictions Using Support Vector Machine Classification

Abstract

Talk to us

Similar Papers

More From: The FASEB Journal

Lead the way for us

Similar Papers

Transcription Factors and DNA Play Hide and Seek.
David M Suter
Trends in Cell Biology | VOL. 30
David M SuterDavid M Suter
07 Apr 2020
Trends in Cell Biology | VOL. 30

Evaluating a linear k-mer model for protein-DNA interactions using high-throughput SELEX data
Juhani Kähärä ... Harri Lähdesmäki
BMC Bioinformatics | VOL. 14
Juhani Kähärä, et. al.Juhani Kähärä ... Harri Lähdesmäki
01 Aug 2013
BMC Bioinformatics | VOL. 14

Transcription Factor Binding Affinities and DNA Shape Readout.
Max Schnepf ... Ulrike Gaul
iScience | VOL. 23
Max Schnepf, et. al.Max Schnepf ... Ulrike Gaul
15 Oct 2020
iScience | VOL. 23

In vitro DNA-binding profile of transcription factors: methods and new insights
Jinke Wang ... Jie Lu
Journal of Endocrinology | VOL. 210
Jinke Wang, et. al.Jinke Wang ... Jie Lu
09 Mar 2011
Journal of Endocrinology | VOL. 210

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

In Vitro Transcription Factor Binding Site Predictions Using Support Vector Machine Classification

Abstract

Talk to us

Similar Papers

More From: The FASEB Journal