Predicting transcription factor binding sites by a multi-modal representation learning method based on cross-attention network

Yuxiao Wei,Qi Zhang,Liwei Liu

doi:10.1016/j.asoc.2024.112134

Abstract

The prediction of transcription factor binding sites (TFBS) plays a crucial role in studying cellular functions and understanding transcriptional regulatory processes. With the development of chromatin immunoprecipitation sequencing (ChIP-seq) technology, an increasing number of computer-aided TFBS prediction models have emerged. However, how to integrate multi-modal information of DNA and obtain efficient features to improve prediction accuracy remains a major challenge. Here, we propose MultiTF, a multi-modal representation learning method based on a cross-attention network for predicting transcription factor binding sites. Among TFBS prediction methods, we are the first to use graph neural networks and cross-attention networks for representation learning. MultiTF uses dna2vec to extract global contextual features of DNA sequences, DNAshapeR to extract shape features, and the CDPfold model and graph attention network for learning and representation of DNA structural features. Finally, with the help of our cross-attention module, we successfully combine sequence, structural, and shape features to achieve interactive fusion. When comparing MultiTF to other state-of-the-art methods using 165 ENCODE ChIP-seq datasets, we find that MultiTF exhibits average ACC, ROC-AUC, and PR-AUC values of 0.911, 0.978, and 0.982, respectively. The results show that MultiTF achieves unprecedented prediction accuracy compared to previous TFBS prediction models. In addition, our visual analysis of structural features provides interpretability for the prediction results.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Predicting transcription factor binding sites by a multi-modal representation learning method based on cross-attention network

Abstract

Talk to us

Similar Papers

More From: Applied Soft Computing

Lead the way for us

Similar Papers

MaxATAC: Genome-scale transcription-factor binding prediction from ATAC-seq with deep neural networks.
Tareian A Cazares ... Teresa M Przytycka
PLOS Computational Biology | VOL. 19
Tareian A Cazares, et. al.Tareian A Cazares ... Teresa M Przytycka
31 Jan 2023
PLOS Computational Biology | VOL. 19

Transcription Factor Binding Sites Prediction Based on Modified Nucleosomes
Mohammad Talebzadeh ... Fatemeh Zare-Mirakabad
PLoS ONE | VOL. 9
Mohammad Talebzadeh, et. al.Mohammad Talebzadeh ... Fatemeh Zare-Mirakabad
21 Feb 2014
PLoS ONE | VOL. 9

Evaluierung des phylogenetischen Footprintings und dessen Anwendung zur verbesserten Vorhersage von Transkriptionsfaktor-Bindestellen
Tilman Sauer
-
Tilman SauerTilman Sauer
20 Feb 2022
20 Feb 2022

Simultaneous prediction of transcription factor binding sites in a group of prokaryotic genomes
Shaoqiang Zhang ... Phuc T Pham
BMC Bioinformatics | VOL. 11
Shaoqiang Zhang, et. al.Shaoqiang Zhang ... Phuc T Pham
23 Jul 2010
BMC Bioinformatics | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Predicting transcription factor binding sites by a multi-modal representation learning method based on cross-attention network

Abstract

Talk to us

Similar Papers

More From: Applied Soft Computing