Abstract

Cell type-specific gene expressions during development or in disease are regulated by interactions between transcription factors (TFs) and their binding sites. Recently, many deep learning approaches have been developed to characterize TF-DNA binding within a population of cells. However, determining TF binding sites (TFBSs) in single cells remains challenging due to the sparsity of data. Here, we propose a multi-stage transfer learning framework called STAPLE for single-cell TF-DNA binding prediction and analysis. Specifically, we design the Cell Type Learning to capture the relationship between different TF-DNA binding events in the same cell type. Meanwhile, we present Individual Learning to extract common motif and chromatin accessibility features of a particular binding event in a cellular population. In addition, we leverage Single-cell Learning to annotate TFBSs in each cell without any supervised label. Extensive experiments based on 570 single-cell datasets validate the effectiveness of our framework for considering cellular heterogeneity, outperforming current methods. This work can provide new insight into the relationship between TF-DNA binding and cellular heterogeneity. The source code of STAPLE can be found at https://github.com/ZhangLab312/STAPLE.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.