Alternative splicing events increase the transcriptomic and proteomic complexity in cancers. Overexpression of metastasis-associated lung adenocarcinoma transcript 1 (MALAT1), a highly conserved lncRNA, is widely known to promote cancer development, one mechanism for which may be through the regulation of alternative splicing and, thereby, gene expression. Its regulatory interactions with proteins have been a subject of much interest, yet little research has been carried out on the mechanisms adopted. It has been observed that MALAT1 binds to RNA-binding protein with serine-rich domain 1 (RNPS1), being colocalized in the nuclear speckles, and together, these two binding partners may regulate alternative splicing. Upregulated RNPS1 is predicted to play a key role in the pan-cancer development. Experimental tertiary structure of full-length MALAT1 is currently lacking despite the availability of the 3D structure of 3' expression and nuclear retention element. We hypothesize that the computationally modeled tertiary structures of the specific binding motifs in the M-region, E-region, and full-length structures of MALAT1 may adopt a modular structure and bind to the RNPS1 loop region of RS/P domain involved in exon skipping, interacting in a manner fully consistent with the biochemical experiments. Extensive observations using the powerful molecular dynamics (MD) simulations of MALAT1 regions bound to RNPS1 suggested that all three regions form interactive, yet stable complexes. The ranking of the MM-GBSA- and MM-PBSA-derived binding free energies between these complexes corroborated well in the MD simulations and experiments. Energy decomposition analyses suggested that arginines in the RNPS1 protein are among the major contributors toward the binding free energies as calculated by MM-GBSA present in the Amber package; while among the nucleotides, the major contributors were nucleotides with G and A nucleobases, with more contributory effect in comparison to arginines, across the bound M-region, E-region, and full-length MALAT1. This suggests that specific purines play a greater role in the complex formation, in a loop-specific manner, and the more proactive approach in complexation tilts toward MALAT1. To the best of our knowledge, our studies are the first studies taking a unique approach, utilizing the binding motifs to deduce a tertiary structure of MALAT1, toward our understanding of the lncRNA-protein interactions, stability, and binding on a structural basis. The therapeutic implications of targeting this complex formation to regulate splicing and hence, oncogenesis, is further envisaged.
Read full abstract