Stage 4 neuroblastoma (NBL), a solid tumor of childhood, has a poor prognosis. Despite intensive molecular genetic studies, no targetable gene abnormalities have been identified. Stage 4S NBL has a characteristic of spontaneous regression, and elucidation of the mechanistic differences between stages 4 and 4S may improve treatment. Conventional NBL studies have mainly focused on the detection of abnormalities in individual genes and have rarely examined abnormalities in gene networks. While the gene coexpression network is expected to contribute to the detection of network abnormalities, the fragility of the network due to data noise and the extraction of arbitrary topological structures for the high-dimensional network are issues. The present paper concerns the classification method of stages 4 and 4S NBL patients using highly accurate gene coexpression network analysis based on RNA-sequencing data of transcription factors (TFs). In particular, after applying a noise reduction method RECODE, generalized topological overlapping measure (GTOM), which weighs the connections of nodes in the network structure, succeeded in extracting a cluster of TFs that showed high classification performance for stages 4 and 4S. In addition, we investigated how these clusters correspond to clinical information and to TFs which control the normal adrenal tissue and NBL characters. A clustering method is presented for finding intermediate-scale clusters of TFs that give considerable separation performance for distinguishing between stages 4 and 4S. It is suggested that this method is useful as a way to extract factors that contribute to the separation of groups from multiple pieces of information such as gene expression levels.
Read full abstract