Abstract

BackgroundWith the development of high-throughput sequencing technology, a huge amount of multi-omics data has been accumulated. Although there are many software tools for statistical analysis and visual development of omics data, these tools are not suitable for private data and non-technical users. Besides, most of these tools have specialized in only one or perhaps a few data typesare, without combining clinical information. What’s more, users could not choose data processing and model selection flexibly when using these tools.ResultsTo help non-technical users to understand and analyze private multi-omics data and ensure data security, we developed an interactive desk tool for statistical analysis and visualization of omics and clinical data (shortly IOAT). Our mainly targets csv format data, and combines clinical data with high-dimensional multi-omics data. It also contains various operations, such as data preprocessing, feature selection, risk assessment, clustering, and survival analysis. By using this tool, users can safely and conveniently try a combination of various methods on their private multi-omics data to find a model suitable for their data, conduct risk assessment and determine their cancer subtypes. At the same time, the tool can also provide them with references to genes that are closely related to tumor staging, facilitating the development of precision oncology. We review IOAT’s main features and demonstrate its analysis capabilities on a lung from TCGA.ConclusionsIOAT is a local desktop tool, which provides a set of multi-omics data integration solutions. It can quickly perform a complete analysis of cancer genome data for subtype discovery and biomarker identification without security issues and writing any code. Thus, our tool can enable cancer biologists and biomedicine researchers to analyze their data more easily and safely. IOAT can be downloaded for free from https://github.com/WlSunshine/IOAT-software.

Highlights

  • With the development of high-throughput sequencing technology, a huge amount of multi-omics data has been accumulated

  • (2) They provide a relatively fixed calculation process, which cannot provide users with various flexible methods, including preprocessing, training models, clustering, and so on. They cannot combine different models for users to choose, such as the UCSC Xena [2] and Firehose [3]. We found that these tools do not completely combine multi-omics data with clinical data to carry out molecular subtype research

  • We developed an interactive tool for statistical analysis of omics and clinical data, which enables non-technical users to perform research on private high-dimensional multi-omics data without any programming burden and security risks

Read more

Summary

Introduction

With the development of high-throughput sequencing technology, a huge amount of multi-omics data has been accumulated. There are many software tools for statistical analysis and visual development of omics data, these tools are not suitable for private data and non-technical users. Most of these tools have specialized in only one or perhaps a few data typesare, without combining clinical information. (1) Those tools have traditionally specialized in only one or perhaps a few data types While these complex datasets generate insights individually, integrating with other-omics datasets is crucial to help researchers discover and validate findings. In our test, when web-based tools upload user data (lung cancer data used by IOAT), many tools crash due to the large data set and cross-regional issues, resulting in a very poor user experience

Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call