Generation Sequencing Data Analysis Research Articles

Abstract The successful application of Next Generation Sequencing (NGS) to drug discovery requires systems to manage and document each step of the sequencing process from sample receipt through data generation and data processing. We combined BenchlingTM, a solution for tracking NGS lab processes, with FONDA (Framework Of Next generation sequencing Data Analysis) an internally developed data processing platform, to support multiple types of NGS data generation and processing. Benchling combines a digital notebook and a laboratory information management system (LIMS). The system documents and automates steps in the NGS process including: sample registration, nucleic acid extraction, library construction, flow cell construction, sequencer sample sheet generation and BCL2FASTQ conversion. This enables wet lab scientists to easily retrieve an appropriate protocol for each sample and sequencing library type. We connected our sequencers to Benchling in order to monitor each sequencing run and to keep track of the quality of NGS data. In addition, it generates “analysis ready sample sheet” (contains project and study information, location of FASTQ, sample species and library type) and uploads it into designated S3 buckets for data processing. Benchling dashboards provide overviews of NGS sample preparation, data generation and quality control. In summary, Benchling interconnects the original sample, the labels, the barcodes, the cDNA/DNA, the library, and all the QC results. We process NGS data using pipelines implemented in FONDA on a dockerized Amazon Web Services cloud platform. Analyses can be configured automatically from information exported by Benchling or launched manually. After data processing is completed, output files such as gene expression counts or variant calls are deposited into project-specific folders, ready for secondary analysis. In the current FONDA version (as of Nov 2020), we have developed pipelines for single cell multi-omics (CITE-seq and single-cell immune profiling) and bulk RNA-seq. The modular design of FONDA facilitates the development, the updating, and the extension of pipelines to new sequencing technologies. In summary, Benchling and FONDA enable high quality sample and NGS data flows from the lab for target identification, understanding mechanism of action, patient stratification and biomarker discovery. Availability and implementation: FONDA is implemented in Java and released under the Apache License 2.0. FONDA can be downloaded from GitHub at https://github.com/epam/fonda. Citation Format: Chandra Sekhar Pedamallu, Joon Sang Lee, Shu Yan, Adalis Maisonet, Aleksandr Sidoruk, Tengui Chen, Yulia Kamyshova, Mariia Zueva, Mark Magid, Quan Wan, Jeffrey Thompson, Valerie Zebrouck, Immanuel Gadaczek, Mikhail Alperovich, Brian McNatt, Alexei Protopopov, Donald Jackson, Jack Pollard. A comprehensive sample tracking and data processing workflow for next generation sequencing [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2021; 2021 Apr 10-15 and May 17-21. Philadelphia (PA): AACR; Cancer Res 2021;81(13_Suppl):Abstract nr 2280.

Read full abstract

Abstract Background: Downregulation or loss of TFF1 expression occurs in more than half of human gastric adenocarcinomas through gene deletions, mutation, and a loss of heterozygosity or hypermethylation. Our previous studies have shown that Tff1 knockout (KO) in mice induces gastric lesions from low- grade dysplasia (LGD), high- grade dysplasia (HGD) to adenocarcinoma. BRD2, a family member of BET proteins, promotes aberrant gene expression in a variety of malignant tumors. In this study, with the analysis of Tff1 KO mice and human gastric cancer tissue samples, we discovered that loss of Tff1 promotes gastric cancer proliferation and drug resistance through upregulating BRD2 expression level. Methods and Results: Integration Next Generation Sequencing data analysis in both Tff1 KO mice and human gastric cancer tissue samples demonstrated that miR-143-3p was significantly down-regulated in both mice and human gastric cancer samples (P&lt;0.05). Our findings were further validated by RT-PCR in Tff1 KO mouse gastric legions (LGD and adenocarcinoma) and 3 different cohorts of human gastric cancer tissue samples from USA, Chile, and China. Using different gastric cancer cell models we further confirmed the decrease of miR-143-3p regulated by loss of TFF1. Next, we wanted to find out the downstream effector of miR-143-3p down-regulation. Western blot data showed that BRD2 protein level was regulated by miR-143-3p in 4 gastric cancer cell lines. BRD2 3'UTR luciferase reporter assay further confirmed that miR-143-3p decreased BRD2 protein expression through directly binding to its 3'UTR sites. These data suggested that, for the first time, BRD2 is a direct downstream target of miR-143-3p. At the meantime, the reconstitution of TFF1 in gastric cancer cells up-regulated miR-143-3p expression level, which in turn, decreased the protein expression levels of BRD2, MYC and BCL-2. Using western blot, we showed the synergistic effect of BRD2 inhibitor and CDDP chemotherapy in human gastric cancer cells. Data from 320 human gastric cancer patients demonstrated that high BRD2 expression level in gastric cancer tissues significantly decreased the overall patient survival rate (P&lt;0.0001). Conclusion: This study unveils, for the first time, loss of TFF1 promotes BRD2 activation in gastric cancer through decreasing miR-143-3p. This axis presents novel therapeutic opportunities by using approaches that reconstitute miR-143-3p or utilizing the recently developed clinical trials in human gastric cancer. Citation Format: Zheng Chen, Zheng Li, Mohammed Soutto, Weizhi Wang, Shoumin Zhu, Alejandro Corvalan, Zekuan Xu, Wael El-Rifai. Loss of TFF1 promotes growth and chemotherapeutic resistance in gastric cancer [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2018; 2018 Apr 14-18; Chicago, IL. Philadelphia (PA): AACR; Cancer Res 2018;78(13 Suppl):Abstract nr 1259.

Read full abstract

Generation Sequencing Data Analysis Research Articles

Related Topics

Articles published on Generation Sequencing Data Analysis

Artificial intelligence for Next generation sequencing data analysis

Virome characterization and identification of a putative parvovirus and poxvirus in bat ectoparasites of Yunnan Province, China

Two Novel Mutations in LAMC2 Gene in Iranian Families Affected by Junctional Epidermolysis Bullosa.

Abstract 2280: A comprehensive sample tracking and data processing workflow for next generation sequencing

Next Generation Sequencing Data Analysis and its Applications in Agriculture

A 3D physio-mimetic interpenetrating network-based platform to decode the pro and anti-tumorigenic properties of cancer-associated fibroblasts

Quantitative profiling of protease specificity.

Deep Learning Applied on Next Generation Sequencing Data Analysis.

Machine Learning in Early Genetic Detection of Multiple Sclerosis Disease: A Survey

Abstract 2995: Dissecting renal cell carcinoma vulnerabilities: Using CRISPR/Cas9 to study resistance to glutaminase inhibition

Accelerating next generation sequencing data analysis: an evaluation of optimized best practices for Genome Analysis Toolkit algorithms.

The Roles of RUNX Family Proteins in Development of Immune Cells.

Abstract 1259: Loss of TFF1 promotes growth and chemotherapeutic resistance in gastric cancer

FISH mapping of 45S rRNA and IHHB genes on autosomes and B chromosome of cichlid fish Astatotilapia latifasciata

Accelerating next generation sequencing data analysis with system level optimizations

Next Generation Sequencing Data Analysis Evaluation in Patients with Parkinsonism from a Genetically Isolated Population

Genetic testing of 248 Chinese aortopathy patients using a panel assay.

Detection of Natural Resistance-Associated Substitutions by Ion Semiconductor Technology in HCV1b Positive, Direct-Acting Antiviral Agents-Naïve Patients.

A novel procedure on next generation sequencing data analysis using text mining algorithm

Detection Copy Number Variants from NGS with Sparse and Smooth Constraints.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Generation Sequencing Data Analysis Research Articles

Related Topics

Articles published on Generation Sequencing Data Analysis

Artificial intelligence for Next generation sequencing data analysis

Virome characterization and identification of a putative parvovirus and poxvirus in bat ectoparasites of Yunnan Province, China

Two Novel Mutations in LAMC2 Gene in Iranian Families Affected by Junctional Epidermolysis Bullosa.

Abstract 2280: A comprehensive sample tracking and data processing workflow for next generation sequencing

Next Generation Sequencing Data Analysis and its Applications in Agriculture

A 3D physio-mimetic interpenetrating network-based platform to decode the pro and anti-tumorigenic properties of cancer-associated fibroblasts

Quantitative profiling of protease specificity.

Deep Learning Applied on Next Generation Sequencing Data Analysis.

Machine Learning in Early Genetic Detection of Multiple Sclerosis Disease: A Survey

Abstract 2995: Dissecting renal cell carcinoma vulnerabilities: Using CRISPR/Cas9 to study resistance to glutaminase inhibition

Accelerating next generation sequencing data analysis: an evaluation of optimized best practices for Genome Analysis Toolkit algorithms.

The Roles of RUNX Family Proteins in Development of Immune Cells.

Abstract 1259: Loss of TFF1 promotes growth and chemotherapeutic resistance in gastric cancer

FISH mapping of 45S rRNA and IHHB genes on autosomes and B chromosome of cichlid fish Astatotilapia latifasciata

Accelerating next generation sequencing data analysis with system level optimizations

Next Generation Sequencing Data Analysis Evaluation in Patients with Parkinsonism from a Genetically Isolated Population

Genetic testing of 248 Chinese aortopathy patients using a panel assay.

Detection of Natural Resistance-Associated Substitutions by Ion Semiconductor Technology in HCV1b Positive, Direct-Acting Antiviral Agents-Naïve Patients.

A novel procedure on next generation sequencing data analysis using text mining algorithm

Detection Copy Number Variants from NGS with Sparse and Smooth Constraints.