MicroSEC filters sequence errors for formalin-fixed and paraffin-embedded samples

Masachika Ikegami,Taisuke Mori,Toshihide Ueno,Naoki Kanomata,Shigeki Sekine,Hideko Yamauchi,Satoshi Inoue,Yoshihiro Inamoto,Hiroshi Kobayashi,Takeshi Hirose,Sakae Tanaka,Hiroyuki Mano,Shinji Kohsaka,Yasushi Yatabe

doi:10.1038/s42003-021-02930-4

Abstract

The clinical sequencing of tumors is usually performed on formalin-fixed, paraffin-embedded samples and results in many sequencing errors. We identified that most of these errors are detected in chimeric reads caused by single-strand DNA molecules with microhomology. During the end-repair step of library preparation, mutations are introduced by the mis-annealing of two single-strand DNA molecules comprising homologous sequences. The mutated bases are distributed unevenly near the ends in the individual reads. Our filtering pipeline, MicroSEC, focuses on the uneven distribution of mutations in each read and removes the sequencing errors in formalin-fixed, paraffin-embedded samples without over-eliminating the mutations detected also in fresh frozen samples. Amplicon-based sequencing using 97 mutations confirmed that the sensitivity and specificity of MicroSEC were 97% (95% confidence interval: 82–100%) and 96% (95% confidence interval: 88–99%), respectively. Our pipeline will increase the reliability of the clinical sequencing and advance the cancer research using formalin-fixed, paraffin-embedded samples.

Highlights

The clinical sequencing of tumors is usually performed on formalin-fixed, paraffin-embedded samples and results in many sequencing errors
microhomology-induced chimeric read (MICR) are formed during the endrepair step of library preparation for clinical sequencing, wherein a considerable amount of extracted DNA is denatured to single-stranded DNA (ssDNA) and behaves as site-directed mutagenesis polymerase chain reaction (PCR) primers[15]
We examined the performance of MicroSEC in distinguishing true mutations from formalin-fixed and paraffin-embedded (FFPE) artifacts with our custom‐made multigene panel test, “Todai OncoPanel”[2]

Summary

Introduction

The clinical sequencing of tumors is usually performed on formalin-fixed, paraffin-embedded samples and results in many sequencing errors. MicroSEC, focuses on the uneven distribution of mutations in each read and removes the sequencing errors in formalin-fixed, paraffin-embedded samples without overeliminating the mutations detected in fresh frozen samples. Based on our theory that artifacts are derived from ssDNA-annealing, we have developed a MICRoriginating Sequence Error Cleaning pipeline (MicroSEC), a post hoc filtering pipeline to predict whether a given mutation is an MICR-derived error. This pipeline allows the processing of thousands of mutations of target sequencing data within hours on a standard PC with 16 gigabytes of memory. MicroSEC requires a list of mutations and corresponding BAM files, rather than FASTQ files as it uses the positional bias of reads mapped against mutations

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Communications Biology	Publication Date: Dec 1, 2021
Citations: 3	License type: open-access

R Discovery Prime

R Discovery Prime

MicroSEC filters sequence errors for formalin-fixed and paraffin-embedded samples

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Communications Biology

Lead the way for us

Similar Papers

Abstract 2185: MicroSEC: Sequence error filter for formalin-fixed and paraffin-embedded samples
Masachika Ikegami ... Yoshihiro Inamoto
Cancer Research | VOL. 82
Masachika Ikegami, et. al.Masachika Ikegami ... Yoshihiro Inamoto
15 Jun 2022
Abstract 2185: MicroSEC: Sequence error filter for formalin-fixed and paraffin-embedded samples
Masachika Ikegami ... Yoshihiro Inamoto

Abstract 4529: Tailoring approaches for global epigenome analysis from archival formalin-fixed paraffin-embedded tissue samples
Sudipto Das ... Dominiek Smeets
Cancer Research | VOL. 76
Sudipto Das, et. al.Sudipto Das ... Dominiek Smeets
15 Jul 2016
Cancer Research | VOL. 76

K-Ras mutation detection in liquid biopsy and tumor tissue as prognostic biomarker in patients with pancreatic cancer: a systematic review with meta-analysis.
Tao Li ... Rongyuan Zhuang
Medical Oncology | VOL. 33
Tao Li, et. al.Tao Li ... Rongyuan Zhuang
25 May 2016
Medical Oncology | VOL. 33

Abstract POSTER-TECH-1109: Robust gene expression and mutation analyses from RNA-sequencing of formalin-fixed diagnostic tumor samples
Stefan Graw ... Kay Minn
Clinical Cancer Research | VOL. 21
Stefan Graw, et. al.Stefan Graw ... Kay Minn
13 Aug 2015
Clinical Cancer Research | VOL. 21

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

MicroSEC filters sequence errors for formalin-fixed and paraffin-embedded samples

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Communications Biology