Published January 1, 2015 | Version v1
Journal article Open

Robustness of Massively Parallel Sequencing Platforms

  • 1. TUBITAK MAM GMBE Sci & Technol Res Council Turkey, Genet Engn & Biotechnol Inst, Gebze, Kocaeli, Turkey
  • 2. Sci & Technol Res Council Turkey TUBITAK, BILGEM, Adv Genom & Bioinformat Res Grp IGBAM, Gebze, Kocaeli, Turkey
  • 3. Bogazici Univ, Dept Comp Engn, Istanbul, Turkey
  • 4. Simon Fraser Univ, Sch Comp Sci, Burnaby, BC V5A 1S6, Canada
  • 5. Bilkent Univ, Dept Comp Engn, Ankara, Turkey

Description

The improvements in high throughput sequencing technologies (HTS) made clinical sequencing projects such as ClinSeq and Genomics England feasible. Although there are significant improvements in accuracy and reproducibility of HTS based analyses, the usability of these types of data for diagnostic and prognostic applications necessitates a near perfect data generation. To assess the usability of a widely used HTS platform for accurate and reproducible clinical applications in terms of robustness, we generated whole genome shotgun (WGS) sequence data from the genomes of two human individuals in two different genome sequencing centers. After analyzing the data to characterize SNPs and indels using the same tools (BWA, SAMtools, and GATK), we observed significant number of discrepancies in the call sets. As expected, the most of the disagreements between the call sets were found within genomic regions containing common repeats and segmental duplications, albeit only a small fraction of the discordant variants were within the exons and other functionally relevant regions such as promoters. We conclude that although HTS platforms are sufficiently powerful for providing data for first-pass clinical tests, the variant predictions still need to be confirmed using orthogonal methods before using in clinical applications.

Files

bib-a1cb2f60-ff51-4684-8dfa-3611bcc7099f.txt

Files (187 Bytes)

Name Size Download all
md5:852676f424517eceafb0ea031c123fea
187 Bytes Preview Download