Go to main contents Go to main menus

사용자별 맞춤메뉴

자주찾는 메뉴

추가하기
닫기

Research Articles

contents area

detail content area

Comprehensive Analysis to Improve the Validation Rate for Single Nucleotide Variants Detected by Next-Generation Sequencing
  • Date2018-02-05 17:24
  • Update2018-02-05 17:24
  • CountersignatureDivision of Research Planning
  • Tel043-719-8033
PLOS one, 2014, 01, e86664-1─e86664-9

Comprehensive Analysis to Improve the Validation Rate for Single Nucleotide Variants Detected by Next-Generation Sequencing

Mi-Hyun Park, Hwanseok Rhee, JPark, H Woo, H Kim, J Jung, S Koo

Abstract

    Next-generation sequencing (NGS) has enabled the high-throughput discovery of germline and somatic mutations. However, NGS-based variant detection is still prone to errors, resulting in inaccurate variant calls. Here, we categorized the variants detected by NGS according to total read depth (TD) and SNP quality (SNPQ), and performed Sanger sequencing with 348 selected non-synonymous single nucleotide variants (SNVs) for validation. Using the SAMtools and GATK algorithms, the validation rate was positively correlated with SNPQ but showed no correlation with TD. In addition, common variants called by both programs had a higher validation rate than caller-specific variants. We further examined several parameters to improve the validation rate, and found that strand bias (SB) was a key parameter. SB in NGS data showed a strong difference between the variants passing validation and those that failed validation, showing a validation rate of more than 92% (filtering cutoff value: alternate allele forward [AF]$20 and AF,80 in SAMtools, SB,?10 in GATK). Moreover, the validation rate increased significantly (up to 97?99%) when the variant was filtered together with the suggested values of mapping quality (MQ), SNPQ and SB. This detailed and systematic study provides comprehensive recommendations for improving validation rates, saving time and lowering cost in NGS analyses.


  • ISBN or ISSN: 1932-6203

  • 본 연구는 질병관리본부 연구개발과제(과제번호 2012-N61001-00) 연구비를 지원받아 수행되었습니다.
  • This research was supported by a fund(code 2012-N61001-00) by Research of Korea Centers for Disease Control and Prevention.


This public work may be used under the terms of the public interest source + commercial use prohibition + nonrepudiation conditions This public work may be used under the terms of the public interest source + commercial use prohibition + nonrepudiation conditions
TOP