发表状态 | 已发表Published |
题名 | Progressive approach for SNP calling and haplotype assembly using single molecular sequencing data |
作者 | |
发表日期 | 2018-06-15 |
发表期刊 | Bioinformatics
![]() |
ISSN/eISSN | 1367-4803 |
卷号 | 34期号:12页码:2012-2018 |
摘要 | Motivation Haplotype information is essential to the complete description and interpretation of genomes, genetic diversity and genetic ancestry. The new technologies can provide Single Molecular Sequencing (SMS) data that cover about 90% of positions over chromosomes. However, the SMS data has a higher error rate comparing to 1% error rate for short reads. Thus, it becomes very difficult for SNP calling and haplotype assembly using SMS reads. Most existing technologies do not work properly for the SMS data. Results In this paper, we develop a progressive approach for SNP calling and haplotype assembly that works very well for the SMS data. Our method can handle more than 200 million non-N bases on Chromosome 1 with millions of reads, more than 100 blocks, each of which contains more than 2 million bases and more than 3K SNP sites on average. Experiment results show that the false discovery rate and false negative rate for our method are 15.7 and 11.0% on NA12878, and 16.5 and 11.0% on NA24385. Moreover, the overall switch errors for our method are 7.26 and 5.21 with average 3378 and 5736 SNP sites per block on NA12878 and NA24385, respectively. Here, we demonstrate that SMS reads alone can generate a high quality solution for both SNP calling and haplotype assembly. Availability and implementation Source codes and results are available at https://github.com/guofeieileen/SMRT/wiki/Software. |
DOI | 10.1093/bioinformatics/bty059 |
URL | 查看来源 |
收录类别 | SCIE |
语种 | 英语English |
WOS研究方向 | Biochemistry & Molecular Biology ; Biotechnology & Applied Microbiology ; Computer Science ; Mathematical & Computational Biology ; Mathematics |
WOS类目 | Biochemical Research Methods ; Biotechnology & Applied Microbiology ; Computer Science, Interdisciplinary ApplicationsMathematical & Computational Biology ; Statistics & Probability |
WOS记录号 | WOS:000435461900005 |
Scopus入藏号 | 2-s2.0-85049083989 |
引用统计 | |
文献类型 | 期刊论文 |
条目标识符 | https://repository.uic.edu.cn/handle/39GCC9TT/13126 |
专题 | 个人在本单位外知识产出 |
通讯作者 | Wang, Lusheng |
作者单位 | 1.School of Computer Science and Technology,Tianjin University,Tianjin Haihe Education Park,Tianjin,China 2.Department of Computer Science,City University of Hong Kong,Kowloon Tong,Hong Kong 3.University of Hong Kong Shenzhen Research Institute,Shenzhen Hi-Tech Industrial Park,Shenzhen, Guangdong,Hong Kong |
推荐引用方式 GB/T 7714 | Guo, Fei,Wang, Dan,Wang, Lusheng. Progressive approach for SNP calling and haplotype assembly using single molecular sequencing data[J]. Bioinformatics, 2018, 34(12): 2012-2018. |
APA | Guo, Fei, Wang, Dan, & Wang, Lusheng. (2018). Progressive approach for SNP calling and haplotype assembly using single molecular sequencing data. Bioinformatics, 34(12), 2012-2018. |
MLA | Guo, Fei,et al."Progressive approach for SNP calling and haplotype assembly using single molecular sequencing data". Bioinformatics 34.12(2018): 2012-2018. |
条目包含的文件 | 条目无相关文件。 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论