科研成果详情

发表状态已发表Published
题名Progressive approach for SNP calling and haplotype assembly using single molecular sequencing data
作者
发表日期2018-06-15
发表期刊Bioinformatics
ISSN/eISSN1367-4803
卷号34期号:12页码:2012-2018
摘要

Motivation Haplotype information is essential to the complete description and interpretation of genomes, genetic diversity and genetic ancestry. The new technologies can provide Single Molecular Sequencing (SMS) data that cover about 90% of positions over chromosomes. However, the SMS data has a higher error rate comparing to 1% error rate for short reads. Thus, it becomes very difficult for SNP calling and haplotype assembly using SMS reads. Most existing technologies do not work properly for the SMS data. Results In this paper, we develop a progressive approach for SNP calling and haplotype assembly that works very well for the SMS data. Our method can handle more than 200 million non-N bases on Chromosome 1 with millions of reads, more than 100 blocks, each of which contains more than 2 million bases and more than 3K SNP sites on average. Experiment results show that the false discovery rate and false negative rate for our method are 15.7 and 11.0% on NA12878, and 16.5 and 11.0% on NA24385. Moreover, the overall switch errors for our method are 7.26 and 5.21 with average 3378 and 5736 SNP sites per block on NA12878 and NA24385, respectively. Here, we demonstrate that SMS reads alone can generate a high quality solution for both SNP calling and haplotype assembly. Availability and implementation Source codes and results are available at https://github.com/guofeieileen/SMRT/wiki/Software.

DOI10.1093/bioinformatics/bty059
URL查看来源
收录类别SCIE
语种英语English
WOS研究方向Biochemistry & Molecular Biology ; Biotechnology & Applied Microbiology ; Computer Science ; Mathematical & Computational Biology ; Mathematics
WOS类目Biochemical Research Methods ; Biotechnology & Applied Microbiology ; Computer Science, Interdisciplinary ApplicationsMathematical & Computational Biology ; Statistics & Probability
WOS记录号WOS:000435461900005
Scopus入藏号2-s2.0-85049083989
引用统计
被引频次:21[WOS]   [WOS记录]     [WOS相关记录]
文献类型期刊论文
条目标识符https://repository.uic.edu.cn/handle/39GCC9TT/13126
专题个人在本单位外知识产出
通讯作者Wang, Lusheng
作者单位
1.School of Computer Science and Technology,Tianjin University,Tianjin Haihe Education Park,Tianjin,China
2.Department of Computer Science,City University of Hong Kong,Kowloon Tong,Hong Kong
3.University of Hong Kong Shenzhen Research Institute,Shenzhen Hi-Tech Industrial Park,Shenzhen, Guangdong,Hong Kong
推荐引用方式
GB/T 7714
Guo, Fei,Wang, Dan,Wang, Lusheng. Progressive approach for SNP calling and haplotype assembly using single molecular sequencing data[J]. Bioinformatics, 2018, 34(12): 2012-2018.
APA Guo, Fei, Wang, Dan, & Wang, Lusheng. (2018). Progressive approach for SNP calling and haplotype assembly using single molecular sequencing data. Bioinformatics, 34(12), 2012-2018.
MLA Guo, Fei,et al."Progressive approach for SNP calling and haplotype assembly using single molecular sequencing data". Bioinformatics 34.12(2018): 2012-2018.
条目包含的文件
条目无相关文件。
个性服务
查看访问统计
谷歌学术
谷歌学术中相似的文章
[Guo, Fei]的文章
[Wang, Dan]的文章
[Wang, Lusheng]的文章
百度学术
百度学术中相似的文章
[Guo, Fei]的文章
[Wang, Dan]的文章
[Wang, Lusheng]的文章
必应学术
必应学术中相似的文章
[Guo, Fei]的文章
[Wang, Dan]的文章
[Wang, Lusheng]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。