引用本文: | 王昊,陈挺.第二、三代基因组测序数据混合拼接软件综述[J].生物信息学,2021,19(1):26-34. |
| WANG Hao,CHEN Ting.Review of computational methods for second-and third-generation sequencing hybrid assembly[J].Chinese Journal of Bioinformatics,2021,19(1):26-34. |
|
摘要: |
DNA 测序是生物信息学研究的重要内容之一,对测序序列的从头拼接是其中非常基础而重要的步骤。随着测序技术的不断更新,新的第三代测序数据拥有更长的序列长度、高错误率等性质,针对这些性质,同时使用二代、三代测序数据进行混合拼接是获得更好的拼接结果一种重要方式。本文介绍了现有的混合拼接软件的基本原理,并比较了不同软件拼接结果。最后,本文对选择拼接软件以及提出新的混合拼接方法的研究方向给出了建议。 |
关键词: 生物信息学 全基因组序列拼接 第三代测序技术 混合拼接 |
DOI:10.12113/202003006 |
分类号:TP391;TP301.6 |
文献标识码:A |
基金项目:国家自然科学基金(No.8,3,1,5,1404804);国家重点研发项目(No.2019Yfb1404804). |
|
Review of computational methods for second-and third-generation sequencing hybrid assembly |
WANG Hao1, CHEN Ting1,2
|
(1.Department of Computer Science and Technology,Tsinghua University, Beijing 100084, China;2. Institute for Artificial Intelligence, Tsinghua University, Beijing 100084, China)
|
Abstract: |
DNA sequencing is essential for bioinformatics research, and de novo genome assembly is the first step toward characterization of microbial species. With the continuous development of sequencing technology, the new third-generation sequencing produces longer reads but with higher error rates than the second-generation sequencing. Based on these properties, combining second- and third-generation sequencing data for hybrid assembly promises to obtain better de novo assembly results. This paper introduces the basic principles of state-of-the-art hybrid assembly methods and compares the assembly results of different software tools. Finally, suggestions are given on the selection of assembly software and and the research directions of improving hybrid assembly approaches. |
Key words: Bioinformatics Whole genome assembly Third-generation sequencing technology Hybrid assembly |