引用本文: | 刘俊宏,李 春.DNA序列基于k-字的数值刻画及其应用[J].生物信息学,2013,11(2):142-145. |
| LIU Jun-hong,LI Chun.A numerical characterization of DNA sequences based on k-words and its application[J].Chinese Journal of Bioinformatics,2013,11(2):142-145. |
|
摘要: |
借助DNA序列中k-字的频数,将序列转化成一个340维向量,进而计算物种间的进化距离。作为应用:分别以15个物种的β球蛋白基因、13种汉坦病毒的S片段以及26个闭壳龟线粒体基因为例,构建系统发生树,所得结果与前人的结论一致,说明了该方法的有效性。 |
关键词: k-字 340维向量 系统发生树 |
DOI:10.3969/j.issn.1672-5565.2013-02.20130212 |
分类号: |
基金项目:国家自然科学基金项目(11171042);辽宁省高等学校杰出青年学者成长计划(LJQ2011122)。 |
|
A numerical characterization of DNA sequences based on k-words and its application |
LIU Jun-hong, LI Chun
|
(Department of Mathematics, Bohai University, Liaoning Jinzhou 121013,China )
|
Abstract: |
By means of the frequencies of k-words, the DNA sequence is transformed into a 340-D vector, and then the evolutional distance is obtained. The proposed measure is used to construct phylogenetic trees on three separate sets:the full β-globin genes of 15species, the S segments of 13hantaviruses and 26Cuora mitochodriona genes. The results are consistent with those of previous analyses, which illustrates the utility of the approach. |
Key words: k-words 340-D Vector Phylogenetic Trees |