首页 | 官方网站   微博 | 高级检索  
     

基于流形学习的基因表达谱数据可视化
引用本文:肖传乐,曹槐.基于流形学习的基因表达谱数据可视化[J].生物信息学,2009,7(1):47-51.
作者姓名:肖传乐  曹槐
作者单位:1. 云南大学物理系非线性复杂系统中心,昆明,650091
2. 云南大学现代生物学研究中心,昆明,650091
摘    要:基因表达谱的可视化本质上是高维数据的降维问题。采用流形学习算法来解决基因表达谱的降维数据可视化,讨论了典型的流形学习算法(Isomap和LLE)在表达谱降维中的适用性。通过类内/类间距离定量评价数据降维的效果,对两个典型基因芯片数据集(结肠癌基因表达谱数据集和急性白血病基因表达谱数据集)进行降维分析,发现两个数据集的本征维数都低于3,因而可以用流形学习方法在低维投影空间中进行可视化。与传统的降维方法(如PCA和MDS)的投影结果作比较,显示Isomap流形学习方法有更好的可视化效果。

关 键 词:基因芯片  流形学习  数据降维可视化

The Visualization of Gene Expression Profile Based on Manifold Learning
XIAO Chuan-le,CAO Huai.The Visualization of Gene Expression Profile Based on Manifold Learning[J].China Journal of Bioinformation,2009,7(1):47-51.
Authors:XIAO Chuan-le  CAO Huai
Affiliation:1. Center for Nonlinear Complex Systems, Department of Physics, Yunnan University, Kunming 650091, China; 2. Modem Biological Research Center, Ytmnan University, Kunming 650091, China)
Abstract:The major problem of the visualization of gene expression profile is dimension reduction. In this paper, we discuss the adaptability of two classic methods(Isomap and LLE) of Manifold Learning in dimension reduction and perform an analysis of two typical rnicroarray data (colon cancer and leukaemia gene expression datasets), in which the assessing the dimension reduction effect is quantitative by using the function of the within and the without class distance. The dimeusionality of the two datasets was found less than three, so they can be visualized in low dimension space. Compared with the two traditional linear methods (PCA and MDS), the result shows that Isomap performs better in the gene expression profile visualization.
Keywords:gene expression profile  manifold learning  reduction and visualization of data
本文献已被 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号