
  • 2025年第23卷
  • 2024年第22卷
  • 2023年第21卷
  • 2022年第20卷
  • 2021年第19卷
  • 2020年第18卷
  • 2019年第17卷
  • 2018年第16卷
  • 2017年第15卷
  • 2016年第14卷
  • 2015年第13卷
  • 2014年第12卷
  • 2013年第11卷
  • 第1期
  • 第2期

主管单位 工业和信息化部 主办单位 哈尔滨工业大学 主编 任南琪 国际刊号ISSN 1672-5565 国内刊号CN 23-1513/Q

LI Ling-Bo,ZHANG Jing,CHEN Dan.Selection of human tumor information genes based on the support vector machine and mean impact value[J].Chinese Journal of Bioinformatics,2013,11(1):72-78.
【打印本页】   【HTML】   【下载PDF全文】   查看/发表评论  下载PDF阅读器  关闭
←前一篇|后一篇→ 过刊浏览    高级检索
本文已被:浏览 3966次   下载 2799 本文二维码信息
分享到: 微信 更多
(云南大学数学与统计学院,昆明 650091)
关键词:  基因表达谱,秩和检验,支持向量机,平均影响值,全折交叉验证
Selection of human tumor information genes based on the support vector machine and mean impact value
LI Ling-Bo, ZHANG Jing, CHEN Dan
(School of Mathematics and Statistics, Yunnan University, Kunming 650091, China)
Selection of information genes for tumor classification based on gene expression profiles is a main means to find specific expression genes and to study their expression pattern. Tumor diagnosis via the information genes obtained from gene expression spectrum is becoming an important research field of bioinformatics and is expected to be a fast and effective method for molecular diagnosis of tumors in clinical medicine. Considering the characteristics of gene expression profiling data of tumors such as high dimensions, small sample size and large noise etc, an algorithm for searching information genes is proposed that exploits support vector machine (SVM) and combines mean impact value (MIV). The advantage of this algorithm is that more information gene subsets with less genes and powerful classification capacity could be searched. A binary classification tumor dataset is applied to examine this novel algorithm, the result shows that it is feasible and effective in tumor classification. For colon cancer sample set, only 3genes can reach 100% accuracy of leave-one-out cross validation (LOOCV). To avoid the influence of classification performance because of the different partition for the sample set, full cross validation method is further used to assess the classification performance of the information gene subsets. More credible information gene subsets are selected. Compared with other tumor classification methods, the result is superior both in information gene number and in classification capacity.
Key words:  Gene Expression Profile,Rank-sum Test,Support Vector Machine,Mean Impact Value,Full-fold Cross Validated

