首页    期刊浏览 2024年12月13日 星期五
登录注册

文章基本信息

  • 标题:A Comparative Analysis of Feature Selection Methods for Clustering DNA Sequences
  • 本地全文:下载
  • 作者:Mr. B.Umamageswari ; Mr. B.Karthikeyan ; Mr. T.Nalini
  • 期刊名称:International Journal of Computer Science and Security (IJCSS)
  • 电子版ISSN:1985-1553
  • 出版年度:2012
  • 卷号:6
  • 期号:2
  • 页码:120-127
  • 出版社:Computer Science Journals
  • 摘要:Large-scale analysis of genome sequences is in progress around the world, the major application of which is to establish the evolutionary relationship among the species using phylogenetic trees. Hierarchical agglomerative algorithms can be used to generate such phylogenetic trees given the distance matrix representing the dissimilarity among the species. ClustalW and Muscle are two general purpose programs that generates distance matrix from the input DNA or protein sequences. The limitation of these programs is that they are based on Smith-Waterman algorithm which uses dynamic programming for doing the pair-wise alignment. This is an extremely time consuming process and the existing systems may even fail to work for larger input data set. To overcome this limitation, we have used the frequency of codons usage as an approximation to find dissimilarity among species. The proposed technique further reduces the complexity by extracting only the significant features of the species from the mtDNA sequences using the techniques like frequent codons, codons with maximum range value or PCA technique. We have observed that the proposed system produces nearly accurate results in a significantly reduced running time.
  • 关键词:Evolutionary Tree; Hierarchical Clustering; Bioinformatics; Codons; mtDNA
国家哲学社会科学文献中心版权所有