文章基本信息

标题：EVALUATION OF GENOME SIMILARITIES USING INDEPENDENT COMPONENTS
本地全文：下载
作者：Thelma SÁFADI ; Leila Maria FERREIRA
期刊名称：Revista Brasileira de Biometria
印刷版ISSN：0102-0811
电子版ISSN：1983-0823
出版年度：2020
卷号：38
期号：1
页码：92-101
DOI：10.28951/rbb.v38i1.439
出版社：Universidade Federal de Lavras
摘要：We propose the use of independent component analysis to ﬁnd similarities of genomes. Considering diﬀerent numbers of independent components, the complete linkage method was used to identify groups based on the estimated coeﬃcients of the mixing matrix. The sequences analyzed correspond to the strains of the Mycobacterium tuberculosis genome, ten sequences were analyzed, obtained from the National Center for Biotechnology Information (NCBI, 2017). The GC-content of each sequence was evaluated using a sliding window of 10,000 bases. The clustering analysis using the independent components of the analyzed sequences was essential to verify the dissimilarity of the sequences.
关键词：GC-content; Mycobacterium tuberculosis genomes; cluster analysis.