期刊名称:International Journal of Computer Science & Technology
印刷版ISSN:2229-4333
电子版ISSN:0976-8491
出版年度:2011
卷号:2
期号:3(Version 1)
出版社:Ayushmaan Technologies
摘要:The biological data is available in different formats and is comparatively more complex. Knowledge discovery from these large and complex databases is the key problem of this era. Data mining and machine learning techniques are needed which can scale to the size of the problems and can be customized to the application of biology. The research in bioinformatics has accumulated large amount of data. As the hardware technology advancing, the cost of storing is decreasing. The biological data is available in different formats and is comparatively more complex. Knowledge discovery from these large and complex databases is the key problem of this era. Data mining and machine learning techniques are needed which can scale to the size of the problems and can be customized to the application of biology. In the present research work, Open Reading Frame is Detected with the help of data mining. Various consensus Sequences are gathered and Cluster algorithm is applied based on local alignment score calculation. The sequences having greater score will be more similar and this is basis for applying clustering algorithm. Another algorithm which uses the clusters created by previous algorithm is created for detecting the percentage match of the consensus sequence with the entered DNA sequence which further results for the Detection of Open reading Frame of entered sequence