首页    期刊浏览 2025年03月01日 星期六
登录注册

文章基本信息

  • 标题:Sketching-Din Elimination of Web Page
  • 本地全文:下载
  • 作者:Sivakumar, P. ; Parvathi, R. M.S.
  • 期刊名称:Journal of Computer Science
  • 印刷版ISSN:1549-3636
  • 出版年度:2011
  • 卷号:7
  • 期号:12
  • 页码:1888-1893
  • DOI:10.3844/jcssp.2011.1888.1893
  • 出版社:Science Publications
  • 摘要:Problem statement: The web content mining used to access lot of web pages, mining of web contents aims to extort positive information or awareness. Approach: There are several type of Web contents which can suggest valuable information to users are accessible in the Web, for instance graphical data, Extensible Markup Language documents, Hyper Text Markup Language documents and simple text. Here, only element of the information is useful for a testing purpose and the remaining information are noises. Results: In this research study, we propose an approach for removing the noises from a given web page which will get better the presentation of web content mining. At first, the web page information is divided into various blocks. Conclusion: From which, the duplicate blocks are removed using sketching. The performance of the proposed approach and results ensure the effectiveness of the proposed approach in classify the main blocks.
  • 关键词:Web mining; web content mining; web cleaning; duplicate blocks; web page information; graphical data; world wide web; Web Structural Mining (WSM); Web Usage Mining (WUM)
国家哲学社会科学文献中心版权所有