首页    期刊浏览 2024年11月30日 星期六
登录注册

文章基本信息

  • 标题:Research on Detection Algorithm of WEB Crawler
  • 本地全文:下载
  • 作者:Hongyan Zhao
  • 期刊名称:International Journal of Security and Its Applications
  • 印刷版ISSN:1738-9976
  • 出版年度:2015
  • 卷号:9
  • 期号:10
  • 页码:137-146
  • DOI:10.14257/ijsia.2015.9.10.12
  • 出版社:SERSC
  • 摘要:In the research of Web crawler, the most important things are structure design and solution of the key technologies. Based on the work of other people, we described the structure design of a distribute Web crawler, which including the organization of hardware and module partition of software. In this paper, one PC is utilized as the main node, and other PCs as the common nodes which are connected in LAN. The software architecture included main node design and common node design. Then, we analyzed solutions of the major techniques of the distributed Web crawler, such as how the nodes of the crawler cooperate with each other, how the task is distributed, how to keep the important Web fresh. We have proposed some practicable arithmetic to solve the problems mentioned above. Besides, we implemented a robust, distensible, customized, distributed Web crawler, and anatomized it. At last, we gave the results of two experiments, including common test and a site download test.
  • 关键词:Web Crawler; Parallel; ; ; Search Engine
国家哲学社会科学文献中心版权所有