期刊名称:International Journal of Data Mining & Knowledge Management Process
印刷版ISSN:2231-007X
电子版ISSN:2230-9608
出版年度:2013
卷号:3
期号:2
出版社:Academy & Industry Research Collaboration Center (AIRCC)
摘要:The conventional search engines existing over the internet are active in searching the appropriate information. The search engine gets few constraints similar to attainment the information seeked from a different sources. The web crawlers are intended towards a exact lane of the web.Web Crawlers are limited in moving towards a different path as they are protected or at times limited because of the apprehension of threats. It is possible to make a web crawler,which will have the ability of penetrating from side to side the paths of the web, not reachable by the usual web crawlers, so as to get a improved answer in terms of infoemation, time and relevancy for the given search query. The proposed web crawler is designed to attend Hyper Text Transfer Protocol Secure (HTTPS) websites including the web pages,which requires verification to view and index.
关键词:Deep Web Crawler; Hidden Pages; Accessing Secured Databases; Indexing.