期刊名称:International Journal of Computer Science & Technology
印刷版ISSN:2229-4333
电子版ISSN:0976-8491
出版年度:2013
卷号:4
期号:3
页码:403-406
语种:English
出版社:Ayushmaan Technologies
摘要:Social network sites are attracting the web users more and more every day to share their views. The contents in the online social networking sites differ from traditional web content in many ways. One of the most important differences is highly temporal nature of the content. There is a great challenge to retrieve information from these sites. This paper presents architecture of focused crawler for social networking sites. The crawler works in parallel for every profile independently of each other. Currently it is implemented for a social network site Google+. The system also avoids redundant crawling.