[1] 中国互联网络信息中心(CNNIC). 第34次《中国互联网络发展状况统计报告》[EB/OL]. http://www.cnnic.net.cn/hlwfzyj/hlwxzbg/hlwtjbg/201407/P020140721507223212132.pdf,
2014-07-23.
[2] Du Yajun, Pen Qiangqiang, Gao Zhaoqiong. A topic-specific crawling strategy based on semantics similarity [J]. Data & Knowledge Engineering, 2013, 88 (2013) 75-93.
[3] 常智荣. 搜索引擎Nutch在数字图书馆中集成应用的研究与实现[D]. 北京: 北京邮电大学, 2010.
[4] 曾铭. 垂直搜索技术在社交网站中的应用与研究[D]. 北京: 北京邮电大学, 2013.
[5] 罗磊. 微博舆情热点检测与跟踪方法研究[D]. 杭州: 杭州电子科技大学, 2013.
[6] Tang T T, Hawking D, Craswell N, et al. Focused crawling for both topical relevance and quality of medical information[C]// Proc. the 14th ACM Int. Conf. on Information
and Knowledge Management. 2005:147-154.
[7] Pirkola A. Focused Crawling: A Means to Acquire Biological Data from the Web [EB/OL].
http://www.researchgate.net/publication/228783841_Focused_Crawling_A_Means_to_Acquire_Biological_Data_from_the_Web, 2015-03-30.
[8] Chakrabarti S, Berg M van Den, Dom B. Focused crawling: A new approach to topic-specific Web resource discovery [J]. Computer Networks, 1999, 31(11-16):1623-1640.
[9] Page L, Brin S, Motwani R, et al. The PageRank Citation Ranking: Bringing Order to the Web [R]. California: Stanford University, 1998.
[10]Kleinberg M. Authoritative sources in a hyperlinked environment [J]. Journal of the ACM, 1999, 46(5):604-632.
[11]Debra P, Post R. Information retrieval in the World Wide Web: Making client-based searching feasible[C]// Proceedings of the 1st International World Wide Web
Conference. 1994:183-192.
[12]Hersovici M, Jacovi M, Maarek Y, et al. The shark-search algorithm-an application: Tailored Web site mapping[C]// Proceedings of the 7th World Wide Web Conference.
1998:317-326.
[13]Wang Can, Guan Zi-yu, Chen Chun, et al. On-line topical importance estimation: An effective focused crawling algorithm combining link and content analysis [J]. Journal
of Zhejiang University(Science A), 2009,10(8):1114-1124.
[14]Abiteboul S, Preda M, Cobena G. Adaptive on-line page importance computation [C]// Proceedings of the 12th international World Wild Web Conference. 2003:280-290.
[15]Bergmark D, Lagoze C, Sbityakov A. Focused crawls, tunneling, and digital libraries[C]// Proceedings of the 6th European Conference on Digital Libraries. 2002: 91-106. |