科研成果详情

发表状态已发表Published
题名Development of an intelligent distributed news retrieval system
作者
发表日期2012
发表期刊International Journal of Knowledge-Based and Intelligent Engineering Systems
ISSN/eISSN1327-2314
卷号16期号:2页码:129-140
摘要

Currently available web news retrieval systems face a number of problems in that web-based news retrieval requires the ability to quickly and accurately process and update a very large amount of data which are constantly being updated. In this paper, we present the development of an intelligent distributed web news retrieval system the goal of which is to accurately retrieve and organize the web news information. It includes: a novel optimized crawler algorithm whose fetching-speed is several times faster than that of the traditional crawler; a keen tag based extraction algorithm which can extract the data rich content with minimal manual effort and which also allows data to be classified as important or not important so that the crawler can revisit and update important data; a modified MapReduce improved by estimating the execution time of each subtask, which is proven to be able to reduce the number of the unusual tasks and shorten the whole job execution time. © 2012 - IOS Press and the authors. All rights reserved.

关键词distributed news retrieval Intelligent system MapReduce web crawler
DOI10.3233/KES-2011-0237
URL查看来源
语种英语English
引用统计
被引频次:1[WOS]   [WOS记录]     [WOS相关记录]
文献类型期刊论文
条目标识符https://repository.uic.edu.cn/handle/39GCC9TT/4730
专题个人在本单位外知识产出
作者单位
Department of Computing, Polytechnic University of Hong Kong, Hung Hom, Kowloon, Hong Kong
推荐引用方式
GB/T 7714
Liu, James Nga Kwok,Choi, K. C.,Chai, Junyi. Development of an intelligent distributed news retrieval system[J]. International Journal of Knowledge-Based and Intelligent Engineering Systems, 2012, 16(2): 129-140.
APA Liu, James Nga Kwok, Choi, K. C., & Chai, Junyi. (2012). Development of an intelligent distributed news retrieval system. International Journal of Knowledge-Based and Intelligent Engineering Systems, 16(2), 129-140.
MLA Liu, James Nga Kwok,et al."Development of an intelligent distributed news retrieval system". International Journal of Knowledge-Based and Intelligent Engineering Systems 16.2(2012): 129-140.
条目包含的文件
条目无相关文件。
个性服务
查看访问统计
谷歌学术
谷歌学术中相似的文章
[Liu, James Nga Kwok]的文章
[Choi, K. C.]的文章
[Chai, Junyi]的文章
百度学术
百度学术中相似的文章
[Liu, James Nga Kwok]的文章
[Choi, K. C.]的文章
[Chai, Junyi]的文章
必应学术
必应学术中相似的文章
[Liu, James Nga Kwok]的文章
[Choi, K. C.]的文章
[Chai, Junyi]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。