发表状态 | 已发表Published |
题名 | Development of an intelligent distributed news retrieval system |
作者 | |
发表日期 | 2012 |
发表期刊 | International Journal of Knowledge-Based and Intelligent Engineering Systems
![]() |
ISSN/eISSN | 1327-2314 |
卷号 | 16期号:2页码:129-140 |
摘要 | Currently available web news retrieval systems face a number of problems in that web-based news retrieval requires the ability to quickly and accurately process and update a very large amount of data which are constantly being updated. In this paper, we present the development of an intelligent distributed web news retrieval system the goal of which is to accurately retrieve and organize the web news information. It includes: a novel optimized crawler algorithm whose fetching-speed is several times faster than that of the traditional crawler; a keen tag based extraction algorithm which can extract the data rich content with minimal manual effort and which also allows data to be classified as important or not important so that the crawler can revisit and update important data; a modified MapReduce improved by estimating the execution time of each subtask, which is proven to be able to reduce the number of the unusual tasks and shorten the whole job execution time. © 2012 - IOS Press and the authors. All rights reserved. |
关键词 | distributed news retrieval Intelligent system MapReduce web crawler |
DOI | 10.3233/KES-2011-0237 |
URL | 查看来源 |
语种 | 英语English |
引用统计 | |
文献类型 | 期刊论文 |
条目标识符 | https://repository.uic.edu.cn/handle/39GCC9TT/4730 |
专题 | 个人在本单位外知识产出 |
作者单位 | Department of Computing, Polytechnic University of Hong Kong, Hung Hom, Kowloon, Hong Kong |
推荐引用方式 GB/T 7714 | Liu, James Nga Kwok,Choi, K. C.,Chai, Junyi. Development of an intelligent distributed news retrieval system[J]. International Journal of Knowledge-Based and Intelligent Engineering Systems, 2012, 16(2): 129-140. |
APA | Liu, James Nga Kwok, Choi, K. C., & Chai, Junyi. (2012). Development of an intelligent distributed news retrieval system. International Journal of Knowledge-Based and Intelligent Engineering Systems, 16(2), 129-140. |
MLA | Liu, James Nga Kwok,et al."Development of an intelligent distributed news retrieval system". International Journal of Knowledge-Based and Intelligent Engineering Systems 16.2(2012): 129-140. |
条目包含的文件 | 条目无相关文件。 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论