Status | 已发表Published |
Title | Development of an intelligent distributed news retrieval system |
Creator | |
Date Issued | 2012 |
Source Publication | International Journal of Knowledge-Based and Intelligent Engineering Systems
![]() |
ISSN | 1327-2314 |
Volume | 16Issue:2Pages:129-140 |
Abstract | Currently available web news retrieval systems face a number of problems in that web-based news retrieval requires the ability to quickly and accurately process and update a very large amount of data which are constantly being updated. In this paper, we present the development of an intelligent distributed web news retrieval system the goal of which is to accurately retrieve and organize the web news information. It includes: a novel optimized crawler algorithm whose fetching-speed is several times faster than that of the traditional crawler; a keen tag based extraction algorithm which can extract the data rich content with minimal manual effort and which also allows data to be classified as important or not important so that the crawler can revisit and update important data; a modified MapReduce improved by estimating the execution time of each subtask, which is proven to be able to reduce the number of the unusual tasks and shorten the whole job execution time. © 2012 - IOS Press and the authors. All rights reserved. |
Keyword | distributed news retrieval Intelligent system MapReduce web crawler |
DOI | 10.3233/KES-2011-0237 |
URL | View source |
Language | 英语English |
Citation statistics | |
Document Type | Journal article |
Identifier | http://repository.uic.edu.cn/handle/39GCC9TT/4730 |
Collection | Research outside affiliated institution |
Affiliation | Department of Computing, Polytechnic University of Hong Kong, Hung Hom, Kowloon, Hong Kong |
Recommended Citation GB/T 7714 | Liu, James Nga Kwok,Choi, K. C.,Chai, Junyi. Development of an intelligent distributed news retrieval system[J]. International Journal of Knowledge-Based and Intelligent Engineering Systems, 2012, 16(2): 129-140. |
APA | Liu, James Nga Kwok, Choi, K. C., & Chai, Junyi. (2012). Development of an intelligent distributed news retrieval system. International Journal of Knowledge-Based and Intelligent Engineering Systems, 16(2), 129-140. |
MLA | Liu, James Nga Kwok,et al."Development of an intelligent distributed news retrieval system". International Journal of Knowledge-Based and Intelligent Engineering Systems 16.2(2012): 129-140. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment