Details of Research Outputs

TitleAutomatic hierarchical classification of structured deep web databases
Creator
Date Issued2006
Conference Name7th International Conference on Web Information Systems Engineering
Source PublicationWeb Information Systems – WISE 2006
EditorKarl Aberer, Zhiyong Peng, Elke A. Rundensteiner, Yanchun Zhang, Xuhui Li
ISBN3540481052
ISSN0302-9743
VolumeLecture Notes in Computer Science, vol 4255
Pages210-221
Conference DateWuhan, China
Conference PlaceOCT 23-26, 2006
Publication PlaceBerlin
PublisherSpringer
Abstract

We present a method that automatically classifies structured deep Web databases according to a pre-defined topic hierarchy. We assume that there are some manually classified databases, i.e., training databases, in every node of the topic hierarchy. Each training database is probed using queries constructed from the node titles of the topic hierarchy and the query result counts reported by the database are used to represent the content of the database. Hence, when adding a new database it can be probed by the same set of queries and classified to a node whose training databases are most similar to the new one. Specifically, a support vector machine classifier is trained on each internal node of the topic hierarchy with these training databases and the new database can be classified into the hierarchy top-down level by level. A feature extension method is proposed to create discriminant features. Experiments run on real structured Web databases collected from the Internet show that this classification method is quite accurate. © Springer-Verlag Berlin Heidelberg 2006.

DOI10.1007/11912873_23
URLView source
Indexed BySCIE ; CPCI-S
Language英语English
WOS Research AreaComputer Science
WOS SubjectComputer Science, Artificial Intelligence ; Computer Science, Information Systems ; Computer Science, Theory & Methods
WOS IDWOS:000241624200020
Scopus ID2-s2.0-33845241508
Citation statistics
Cited Times:6[WOS]   [WOS Record]     [Related Records in WOS]
Document TypeConference paper
Identifierhttp://repository.uic.edu.cn/handle/39GCC9TT/6843
CollectionResearch outside affiliated institution
Affiliation
1.Hong Kong University of Science and Technology,Hong Kong
2.City University,Hong Kong
Recommended Citation
GB/T 7714
Su, Weifeng,Wang, Jiying,Lochovsky, Frederick. Automatic hierarchical classification of structured deep web databases[C]//Karl Aberer, Zhiyong Peng, Elke A. Rundensteiner, Yanchun Zhang, Xuhui Li. Berlin: Springer, 2006: 210-221.
Files in This Item:
There are no files associated with this item.
Related Services
Usage statistics
Google Scholar
Similar articles in Google Scholar
[Su, Weifeng]'s Articles
[Wang, Jiying]'s Articles
[Lochovsky, Frederick]'s Articles
Baidu academic
Similar articles in Baidu academic
[Su, Weifeng]'s Articles
[Wang, Jiying]'s Articles
[Lochovsky, Frederick]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Su, Weifeng]'s Articles
[Wang, Jiying]'s Articles
[Lochovsky, Frederick]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.