Title | Continuous topically related queries grouping and its application on interest identification |
Creator | |
Date Issued | 2013 |
Conference Name | The 18th International Conference on Database Systems for Advanced Applications (DASFAA 2013) |
Source Publication | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
![]() |
ISBN | 9783642374876 |
ISSN | 0302-9743 |
Volume | 7825 LNCS |
Issue | PART 1 |
Pages | 224-238 |
Conference Date | APR 22-25, 2013 |
Conference Place | Wuhan, China |
Abstract | When a user performs a search on a search engine, the query reflects a particular interest of the user. The interest may either span a short session of a few minutes, or a long period of time like months or years. In the latter, the user may perform searching related to a particular interest from time to time, making the queries related to that interest sporadically distributed in the search log. Identification of these topically related queries is very meaningful, since it can help the search engine better understand the user's interest and hence deliver better results to the user. In this paper, we propose a method to aggregate topically related queries into interests regardless of where the queries appear in the search log. It first identifies sets of continuous topically-related queries called CTQs and then clusters similar CTQs together to form interests. In order to identify the CTQs accurately, we propose the Pattern-Concept-Time-Based (PCTB) method that utilizes query reformulation patterns, concepts behind the queries and the user's continuous searching behavior to compute the similarity between two queries. To evaluate the effectiveness of our approach, we employ the AOL search log as our test dataset and develop a search middleware on top of Google for extracting concepts related to the queries. Experimental results show that our method can obtain a high precision and recall on identifying CTQs, which in turn improves the performance of interest identification. © Springer-Verlag 2013. |
DOI | 10.1007/978-3-642-37487-6_19 |
URL | View source |
Language | 英语English |
Scopus ID | 2-s2.0-84892886456 |
Citation statistics |
Cited Times [WOS]:0
[WOS Record]
[Related Records in WOS]
|
Document Type | Conference paper |
Identifier | http://repository.uic.edu.cn/handle/39GCC9TT/10092 |
Collection | Research outside affiliated institution |
Affiliation | Department of Computer Science and Engineering, Hong Kong University of Science and Technology, Hong Kong, China |
Recommended Citation GB/T 7714 | Zhao, Pengfei,Leung, Kenneth Wai Ting,Lee, Dik Lun. Continuous topically related queries grouping and its application on interest identification[C], 2013: 224-238. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment