科研成果详情

题名Topic sequence kernel
作者
发表日期2012
会议名称8th Asia Information Retrieval Societies Conference, AIRS 2012
会议录名称Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
ISBN9783642353406
ISSN1611-3349
卷号7675 LNCS
页码457-466
会议日期17-19 December 2012
会议地点Tianjin
摘要

This paper addresses the problem of classifying documents using the kernel approaches based on topic sequences. Previously, the string kernel uses the ordered subsequence of characters as features and the word sequence kernel is proposed to use words as the subsequences. However, they both face the problem of computational complexity because of the large amount of symbols (characters or words). This paper, therefore, proposes to use sequences of topics rather than characters or words to reduce the number of symbols, thus increasing the computational efficiency. Documents that exhibit similar posterior topic proportions are expected to have similar topic sequence and then should be classified into the same category. Experiments conducted on the Reuters-21578 datasets have proven this hypothesis. © Springer-Verlag 2012.

关键词Classification String kernel Topic sequence
DOI10.1007/978-3-642-35341-3_41
URL查看来源
语种英语English
引用统计
被引频次[WOS]:0   [WOS记录]     [WOS相关记录]
文献类型会议论文
条目标识符https://repository.uic.edu.cn/handle/39GCC9TT/4682
专题个人在本单位外知识产出
作者单位
Department of Computing, Hong Kong Polytechnic University, Hong Kong
推荐引用方式
GB/T 7714
Xu, Jian,Lu, Qin,Liu, Zhengzhonget al. Topic sequence kernel[C], 2012: 457-466.
条目包含的文件
条目无相关文件。
个性服务
查看访问统计
谷歌学术
谷歌学术中相似的文章
[Xu, Jian]的文章
[Lu, Qin]的文章
[Liu, Zhengzhong]的文章
百度学术
百度学术中相似的文章
[Xu, Jian]的文章
[Lu, Qin]的文章
[Liu, Zhengzhong]的文章
必应学术
必应学术中相似的文章
[Xu, Jian]的文章
[Lu, Qin]的文章
[Liu, Zhengzhong]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。