Status | 已发表Published |
Title | Unsupervised image categorization based on deep generative models with disentangled representations and von Mises-Fisher distributions |
Creator | |
Date Issued | 2025 |
Source Publication | International Journal of Machine Learning and Cybernetics
![]() |
ISSN | 1868-8071 |
Volume | 16Issue:1Pages:611-623 |
Abstract | Variational autoencoders (VAEs) have emerged as powerful deep generative models for learning abstract representations in the latent space, making them highly applicable across diverse domains. This paper presents a novel image categorization approach that leverages VAEs with disentangled representations. In VAE-based clustering models, the latent representations learned by encoders often combine both generation and clustering information. To address this concern, our proposed model disentangles the acquired latent representations into dedicated clustering and generation modules, thereby enhancing the performance and efficiency of clustering tasks. Specifically, we introduce an extension of the Kullback–Leibler (KL) divergence to promote independence between these two modules. Additionally, we incorporate the von Mises-Fisher (vMF) distribution to improve the clustering model's ability to capture cluster characteristics within the generation module. Extensive experimental evaluations confirm the effectiveness of our model in clustering tasks, notably without the requirement for pre-training. Furthermore, when compared to various deep generative clustering models requiring pre-training, our model is able to achieve either comparable or superior performance across multiple datasets. |
Keyword | Clustering Disentangling module Variational autoencoder (VAE) vMF distribution |
DOI | 10.1007/s13042-024-02265-6 |
URL | View source |
Indexed By | SCIE |
Language | 英语English |
WOS Research Area | Computer Science |
WOS Subject | Computer Science, Artificial Intelligence |
WOS ID | WOS:001252533900002 |
Scopus ID | 2-s2.0-85196663068 |
Citation statistics | |
Document Type | Journal article |
Identifier | http://repository.uic.edu.cn/handle/39GCC9TT/12550 |
Collection | Beijing Normal-Hong Kong Baptist University |
Corresponding Author | Fan, Wentao |
Affiliation | 1.Guangdong Provincial Key Laboratory IRADS and Department of Computer Science,Beijing Normal University-Hong Kong Baptist University United International College (UIC),Zhuhai,Guangdong,China 2.Department of Computer Science and Technology,Huaqiao University,Xiamen,China |
First Author Affilication | Beijing Normal-Hong Kong Baptist University |
Corresponding Author Affilication | Beijing Normal-Hong Kong Baptist University |
Recommended Citation GB/T 7714 | Fan, Wentao,Xu, Kunxiong. Unsupervised image categorization based on deep generative models with disentangled representations and von Mises-Fisher distributions[J]. International Journal of Machine Learning and Cybernetics, 2025, 16(1): 611-623. |
APA | Fan, Wentao, & Xu, Kunxiong. (2025). Unsupervised image categorization based on deep generative models with disentangled representations and von Mises-Fisher distributions. International Journal of Machine Learning and Cybernetics, 16(1), 611-623. |
MLA | Fan, Wentao,et al."Unsupervised image categorization based on deep generative models with disentangled representations and von Mises-Fisher distributions". International Journal of Machine Learning and Cybernetics 16.1(2025): 611-623. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment