Details of Research Outputs

TitleSVNet: Supervoxel Network for Video Oversegmentation
Creator
Date Issued2023-10-20
Source PublicationACM International Conference Proceeding Series
Pages368-376
AbstractSupervoxel segmentation for video is a video pre-processing technique that groups voxels with similar spatiotemporal features into supervoxels, effectively reducing the number of elemental voxels for downstream computer vision applications, or applications in other fields, e.g., hardware design or video visualization. Existing methods for video supervoxel generation primarily rely on traditional techniques, such as graph theory, mean shift, and clustering, which have yielded promising results. Recent deep learning-based methods mainly worked on object segmentation or semantic segmentation from videos, paying less attention to video oversegmentation. However, the quality of supervoxels directly affects the results of subsequent tasks. In this paper, we introduce a novel approach SVNet which enables direct end-to-end segmentation of voxels into supervoxels from a deep iterative clustering network. The process begins by utilizing the spatiotemporal features learned by the deep neural network to construct a soft association map between voxels and supervoxels. Subsequently, through an iterative update process, the features of supervoxels and the soft association map between voxels and supervoxels are continually refined to enhance the accuracy of voxels segmentation into supervoxels with the supervised loss using the reconstructed video features and labels. We evaluate our method and the representative supervoxel algorithms for their capability on the performance of video segmentation. Experiments show that our SVNet excels particularly in terms of the BRD metric, and its accuracy is roughly on par with the compared methods.
KeywordClustering Spatio-temporal learning Supervoxels Video segmentation
DOI10.1145/3650400.3650460
URLView source
Language英语English
Scopus ID2-s2.0-85191488597
Citation statistics
Document TypeConference paper
Identifierhttp://repository.uic.edu.cn/handle/39GCC9TT/11560
CollectionBeijing Normal-Hong Kong Baptist University
Corresponding AuthorYang,Baorong
Affiliation
1.College of Computer Engineering,Jimei University,Xiamen,361021,China
2.School of Mathematics and Computer Science,Zhejiang A&f University,Hangzhou,311300,China
3.Guangdong Provincial Key Laboratory Irads,BNU-HKBU United International College,Zhuhai,519087,China
Recommended Citation
GB/T 7714
Qi,Yijie,Yang,Baorong,Zhang,Wenjinget al. SVNet: Supervoxel Network for Video Oversegmentation[C], 2023: 368-376.
Files in This Item:
There are no files associated with this item.
Related Services
Usage statistics
Google Scholar
Similar articles in Google Scholar
[Qi,Yijie]'s Articles
[Yang,Baorong]'s Articles
[Zhang,Wenjing]'s Articles
Baidu academic
Similar articles in Baidu academic
[Qi,Yijie]'s Articles
[Yang,Baorong]'s Articles
[Zhang,Wenjing]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Qi,Yijie]'s Articles
[Yang,Baorong]'s Articles
[Zhang,Wenjing]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.