科研成果详情

题名SVNet: Supervoxel Network for Video Oversegmentation
作者
发表日期2023-10-20
会议录名称ACM International Conference Proceeding Series
页码368-376
摘要Supervoxel segmentation for video is a video pre-processing technique that groups voxels with similar spatiotemporal features into supervoxels, effectively reducing the number of elemental voxels for downstream computer vision applications, or applications in other fields, e.g., hardware design or video visualization. Existing methods for video supervoxel generation primarily rely on traditional techniques, such as graph theory, mean shift, and clustering, which have yielded promising results. Recent deep learning-based methods mainly worked on object segmentation or semantic segmentation from videos, paying less attention to video oversegmentation. However, the quality of supervoxels directly affects the results of subsequent tasks. In this paper, we introduce a novel approach SVNet which enables direct end-to-end segmentation of voxels into supervoxels from a deep iterative clustering network. The process begins by utilizing the spatiotemporal features learned by the deep neural network to construct a soft association map between voxels and supervoxels. Subsequently, through an iterative update process, the features of supervoxels and the soft association map between voxels and supervoxels are continually refined to enhance the accuracy of voxels segmentation into supervoxels with the supervised loss using the reconstructed video features and labels. We evaluate our method and the representative supervoxel algorithms for their capability on the performance of video segmentation. Experiments show that our SVNet excels particularly in terms of the BRD metric, and its accuracy is roughly on par with the compared methods.
关键词Clustering Spatio-temporal learning Supervoxels Video segmentation
DOI10.1145/3650400.3650460
URL查看来源
语种英语English
Scopus入藏号2-s2.0-85191488597
引用统计
文献类型会议论文
条目标识符https://repository.uic.edu.cn/handle/39GCC9TT/11560
专题北师香港浸会大学
通讯作者Yang,Baorong
作者单位
1.College of Computer Engineering,Jimei University,Xiamen,361021,China
2.School of Mathematics and Computer Science,Zhejiang A&f University,Hangzhou,311300,China
3.Guangdong Provincial Key Laboratory Irads,BNU-HKBU United International College,Zhuhai,519087,China
推荐引用方式
GB/T 7714
Qi,Yijie,Yang,Baorong,Zhang,Wenjinget al. SVNet: Supervoxel Network for Video Oversegmentation[C], 2023: 368-376.
条目包含的文件
条目无相关文件。
个性服务
查看访问统计
谷歌学术
谷歌学术中相似的文章
[Qi,Yijie]的文章
[Yang,Baorong]的文章
[Zhang,Wenjing]的文章
百度学术
百度学术中相似的文章
[Qi,Yijie]的文章
[Yang,Baorong]的文章
[Zhang,Wenjing]的文章
必应学术
必应学术中相似的文章
[Qi,Yijie]的文章
[Yang,Baorong]的文章
[Zhang,Wenjing]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。