科研成果详情

题名Infant Cry Classification Based-On Feature Fusion and Mel-Spectrogram Decomposition with CNNs
作者
发表日期2022
会议名称11th International Conference on Artificial Intelligence and Mobile Services, AIMS 2022 held as Part of the Services Conference Federation, SCF 2022
会议录名称Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
ISSN0302-9743
卷号13729 LNCS
页码126-134
会议日期10-14 December 2022
会议地点Honolulu
摘要

We propose a novel method of using feature fusion and model fusion to improve infant cry classification performance. Spectrogram features extracted from transfer learning convolutional neural network model and mel-spectrogram features extracted from mel-spectrogram decomposition model are fused and fed into a multiple layer perception for better classification accuracy. The mel-spectrogram decomposition method feeds band-wise crops of the mel-spectrograms into multiple CNNs followed by a merged global classifier to capture more enhanced discriminative features. Feature fusion brings higher dimensional detailed information and characteristics more in line with human hearing perception together to achieve better performance on CNNs. The evaluation of the approach is conducted on Baby Chillanto database and Baby2020 database. Our approach yields a significant reduction of 4.72% absolute classification error rate compared with the result using single mel-spectrogram images with CNN model on Baby Chillanto database and our testing accuracy reaches 99.26%, which outperforms all other methods with this five-category classification task. The gender classification experiment on Baby2020 database also shows 3.87% accuracy improvement compared with the CNN model using single spectrograms.

关键词Convolutional neural networks Feature fusion Infant cry classification Mel-spectrogram decomposition
DOI10.1007/978-3-031-23504-7_10
URL查看来源
语种英语English
Scopus入藏号2-s2.0-85144822927
引用统计
文献类型会议论文
条目标识符https://repository.uic.edu.cn/handle/39GCC9TT/13012
专题理工科技学院
通讯作者Pan, Yi
作者单位
1.Computer Science Department,BNU-HKBU United International College,Zhuhai,China
2.Shenzhen Institute of Advanced Technology,Chinese Academy of Sciences,Shenzhen,China
3.College of Information Science and Engineering,Hunan Normal University,Changsha,China
第一作者单位北师香港浸会大学
推荐引用方式
GB/T 7714
Ji, Chunyan,Jiao, Yang,Chen, Minget al. Infant Cry Classification Based-On Feature Fusion and Mel-Spectrogram Decomposition with CNNs[C], 2022: 126-134.
条目包含的文件
条目无相关文件。
个性服务
查看访问统计
谷歌学术
谷歌学术中相似的文章
[Ji, Chunyan]的文章
[Jiao, Yang]的文章
[Chen, Ming]的文章
百度学术
百度学术中相似的文章
[Ji, Chunyan]的文章
[Jiao, Yang]的文章
[Chen, Ming]的文章
必应学术
必应学术中相似的文章
[Ji, Chunyan]的文章
[Jiao, Yang]的文章
[Chen, Ming]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。