题名 | Infant Cry Classification Based-On Feature Fusion and Mel-Spectrogram Decomposition with CNNs |
作者 | |
发表日期 | 2022 |
会议名称 | 11th International Conference on Artificial Intelligence and Mobile Services, AIMS 2022 held as Part of the Services Conference Federation, SCF 2022 |
会议录名称 | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
![]() |
ISSN | 0302-9743 |
卷号 | 13729 LNCS |
页码 | 126-134 |
会议日期 | 10-14 December 2022 |
会议地点 | Honolulu |
摘要 | We propose a novel method of using feature fusion and model fusion to improve infant cry classification performance. Spectrogram features extracted from transfer learning convolutional neural network model and mel-spectrogram features extracted from mel-spectrogram decomposition model are fused and fed into a multiple layer perception for better classification accuracy. The mel-spectrogram decomposition method feeds band-wise crops of the mel-spectrograms into multiple CNNs followed by a merged global classifier to capture more enhanced discriminative features. Feature fusion brings higher dimensional detailed information and characteristics more in line with human hearing perception together to achieve better performance on CNNs. The evaluation of the approach is conducted on Baby Chillanto database and Baby2020 database. Our approach yields a significant reduction of 4.72% absolute classification error rate compared with the result using single mel-spectrogram images with CNN model on Baby Chillanto database and our testing accuracy reaches 99.26%, which outperforms all other methods with this five-category classification task. The gender classification experiment on Baby2020 database also shows 3.87% accuracy improvement compared with the CNN model using single spectrograms. |
关键词 | Convolutional neural networks Feature fusion Infant cry classification Mel-spectrogram decomposition |
DOI | 10.1007/978-3-031-23504-7_10 |
URL | 查看来源 |
语种 | 英语English |
Scopus入藏号 | 2-s2.0-85144822927 |
引用统计 | |
文献类型 | 会议论文 |
条目标识符 | https://repository.uic.edu.cn/handle/39GCC9TT/13012 |
专题 | 理工科技学院 |
通讯作者 | Pan, Yi |
作者单位 | 1.Computer Science Department,BNU-HKBU United International College,Zhuhai,China 2.Shenzhen Institute of Advanced Technology,Chinese Academy of Sciences,Shenzhen,China 3.College of Information Science and Engineering,Hunan Normal University,Changsha,China |
第一作者单位 | 北师香港浸会大学 |
推荐引用方式 GB/T 7714 | Ji, Chunyan,Jiao, Yang,Chen, Minget al. Infant Cry Classification Based-On Feature Fusion and Mel-Spectrogram Decomposition with CNNs[C], 2022: 126-134. |
条目包含的文件 | 条目无相关文件。 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论