...
首页> 外文期刊>Mathematical Problems in Engineering: Theory, Methods and Applications >Acoustic Scene Classification and Visualization of Beehive Sounds Using Machine Learning Algorithms and Grad-CAM
【24h】

Acoustic Scene Classification and Visualization of Beehive Sounds Using Machine Learning Algorithms and Grad-CAM

机译:使用机器学习算法和毕业凸轮的蜂箱声音的声学场景分类和可视化

获取原文
           

摘要

Honeybees play a crucial role in the agriculture industry because they pollinate approximately 75% of all flowering crops. However, every year, the number of honeybees continues to decrease. Consequently, numerous researchers in various fields have persistently attempted to solve this problem. Acoustic scene classification, using sounds recorded from beehives, is an approach that can be applied to detect changes inside beehives. This method can be used to determine intervals that threaten a beehive. Currently, studies on sound analysis, using deep learning algorithms integrated with various data preprocessing methods that extract features from sound signals, continue to be conducted. However, there is little insight into how deep learning algorithms recognize audio scenes, as demonstrated by studies on image recognition. Therefore, in this study, we used a mel spectrogram, mel-frequency cepstral coefficients (MFCCs), and a constant-Q transform to compare the performance of conventional machine learning models to that of convolutional neural network (CNN) models. We used the support vector machine, random forest, extreme gradient boosting, shallow CNN, and VGG-13 models. Using gradient-weighted class activation mapping (Grad-CAM), we conducted an analysis to determine how the best-performing CNN model recognized audio scenes. The results showed that the VGG-13 model, using MFCCs as input data, demonstrated the best accuracy (91.93%). Additionally, based on the precision, recall, and F1-score for each class, we established that sounds other than those from bees were effectively recognized. Further, we conducted an analysis to determine the MFCCs that are important for classification through the visualizations obtained by applying Grad-CAM to the VGG-13 model. We believe that our findings can be used to develop a monitoring system that can consistently detect abnormal conditions in beehives early by classifying the sounds inside beehives.
机译:蜜蜂在农业产业中发挥着至关重要的作用,因为它们授予所有开花作物的75%。但是,每年,蜜蜂的数量继续下降。因此,各种领域的许多研究人员持续尝试解决这个问题。声场分类,使用从蜂箱中记录的声音,是一种方法可以应用于检测蜂箱内的变化。这种方法可用于确定威胁蜂箱的间隔。目前,使用深度学习算法与各种数据预处理方法集成的深度学习算法继续进行,继续进行。然而,正如图像识别研究所证明的那样,对深度学习算法如何认识到音频场景,几乎没有了解。因此,在本研究中,我们使用了MEL谱图,熔融频率谱系数(MFCC)和恒定Q变换,以将传统机器学习模型的性能与卷积神经网络(CNN)模型的性能进行比较。我们使用支持向量机,随机森林,极端梯度升压,浅CNN和VGG-13型号。使用梯度加权类激活映射(Grad-Cam),我们进行了一个分析,以确定如何最佳的CNN模型识别的音频场景。结果表明,使用MFCCS作为输入数据的VGG-13模型显示了最佳精度(91.93%)。此外,基于每个类的精度,召回和F1分数,我们建立了除了来自蜜蜂之外的声音的声音得到了有效认可。此外,我们进行了一个分析,以确定通过通过将渐变凸轮施加到VGG-13模型而获得的可视化来分类的MFCC。我们认为,我们的调查结果可用于开发一个监测系统,可以通过对蜂箱内的声音进行分类,始终可以在早期检测蜂箱中的异常情况。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号