An Ensembled Convolutional Recurrent Neural Network approach for Automated Classroom Sound Classification

Publication Name

Proceedings - 2024 IEEE Conference on Artificial Intelligence, CAI 2024

Abstract

The paper explores automated classification techniques for classroom sounds to capture diverse learning and teaching activities' sequences. Manual labeling of all recordings, especially for long durations like multiple lessons, poses practical challenges. This study investigates an automated approach employing scalogram acoustic features as input into the ensembled Convolutional Neural Network (CNN) and Bidirectional Gated Recurrent Unit (BiGRU) hybridized with Extreme Gradient Boost (XGBoost) classifier for automatic classification of classroom sounds. The research involves analyzing real classroom recordings to identify distinct sound segments encompassing teacher's voice, student voices, babble noise, classroom noise, and silence. A sound event classifier utilizing scalogram features in an XGBoost framework is proposed. Comparative evaluations with various other machine learning and neural network methodologies demonstrate that the proposed hybrid model achieves the most accurate classification performance of 95.38%.

Open Access Status

This publication is not available as open access

First Page

183

Last Page

188

Share

COinS
 

Link to publisher version (DOI)

http://dx.doi.org/10.1109/CAI59869.2024.00041