An Ensembled Convolutional Recurrent Neural Network approach for Automated Classroom Sound Classification
Publication Name
Proceedings - 2024 IEEE Conference on Artificial Intelligence, CAI 2024
Abstract
The paper explores automated classification techniques for classroom sounds to capture diverse learning and teaching activities' sequences. Manual labeling of all recordings, especially for long durations like multiple lessons, poses practical challenges. This study investigates an automated approach employing scalogram acoustic features as input into the ensembled Convolutional Neural Network (CNN) and Bidirectional Gated Recurrent Unit (BiGRU) hybridized with Extreme Gradient Boost (XGBoost) classifier for automatic classification of classroom sounds. The research involves analyzing real classroom recordings to identify distinct sound segments encompassing teacher's voice, student voices, babble noise, classroom noise, and silence. A sound event classifier utilizing scalogram features in an XGBoost framework is proposed. Comparative evaluations with various other machine learning and neural network methodologies demonstrate that the proposed hybrid model achieves the most accurate classification performance of 95.38%.
Open Access Status
This publication is not available as open access
First Page
183
Last Page
188