Convolutional Neural Networks (CNNs) have been widely used in the field of audio recognition and classification, since they often provide positive results. Motivated by the success of this kind of approach and the lack of practical methodologies for the monitoring of construction sites by using audio data, we developed an application for the classification of different types and brands of construction vehicles and tools, which operates on the emitted audio through a stack of convolutional layers. The proposed architecture works on the mel-spectrogram representation of the input audio frames and it demonstrates its effectiveness in environmental sound classification (ESC) achieving a high accuracy. In summary, our contribution shows that techniques employed for general ESC can be also successfully adapted to a more specific environmental sound classification task, such as event recognition in construction sites.
2020, Progresses in Artificial Intelligence and Neural Systems, Pages 371-381
A CNN approach for audio classification in construction sites (04b Atto di convegno in volume)
Maccagno Alessandro, Mastropietro Andrea, Mazziotta Umberto, Scarpiniti Michele, Lee Yong-Cheol, Uncini Aurelio
ISBN: 978-981-15-5092-8; 978-981-15-5093-5
Gruppo di ricerca: Algorithms and Data Science