Frontiers of Data and Computing ›› 2021, Vol. 3 ›› Issue (5): 130-140.

doi: 10.11871/jfdc.issn.2096-742X.2021.05.010

• Technology and Applicaton • Previous Articles     Next Articles

Bird Audio Data Preprocessing Method

ZHANG Meng1,2,*(),LI Jian1()   

  1. 1. Computer Network Information Center, Chinese Academy of Sciences, Beijing 100190, China
    2. University of Chinese Academy of Sciences, Beijing 100049, China
  • Received:2021-03-08 Online:2021-10-20 Published:2021-11-24
  • Contact: ZHANG Meng E-mail:zhangmeng@cnic.cn;lijian@cnic.cn

Abstract:

[Objective] The accuracy of bird species classification can be improved by noise spectrogram filtering and removing from the sample set of original bird audio spectrograms. [Methods] Based on the convolutional neural network, this paper extracts the feature vector from the spectrogram, calculates the distance matrix of the feature vector with the Faiss algorithm library, and then uses the DBSCAN (Density-Based Spatial Clustering of Applications with Noise) clustering algorithm to filter out the noise spectrogram. Finally, the filtered spectrogram sample set is input into the classification model for bird species classification. [Results]Through this method, a large number of noise spectrograms are removed from the spectrogram sample set so that the accuracy of subsequent bird species classification has been improved. [Limitations] Because the clustering effect of the DBSCAN algorithm is greatly affected by the neighborhood threshold (ε) and density threshold (MinPts) parameters, we should explore adaptive methods to obtain parameter values in the future. [Conclusions] This paper combines the convolutional neural network and the density clustering algorithm in data mining and proposes a bird audio data preprocessing method for automatically noise spectrogram filtering, which provides a high-quality spectrogram sample set for subsequent bird species identification.

Key words: bird audio, spectrogram, data filtering, convolutional neural network, clustering