This paper introduces a novel method for speaker verification using Convolutional Neural Networks (CNNs). Unlike traditional approaches that rely solely on spectrogram and waveform images, the proposed method, termed ‘Deep- ConvVectors’, dynamically captures speaker-specific features from speech signals. By transforming segments of speech into specialized CNN filters, Deep-ConvVectors were created, which encapsulate essential speaker characteristics. The experiments carried out on the THUYG-20 SRE dataset demonstrated the superior performance of the proposed method in comparison with the established methods, with an average Equal Error Rate (EER) of just 0.99%. This approach offers a dynamic solution for precise speaker identification, showcasing the transformative potential of CNNs in the context of ASV.
Speaker verification, CNN, RBM, DBN, DNN.
Soufiane HOURRI, "Empowering Speaker Verification with Deep Convolutional Neural Network Vectors", Studies in Informatics and Control, ISSN 1220-1766, vol. 33(2), pp. 97-107, 2024. https://doi.org/10.24846/v33i2y202409