Paper
23 May 2023 Speech recognition of speaker identity based on convolutional neural networks
Hangdong An
Author Affiliations +
Proceedings Volume 12604, International Conference on Computer Graphics, Artificial Intelligence, and Data Processing (ICCAID 2022); 126042K (2023) https://doi.org/10.1117/12.2674577
Event: 2nd International Conference on Computer Graphics, Artificial Intelligence, and Data Processing (ICCAID 2022), 2022, Guangzhou, China
Abstract
Recent advances in the field of speaker recognition (SR) have shown very accurate and high-performance algorithms. When the amount of data is small, its performance drops dramatically [1]. Today, when testing and training involve only small amounts of speech data, identifying the speaker remains a key consideration, as real-life applications can only access speech data for a limited period. Speech recognition-based security systems are one of the main areas of research. In this paper, we will use a graphical CNN algorithm for speaker recognition and compare the effect of different times of speech on the accuracy of the model. The results show that better results can be obtained by using the speech feature data with a deep learning CNN model, with an average accuracy of 86.3%. In addition, the accuracy of speech recognition can be better improved by comparing multiple sets of speech segments with smaller results in this study.
© (2023) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Hangdong An "Speech recognition of speaker identity based on convolutional neural networks", Proc. SPIE 12604, International Conference on Computer Graphics, Artificial Intelligence, and Data Processing (ICCAID 2022), 126042K (23 May 2023); https://doi.org/10.1117/12.2674577
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Data modeling

Matrices

Speech recognition

Speaker recognition

Convolutional neural networks

Feature extraction

Education and training

Back to Top