Paper
6 July 2015 Continuous speech recognition based on convolutional neural network
Qing-qing Zhang, Yong Liu, Jie-lin Pan, Yong-hong Yan
Author Affiliations +
Proceedings Volume 9631, Seventh International Conference on Digital Image Processing (ICDIP 2015); 963121 (2015) https://doi.org/10.1117/12.2197152
Event: Seventh International Conference on Digital Image Processing (ICDIP15), 2015, Los Angeles, United States
Abstract
Convolutional Neural Networks (CNNs), which showed success in achieving translation invariance for many image processing tasks, are investigated for continuous speech recognitions in the paper. Compared to Deep Neural Networks (DNNs), which have been proven to be successful in many speech recognition tasks nowadays, CNNs can reduce the NN model sizes significantly, and at the same time achieve even better recognition accuracies. Experiments on standard speech corpus TIMIT showed that CNNs outperformed DNNs in the term of the accuracy when CNNs had even smaller model size.
© (2015) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Qing-qing Zhang, Yong Liu, Jie-lin Pan, and Yong-hong Yan "Continuous speech recognition based on convolutional neural network", Proc. SPIE 9631, Seventh International Conference on Digital Image Processing (ICDIP 2015), 963121 (6 July 2015); https://doi.org/10.1117/12.2197152
Lens.org Logo
CITATIONS
Cited by 5 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Convolution

Speech recognition

Convolutional neural networks

Acoustics

Principal component analysis

Image processing

Neural networks

Back to Top