Paper
12 October 2020 Speech emotion recognition based on data enhancement in time-frequency domain
Author Affiliations +
Proceedings Volume 11574, International Symposium on Artificial Intelligence and Robotics 2020; 115740R (2020) https://doi.org/10.1117/12.2579205
Event: International Symposium on Artificial Intelligence and Robotics (ISAIR), 2020, Kitakyushu, Japan
Abstract
Currently, there is a lack of voice samples in the speech emotion recognition field, which leads to poor recognition rate and over-fitting of data. Inspire by this, we propose speech emotion recognition based on data enhancement. The Berlin Emotional Corpus is enhanced from two directions: Time Domain and Frequency Domain. The samples was extracted and trained. Research and analyze the recognition rate of two classifiers: K-Nearest Neighbor and Support Vector Machine. Experiments show that the effect after data enhancement is better.
© (2020) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Qianqian Li, Fuji Ren, Xiaoyan Shen, and Xin Kang "Speech emotion recognition based on data enhancement in time-frequency domain", Proc. SPIE 11574, International Symposium on Artificial Intelligence and Robotics 2020, 115740R (12 October 2020); https://doi.org/10.1117/12.2579205
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
Back to Top