Paper
26 May 2023 Research on news text classification based on TextCNN
Xie Bei, Liu Su Ping, Zhong Mu En
Author Affiliations +
Proceedings Volume 12700, International Conference on Electronic Information Engineering and Data Processing (EIEDP 2023); 127002B (2023) https://doi.org/10.1117/12.2682270
Event: International Conference on Electronic Information Engineering and Data Processing (EIEDP 2023), 2023, Nanchang, China
Abstract
In the era of information explosion, manual processing and classification on huge amount of text data is time-consuming. It also has many operational difficulties. Besides, the accuracy of manual text classification is easily affected by human factors. A classification algorithm for news text is designed in this paper. Firstly, a large amount of news data is obtained by crawlers. They are classified into nine categories and other categories. The text features are extracted by Skip-Gram model and Word2Vec method. Then, the classification model is trained by TextCNN. The experiment shows that the algorithm in this paper can get good classification effect, the highest F1 value can reach 95.07%. For the problem of difficult classification of other category, four different classification methods are compared. The experiments show that reconstructing the data of other category can get the best classification effect.
© (2023) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Xie Bei, Liu Su Ping, and Zhong Mu En "Research on news text classification based on TextCNN", Proc. SPIE 12700, International Conference on Electronic Information Engineering and Data Processing (EIEDP 2023), 127002B (26 May 2023); https://doi.org/10.1117/12.2682270
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Classification systems

Semantics

Feature extraction

Binary data

Machine learning

Convolutional neural networks

Data processing

Back to Top