Paper
7 March 2022 The research and implementation of keyword extraction algorithm based on LDA
Chengxia Liu
Author Affiliations +
Proceedings Volume 12167, Third International Conference on Electronics and Communication; Network and Computer Technology (ECNCT 2021); 1216728 (2022) https://doi.org/10.1117/12.2628424
Event: 2021 Third International Conference on Electronics and Communication, Network and Computer Technology, 2021, Harbin, China
Abstract
In this paper, the basic principle of LDA (Latent Dirichlet Allocation)Algorithm is studied and the posterior probability distribution of topics in the document θm and the posterior probability distribution of words in the topic φzm,n is deduced from the Dirichlet distribution with parameter alpha and beta. And the parameters of LDA model are generated by Gibbs sampling results θ and φ。 Then, the accuracy of the final subject words under different iteration times is compared, and the whole keyword extraction algorithm process based on LDA is realized in Python. Finally, the results are visually displayed with keyword cloud.
© (2022) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Chengxia Liu "The research and implementation of keyword extraction algorithm based on LDA", Proc. SPIE 12167, Third International Conference on Electronics and Communication; Network and Computer Technology (ECNCT 2021), 1216728 (7 March 2022); https://doi.org/10.1117/12.2628424
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Machine learning

Statistical modeling

Algorithm development

Back to Top