Increasing Text Filtering Accuracy with Improved LSTM

Wei Dang

School of Automation University of Electronic Science and Technology of China Chengdu 610054, China
Ligao Cai

School of Automation University of Electronic Science and Technology of China Chengdu 610054, China
Mingzhe Liu

School of Data Science and Artificial Intelligence Wenzhou University of Technology Wenzhou 325000, China & College of Computer Science and Cyber Security Chengdu University of Technology Chengdu 610054, China
Xiaolu Li

School of Geographical Sciences Southwest University Chongqing 400715, China
Zhengtong Yin

College of Resource and Environment Engineering Guizhou University Guiyang, Guizhou 550025, China
Xuan Liu

School of Public Affairs and Administration University of Electronic Science and Technology of China Chengdu 611731, China
Lirong Yin

Department of Geography and Anthropology Louisiana State University Baton Rouge, LA 70803, USA
Wenfeng Zheng

School of Automation University of Electronic Science and Technology of China Chengdu 610054, China

Increasing Text Filtering Accuracy with Improved LSTM

keywords: Texting filtering, LSTM, word vector, CBOW, dropout, CNN, DAN

How to eliminate useless information in the vast network information and retain effective information is a problem that needs to be continuously explored in the field of deep learning. This paper conducts text classification on the network evaluation frequently encountered in daily life, mainly to screen out the meaningless comments published by Internet users, to have access to more useful information. In this paper, a text filtering model was constructed based on word vector and Long Short-Term Memory (LSTM) and improved by adding Deep Averaging Networks (DAN) and convolutional neural network (CNN). The major improvement of the LSTM & DAN model was to retain the original word vector information and to improve the accuracy of the text classification model without increasing hyperparameter and model structure complexity. The LSTM & CNN model mainly combines the advantages of convolutional neural network in exploring the deep information of text, which was an improvement over the original LSTM. It was proved by experiments that this improvement is meaningful. Compared with the shallow neural network, the accuracy has been greatly improved.

reference: Vol. 42, 2023, No. 6, pp. 1491–1517

doi: 10.31577/cai_2023_6_1491

Computing and Informatics

formerly Computers and Artificial Intelligence

Increasing Text Filtering Accuracy with Improved LSTM