VNSED: Vietnamese spam email detection using multi deep learning models
Author affiliations
DOI:
https://doi.org/10.15625/1813-9663/22392Keywords:
Spam email detection, convolutional neural network, bidirectional long short-term memory, PhoBERT.Abstract
Email is one of the most popular communication methods today. However, a high percentage of spam emails are used for various purposes. Therefore, detecting spam emails and proposing solutions to limit spam are necessary. There are many current studies related to spam detection. Deep learning models have been utilized in numerous related studies to detect spam and achieve high accuracy. However, these deep learning models are mostly trained on English datasets. Adding test datasets for Vietnamese spam emails is essential to build spam detection models not only in English but also in Vietnamese. This study presents the construction of a Vietnamese spam email dataset and the proposed system named VNSED, which uses deep learning models including CNN (Convolutional Neural Network), BiLSTM (Bidirectional Long Short-Term Memory), and PhoBERT to detect Vietnamese spam email. Experimental results show that these deep learning models all achieve high accuracy in detecting Vietnamese spam emails. Specifically, the accuracy of the models are CNN: 88.42%, BiLSTM: 83.03%, and PhoBERT: 86.47%, respectively.
Downloads
Published
How to Cite
Issue
Section
License
1. We hereby assign copyright of our article (the Work) in all forms of media, whether now known or hereafter developed, to the Journal of Computer Science and Cybernetics. We understand that the Journal of Computer Science and Cybernetics will act on my/our behalf to publish, reproduce, distribute and transmit the Work.2. This assignment of copyright to the Journal of Computer Science and Cybernetics is done so on the understanding that permission from the Journal of Computer Science and Cybernetics is not required for me/us to reproduce, republish or distribute copies of the Work in whole or in part. We will ensure that all such copies carry a notice of copyright ownership and reference to the original journal publication.
3. We warrant that the Work is our results and has not been published before in its current or a substantially similar form and is not under consideration for another publication, does not contain any unlawful statements and does not infringe any existing copyright.
4. We also warrant that We have obtained the necessary permission from the copyright holder/s to reproduce in the article any materials including tables, diagrams or photographs not owned by me/us.

