Spam can be defined as unsolicited bulk email. In an effort to evade textbased filters, spammers sometimes embed spam text in an image, which is referred to as image spam. In this research, we consider the problem of image spam detection, based on image analysis. We apply convolutional neural networks (CNN) to this problem, we compare the results obtained using CNNs to other machine learning techniques, and we compare our results to previous related work. We consider both real-world image spam and challenging image spam-like datasets. Our results improve on previous work by employing CNNs based on a novel feature set consisting of a combination of the raw image and Canny edges.
Deep Learning for Image Spam Detection by Tazmina Sharmin Spam can be defined as unsolicited bulk email. In an effort to evade text-based spam filters, spammers can embed their spam text in an image, which is referred to as image spam. In this research, we consider the problem of image spam detection, based on image analysis. We apply various machine learning and deep learning techniques to real-world image spam datasets, and to a challenge image spam-like dataset. We obtain results comparable to previous work for the real-world datasets, while our deep learning approach yields the best results to date for the challenge dataset. ACKNOWLEDGMENTS I would like to express my sincere gratitude to my advisor, Dr. Mark Stamp, for his extraordinary support, patience, and continuous guidance throughout my project and graduate studies as well. I would like to thank my committe members, Dr. Katerina Potika and Fabio Di Troia for being very helpful and their valuable time. My parents, Md. Gofranul Hoque and Ferdousi Rezwan, are my constant source of inspiration. I am extremely grateful for their endless support and love throughout all these years. I would like to thank my husband, Jane Alam Jan, for his gracious support and constant encouragement which made it possible. Last, but not the least, I am thankful to my daughter, Ahona, for her understanding and caring in her little own way during the past few years.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.