Applied Mathematics & Information Sciences

Author Country (or Countries)



Email is one of the most ubiquitous and pervasive application used on a daily basis by millions of people worldwide. Email spam is a serious worldwide problem which causes problems for almost all computer users. Nowadays, e-mail becomes a powerful tool for communication as it saves a lot of time and cost. But, due to social networks and advertisers, most of the e-mails contain unwanted information called spam. Spam is the unwanted and unsolicited commercial e-mail. It is also known as junk e-mail. This issue not only affects normal users of the internet, but also causes a huge problem for companies and organizations since it costs a huge amount of money in lost productivity, wasting user’s time and network bandwidth. Recently, various researchers have presented several email spam classification techniques. Spam classifications, which filter the spam emails from inbox moves it to our junk email folder. It automatically classifies email based on the social features. Spam classifies the set of mails into spam and ham based on its contents. It is very difficult to eliminate the spam mail completely as the spammers change their techniques frequently. The proposed system, we have developed is an efficient technique to classify the email spam using ensemble method. Gradient Boost classification is used which is an ensemble of the weak decision tree and weighted majority voting is used to ensemble the decision tree and also Naive Bayes classification is used. It consists of two phases, such as training phase and testing phase. The performance metrics namely precision, recall and accuracy are used for evaluation.

Digital Object Identifier (DOI)