Spam filter Problem Statement

Published on May 2016 | Categories: Documents | Downloads: 51 | Comments: 0 | Views: 1361
of 1
Download PDF   Embed   Report

problem statement of spam filter

Comments

Content

PROBLEM STATEMENT (SPAM FILTERING SOFTWARE)
Being tired of getting so much SPAM in the email inbox of the official email address catering to the business of training and placement cell, GGSIPU wants you to develop a SPAM filter for it that will detect and delete SPAM emails but let the nonSPAM emails through. Your task is to design a SPAM filter that would determine whether or not each message is SPAM or a legitimate message by examining the subject line of incoming email messages. As you design the SPAM filter, keep in mind that 1) Different people have different criteria for what constitutes SPAM 2) a SPAM filter will be specific to a given individual 3) The SPAM filter you develop needs to work only for Training and Placement Cell activities. In order to develop the SPAM filter, you will be given a data set that contains subject lines from a set of SPAM email messages and subject lines from a set of non-SPAM email messages that are predicted intelligently. Use the data to develop the SPAM filter. Also SPAM filters may use a variety of information such as: 1. Email address of sender 2. Reply to email address 3. Subject line 4. Body of the text. You will be given the two databases of the above mentioned parameters: the one consisting of spam keywords and the other one ham(legitimate) mail keywords. Therefore, your task is to develop a set of rules that will identify SPAM and NONSPAM emails using these parameters which will increase its efficiency manifolds.

Sponsor Documents

Or use your account on DocShare.tips

Hide

Forgot your password?

Or register your new account on DocShare.tips

Hide

Lost your password? Please enter your email address. You will receive a link to create a new password.

Back to log-in

Close