Email Spam Detection using Machine Learning

Project Overview

This project focuses on building a machine learning model to classify emails as Spam or Not Spam.
Using Natural Language Processing (NLP) techniques and machine learning algorithms, the model learns patterns from email text and predicts whether a given email is legitimate or spam.

Tech Stack :

Programming Language: Python
Libraries:
- scikit-learn (Machine Learning models)
- pandas, numpy (Data handling)
- matplotlib, seaborn (Visualization)
Algorithms Used:
- Logistic Regression
- Naive Bayes (MultinomialNB)

Project Workflow :

Data Preprocessing :
- Clean and prepare email text
- Remove stopwords, punctuations, and apply stemming
Feature Extraction :
- Convert text into numerical vectors using CountVectorizer / TF-IDF
Model Training :
- Train multiple ML models (Naive Bayes, Logistic Regression)
Evaluation :
- Accuracy, Precision, Recall, F1-Score
- Confusion Matrix
- ROC Curve & AUC Score

Results :

Logistic Regression Accuracy: ~97%
Naive Bayes Accuracy: ~97%

Both models performed well, with Logistic Regression showing slightly better performance.

Dataset :

The dataset contains labeled email messages as Spam (1) or Not Spam (0).

Text data is preprocessed (stopword removal, stemming, punctuation removal).
Features extracted using Bag of Words / TF-IDF techniques.

Dataset Source : https://www.kaggle.com/datasets/suraj452/mail-data

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
Email Spam Detection.ipynb		Email Spam Detection.ipynb
Email Spam model.pkl		Email Spam model.pkl
README.md		README.md
feature_extraction.pkl		feature_extraction.pkl
mail_data.csv		mail_data.csv
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Email Spam Detection using Machine Learning

Project Overview

Tech Stack :

Project Workflow :

Results :

Dataset :

About

Uh oh!

Releases

Packages

Languages

nikhil-kumarrr/Email-Spam-Detection

Folders and files

Latest commit

History

Repository files navigation

Email Spam Detection using Machine Learning

Project Overview

Tech Stack :

Project Workflow :

Results :

Dataset :

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages