🛡️ Phishing Website Detection System using Machine Learning and MLOps

This project involves developing a machine learning system to predict whether a website is phishing or not. The system is designed to enhance cybersecurity by identifying and blocking malicious websites. The project integrates machine learning techniques with MLOps tools to create a scalable, automated, and efficient solution.

🚀 Project Overview

🎯 Objective: Predict phishing websites using machine learning to improve cybersecurity.
🛠️ Key Technologies: Python, Docker, AWS, MongoDB, FastAPI, GitHub Actions, MLflow, Airflow, Terraform.

🌟 Features

🔍 Machine Learning Model: A model trained to detect phishing websites based on various indicators.
🔄 MLOps Pipeline: Automated deployment pipeline using GitHub Actions, MLflow, Airflow, and Terraform.
⚙️ Scalable Deployment: The system is deployed on AWS using Docker, and MongoDB, ensuring scalability and efficient management.
🌐 API Integration: FastAPI is used to build and deploy APIs for interacting with the model.
💾 Database Integration: MongoDB is used for storing and managing website data during model training and deployment.

🛠️ Installation and Setup

Prerequisites

🐍 Python 3.8+
🐳 Docker
☁️ AWS account with necessary permissions
🍃 MongoDB

🖥️ Setup Virtual Environment

python3 -m venv venv
source venv/bin/activate
pip install -r requirements.txt

⚙️ Configure AWS

Ensure your AWS CLI is configured with the correct credentials.
Update the terraform/variables.tf file with your AWS settings.

🚀 Deploy the System

🐳 Build Docker Image:
```
docker build -t phishing-detection:v1 .
```
📤 Push to AWS ECR: Follow AWS ECR instructions to push your Docker image.

🛠️ Run Terraform:

cd terraform
terraform init
terraform apply

🧑‍💻 Usage

📊 Training the Model:
- Prepare your dataset and upload it to the specified MongoDB collection.
🔍 Prediction:
- After deployment, the system will automatically predict whether new websites are phishing and store the results in MongoDB.
- Interact with the model via APIs built with FastAPI.

📈 Results and Evaluation

The model's performance metrics, such as accuracy, precision, and recall, are tracked using MLflow.

🤝 Contributing

If you'd like to contribute to this project, please fork the repository and use a feature branch. Pull requests are welcome.

📧 Contact

For any questions or feedback, please reach out to Krishna Shukla.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.vscode		.vscode
Network_Data		Network_Data
Notebook		Notebook
airflow/dags		airflow/dags
data_schema		data_schema
infrastructure		infrastructure
networksecurity		networksecurity
saved_models		saved_models
templates		templates
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENCE		LICENCE
README.md		README.md
docker-compose.yaml		docker-compose.yaml
get_data.py		get_data.py
main.py		main.py
requirements.txt		requirements.txt
setup.py		setup.py
start.sh		start.sh
start_training.py		start_training.py
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🛡️ Phishing Website Detection System using Machine Learning and MLOps

🚀 Project Overview

🌟 Features

🛠️ Installation and Setup

Prerequisites

🖥️ Setup Virtual Environment

⚙️ Configure AWS

🚀 Deploy the System

🧑‍💻 Usage

📈 Results and Evaluation

🤝 Contributing

📧 Contact

About

Releases

Packages

Languages

License

kriishukla/PhishingURLPrediction

Folders and files

Latest commit

History

Repository files navigation

🛡️ Phishing Website Detection System using Machine Learning and MLOps

🚀 Project Overview

🌟 Features

🛠️ Installation and Setup

Prerequisites

🖥️ Setup Virtual Environment

⚙️ Configure AWS

🚀 Deploy the System

🧑‍💻 Usage

📈 Results and Evaluation

🤝 Contributing

📧 Contact

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages