This project analyzes a mini-database of clinical and professional data from patients to categorize their sleep disorders. The workflow includes:
- Feature Engineering: Transforming raw data into a format consumable by machine learning models.
- Model Training and Testing: Implementing and comparing the performance of four classification algorithms:
- K-Nearest Neighbors (KNN)
- Gaussian Naive Bayes
- Random Forest
- Decision Tree
Ensure you have Python 3.8+ installed on your system.
-
Clone the Repository
git clone https://github.com/yourusername/sleep-disorder-classification.git cd sleep-disorder-classification
-
Create a Virtual Environment (Optional but Recommended)
python -m venv env source env/bin/activate # Linux/Mac env\Scripts\activate # Windows
-
Install Dependencies
Install the necessary Python libraries from therequirements.txt
file:pip install -r requirements.txt
-
Feature Engineering
- Transform clinical data into features suitable for machine learning models.
- Includes handling missing values, encoding categorical data, and scaling numerical data.
-
Model Implementation
- Train and test four classifiers:
- K-Nearest Neighbors (KNN)
- Gaussian Naive Bayes
- Random Forest
- Decision Tree
- Compare performance metrics (accuracy, precision, recall, F1-score).
- Train and test four classifiers:
-
Evaluation
- Analyze model predictions to identify the best-performing algorithm.