Concentration Difficulty Prediction Model

Project Overview

This project aims to develop a machine learning model to predict whether individuals can concentrate on their studies based on their social media usage. The goal is to identify key factors that contribute to concentration difficulties and provide insights that could help mitigate these issues.

Introduction

In today's digital age, social media usage has become ubiquitous, especially among students. While social media offers various benefits, excessive use can lead to distractions and affect academic performance. This project utilizes machine learning techniques to predict concentration difficulties based on social media usage and demographic information.

Dataset

The dataset used in this project was sourced from Kaggle, a popular platform for data science and machine learning datasets. The dataset contains the following types of information.

Features and Target Variables

Features

Demographic Information

Age: The age of the participant.
Sex: The gender of the participant (e.g., Male, Female, Other).
Relationship: Current relationship status (e.g., Single, In a Relationship, Married).
Occupation: Current occupation status (e.g., Student, Employed, Unemployed).
Affiliations: Type of organizations the participant is affiliated with (e.g., educational institutions, workplaces).

Social Media Usage Information

Social Media User?: Whether the participant uses social media (Yes/No).
Platforms Used: Common social media platforms used by the participant.
Time Spent: Average time spent on social media every day.
Q1: How often the participant uses social media without a specific purpose.
Q2: How often the participant gets distracted by social media when busy.
Q3: Whether the participant feels restless if they haven't used social media in a while.
Q4: How often the participant seeks validation from features of social media.
Q5: On a scale of 1 to 5, how much the participant is bothered by worries.
Q6: How often the participant feels depressed or down.
Q7: On a scale of 1-5, how often the participant compares themselves to other successful people through social media.
Q8: General feelings about these comparisons.

Target Variables

Target 1: On a scale of 1 to 5, how easily the participant is distracted.
Target 2: Whether the participant finds it difficult to concentrate on things.
Target 3: On a scale of 1 to 5, how frequently the participant's interest in daily activities fluctuates.

Modeling Approach

Data Collection: The data was sourced from Kaggle and consists of survey responses.
Data Preprocessing: Cleaning and preparing the data for analysis.
Feature Engineering: Creating and selecting relevant features for the model.
Model Training: Training various machine learning models (e.g., Logistic Regression, Decision Trees, Random Forest, SVM).
Model Evaluation: Evaluating model performance using metrics such as accuracy, precision, recall, and F1 score.
Model Selection: Choosing the best-performing model based on evaluation metrics.
Deployment: Deploying the model for real-world use.

Contributing

We welcome contributions to improve this project. If you have any ideas, suggestions, or bug reports, please open an issue or submit a pull request.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 116 Commits
MLP		MLP
Preprocessing_pipeline		Preprocessing_pipeline
SVM		SVM
docs		docs
mlruns		mlruns
.DS_Store		.DS_Store
.gitignore		.gitignore
2.png		2.png
4.png		4.png
5.png		5.png
Logistic_Regression_Performance_Improvement.ipynb		Logistic_Regression_Performance_Improvement.ipynb
README.md		README.md
classification_report.txt		classification_report.txt
confusion_matrix.png		confusion_matrix.png
data_set.csv		data_set.csv
eda_data.csv		eda_data.csv
mlp_new.ipynb		mlp_new.ipynb
model.ipynb		model.ipynb
random_forest.ipynb		random_forest.ipynb
random_forest_gridsearch.ipynb		random_forest_gridsearch.ipynb
random_forest_old.ipynb		random_forest_old.ipynb
random_forest_test.ipynb		random_forest_test.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Concentration Difficulty Prediction Model

Project Overview

Introduction

Dataset

Features and Target Variables

Features

Demographic Information

Social Media Usage Information

Target Variables

Modeling Approach

Contributing

License

Links

About

Releases

Packages

Contributors 5

Languages

cepdnaclk/e19-co544-Concentration-Difficulty-Prediction-Model

Folders and files

Latest commit

History

Repository files navigation

Concentration Difficulty Prediction Model

Project Overview

Introduction

Dataset

Features and Target Variables

Features

Demographic Information

Social Media Usage Information

Target Variables

Modeling Approach

Contributing

License

Links

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Languages

Packages