This project involves gender classification based on Twitter user data. It includes data cleaning, feature extraction using text data, and classification using machine learning algorithms.
The dataset used in this project is the "gender-classifier-DFE-791531.csv" file, which contains information about Twitter users.
- pandas
- numpy
- matplotlib
- scikit-learn
- nltk
- Clone the repository:
git clone https://github.com/aakcay5656/Natural-Language-Process-Tutorial.git
- Install dependencies:
pip install -r requirements.txt
(if you have a requirements file)