Skip to content

Latest commit

 

History

History
66 lines (54 loc) · 4.41 KB

README.md

File metadata and controls

66 lines (54 loc) · 4.41 KB

Facial Key Points and Emotion Detector

TensorFlow Keras Scikit-Learn

Description

This project aims to develop two convolutional neural network (CNN) models to analyze facial features and emotions in images, respectively. The first model analyzes 6400+ images and their associated 15 facial key points, while the second model analyzes 24500+ images with their associated emotions. The goal is to combine the output of the two models to predict an image's emotion and facial key points.

Getting Started

To run this project, follow these steps:

  1. Download the data by clicking this link, it will take you to the google drive folder where the files are stored for this project. The data is stored on Google Cloud because the file sizes are too large.
  2. Run the Emotion_Detector_AI.ipynb notebook to execute the algorithms. Make sure you have the following dependencies installed: Keras, TensorFlow, Seaborn, Pandas, NumPy, cv2, PIL, and MatPlotLib.

Datasets

The main datasets used in this project are data.csv and icml_face_data.csv. data.csv is a table that's 31 columns by 2140 rows, 30 of those columns being x and y components of facial key points where the last column is the image data that's preprocessed to be individual arrays of 96x96 pixels that gets normalized for the convolutional neural networks. The icml_face_data.csv is a table that's 2 columns by 24568 rows. The first column represents the encoded emotion, and the second column represents the image data that gets reshaped into a 96x96 pixel array and normalized for the CNN model.

There are also several .hdf5 and .json files included in this project:

  • FacialKeyPoints_weights.hdf5 : saved weights from the first CNN model taking the 30x2140 dataset.
  • FacialExpression-model.json : saved weights for the second CNN model taking the 2x24568 dataset.
  • detection.json : pre-saved weights model that had better parameters for more accurate data on key point detection.
  • emotion.json : pre-ran weights of the model that detects emotion.

CNN Architecture

The convolutional neural network models used a "ResNet" so that the "vanishing gradient" problem was avoided, and so that we are able to enable 152 layers of training with virtually no gradient issues. The layers of the final model, in order, are as follows:

  1. Zeropadding
  2. Conv2D
  3. BatchNorm/Relu
  4. MaxPool2D
  5. RES-block (the RES-Blocks obtain a convolution block, and two identity blocks)
  6. AveragePooling2D
  7. Flatten()
  8. Dense Layer/Relu/Dropout
  9. Dense Layer/Relu/Dropout
  10. Dense Layer/Relu
  11. Loading Pre-trained Model Weights
  12. To load the pre-trained model weights, run the Notebook with the code, Emotion_Detector_AI.ipynb.

Evaluation Metrics

To evaluate the CNN models, we compiled the results from the pre-trained weights with the "Adam" optimizer from the Keras library that measured the loss with "mean_squared_error" and "accuracy" as its metrics.

We also saved several visuals in the repository, including:

  • Num_of_Emotion_training.jpg : A bar chart for the number of images with their respective emotions to train our model with. IMAGE
  • Accuracy_Loss.png : Visual of Accuracy Loss over the Epochs of training for our model. IMAGE
  • Confusion_Matrix.png : Confusion Matrix for the predicted emotions vs actual emotions. IMAGE
  • Face_Board.png : A board of several images of faces with their predicted keypoints and predicted vs actual emotion. IMAGE

Contributing

We welcome contributions from the community! If you have any suggestions or ideas, feel free to open an issue or submit a pull request.

Acknowledgements

We would like to acknowledge the following resources and libraries that were used in this project:

  • Keras
  • TensorFlow
  • Seaborn
  • Pandas
  • NumPy
  • cv2
  • PIL
  • MatPlotLib