This repo treats about 3 themes:
- Audio Recognition
- Video Recognition
- Image Retrieval
Several different notebook were devoloped with respect to the project phases. In particular you will find:
-Media Collection: script for audio and faces collection. The faces are cropped before saving the images.
-Recordings Recognition: Feed-forward Neural Net for speaker and word recognition.
-Face Recognition: VGG16 CNN fine tuning for Face Recognition.
-Image Retrieval: Most similar VIPs faces are returned from one picture.
-Demo Live: Live Demo with OpenCV2.