Skip to content

RaffaeleAns/Audio-Video-Recognition-and-Retrieval

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

30 Commits
 
 
 
 
 
 

Repository files navigation

Digital Signals Processing and Retrieval

Machine Learning tools for Audio & Face Recognition and Retrieval

This repo treats about 3 themes:

  1. Audio Recognition
  2. Video Recognition
  3. Image Retrieval

Several different notebook were devoloped with respect to the project phases. In particular you will find:

-Media Collection: script for audio and faces collection. The faces are cropped before saving the images.

-Recordings Recognition: Feed-forward Neural Net for speaker and word recognition.

-Face Recognition: VGG16 CNN fine tuning for Face Recognition.

-Image Retrieval: Most similar VIPs faces are returned from one picture.

-Demo Live: Live Demo with OpenCV2.

W3Schools do