Skip to content
#

maxpool2d

Here are 14 public repositories matching this topic...

Lip reading using TensorFlow, OpenCV, and Keras involves training a deep learning model to recognize spoken words by analyzing lip movements from video frames. The process starts with OpenCV for capturing and preprocessing video frames, focusing on the speaker’s lips. These frames are then fed into a neural network built using Keras and TensorFlow.

  • Updated Sep 18, 2024
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the maxpool2d topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the maxpool2d topic, visit your repo's landing page and select "manage topics."

Learn more