Skip to content

Video content description model for generating descriptions for unconstrained videos

License

Notifications You must be signed in to change notification settings

Abdelrhman-Yasser/video-content-description

Repository files navigation

Video Content Description

Increasing trend in the research community for video processing using artificial intelligence. Trending Tasks:

  • Video classification.
  • Video content description.
  • Video question answering (VQA).

Main Idea

The main idea is to generate descritptions for unconstrained videos, which can be used in video retrieval, blind navigation, and video subtitling.

Examples

Watch the video

Dataset

We use the Microsoft Research Video to Text (MSVD) dataset.

Extracted Visual Feature

We extracted the visual features of the data set using :

Architecture

Here is the our architecture.

Checkpoints

We have trained the model using different techniques.

Results

From the results obtained in the explained experiments, we found out that the best results obtained are from using attention and drop out. Our model outperforms the original paper model in all used metrics as shown in the following table:

Authors

Contribute

Contributions are always welcome!

Please read the contribution guidelines first.

License

This project is licensed under the GNU General Public License v3.0 - see the LICENSE file for details

About

Video content description model for generating descriptions for unconstrained videos

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published