Text To Narrate

Overview

It is a django app which takes in input a text exerpt and shows the narrated film of that text with relevant images.It uses Named Entitiy Recognition to extract entites from the given text. Then Images of those entities are downloaded from google. Then google Tachotron model is used to create audio of given text. Then using Images, Text and audio the final Video is generated.

Dataset

The dataset used is Annotated Corpus for Named Entity RecognitionCorpus (CoNLL 2002) annotated with IOB and POS tags. The dataset consists 47959 sentences having 1048575 words with labels. The labels contain POS(position of word in sentence ) and Tag(Name, Orginisation, Location etc).

NLP Model

NER model is trained using CONNL-2003 dataset and Glove Embeddings.We have tried two models.The training code is included in training folder.

Model 1

input_1 (InputLayer)
Embedding_1 (Embedding)
Dropout_1 (Dropout Layer)
Bidirectional_1 (Bidirection LSTM )
Time_distributed_1 (TimeDistributed LSTM)
Crf_1 (CRF)

Model 1 has word embedding created from our dictionary.

Model 2

Model 2 has glove pretrained embeddings as well as the embeddings created from our dictionary.

For final App Model 2 is used.

Tensorboard Visual

To view properly download it and zoom it.

Django UI

Text Input

Video Output

Instruction for running app

Install all the requirements from requirement.txt
Clone the repository and use 'python manage.py runserver' to run it on localhost.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.ipynb_checkpoints		.ipynb_checkpoints
audio		audio
data		data
narration		narration
photos		photos
temp		temp
text2Narrate		text2Narrate
.gitignore		.gitignore
README.md		README.md
db.sqlite3		db.sqlite3
manage.py		manage.py
requirement.txt		requirement.txt
sample_video.mp4		sample_video.mp4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Text To Narrate

Overview

Dataset

NLP Model

Model 1

Model 2

Tensorboard Visual

Django UI

Text Input

Video Output

Instruction for running app

About

Releases

Packages

Contributors 4

Languages

SHIVA-sopho/TexttoNarate

Folders and files

Latest commit

History

Repository files navigation

Text To Narrate

Overview

Dataset

NLP Model

Model 1

Model 2

Tensorboard Visual

Django UI

Text Input

Video Output

Instruction for running app

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages