time for pretraining #3

Tortoise17 · 2021-05-07T11:27:57Z

I have to ask that do you have any figures how much this takes for pretraining and on which GPU and which dataset_ ? please if you can guide.

harshraj32 · 2021-05-09T13:23:42Z

where u able to download the dataset?

ammesatyajit · 2021-05-10T07:45:16Z

So for the dataset, I used the HowTo100M dataset and filtered out the cooking videos. The specific ids for only cooking videos are listed in VideoBERT/data/ids.txt. Here are the steps I took to download the dataset:
go to this link: https://www.di.ens.fr/willow/research/howto100m/
On the bottom of the page there should be a download section. To get access to the private server to download it easily, you will need to fill out the form that it links. The creator of the dataset should respond within 2-3 days giving you a login for the server. Using that, you can download the videos with the instructions given along with the ids.txt file.

@Tortoise17 Unfortunately I do not have the pretraining figures for this model. I do not recall which GPU I was using at the time since I was using a VM. However, this should run on a google colab equivalent GPU if the model size is decreased a bit. Hope that helps.

harshraj32 · 2021-05-10T07:50:11Z

yeah there are over 47k ids, what would be the size of all 47k videos?

ammesatyajit · 2021-05-10T07:51:49Z

It was about 400-500 GB

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

time for pretraining #3

time for pretraining #3

Tortoise17 commented May 7, 2021

harshraj32 commented May 9, 2021

ammesatyajit commented May 10, 2021

harshraj32 commented May 10, 2021

ammesatyajit commented May 10, 2021

time for pretraining #3

time for pretraining #3

Comments

Tortoise17 commented May 7, 2021

harshraj32 commented May 9, 2021

ammesatyajit commented May 10, 2021

harshraj32 commented May 10, 2021

ammesatyajit commented May 10, 2021