This program let you create a file .CSV with all information from TedTalks, including:
- Title
- Description
- Views (Number of Views)
- Author
- Date
- Tags
- Link for the Page on Ted.com
This project is build in GoLang using for the first time the Colly library to scrap all Talks from the Ted website. I hope to improve the quality and quantity of the generated file, to create a better Dataset to use and maybe one day, upload it to Kaggle.
This isn't perfect so here a list of things to improve/add:
- Retrive the Thumb Link and Add into the Dataset
- Retrive the Duration of the video
- Add (Maybe in an other file) the Descriptions of Authors
- Publish on Kaggle
MIT