Skip to content

Latest commit

 

History

History
36 lines (30 loc) · 1.46 KB

README.md

File metadata and controls

36 lines (30 loc) · 1.46 KB

TMDB API: Movie information retrieval

This project consists of two Python scripts with the purpose of retrieving all the information possible from movies in the TMBD API. Its goal is to create a comprehensive dataset for data analysis and/or machine learning.

Requirements

  • TMDB API KEY: You must have a developer account in TMDB. It's free.
  • Virtual environment: (Optional)
python3 -m venv venv
source venv/bin/activate # Unix/macOS
venv\Scripts\activate.bat # Windows
  • Python 3:   pip install -r requirements.txt

Usage

Very easy. We have 2 modes of retrieving:

  • Brute force search of valid IDs in the API by incrementing an index.
  • A more targeted alternative that consists in having a file loaded of TMDB IDs or IMDB IDs to perform the retrieval of information of those.
# Brute force search
python main.py

# Retrieve from file
python main.py /path/to/file.csv

Dataset

This project includes a dataset with 7000+ movies with various information such as budget, revenue, cast, crew, etc. If you need a larger dataset you can run the script using brute force search or edit the script to start searching for movies where I left off.

Important note

Although there is no request rate limit (more info here) be extremely cautious and don't over abuse it. I claim NO RESPONSIBILITY for what you might cause.