Skip to content

Pipeline ETL criado com a biblioteca Requests do Python, utilizando a API do GitHub para extrair e armazenar dados sobre linguagens usadas nos repositórios de algumas empresas.

Notifications You must be signed in to change notification settings

millenagena/python-apis-conhecendo-a-biblioteca-requests

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 

Repository files navigation

ETL Project with Python's Requests library and the GitHub API

This project aims at the development of an ETL (Extract, Transform, Load) pipeline that fetches data about programming languages used by major corporations, including Amazon, Spotify, Netflix, and Apple. Through data extraction, transformation, and loading, we're able to gather, shape, and store pertinent information for subsequent analysis.

We employ Python's Requests library and the GitHub API to implement this ETL. The GitHub API provides a rich dataset we use to access information on the programming languages deployed in these companies' projects.

The notebook containing all developed codes can be found in the "notebook" folder. In the "classes" folder, you will find the same codes, but structured into Python classes to allow for their reuse.

Key features and functionalities of the project include:

  1. Data extraction with the GitHub API
  2. Status Codes Management
  3. Authentication
  4. Pagination
  5. Object-Oriented Programming (OOP)

This project was developed for a course I taught at Alura. You can access it by clicking on the link: Course's link

Technologies used

  • Python;
  • Requests library;
  • GitHub API;
  • Pandas library.

Contact

Email: millenagena@gmail.com

About

Pipeline ETL criado com a biblioteca Requests do Python, utilizando a API do GitHub para extrair e armazenar dados sobre linguagens usadas nos repositórios de algumas empresas.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published