Skip to content

Latest commit

 

History

History
12 lines (7 loc) · 382 Bytes

README.md

File metadata and controls

12 lines (7 loc) · 382 Bytes

Answering Queries using PySpark Framework

Usage:

python3 pyspark_no.py <Output CSV File Name> <Number of CPUs>

where no = 1, 2, or 3 for Question 1, 2 and 3 respectively.

Instructions:

  • Refer Assignment-5.pdf for a detailed description of the queries a user can ask.
  • Refer Dataset directory for details on the dataset which is used to answer the user's queries.