Piazza Scraper

Modified to dump everything requested to JSON instead of a sqlite database

Removed the updating mechanisms -- this is meant to be used once, after my class is over, to export everything from the course

Installation

pip install git+https://github.com/purarue/piazza-scraper

How to Run

Set the PIAZZA_USERNAME and PIAZZA_PASSWORD environment variables

python3 -m piazza_scraper scrape courseid

To parse:

python3 -m piazza_scraper parse ./courseid.json

HPI

This is used in HPI with the my.piazza.scraper module -- it locates, infers my user in the export and returns my posts, allowing me to summarize/query easily:

$ hpi doctor -S my.piazza.scraper
✅ OK  : my.piazza.scraper
✅     - stats: {'posts': {'count': 22}}
# e.g. extract text from one of the posts
$ hpi query my.piazza.scraper.posts --order-key created --reverse | jq '.[6].text'
"bash doesn't exist where the script assumes it is [which is /usr/bin/bash most linux systems]. You need to run the grading script on the server or change the shebang value at the top"

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
piazza_scraper		piazza_scraper
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Piazza Scraper

Installation

How to Run

HPI

About

Languages

License

purarue/piazza-scraper

Folders and files

Latest commit

History

Repository files navigation

Piazza Scraper

Installation

How to Run

HPI

About

Topics

Resources

License

Stars

Watchers

Forks

Languages