Skip to content

This repository is a collection of scraping scripts written for Programming for Cultural Heritage taken at Pratt Institute.

License

Notifications You must be signed in to change notification settings

GenevieveMilliken/pfch_New_Deal_in_New_Orleans

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

New Deal in New Orleans - Web Scraping Images

This collection of Python scripts were written for a final project in the course Programming for Cultural Heritage taken at Pratt Institute. The collection scrapes two websites: Yale's Photogrammar and the New Orleans Public Library WPA Photography Collection. The scraped images were used to create The New Deal in New Orleans, an Omeka site showcasing images taken during the New Deal.

WPA_Images

About

(1) Yale's Photogrammar

Running FSA_Photo_1_JSON_Write.py creates a JSON file that includes metadata information as well as image URLs.

Running FSA_Photo_2_Download.py reads the image URLs from the JSON file created by the first script and downloads them to a local folder named FSA_Images. You will need to create this folder in your local directory before running script 2 and download.

(2) New Orleans Public Library's WPA Collection

Running WPA_COLLECTION NAME_1_JSON_Write.py creates a JSON file that includes metadata information as well as image URLs.

Running WPA_COLLECTION_NAME_2_Download.py reads the image URLs from the JSON file created by the first script and downloads them to a local folder named WPA_COLLECTION NAME_Images. You will need to create a folder with the collection name (i.e. WPA_Lib_Images, WPA_Rec_Images) in your local directory before downloading. Please be advised that the folder name must match the file path in the second script.

Scripts are available for the following WPA collections:

If you have any questions, feel free to reach out at genevieve [dot] milliken [at] gmail [dot] com

About

This repository is a collection of scraping scripts written for Programming for Cultural Heritage taken at Pratt Institute.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages