Skip to content

Latest commit

 

History

History
46 lines (34 loc) · 2.42 KB

README.md

File metadata and controls

46 lines (34 loc) · 2.42 KB

recipeProj

EECS 337 Team project by Xinyi Wu, Group 22

-----------------------------Version for programming language---------------------
Python 3.6
-----------------------------Package list----------------------------------------
Standard python package:json,copy,collections,re,request

External python package:nltk 3.2.4,beautifulsoup4 4.6.0

all the external package could be gotten by pip or use anaconda

-----------------------------------------Relative path import----------------------------
Some the code are used by import in relative path. If you are using pycharm, even though import the code that in the same path, it will report a no-module-name error notice. If you are in that situation, ignore the notice and run the code directly .

-------------------------------------File specification-----------------------------------
The txt file named as "output-XXX-prep" are all preparation files(or reference files) for creating the transformation diactionary.

All the runnable code are in EECS337 folder.

-supporting code:
>>-SL.py :Save the code into disk and arrange them into a more readable format.
>>-scrapeurl.py :Scrape special urls for the extraction of description. (e.g. scrapy 100 Chinese recipe urls from Allrecipes.com)
>>-dict.py :Output the txt files of "output-XXX-prep", which includes descriptions of ingredients of different recipe syles by frequency

-recipe extraction code:

-recipegenerator.py :Scrape recipe information from the given url and arrange them into readable format. -recipeNT.py :Output single recipe.

-transformation code:

-to(style).py :Transform the original recipes into special styles, output the previous and changed one.

--------------------------------------------How to run------------------------------------------------
All the transformation code with recipeNT could directly run.

These code will request you to input url with a space(without the space, you will be redirected to that webpage, at least in pycharm), if you do not input anything and enter, the program will use a default url to run.

--------------------------------------------Server no response------------------------------------------
Because the program runtimely extract information from website, there is possible to occur problems from HTTP connection and server respondence. If such problem comes, just rerun the code.