Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
JoanGi authored Jan 4, 2024
1 parent 4053a11 commit b1d3931
Showing 1 changed file with 5 additions and 1 deletion.
6 changes: 5 additions & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -12,6 +12,10 @@ The *FullAnalysisResults.xlsx* contains the full data, the charts, the topic ana

3 - **Code**: The code used to extract the data. One for each journal. This will help into replicating the experiment.

*SDataExtractor.py* and *DBriefExtractor.py* contains the code to perform the extraction. Note you will need and OpenAI ApiKey and a GROBID service running to execute the notebooks.
*dataPaperScrapping.ipynb* notebook contains the code used to filter all the data papers type of both journals, and get the PDF (when possible). If you want to reproduce the experiment you may start by this notebook.

Once you have all the PDF of the journals, *SDataExtractor.py* and *DBriefExtractor.py* contains the code to perform the extraction for each journal. Note you will need and OpenAI ApiKey and a GROBID service running to execute the notebooks.




0 comments on commit b1d3931

Please sign in to comment.