Skip to content

getpapers

ShweataNHegde edited this page Sep 9, 2020 · 5 revisions

### # # TESTER 10: Shweata N. Hegde


1. OS: Windows 7

SOURCE: https://github.com/petermr/tigr2ess/blob/master/installation/windows/INSTALLATION.md

a. DOWNLOADING Node

b. INSTALLATION of Node

  • Run the following command at the command prompt
nvm install 7
nvm use 7.10.1

Installation successful without any problems.

c. INSTALLATION OF getpapers

  • Run the following command at the command prompt. 11 warnings but no errors.
npm install --global getapapers

d. USAGE OF getpapers

  1. getpapers was first used to find out how many open access papers were available on a specific topic.
getpapers -q <query> -n
  • -q: search query
  • -n: non-executable

Eg. Run the following syntax on the command prompt

getpapers -q viral epidemics -n

image 2: first non-executable run

It is found that 312120 open access results were available for our search.

  1. getpapers was now used to download limited number of xml and pdf of open access paper on a specified topic.
getpapers -q <query> -k <int> -o <path> -x -p
  • -q: search query
  • -k: limits the number of hits and downloads <int>: integer
  • -o: output directory, will be created if not found
  • -p: downloads if full text pdf is available
  • -x: downloads if full text xml is available

In our case, we run the following syntax

getpapers -q viral epidemics -k 100 -o test -x -p

https://s3-us-west-2.amazonaws.com/secure.notion-static.com/17146218-749f-4d9c-9d39-0f8feaa5feb2/first_search_run_eight_three_downloadable.png

image 3: 83 xml files downloaded

17 warnings. (14 of which said that the articles were not open access and 3 of them said xml files were unavailable.)

image 5: 86 pdfs downloaded

A new output directory called test was created. It had 97 folders each containing a JSON file(most of which also had xml and pdfs file). 3 folders had only pdfs version of the paper and 1 folder had 3 JSON file corresponding to those 3 pdf. In addition, 1 html text file containing all the URLs of the papers downloaded and 1 JSON file with results were created.

(83 XML files and 86 pdfs were downloaded.)

Google drive link for the mini corpus.

https://drive.google.com/drive/folders/1HB8lCK2cOfvCrV3ihh4-Zx5i1IZnJHLW?usp=sharing


2. OPERATING SYSTEM: Windows 10

Same installation process as mentioned above. No problems reported in downloading node.

  • Tried installing getpapers but below errors(image 6) showed up. Yet to solve the issue.

https://s3-us-west-2.amazonaws.com/secure.notion-static.com/c1cef8a5-adc8-45a4-a30d-7117fe25c592/successfully_installed_node_but_cannot_install_getpapers_Error1.jpg

image 6: Error messages that appeared during the installation of getpapers