Get the paper information from arXiv.org based on the ID, take your notes about the paper and save it into a MongoDB database.
Working on the model to analyze paper's data.
Build
docker-compose build
Run
docker-compose up
Navigate to http://127.0.0.1:5000/library
To reset the DB
docker-compose down --volumes
-
arxiv-downloader.py: Script to download pdf papers from arXiv
-
pdf-image.py: This script converts all the pdf's from the source folder into images
-
split-data.py: This script split the data from a source folder into a training set and validation set.
- Analyze the text and extract other sections in addition to the abstract