Replication code for the paper "Newswire: A Large-Scale Structured Database of a Century of Historical News"
Repo structure -
├── entity
│ └── pipeline.py
├── georeferencing
│ └── georef.py
├── README.md
├── topic_models
│ └── train_topics.py
└── utils
└── utils.py
All relevant functions are in entity/pipeline.py
All relevant functions in georeferncing/georef.py
Code to train the models and example Hyperparameters are in topic_models/train_topics.py