crawl-for-parallel-corpora

simple bs4 based web crawl for a corpus in need of statistical machine translation

This Project collects Bible Dataset for Ethiopian languages and English respective transalatioin:

How to Run to get Data for Four languages

This is a NLP Data Collection Effort for to increase NLP data in Under-resourced languages.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
LICENSE		LICENSE
README.md		README.md
index.py		index.py
merge.py		merge.py
organize.py		organize.py