Skip to content

Commit

Permalink
docs(readme.md): update radme
Browse files Browse the repository at this point in the history
  • Loading branch information
euberdeveloper authored Dec 5, 2020
1 parent ca5c32e commit b1c4721
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -5,7 +5,7 @@ Import all the tsv wikimedia history dump to mongodb

The purpose of this repo is to import all the italian **tsv wikimedia history dump** in a **mongodb database**. The reference to the dump is [here](https://dumps.wikimedia.org/other/mediawiki_history/readme.html).

All the data in the tsv is preserved, but separated in **three collections** in base of the event_type: revisions, pages and users.
All the data in the tsv is preserved, but separated in **three collections** in base of the event_type: revisions, pages and users. The types are correctly parsed before inserting to mongodb, so the timestamps become dates, the comma-separated lists become arrays of strings, ecc. ecc.

## How was it made

Expand Down

0 comments on commit b1c4721

Please sign in to comment.