Collecting data of kanji usage frequencies from Twitter Streaming API.
See POST statuses/filter
for details.
git clone https://github.com/THIS/REPO.git
npm install
cp tokens.json.dist tokens.json
- Fill in your tokens in
tokens.json
(create a new app) node collect-data.js
- a new stream is started right away, but data is not being saved automatically!- Enter
help
to see a list of available commands - Enter
save
to store data in a file (kanji.json
in root directory)
{ state: 'RUNNING',
percentFull: 0,
uniqueKanji: 3666,
totalKanji: 2300350 }
state
- if'PAUSED'
, data is not being collectedpercentFull
- how full is a stream queue, see description ofstall_warnings
parameteruniqueKanji
- how many distinct kanji were seen so fartotalKanji
- how many kanji were seen in total, including repetitions