Skip to content

Releases: oroszgy/hunlp-resources

Webcorpuswiki word2vec model

26 Apr 08:07
Compare
Choose a tag to compare
Pre-release

Word2vec model trained on the Hungarian Webcorpus and on the Hungarian Wikipedia dump (as of 2017-04-21).

Parameters:

  • 300 dimensions
  • cbow model
  • minimium word frequency is set to 10

Webcorpuswiki frequencies

26 Apr 08:11
Compare
Choose a tag to compare
Pre-release

Term and document frequency list of words generated from union of the Hungarian Webcorpus and the Hungarian Wikipedia dump (as of 2017-04-21).

Webcorpuswiki Brown clusters

02 May 14:37
Compare
Choose a tag to compare
Pre-release

Brown clusters (2^6 clusters) got from the Hungarian Webcorpus and on the Hungarian Wikipedia dump (as of 2017-04-21).