Skip to content

Releases: giacbrd/ShallowLearn

Online learning and hashing trick

30 Dec 17:09
Compare
Choose a tag to compare
Pre-release
  • Online learning and better pre-training in GensimFastTex:
    • Hashing trick for building the vocabulary, similar to the original fastText approach
    • It is possible to pre-fit word embeddings from a dataset with word2vec
    • True online earning with partial_fit, the vocabulary is incrementally updated
  • New version of fastText.py: 0.8.2
  • New version of Gensim: 0.13.4
  • Fixed predict_proba output format

Faster prediction and persistence

06 Nov 17:04
Compare
Choose a tag to compare
Pre-release
  • Faster prediction for multiple labels with one predict call
  • Better persistence with save and load methods
  • Fixed parameter names convention

Quasi-stable models

27 Oct 23:50
Compare
Choose a tag to compare
Quasi-stable models Pre-release
Pre-release
  • FastText classifier based on version 0.8.0 of https://github.com/salestock/fastText.py
  • GensimFastText has now:
    • negative sampling
    • softmax as alternative output function
    • almost complete LabeledWord2Vec as subclass of Gensim's Word2Vec
  • More tests

Fast Cython code

13 Oct 22:51
Compare
Choose a tag to compare
Fast Cython code Pre-release
Pre-release

this is broken, valid commit for this release is 263bfb1

  • Cython code for fastText in Gensim
  • Script for benchmarks

First working model

11 Oct 00:21
Compare
Choose a tag to compare
First working model Pre-release
Pre-release
  • GensimFTClassifier can train and predict