Skip to content

v0.4.0

Compare
Choose a tag to compare
@mattdangerw mattdangerw released this 28 Dec 00:10
· 76 commits to r0.4 since this release
729815b

The 0.4 release adds support for pretrained models to the library via keras_nlp.models. You can read an
introduction to the new API in our Getting Started Guide.

If you encounter any problems or have questions, please open an issue!

Breaking Changes

  • Renamed keras_nlp.layers.MLMHead -> keras_nlp.layers.MaskedLMHead.
  • Renamed keras_nlp.layers.MLMMaskGenerator -> keras_nlp.layers.MaskedLMMaskGenerator.
  • Renamed keras_nlp.layers.UnicodeCharacterTokenizer -> keras_nlp.layers.UnicodeCodepointTokenizer.
  • Switched the default of lowercase in keras_nlp.tokenizers.WordPieceTokenizer from True to False.
  • Renamed the token id output of MaskedLMMaskGenerator from "tokens" to "tokens_ids".

Summary

  • Added the keras_nlp.models API.
    • Added support for BERT, DistilBERT, RoBERTa, and XLM-RoBERTa models and pretrained checkpoints.
    • See our Getting Started Guide for more details.
  • Added new metrics.
    • keras_nlp.metrics.Bleu and keras_nlp.metrics.EditDistance.
  • Added new vocabulary training utilities.
    • keras_nlp.tokenizers.compute_word_piece_vocabulary and keras_nlp.tokenizers.compute_sentence_piece_proto.
  • Added new preprocessing layers.
    • keras_nlp.layers.RandomSwap and keras_nlp.layers.RandomDeletion.

What's Changed

New Contributors

Full Changelog: v0.3.0...v0.4.0