Skip to content

Releases: clipperhouse/uax29.net

v3.0.0

23 Jul 01:36
Compare
Choose a tag to compare

What's Changed

Add Options.OmitWhitespace

Add optional extension methods using UAX29.Extensions

Tokenizer.GetWordsSplit.Words

Full Changelog: v2.2.0...v3.0.0

v2.2.0

09 Jul 15:44
a0a52d7
Compare
Choose a tag to compare

Added a new Position property, to know where in the original string/stream the current token is. It’s a count of bytes (UTF-8) or chars (UTF-16).

What's Changed

Full Changelog: v2.1.0...v2.2.0

v2.1.0

07 Jul 04:52
ab1d0ab
Compare
Choose a tag to compare

Added a new Ranges property on the tokenizer, an enumerator of the token boundaries.

Several optimizations and cleaner code.

What's Changed

Full Changelog: v2.0.3...v2.1.0

v2.0.3

21 Jun 19:10
Compare
Choose a tag to compare

Added ToList and ToArray methods, refactors and renames.

If you are using v1.x of this package, v2 has been renamed:

dotnet add package uax29.netdotnet add package UAX29

using uax29using UAX29

We now use methods:

Tokenizer.Create(input)Tokenizer.GetWords(input)

Tokenizer.Create(input, TokenType.Graphemes)Tokenizer.GetGraphemes(input)

v1.1.0

15 Jun 18:07
Compare
Choose a tag to compare

Add support for Stream (of UTF-8 bytes) and TextReader / StreamReader of char.

1.0.0

06 Jun 12:46
Compare
Choose a tag to compare

First implementation. Unicode tests are passing and perf looks good.

Install via Nuget dotnet add package uax29.net --version 1.0.0