line-breaking-algorithms

Investigation of line-breaking algorithms demonstrated by Juraj Sukop at https://xxyxyz.org/line-breaking/ .

Algorithms are discussed on that page in the following order:

Performance Analysis

For the longest texts/line lengths (the 750+ word Bleak House excerpt), the Divide & Conquer algorithm tends to be most performant, with Shortest Path the runner up.

For shorter texts/line lengths (the alphabet sample or the single line Gilbert & Sullivan sample), the Shortest Path algorithm is most performant.

The full Gilbert & Sullivan and the Preamble to the US Constitution seem to sit on the cusp of those cases, with Shortest Path and Divide & Conquer offering trade-offs.

Testing

Performance and correctness tests are handled separately.

$ python -m pytest -m correctness
$ python -m pytest -m performance --benchmark-group-by=func

Performance tests can also be run for only a specific test input. Markers are defined in pytest.ini:

$ python -m pytest -m preamble

Note when comparing performance across different inputs that pytest-benchmark uses different scales for each input if you either group them or test only a single input.

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
algorithms		algorithms
tests		tests
.cursorignore		.cursorignore
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
Pipfile		Pipfile
README.md		README.md
pytest.ini		pytest.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

line-breaking-algorithms

Performance Analysis

Testing

About

Releases

Packages

Languages

License

triopter/line-breaking-algorithms

Folders and files

Latest commit

History

Repository files navigation

line-breaking-algorithms

Performance Analysis

Testing

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages