Word Break Problem

The words.txt file contains a sorted list of approximately 173,000 words. The words are listed one word per line, do not contain spaces, and are all lowercase.

What we have to find

the longest concatenated word (that is, the longest word that is comprised entirely of shorter words in the file)
the 2nd longest concatenated word
the total count of all the concatenated words in the file

HashSet as perfect data structure or Why Am I Not Using Tries

In this project I used HashSet as a data structure to contain a list of words because according to Big O complexity table

It has O(1) lookup time while trees have O(m) where m depends on the length of the string we are looking up;
In some cases tries require much more space than hashset because memory can be allocated for each string character while in hashset it is a single chunk of memory for the whole string entry;
Imagine that the alphabet is not 26 english characters but 136,690 Unicode symbols. That means that each node of tree will represent 136,690 symbols. Here we are moving to the conclusion that complexity grows from O(m) to O(alphabet_size x key_length x N) where N is number of keys in Trie. Sounds not that nice, uh? Tries just can't work fast on big input alphabets.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
.gradle		.gradle
.idea		.idea
build		build
gradle/wrapper		gradle/wrapper
out/production/classes/wbpsolution		out/production/classes/wbpsolution
src		src
.gitignore		.gitignore
.travis.yml		.travis.yml
LICENSE		LICENSE
README.md		README.md
build.gradle		build.gradle
circle.yml		circle.yml
codecov.yml		codecov.yml
gradlew		gradlew
gradlew.bat		gradlew.bat
settings.gradle		settings.gradle
test.txt		test.txt
words.txt		words.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Word Break Problem

What we have to find

HashSet as perfect data structure or Why Am I Not Using Tries

Gathered metrics & results

About

Releases

Packages

Languages

License

olesiakissa/word-break-problem

Folders and files

Latest commit

History

Repository files navigation

Word Break Problem

What we have to find

HashSet as perfect data structure or Why Am I Not Using Tries

Gathered metrics & results

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages