seq-unscrambler

Recently, I came across a post that was an image of a piece of text in which each word had been partially scrambled. The text was as follows, "fi yuo cna raed tihs, yuo hvae a sgtrane mnid too. I cdnuolt blveiee taht I cluod aulaclty uesdnatnrd waht I was rdanieg. The phaonmneal pweor of the hmuan mind! ...". Interestingly, I could unscramble the words in my head without much effort. But this got me asking, if computers can be made do the same? This fun project is my attempt at answering the question.

I used a masked language model to reveal the original text after the letters in it's words are scrambled.

Installation

poetry install

To run

PYTHONPATH=. python src/main.py -t "if you can read this you have a strange mind too" -k 1

args:
    -t text
    -k top k candidates

Output

Input text: if you can read this you have a strange mind too
Scrambled text: fi uoy acn drae hits ouy ehva a eagrnst idmn oot

Top 1 inferred input texts with log probability scores:

sentence: if you can read this you have a strange mind too
score: -40.09420431405306

Streamlit

PYTHONPATH=. streamlit run streamlit_demo.py

Tests

PYTHONPATH=. poetry run pytest --cov src

References

Salazar J. et al., 2020, Masked Language Model Scoring. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pages 2699–2712July 5 - 10, 2020

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
resources		resources
src		src
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
streamlit_demo.py		streamlit_demo.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

seq-unscrambler

Installation

To run

Output

Streamlit

Tests

References

About

Releases

Packages

Languages

License

skomban/seq-unscrambler

Folders and files

Latest commit

History

Repository files navigation

seq-unscrambler

Installation

To run

Output

Streamlit

Tests

References

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages