Example NLP Project: HMM Noun-phrase Chunking

Noun-phrase chunking does a shallow syntactic parse of a text. The text is preprocessed into suitable units, such as words or parts-of-speech. The chunker then decides whether each unit is inside or outside of a noun phrase.

This chunker is a hidden Markov model using Viterbi to find the most likely sequence of states (inside or outside), given observations of part-of-speech tags. I built it in Scheme, while learning Scheme, without libraries, in order to understand as thoroughly as possible how it worked.

The problem and data are from the CoNLL-2000 shared task. The model and states were given. I designed and implemented several modifications aimed at improving performance.

See results.pdf for the full description and results.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitignore		.gitignore
README.md		README.md
counts-to-hmm.scm		counts-to-hmm.scm
counts.scm		counts.scm
hmm-to-pred.scm		hmm-to-pred.scm
hmm.scm		hmm.scm
lib.scm		lib.scm
pred-to-scores.scm		pred-to-scores.scm
pred.scm		pred.scm
results.pdf		results.pdf
scores.txt		scores.txt
testing-enc.scm		testing-enc.scm
testing-raw.txt		testing-raw.txt
testing.scm		testing.scm
training-enc.scm		training-enc.scm
training-raw.txt		training-raw.txt
training-to-counts.scm		training-to-counts.scm
training.scm		training.scm

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Example NLP Project: HMM Noun-phrase Chunking

About

Releases

Packages

Languages

rachelbasse/hmm-np-chunker

Folders and files

Latest commit

History

Repository files navigation

Example NLP Project: HMM Noun-phrase Chunking

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages