An OpenMP based solution for computing K-most frequent words in a corpus.
- C++11 or higher
- OpenMP
- Build the project
make
-
Add path to the corpus to a file (see
exp.txt
). In case you want to try this on multiple corpus, add each of their top level paths in a different line inexp.txt
-
The executable is
bin/exec
, which you can run like so:
time bin/exec -config exp.txt -k 2 -nt 2
GNU GPL v3