Adds MS Word .doc support to the llm-dataset-converter library.
-
antiword available on
PATH
- Debian/Ubuntu:
sudo apt install antiword
- Windows: Softpedia
- Debian/Ubuntu:
pip install git+https://github.com/waikato-llm/llm-dataset-converter.git
pip install git+https://github.com/waikato-llm/ldc-doc.git
See here for an overview of all plugins.