Spark NLP Workshop

Showcasing notebooks and codes of how to use Spark NLP in Python and Scala.

Python Setup

$ java -version
# should be Java 8 (Oracle or OpenJDK)
$ conda create -n sparknlp python=3.6 -y
$ conda activate sparknlp
$ pip install spark-nlp==2.5.0 pyspark==2.4.4

Docker setup

If you want to experience Spark NLP and run Jupyter examples without installing anything, you can simply use our Docker image:

1- Get the docker image for spark-nlp-workshop:

docker pull johnsnowlabs/spark-nlp-workshop

2- Run the image locally with port binding.

 docker run -it --rm -p 8888:8888 -p 4040:4040 johnsnowlabs/spark-nlp-workshop

3- Open Jupyter notebooks inside your browser by using the token printed on the console.

http://localhost:8888/

The password to Jupyter notebook is sparknlp
The size of the image grows everytime you download a pretrained model or a pretrained pipeline. You can cleanup ~/cache_pretrained if you don't need them.
This docker image is only meant for testing/learning purposes and should not be used in production environments. Please install Spark NLP natively.

Main repository

https://github.com/JohnSnowLabs/spark-nlp

Project's website

Take a look at our official spark-nlp page: http://nlp.johnsnowlabs.com/ for user documentation and examples

Slack community channel

Join Slack

Contributing

If you find any example that is no longer working, please create an issue.

License

Apache Licence 2.0

Name		Name	Last commit message	Last commit date
Latest commit History 568 Commits
data		data
databricks		databricks
java/annotation		java/annotation
jupyter		jupyter
scala		scala
tutorials		tutorials
zeppelin		zeppelin
.gitattributes		.gitattributes
.gitignore		.gitignore
Dockerfile		Dockerfile
ISSUE_TEMPLATE.md		ISSUE_TEMPLATE.md
LICENSE		LICENSE
README.md		README.md
colab_setup.py		colab_setup.py
jupyter_notebook_config.json		jupyter_notebook_config.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Spark NLP Workshop

Table of contents

Python Setup

Docker setup

Main repository

Project's website

Slack community channel

Contributing

License

About

Releases

Packages

Languages

License

gregpawin/spark-nlp-workshop

Folders and files

Latest commit

History

Repository files navigation

Spark NLP Workshop

Table of contents

Python Setup

Docker setup

Main repository

Project's website

Slack community channel

Contributing

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages