-
Notifications
You must be signed in to change notification settings - Fork 719
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge pull request #258 from JohnSnowLabs/161-release-candidate
1.6.1 release candidate
- Loading branch information
Showing
16 changed files
with
144 additions
and
244 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -14,18 +14,18 @@ Questions? Feedback? Request access sending an email to [email protected] | |
|
||
This library has been uploaded to the spark-packages repository https://spark-packages.org/package/JohnSnowLabs/spark-nlp . | ||
|
||
To use the most recent version just add the `--packages JohnSnowLabs:spark-nlp:1.6.0` to you spark command | ||
To use the most recent version just add the `--packages JohnSnowLabs:spark-nlp:1.6.1` to you spark command | ||
|
||
```sh | ||
spark-shell --packages JohnSnowLabs:spark-nlp:1.6.0 | ||
spark-shell --packages JohnSnowLabs:spark-nlp:1.6.1 | ||
``` | ||
|
||
```sh | ||
pyspark --packages JohnSnowLabs:spark-nlp:1.6.0 | ||
pyspark --packages JohnSnowLabs:spark-nlp:1.6.1 | ||
``` | ||
|
||
```sh | ||
spark-submit --packages JohnSnowLabs:spark-nlp:1.6.0 | ||
spark-submit --packages JohnSnowLabs:spark-nlp:1.6.1 | ||
``` | ||
|
||
## Jupyter Notebook | ||
|
@@ -35,23 +35,23 @@ export SPARK_HOME=/path/to/your/spark/folder | |
export PYSPARK_DRIVER_PYTHON=jupyter | ||
export PYSPARK_DRIVER_PYTHON_OPTS=notebook | ||
pyspark --packages JohnSnowLabs:spark-nlp:1.6.0 | ||
pyspark --packages JohnSnowLabs:spark-nlp:1.6.1 | ||
``` | ||
|
||
## Apache Zeppelin | ||
This way will work for both Scala and Python | ||
``` | ||
export SPARK_SUBMIT_OPTIONS="--packages JohnSnowLabs:spark-nlp:1.6.0" | ||
export SPARK_SUBMIT_OPTIONS="--packages JohnSnowLabs:spark-nlp:1.6.1" | ||
``` | ||
Alternatively, add the following Maven Coordinates to the interpreter's library list | ||
``` | ||
com.johnsnowlabs.nlp:spark-nlp_2.11:1.6.0 | ||
com.johnsnowlabs.nlp:spark-nlp_2.11:1.6.1 | ||
``` | ||
|
||
## Python without explicit Spark installation | ||
If you installed pyspark through pip, you can now install sparknlp through pip | ||
``` | ||
pip install --index-url https://test.pypi.org/simple/ spark-nlp==1.6.0 | ||
pip install --index-url https://test.pypi.org/simple/ spark-nlp==1.6.1 | ||
``` | ||
Then you'll have to create a SparkSession manually, for example: | ||
``` | ||
|
@@ -67,11 +67,11 @@ spark = SparkSession.builder \ | |
|
||
## Pre-compiled Spark-NLP and Spark-NLP-OCR | ||
You may download fat-jar from here: | ||
[Spark-NLP 1.6.0 FAT-JAR](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/spark-nlp-assembly-1.6.0.jar) | ||
[Spark-NLP 1.6.1 FAT-JAR](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/spark-nlp-assembly-1.6.1.jar) | ||
or non-fat from here | ||
[Spark-NLP 1.6.0 PKG JAR](http://repo1.maven.org/maven2/com/johnsnowlabs/nlp/spark-nlp_2.11/1.6.0/spark-nlp_2.11-1.6.0.jar) | ||
[Spark-NLP 1.6.1 PKG JAR](http://repo1.maven.org/maven2/com/johnsnowlabs/nlp/spark-nlp_2.11/1.6.1/spark-nlp_2.11-1.6.1.jar) | ||
Spark-NLP-OCR Module (Requires native Tesseract 4.x+ for image based OCR. Does not require Spark-NLP to work but highly suggested) | ||
[Spark-NLP-OCR 1.6.0 FAT-JAR](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/spark-nlp-ocr-assembly-1.6.0.jar) | ||
[Spark-NLP-OCR 1.6.1 FAT-JAR](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/spark-nlp-ocr-assembly-1.6.1.jar) | ||
|
||
## Maven central | ||
|
||
|
@@ -83,19 +83,19 @@ Our package is deployed to maven central. In order to add this package as a depe | |
<dependency> | ||
<groupId>com.johnsnowlabs.nlp</groupId> | ||
<artifactId>spark-nlp_2.11</artifactId> | ||
<version>1.6.0</version> | ||
<version>1.6.1</version> | ||
</dependency> | ||
``` | ||
|
||
#### SBT | ||
```sbtshell | ||
libraryDependencies += "com.johnsnowlabs.nlp" % "spark-nlp_2.11" % "1.6.0" | ||
libraryDependencies += "com.johnsnowlabs.nlp" % "spark-nlp_2.11" % "1.6.1" | ||
``` | ||
|
||
If you are using `scala 2.11` | ||
|
||
```sbtshell | ||
libraryDependencies += "com.johnsnowlabs.nlp" %% "spark-nlp" % "1.6.0" | ||
libraryDependencies += "com.johnsnowlabs.nlp" %% "spark-nlp" % "1.6.1" | ||
``` | ||
|
||
## Using the jar manually | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -78,8 +78,8 @@ <h2 class="title">High Performance NLP with Apache Spark </h2> | |
</p> | ||
<a class="btn btn-info btn-cta" style="float: center;margin-top: 10px;" href="mailto:[email protected]?subject=SparkNLP%20Slack%20access" target="_blank"> Questions? Join our Slack</a> | ||
<b/><p/><p/> | ||
<p><span class="label label-warning">2018 Jul 7th - Update!</span> 1.6.0 Released! OCR PDF to Spark-NLP capabilities, new Chunker annotator, fixed AWS compatibility, better performance and much more. | ||
Learn changes <a href="https://github.com/JohnSnowLabs/spark-nlp/blob/1.6.0/CHANGELOG">HERE</a> and check out for updated documentation below</p> | ||
<p><span class="label label-warning">2018 Aug 9th - Update!</span> 1.6.1 Released! Fixed S3-based clusters support, new CHUNK type annotation and more! | ||
Learn changes <a href="https://github.com/JohnSnowLabs/spark-nlp/blob/1.6.1/CHANGELOG">HERE</a> and check out for updated documentation below</p> | ||
</div> | ||
<div id="cards-wrapper" class="cards-wrapper row"> | ||
<div class="item item-green col-md-4 col-sm-6 col-xs-6"> | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.