From 300c9b673e9892dc9a58124abaaf32dbdbcd6f62 Mon Sep 17 00:00:00 2001 From: Carmine DiMascio Date: Thu, 29 Nov 2018 09:36:10 -0500 Subject: [PATCH 1/6] Update README.md --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index 097f19c..de5334d 100644 --- a/README.md +++ b/README.md @@ -9,7 +9,7 @@ Given an HTML document, **essence** automatically extracts the main text content [Try out the demo](https://essence.mybluemix.net/index.html) - _a simple webapp to demonstrate essence_

- +

From a6dfdd43f82236426f3b22770734dfa89d911872 Mon Sep 17 00:00:00 2001 From: Carmine DiMascio Date: Thu, 29 Nov 2018 09:37:13 -0500 Subject: [PATCH 2/6] Update README.md --- README.md | 1 + 1 file changed, 1 insertion(+) diff --git a/README.md b/README.md index de5334d..6082d21 100644 --- a/README.md +++ b/README.md @@ -12,6 +12,7 @@ Given an HTML document, **essence** automatically extracts the main text content

+The essence web project lives [here](https://github.com/cdimascio/essence-web) _This library is heavily influenced by [node-unfluff](https://github.com/ageitgey/node-unfluff) and its [lineage](#credits)_ From 4b5c4f8c8336e53c8c877eca976caef9b257a234 Mon Sep 17 00:00:00 2001 From: Carmine DiMascio Date: Thu, 29 Nov 2018 09:37:32 -0500 Subject: [PATCH 3/6] Update README.md --- README.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/README.md b/README.md index 6082d21..448fd2e 100644 --- a/README.md +++ b/README.md @@ -12,8 +12,6 @@ Given an HTML document, **essence** automatically extracts the main text content

-The essence web project lives [here](https://github.com/cdimascio/essence-web) - _This library is heavily influenced by [node-unfluff](https://github.com/ageitgey/node-unfluff) and its [lineage](#credits)_ ## Usage @@ -62,6 +60,8 @@ compile 'io.github.cdimascio:essence:0.10.11' ![](https://raw.githubusercontent.com/cdimascio/essence/master/assets/example.png) +The essence web project lives [here](https://github.com/cdimascio/essence-web) + ## Extracted data elements **essence** attempts to extract the following content: From e6cdbf218c1946549d876dee09324255e1efa333 Mon Sep 17 00:00:00 2001 From: Carmine DiMascio Date: Thu, 29 Nov 2018 09:40:23 -0500 Subject: [PATCH 4/6] Update README.md --- README.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/README.md b/README.md index 448fd2e..e0f61bd 100644 --- a/README.md +++ b/README.md @@ -43,7 +43,7 @@ See [Extracted data elements](#extracted-data-elements) for additional extracted io.github.cdimascio essence - 0.10.11 + 0.12.6 pom ``` @@ -51,7 +51,7 @@ See [Extracted data elements](#extracted-data-elements) for additional extracted **Gradle** ```groovy -compile 'io.github.cdimascio:essence:0.10.11' +compile 'io.github.cdimascio:essence:0.12.6' ``` ## Try the Essence web demo From ce9de76464611c8fc38149066460ed8467a4a298 Mon Sep 17 00:00:00 2001 From: Carmine DiMascio Date: Thu, 29 Nov 2018 10:14:43 -0500 Subject: [PATCH 5/6] Update README.md --- README.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/README.md b/README.md index e0f61bd..18bfc46 100644 --- a/README.md +++ b/README.md @@ -6,7 +6,7 @@ An automatic web page content extractor for _Kotlin_ and _Java_. Given an HTML document, **essence** automatically extracts the main text content (and much more). -[Try out the demo](https://essence.mybluemix.net/index.html) - _a simple webapp to demonstrate essence_ +[Try out the demo](https://essence.mybluemix.net) - _a simple webapp to demonstrate essence_

@@ -56,7 +56,7 @@ compile 'io.github.cdimascio:essence:0.12.6' ## Try the Essence web demo -[Essence web](https://essence.mybluemix.net/index.html) is a simple web page that fetches content at a given url and passes the HTML to this essence library. +[Essence web](https://essence.mybluemix.net) is a simple web page that fetches content at a given url and passes the HTML to this essence library. ![](https://raw.githubusercontent.com/cdimascio/essence/master/assets/example.png) From fe1e7d212d02e6ede9adb50d3c54ec95326c6021 Mon Sep 17 00:00:00 2001 From: Carmine DiMascio Date: Sat, 1 Dec 2018 15:26:04 -0500 Subject: [PATCH 6/6] Update README.md --- README.md | 1 - 1 file changed, 1 deletion(-) diff --git a/README.md b/README.md index 18bfc46..acea0b5 100644 --- a/README.md +++ b/README.md @@ -30,7 +30,6 @@ System.out.println(data.getText()); ```Kotlin val data = Essence.extract(html) println(data.text) -// ... ``` See [Extracted data elements](#extracted-data-elements) for additional extracted metadata.