Releases: myui/hivemall
Hivemall v0.3.2-3 (maintenance release)
Hivemall v0.3.2-2 (maintenance release)
Hivemall v0.3.2-1 (maintenance release)
This is just a maintenance release of Hivemall v0.3.2.
Only minor changes have been applied since the last release as follows:
The release version of Hivemall v0.3.2
This is the stable release version of Hivemall v0.3.2.
A major enhancement in this release is support for anomaly detection using LOF and polynomial features that is useful for non-linear regression/classification.
Changes since v0.3.1 are summarized as follows:
- Major Enhancement
- Supported Local Outlier Factor (LOF) [0d32487]
- Added
polynomial_feature
andpowered_feature
UDFs [95c60f0][cf1aacc]
- Minor Enhancement
- Modified to accept double values to avoid annoying casts in
rescale
/zscore
[1485d7e] - Added
to_string_array
UDF that convert array() to array() [8b910a6] - Added
euclid_distance
function [21d4d9a] - Modified to accept various numeric types for userid and itemid in Matrix Factorization [078cd85]
- Added
cosine_distance
UDF [1f8f127] - Added
angular_similarity
andangular_distance
UDFs [00ff531] - Added
vectorize_features
UDF [3cedd66][f165d49]
- Modified to accept double values to avoid annoying casts in
- Minor Changes
- Added alias
cosine_similarity
tocosine_sim
[3f615e5] - Modified to return null instead of throwing UDFArgumentException for a null argument in
mhash
[181b369] - Moved the package of similarity UDFs from
hivemall.knn.distance
tohivemall.knn.similarity
and changed the function signature ofconsine_similarity
function. [2a0f1e7] - Changed argument signatures from INT to DOUBLE in TF-IDF macro [61eeca2]
- Added alias
- Bugfix
- Fixed a bug in
java_min
[a48c367]
- Fixed a bug in
The release version of Hivemall v0.3.1
This is the stable release version of Hivemall v0.3.1. We have changed the license of Hivemall from LGPL v2 to Apache License v2 from this release.
From this release, stable releases of Hivemall are also released on Maven Central.
Changes since v0.3 are summarized as follows:
- Major Enhancement
- Supported Hivemall on Pig (on Pig v0.15 or later) [8aac879]
Thanks Daniel for the contribution! - Released Hivemall on Maven Central
- Supported Hivemall on Pig (on Pig v0.15 or later) [8aac879]
<dependency>
<groupId>io.github.myui</groupId>
<artifactId>hivemall</artifactId>
<version>0.3.1</version>
</dependency>
-- assign indices to dense features
select add_feature_index(array(3,4.0,5)) from dual;
> ["1:3.0","2:4.0","3:5.0"]
- Major Changes
- Changed the license from LGPL v2 to Apache License v2 [cc8be7e]
- Minor Changes
- Modifed extract_weight to accept a categorical feature representation [f5a3c28]
- Modified to accept various types in addition to INT for label (classification) [c352670]
- Modified to accept both float/double in label/target (regression) [35bedf1]
- Feature parsing scheme has been refactored to be more efficient [e37c47f]
- Changed the implementation of array_avg from ArrayAvgUDAF to ArrayAvgGenericUDAF [3a21853]
-- accept categorical variables (e.g., "weight") in Hivemall v0.3.1 in addition to quantitive variables (e.g., "weight:55.0") in extract_weight()
select extract_weight("weight"), extract_weight("weight:55.0") from dual;
> 1.0 | 55.0
select
-- logress(addBias(features), CAST(label as FLOAT)) as (feature, weight) -- Hivemall v0.3 (need to cast labels)
logress(addBias(features), label) as (feature, weight) -- Hivemall v0.3.1 or later (no need to cast labels)
- Bugfixes
- Fixed a bug in jaccard() assuming the result of dividing integer expressions as float [4115913]
The release version of Hivemall v0.3.0
This is the first stable release version of Hivemall v0.3.0.
A major enhancement within this release is support for matrix factorization.
Note that hivemall v0.3 or later supports Hive v0.11 or later.
Changes since v0.3_beta3 are summarized as follows:
- Major Enhancement
- Supported Matrix Factorization [433ddc8]
- Minor Enhancement
- Major Changes
- Bugfix
Beta version #3 of v0.3
A major change in this maintenance release is the support for TF-IDF computation.
- Major changes
- Added a support for TF-IDF computation [876b335]
- Changed the default eta0 (initial learning rate) to 0.2 [aa92317]
- Added a DDL to create hivemall UDFs as permanent functions [5cc77c9]
- Minor changes
- Added a server throughput monitoring functionality using JMX [744bc8d]
- Fixed MixServerTest to pass a TravisCI's build
Beta version #2 of v0.3
A major change in this maintenance release is the support for AdaGrad/AdaGradRDA/AdaDelta.
For the usage, see the following examples:
The initial beta release of v0.3 (beta1)
We are pleased to announce the initial beta release (beta1) of Hivemall v0.3.
The major enhancement in this release is the support for model mixing.
See this page to know how to use the new feature.
You can find a brief explanation of the internal design of MIX protocol in this slide.
Note this is a pre-release and for alpha users. Any feedback is welcome!
The stable release of Hivemall v0.2
This is the stable release of Hivemall v0.2, recommended for production uses.
From this release, Hive v0.11 or later is required.
Changes applied since v0.2-alpha4 is small is as follows: