Skip to content
Christopher Rost edited this page Mar 29, 2021 · 19 revisions

Version 0.6.0 (Current Stable)

  • Changed Flink version to 1.9.3
  • Temporal Pattern Matching using Temporal-GDL added
  • Temporal verify operator added
  • More Gelly-based algorithms are available for TemporalGraph
  • Min and Max aggregations can now applied on date and datetime types
  • Duration aggregations have now an optional unit parameter to define a time unit (e.g., Minutes)
  • Indexed CSV Source/Sink added (also for temporal graphs)
  • Some bugfixes in MinimalCSVImporter

Version 0.5.2

  • Improved testing and stability
  • Improved gdl error handling
  • Added basic vertex deduplication
  • Implemented distributed graph layouting
  • Moved capf into new repository gradoop-capf
  • Improved temporal graph handling

Version 0.5.1

  • gradoop-temporal module added
    • Temporal Property Graph Model (TPGM) implementation
    • diff and snapshot operators for TPGM
    • some commonly used temporal predicates
    • examples for temporal operator usages
    • equals and print support
    • support for generified operators
  • Fixed bug in cypher constructor patterns
  • Reorganized hbase dependency
    • Remove hbase dependencies from gradoop-flink
  • Fixed incorrectly escaped chars in DOT output
  • Added overwrite parameter for CSV output
  • Added new grouping operator using key functions (KeyedGrouping`)
    • Added key functions for grouping on temporal infos

Version 0.5.0

  • API Changes - Interfaces EPGMEdge, EPGMVertex, EPGMGraphHead renamed to Edge, Vertex and GraphHead
  • API Changes - Instance classes Edge, Vertex and GraphHead renamed to EPGMEdge, EPGMVertex, EPGMGraphHead
  • Logical Graph Operators generalized
  • Graph Collection operators generalized
  • VerifyGraph(s)Containment operator added
  • gradoop-examples module is refactored
  • ElementFactories are now part of the BaseFactory (not part of config)
  • Enhanced comparison of property values
  • Deprecated JSON source and sink removed
  • Support for Maven 3.6

Version 0.4.5

  • Upgrade Flink dependency from 1.7.0 to 1.7.2
  • Integration of Cypher on Apache Flink (CAPF) to support cypher queries
  • Improvement of CSVDataSource
  • Added Graph ETL operators to module gradoop-data-integration
  • Added Degree Centrality operator
  • Added Random Walk operator
  • Added Verify operator to remove dangling edges
  • Refactored PropertyValue class according to strategy pattern
  • Added overwrite support for HBaseDataSink

Version 0.4.4

  • Upgrade Flink dependency from 1.6.0 to 1.7.0
  • Generalized Subgraph, Transformation and Property Transformation operators
  • Generalized MetaData construct
  • Refactored GradoopId to no longer use MongoDB
  • Added RollUp example
  • Added support to construct GraphCollection from mutliple LogicalGraphs
  • Added simple importers for CSV and JSON files

Version 0.4.3

  • Upgrade Flink dependency from 1.5.0 to 1.6.0
  • gradoop-data-integration module added
  • Fixed bugs in IndexedCSVDataSink
  • Sampling Benchmarks added
  • Datagen-package removed from Gradoop
  • Clustering coefficient computation addded
  • Triangle counting added
  • Rollup-Operator added
  • Unified aggregate functions now applicable to both aggregate and grouping operator

Location of LogicalGraph and GraphCollection changed! Through a refactoring, the location of the classes LogicalGraph and GraphCollection has changed from package org.gradoop.flink.model.api.epgm to org.gradoop.flink.model.impl.epgm.

Version 0.4.2

  • PageRank sampling added
  • Mockito added for testing support
  • Multiple aggregation functions in a single aggregation operator
  • GraphCollection support for CSV Source/Sink
  • LGTM code quality badge
  • PropertyTransformation operator added
  • Weakly Connected Components runner added
  • CSV delimiter escaping

CSV format changed! Through the newly added graph collection support for CSVDataSource/Sink and IndexedCSVDataSource/Sink, existing graphs created by these sinks from a previous release have to be transformed into the new representation. A GradoopFormatConverter is available in the gradoop-examples package to do this conversion. Use parameter lgcsv (for graphs created by the CSVDataSink) or lgindexed (for graphs created by the IndexedCSVDataSink) as input format and csv or indexed as output format. The outdated sources and sinks are renamed as LogicalGraphCSVDataSource and LogicalGraphCSVDataSink and will be removed in version 0.5.0.

Version 0.4.1

  • Cypher, Subgraph and SNA Benchmarks added
  • Predicate pushdown for HBase available
  • Split property type and value in HBase table layout
  • NULL and Set support in PropertyValues
  • Code coverage report
  • Support graphs without edges
  • .print() added for LogicalGraph and GraphCollection
  • Aggregation example added
  • Sampling algorithms extended incl. Graph density computation

Version 0.4.0

  • Upgrade Flink from 1.3.1 to 1.5.0
  • JSONDataSource and JSONDataSink are now marked as deprecated
  • CSVDataSource and CSVDataSink supporting all available property value types
  • IndexedCSVDataSink added
  • Added Gradoop-Store module with Apache Accumulo and HBase support
  • Added Store Api with manual predicate pushdown (for Accumulo DataSource)
  • First Sampling Algorithms

Version 0.3.3

--

Version 0.3.2

--

Version 0.3.1

Bug fixes and support for more Gelly algorithms.

Version 0.2.0

Added Pattern Matching and Frequent Subgraph Mining algorithms.

Version 0.1

Major refactoring of internal EPGM representation (e.g. ID and property handling), Equality Operators, GDL-based unit testing

Version 0.0.3

Apache Flink replaces MapReduce and Giraph as operator implementation layer and distributed execution engine.

Version 0.0.2

Added support for HBase as distributed graph storage.

Version 0.0.1

First prototype using Hadoop MapReduce and Apache Giraph for operator processing.