Update kafka consumer #14

datto-aparrill · 2019-03-18T16:54:56Z

The current client is several major versions old and uses deprecated functionality (zookeeper offsets) that doesn't play nice with other clients.

This PR changes the project to use the new kafka consumer API.

Note that existing configurations are incompatible, since the new kafka client does not use the same config names as the old one.

The current client is several major versions old and uses deprecated functionality that doesn't play nice with other clients (zookeeper offsets).

muffix · 2019-03-24T17:21:01Z

This looks great. Is it worth documenting the updating the config documentation in the readme, too?

muffix · 2019-03-24T17:44:46Z

src/main/java/net/opentsdb/tsd/KafkaRpcPluginConfig.java

-  public static final String AUTO_COMMIT_ENABLE_DEFAULT = "true";
-  public static final String AUTO_OFFSET_RESET_DEFAULT = "smallest";
+  public static final String AUTO_COMMIT_ENABLE_DEFAULT = "false";
+  public static final String AUTO_OFFSET_RESET_DEFAULT = "latest";


To keep the changeset as small as possible and avoid surprises for users, I'd lean to keeping the old defaults in place unless there is a particular reason why have these two defaults changed?

Auto-commit is off-by-default because this code does manual committing after we insert a batch of records.

For offset_reset, kafka changed the value names. "latest" is the new name for "smallest".

datto-aparrill · 2019-03-25T13:06:34Z

src/main/java/net/opentsdb/tsd/KafkaRpcPluginThread.java

        }
-        nanoCtr = System.nanoTime();
+        consumer.commitSync();


Offset commiting is here

Fwiw, we cherry-picked these changes to the version we're runing and have found that committing manually is significantly reducing the throughput. The synchronous call was the bottleneck and since we prioritise latency, we switched back to using auto-commits. It's not a problem if a couple of points are written multiple times if a commit fails as long as these failres are rare.

In our setup, the node we ran with this version of the plugin managed to write up to 9,000 datapoints per second and after switching to auto-commits every 5000 ms, we reached more than 40,000.

Whether it's necessary to commit after every poll is maybe something to reconsider for this PR. As a middle way, we have seen good results with making the commit call async.

Yeah I was concerned about data durability, hence doing the commits manually and waiting for them to finish. However, if submitting the same metric multiple times just overwrites the same data, it shouldn't be a problem.

Update Kafka client

119bd85

The current client is several major versions old and uses deprecated functionality that doesn't play nice with other clients (zookeeper offsets).

muffix reviewed Mar 24, 2019

View reviewed changes

datto-aparrill commented Mar 25, 2019

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update kafka consumer #14

Update kafka consumer #14

datto-aparrill commented Mar 18, 2019

muffix commented Mar 24, 2019

muffix Mar 24, 2019

datto-aparrill Mar 25, 2019

datto-aparrill Mar 25, 2019

muffix Jun 6, 2019

datto-aparrill Jun 14, 2019

Update kafka consumer #14

Are you sure you want to change the base?

Update kafka consumer #14

Conversation

datto-aparrill commented Mar 18, 2019

muffix commented Mar 24, 2019

muffix Mar 24, 2019

Choose a reason for hiding this comment

datto-aparrill Mar 25, 2019

Choose a reason for hiding this comment

datto-aparrill Mar 25, 2019

Choose a reason for hiding this comment

muffix Jun 6, 2019

Choose a reason for hiding this comment

datto-aparrill Jun 14, 2019

Choose a reason for hiding this comment