Improve http output handling #966

aleksmaus · 2024-08-12T15:58:17Z

What type of PR is this?

/kind bug
/kind cleanup

Any specific area of the project related to this PR?

/area config
/area outputs

What this PR does / why we need it:

Multiple improvements of http outputs handling.

Addressing the following areas:

There is no limit on the number of outgoing http requests, which is combined with with the lack of http client reuse creates thousands of connections to the host (the http client reuse is addresses in this previous PR Reuse http client #962). That except in the cases where some auth related http headers are set using the mutex lock that causes the requests to be limited to one at a time competing for the mutex.
For example the Elasticsearch output is limited to one request at a time when basic auth is used, but will create unlimited number of requests at a time when the custom headers are used to pass basic auth or api key headers: https://github.com/falcosecurity/falcosidekick/blob/master/outputs/elasticsearch.go#L60. Which leads to unpredictable runtime characteristics in terms of connections use etc.
FIXME

falcosidekick/outputs/client.go

Line 11 in a36af89

"encoding/json"
The config.go defaults settings is prone to mistakes (looking at previous PRs) where the same prefix has to be retyped every time and some defaults are set twice, for example

falcosidekick/config.go

Line 400 in a36af89

v.SetDefault("Grafana.MutualTls", false)

and

falcosidekick/config.go

Line 409 in a36af89

v.SetDefault("Grafana.MutualTls", false)

It also required multiple lines of default definitions when needed to add the new setting per output.

Here is the list of changes addressing the points listed above:

Add MaxConcurrentRequests configuration per output in order to limit
the number of requests/connections. Decided to go with 1 as default, because it's already one at time in multiple cases due to the use of the mutex for the auth headers. This is item number 2 listed in the "short term" sections Improve Outputs throughput handling #963.
I'm using semaphore in order to limit the number of requests. I left one TODO item there, that eventually we want to make http requests and semaphore cancellable to make the http client implementation more robust.
Refactor HTTP auth headers handling, addressing this FIXME

falcosidekick/outputs/client.go

Line 11 in a36af89

"encoding/json"

This eliminates the unexpected limit on the number of requests in some cases and no limit in another cases with the same output.
Extract common configuration for http, refactor NewClient to avoid
adding one more parameter for MaxConcurrentRequests.
Now the outputs cat update requests properties including headers before the request is executed, also allows to use already existing SetBasicAuth method implemented with Go http.Headers.
Refactor default configuration definition in order to avoid typos with
repetitive Output name prefix and avoid repetitive use of defaults. This item is not strictly necessary, but I think makes it easier to manage and add new outputs defaults and avoid errors. Some of the configurations like AWS, GCP a non-output configuration are still left inlined for now, but could be extracted later if needed.

Example of the throughput improvement for Elasticsearch Output:

As I mentioned above due to the current use of mutex lock on basic auth header, the requests are executed one after another, one request at a time if the user uses username and password for auth: because of this
https://github.com/falcosecurity/falcosidekick/blob/master/outputs/elasticsearch.go#L60

So current through numbers with Elasticsearch cloud instance are about 4 docs/sec

Now with addition of MaxConcurrentRequests configuration, the user can adjust the number of the requests

maxconcurrentrequests	throughput
1	4 docs/sec
2	8 docs/sec
10	40 docs/sec

Now if we combine this PR with with http client reuse we get about 10 doc/sec with default 1 request at a time

maxconcurrentrequests	throughput
1	10 docs/sec
2	20 docs/sec
10	100 docs/sec

This throughput could improved further with the addition of batching, which I'll have shortly.

Which issue(s) this PR fixes:

Related to # #963
First steps.

Special notes for your reviewer:

* Add MaxConcurrentRequests configuration per output in order to limit the number of requests/connections. * Refactor HTTP auth headers handling, eliminate mutex on that code path. * Extract common configuration for http, refactor NewClient to avoid adding one more parameter. * Refactor default configuration definition in order to avoid typos with repetitive Output name prefix and avoid repetitive use of defaults Signed-off-by: Aleksandr Maus <aleksandr.maus@elastic.co>

aleksmaus · 2024-08-21T11:50:57Z

didn't realize that the merge commits are not allowed in this repo, redid it with rebase

Signed-off-by: Aleksandr Maus <aleksandr.maus@elastic.co>

poiana · 2024-08-22T12:42:12Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: aleksmaus, Issif

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [Issif]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

poiana · 2024-08-22T12:42:13Z

LGTM label has been added.

Git tree hash: 058531bc27dec5657f2c1f1dcfc330c397394126

poiana added kind/bug Something isn't working dco-signoff: yes kind/cleanup labels Aug 12, 2024

poiana requested review from Issif and leogr August 12, 2024 15:58

poiana added area/config area/outputs size/XXL labels Aug 12, 2024

aleksmaus force-pushed the feature/cleanup_requests_handling branch 2 times, most recently from 1bfc70c to 261eb9d Compare August 15, 2024 22:04

aleksmaus mentioned this pull request Aug 16, 2024

Elasticsearch output batching and gzip compression support #967

Merged

Issif self-assigned this Aug 17, 2024

Issif added this to the 2.30 milestone Aug 17, 2024

poiana added the do-not-merge/contains-merge-commits label Aug 21, 2024

aleksmaus force-pushed the feature/cleanup_requests_handling branch from aee575b to 261eb9d Compare August 21, 2024 11:46

poiana removed the do-not-merge/contains-merge-commits label Aug 21, 2024

aleksmaus force-pushed the feature/cleanup_requests_handling branch from 261eb9d to 0f9a5ea Compare August 21, 2024 11:49

aleksmaus added 2 commits August 21, 2024 07:58

Make golangci-lint happy

5b93b0a

Signed-off-by: Aleksandr Maus <aleksandr.maus@elastic.co>

go mod tidy

eb6787f

Signed-off-by: Aleksandr Maus <aleksandr.maus@elastic.co>

Issif approved these changes Aug 22, 2024

View reviewed changes

poiana added the lgtm label Aug 22, 2024

poiana added the approved label Aug 22, 2024

poiana merged commit c6e0752 into falcosecurity:master Aug 22, 2024
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve http output handling #966

Improve http output handling #966

aleksmaus commented Aug 12, 2024 •

edited

Loading

aleksmaus commented Aug 21, 2024

poiana commented Aug 22, 2024

poiana commented Aug 22, 2024

Improve http output handling #966

Improve http output handling #966

Conversation

aleksmaus commented Aug 12, 2024 • edited Loading

aleksmaus commented Aug 21, 2024

poiana commented Aug 22, 2024

poiana commented Aug 22, 2024

aleksmaus commented Aug 12, 2024 •

edited

Loading