metricbeat/module/mongodb: Improve logic to calculate oplog info and window #42224

shmsr · 2025-01-06T19:54:55Z

Proposed commit message

This change tries to even improve the implementation done here that was made some time back. The previous changes did help a lot but this PR aims to further improve as several users have been reporting high CPU usage. Now we are following some recommended ways used by MongoDB themselves to calculate the oplog window (i.e., lastTs - firstTs of the log). The change now again leverages the natural order of the log along with using Limit (to restrict to just one doc) and Projection. Only expensive process right now when we do sort the the log in reverse i.e., $natural: -1. We also have ugpraded the client library to further reduce any issues in the client side related to query, etc.

Please see for a detailed comparison: #42224 (comment). Also, please read the inline comments in the code itself to understand the implemented logic; as I've documented it properly for future reference.

Checklist

My code follows the style guidelines of this project
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
I have made corresponding change to the default configuration files
I have added tests that prove my fix is effective or that my feature works
I have added an entry in CHANGELOG.next.asciidoc or CHANGELOG-developer.next.asciidoc.

Disruptive User Impact

Author's Checklist

Match the implementation with other oplog window calculators
Test the source
Benchmark

How to test this PR locally

Tested with MongoDB replicaSet

Related issues

mergify · 2025-01-06T19:55:39Z

This pull request does not have a backport label.
If this is a bug or security fix, could you label this PR @shmsr? 🙏.
For such, you'll need to label your PR with:

The upcoming major version of the Elastic Stack
The upcoming minor version of the Elastic Stack (if you're not pushing a breaking change)

To fixup this pull request, you need to add the backport labels for the needed
branches, such as:

backport-8./d is the label to automatically backport to the 8./d branch. /d is the digit

mergify · 2025-01-06T19:55:40Z

backport-8.x has been added to help with the transition to the new branch 8.x.
If you don't need it please use backport-skip label and remove the backport-8.x label.

shmsr · 2025-01-09T05:27:18Z

So, to test the consumption of cpu and memory resources over the changes made to calculate oplog info/ window:

Let's consider 3 cases that we want to compare:

Case 1: Changes prior to metricbeat/module/mongodb/replstatus: Update getOpTimestamp in replstatus to fix sort and temp files generation issue #37688
Case 2: metricbeat/module/mongodb/replstatus: Update getOpTimestamp in replstatus to fix sort and temp files generation issue #37688
Case 3: Current PR which aims to solve both memory and cpu usage

We will use the following script to track docker stats i.e., cpu and memory usage over time for the mongodb replicaset setup where 3 nodes are set:

#!/bin/bash

COMPOSE_PROJECT_DIR="$1"
OUTPUT_FILE="$2"
INTERVAL=1

# Set default values
if [ -z "$COMPOSE_PROJECT_DIR" ]; then
    COMPOSE_PROJECT_DIR="."
fi

if [ -z "$OUTPUT_FILE" ]; then
    OUTPUT_FILE="cpu_memory_usage.log"
fi

cd "$COMPOSE_PROJECT_DIR"

# Clear previous logs
> "$OUTPUT_FILE"

while true; do
    echo "----------------------------------------"
    echo "Docker Container Usage - $(date)"
    echo "----------------------------------------"

    docker compose ps -q | while read -r container_id; do
        container_name=$(docker inspect --format '{{.Name}}' "$container_id" | sed 's/\///')
        stats=$(docker stats --no-stream --format "{{.CPUPerc}}\t{{.MemPerc}}" "$container_id")
        cpu_usage=$(echo "$stats" | cut -f1)
        mem_usage=$(echo "$stats" | cut -f2)
        timestamp=$(date +%s)

        echo "$timestamp,$container_name,$cpu_usage,$mem_usage" >> "$OUTPUT_FILE"
    done

    sleep $INTERVAL
done

We will use a custom benchmarking suite:

func runBenchmarkModeXXX(client *mongo.Client) {
	start := time.Now()
	iterations := 10000
	workers := 10

	var wg sync.WaitGroup
	errChan := make(chan error, iterations)

	batchSize := iterations / workers

	for w := 0; w < workers; w++ {
		wg.Add(1)
		go func() {
			defer wg.Add(-1)
			for i := 0; i < batchSize; i++ {
				if _, err := getReplicationInfoXXX(client); err != nil {
					errChan <- fmt.Errorf("iteration failed: %w", err)
				}
			}
		}()
	}

	go func() {
		wg.Wait()
		close(errChan)
	}()

	// Process errors
	for err := range errChan {
		log.Printf("Error: %v", err)
	}

	duration := time.Since(start)
	fmt.Printf("Benchmark Results:\n")
	fmt.Printf("Total iterations: %d\n", iterations)
	fmt.Printf("Total time: %v\n", duration)
	fmt.Printf("Average time per operation: %v\n", duration/time.Duration(iterations))
	fmt.Printf("Operations per second: %.2f\n", float64(iterations)/duration.Seconds())
}

To do the benchmarking and observe the usage. I have taken out the logic from all 3 cases and made them a standalone program so that we benchmark the change in isolation.

Case 1:

Notice that cpu peaks at 800%+ and memory peaks at 80%+ which is bad and hence we received so many issues around it that claimed that the this calculation is causing memory spikes.

Case 2:

This greatly improved but only the memory consumption. The introduction of aggregation pipeline did a good job. Now memory peaks at 1.6% and cpu still peaks at around 800%. And hence, we were not getting issues reported for memory but now we were getting issues reported for CPU spikes.

Case 3:

Here, the CPU peaks at 50% that too for a short while and memory peaks at 0.36%, a considerable improvement for both.

Please also note that, for Case 1 and Case 2 benchmark did not even complete after a minute in my setup but Case 3 took <10s to do the benchmarking and it reported the final stats.

From this, we can surely that Case 3 (current PR changes) does massive improvement in calculating the oplog window.

stefans-elastic

just one small comment.
Also I'm not sure how i feel about such a verbose code comments (on one hand they are useful, on another hand - they can get out of sync with code very easily in the future which will give a negative effect)

metricbeat/module/mongodb/replstatus/info.go

ishleenk17 · 2025-01-22T05:07:29Z

metricbeat/module/mongodb/replstatus/info.go

+	// only need the timestamp (ts) field. FindOne() is used to retrieve a single
+	// document from the collection (limit: 1).
+
+	ctx := context.TODO()


Use a proper context with timeout ?

So, we don't how much time it can take, it varies cluster to cluster. Although, I asked our users with big production cluster to use this and for them it took just a few seconds whereas the previous one was taking 10 mins sometimes too.

So, that's why did not put a timeout here as I do not have a number. It will vary user to user and how big is their oplog.

ishleenk17 · 2025-01-22T05:08:40Z

metricbeat/module/mongodb/replstatus/info.go

+	// https://www.mongodb.com/docs/manual/reference/method/db.collection.stats/#mongodb-method-db.collection.stats
+	// or use this: https://github.com/percona/mongodb_exporter/blob/95d1865e34940d0d610bb1fbff9745bc66ddbc73/exporter/collstats_collector.go#L100
+	res := db.RunCommand(context.Background(), bson.D{
+		{Key: "collStats", Value: oplogCol},


This would work for v6.2 onwards as well ?

Or do we need a version based implementation of db.collection.stats() here as mentioned in your comment ?

No it works. Mongosh (mongosh uses the very same thing for oplog window).

metricbeat/module/mongodb/replstatus/info.go

ishleenk17 · 2025-01-22T05:27:29Z

@shmsr : since we are updating the mongo driver here, do we need to test other functionalities of the module to ensure the latest driver has no deprecated features that we might be using ?

shmsr · 2025-01-22T06:12:53Z

@shmsr : since we are updating the mongo driver here, do we need to test other functionalities of the module to ensure the latest driver has no deprecated features that we might be using ?

Although no. I did test this with MongoDB 4.x as well which is pretty old and it worked. But yes, as we know driver is not the issue for this particular problem we can revert it back too. I updated this because the driver was too old, and 1.17 was latest of 1.x (latest is 2.x).

But for sure we cant revert the driver update and take it up as a separate task where we test everything and then upgrade. Wdyt?

ishleenk17 · 2025-01-22T07:16:22Z

@shmsr : since we are updating the mongo driver here, do we need to test other functionalities of the module to ensure the latest driver has no deprecated features that we might be using ?

Although no. I did test this with MongoDB 4.x as well which is pretty old and it worked. But yes, as we know driver is not the issue for this particular problem we can revert it back too. I updated this because the driver was too old, and 1.17 was latest of 1.x (latest is 2.x).

But for sure we cant revert the driver update and take it up as a separate task where we test everything and then upgrade. Wdyt?

I think that would be a better approach. If this perf gain is not dependent on driver, we can remove that change and have a failsafe approach fopr the whole module.

metricbeat/module/mongodb: Improve logic

703919c

botelastic bot added the needs_team Indicates that the issue/PR needs a Team:* label label Jan 6, 2025

mergify bot assigned shmsr Jan 6, 2025

mergify bot added the backport-8.x Automated backport to the 8.x branch with mergify label Jan 6, 2025

metricbeat/module/mongodb: Improve logic

90f5190

shmsr changed the title ~~[DRAFT/ DO NOT REVIEW]: metricbeat/module/mongodb: Improve logic~~ metricbeat/module/mongodb: Improve logic to calculate oplog info Jan 8, 2025

shmsr added 3 commits January 8, 2025 09:44

metricbeat/module/mongodb: Improve logic

283e70d

metricbeat/module/mongodb: Improve logic

bc4feb6

metricbeat/module/mongodb: Improve logic

fbabf0f

shmsr changed the title ~~metricbeat/module/mongodb: Improve logic to calculate oplog info~~ metricbeat/module/mongodb: Improve logic to calculate oplog info and window Jan 8, 2025

shmsr added 3 commits January 8, 2025 12:04

metricbeat/module/mongodb: Minor diff

59b039c

metricbeat/module/mongodb: make check

8c44d94

Merge branch 'main' into mongodb_replstatus_cpu

91081af

shmsr added the Team:Obs-InfraObs Label for the Observability Infrastructure Monitoring team label Jan 8, 2025

botelastic bot removed the needs_team Indicates that the issue/PR needs a Team:* label label Jan 8, 2025

shmsr marked this pull request as ready for review January 8, 2025 06:48

shmsr requested review from a team as code owners January 8, 2025 06:48

shmsr added 2 commits January 8, 2025 17:44

metricbeat/module/mongodb: fix mispell

b27fd95

Merge branch 'main' into mongodb_replstatus_cpu

9b93aca

shmsr added 2 commits January 9, 2025 11:04

metricbeat/module/mongodb: improve error string

f22a6ff

Merge branch 'main' into mongodb_replstatus_cpu

3851aed

shmsr added the enhancement label Jan 9, 2025

shmsr requested a review from stefans-elastic January 16, 2025 09:28

stefans-elastic reviewed Jan 16, 2025

View reviewed changes

metricbeat/module/mongodb/replstatus/info.go Outdated Show resolved Hide resolved

shmsr added the backport-8.16 Automated backport with mergify label Jan 21, 2025

metricbeat/module/mongodb: Address review comments

b86a7c7

Merge branch 'main' into mongodb_replstatus_cpu

6cfb54a

shmsr mentioned this pull request Jan 21, 2025

pre-commit: detect conflict if no ongoing merge operation #42377

Merged

6 tasks

Merge branch 'main' into mongodb_replstatus_cpu

fa918b5

ishleenk17 reviewed Jan 22, 2025

View reviewed changes

metricbeat/module/mongodb: Address review comments

f487516

Merge branch 'main' into mongodb_replstatus_cpu

1a8e5d7

shmsr added the backport-8.17 Automated backport with mergify label Jan 22, 2025

shmsr added 2 commits January 22, 2025 12:48

metricbeat/module/mongodb: Revert go.{mod,sum}

3ff5292

metricbeat/module/mongodb: Revert go.{mod,sum} and NOTICE.txt

ea9ce68

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

metricbeat/module/mongodb: Improve logic to calculate oplog info and window #42224

metricbeat/module/mongodb: Improve logic to calculate oplog info and window #42224

shmsr commented Jan 6, 2025 •

edited

Loading

mergify bot commented Jan 6, 2025

mergify bot commented Jan 6, 2025

shmsr commented Jan 9, 2025 •

edited

Loading

stefans-elastic left a comment

ishleenk17 Jan 22, 2025

shmsr Jan 22, 2025 •

edited

Loading

ishleenk17 Jan 22, 2025

ishleenk17 Jan 22, 2025

shmsr Jan 22, 2025

ishleenk17 commented Jan 22, 2025

shmsr commented Jan 22, 2025

ishleenk17 commented Jan 22, 2025

metricbeat/module/mongodb: Improve logic to calculate oplog info and window #42224

Are you sure you want to change the base?

metricbeat/module/mongodb: Improve logic to calculate oplog info and window #42224

Conversation

shmsr commented Jan 6, 2025 • edited Loading

Proposed commit message

Checklist

Disruptive User Impact

Author's Checklist

How to test this PR locally

Related issues

mergify bot commented Jan 6, 2025

mergify bot commented Jan 6, 2025

shmsr commented Jan 9, 2025 • edited Loading

stefans-elastic left a comment

Choose a reason for hiding this comment

ishleenk17 Jan 22, 2025

Choose a reason for hiding this comment

shmsr Jan 22, 2025 • edited Loading

Choose a reason for hiding this comment

ishleenk17 Jan 22, 2025

Choose a reason for hiding this comment

ishleenk17 Jan 22, 2025

Choose a reason for hiding this comment

shmsr Jan 22, 2025

Choose a reason for hiding this comment

ishleenk17 commented Jan 22, 2025

shmsr commented Jan 22, 2025

ishleenk17 commented Jan 22, 2025

shmsr commented Jan 6, 2025 •

edited

Loading

shmsr commented Jan 9, 2025 •

edited

Loading

shmsr Jan 22, 2025 •

edited

Loading