Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[DOCS] Document Metricbeat limitation related to some metrics #37556

Open
lucabelluccini opened this issue Jan 5, 2024 · 2 comments
Open

[DOCS] Document Metricbeat limitation related to some metrics #37556

lucabelluccini opened this issue Jan 5, 2024 · 2 comments
Labels
docs Metricbeat Metricbeat Team:Docs Label for the Observability docs team

Comments

@lucabelluccini
Copy link
Contributor

The missing field is when the process is not around long enough to be included in two periods of metric collection. Because it is the first time that the process exists in the period it does not include the field.

In order to calculate percentages, the process needs to appear at least twice to calculate a performance delta. We can expect that the first time a process is reported from the metricset that it does not have for example a system.process.cpu.total.norm.pct value. The second time it is report it then will have a system.process.cpu.total.norm.pct value. In this case process is too short lived then it will only ever be reported once having no value in system.process.cpu.total.norm.pct.

Please note that versions earlier than 8 were reporting the field with 0, when we shouldn't have been reporting anything. Versions on 8.x are not populating the field.

Why do we need 2 values?
For example, the CPU usage in percent is calculated based on a time delta using the raw tick value (usually representing a millisecond) that indicates the total amount of CPU time a process has used. For example, if we collect the tick value for a process in 5 second intervals, and and the difference between the first tick value and the second is 1000 ticks, then we know the process has used 1 second of time in a 5 second interval, or 1/5, or 20%, of CPU time. In order to do this, we need three numbers: The first count of total cpu time, the second count, and the time between the counts. Hence why we need two values.

@lucabelluccini lucabelluccini added docs Metricbeat Metricbeat Team:Docs Label for the Observability docs team labels Jan 5, 2024
@botelastic
Copy link

botelastic bot commented Jan 4, 2025

Hi!
We just realized that we haven't looked into this issue in a while. We're sorry!

We're labeling this issue as Stale to make it hit our filters and make sure we get back to it as soon as possible. In the meantime, it'd be extremely helpful if you could take a look at it as well and confirm its relevance. A simple comment with a nice emoji will be enough :+1.
Thank you for your contribution!

@botelastic botelastic bot added the Stalled label Jan 4, 2025
@lucabelluccini
Copy link
Contributor Author

bump

@botelastic botelastic bot removed the Stalled label Jan 6, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
docs Metricbeat Metricbeat Team:Docs Label for the Observability docs team
Projects
None yet
Development

No branches or pull requests

1 participant