-
Notifications
You must be signed in to change notification settings - Fork 51
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
#301 session/active time #305
Conversation
This commit only gives the opportunity to write PG14 specific queries.
Hi, IMHO it's a good idea to mix instant and cumulated counters in the same service. I'm also unsure of how useful it's, so it's another argument to be a in a dedicated service. Also, the "global_" prefix is quite misleading, maybe using "instance_" instead? |
Hi Julien, I also have a patch for a dedicated service, but @ioguix told me it would be better to include these new metrics in a existing one. I'll let the two of you debate this ;-) And yes, the "_instance" prefix would certainly be better (the code was very much modelled on the commit_ratio service). |
Was it a private discussion or did I miss some activity? In any case, agreed let's wait for him to chime in. |
Yes it was a private discussion (sorry for the confusion). I've been given time to work on check_pgactivity, a few weeks spread into this year, and I asked him what I could do to get a foothold in this project. Other suggestions are welcomed :) |
Well, there are already a few bugs opened (https://github.com/OPMDG/check_pgactivity/issues?q=is%3Aopen+is%3Aissue+label%3Abug) or otherwise quite a lot of other enhancements ideas in the bug tracker. Or simply improve the tap tests :) |
Hi, We reviewed some existing issues and spot the #301 that leads to the internal short discussion, and this PR. When discussing it, I couldn't find any interesting alerting use case based on these metrics. Creating a service only for two little metrics seemed overkill as well. On the other hand, we usually fetch more metrics than needed in other services for pure metrics. Adding two metrics to an existing service which is already related to the backends numbers was easier, seemed logical, coherent with other services and allow to add them without setting anything on the monitoring side. Moreover, the metrics have different units and are not supposed to be on the same graph anyway. Now, I really have no strong feelings and didn't review the patch yet. |
Well, as I said in my first answer I don't really see how helpful those metrics are. It becomes even worse on environment where you have connection pooling. |
I guess it could be helpful to decide whether or not you should set up a connection pooling. And maybe, once it has been set up, how efficient it is. If there's no activity at night, but pgBouncer maintains a pool of server connections, then a graph of these metrics over time could be useful (good active ratio during workday, bad average). |
I'm not sure it's really that helpful. Using a pooler is already a solution for specific problems that can already be identified without those info.
I'm also doubtful. It depends on your configuration as e.g. your pooler could aggresively drop idle connection or not. If you really want to integrate it I still thing that having a dedicate check with a bit more thinking on what could be done with the info would be better. For instance, we could detect which how the busy time is distributed among the databases. |
What are the other means are you thinking about? From check_pga perspective, I don't remember there's much, out of the One of the reason you might want to add a pooler is to lower the number of sessions per second and we have nothing about this currently (neither in current code base or in this PR).
Thinking about the other new
Moreover, I believe the So, indeed, this might qualify for a dedicate check if you agree with these certainly not exhausted use case. |
Nothing in check_pgactivity indeed. But the need for a pooler is something really tied to your architecture, I don't think that's the job of monitoring to tell you that you need a pooler, as it's not its job to tell you that you need to do backups or setup replicas. Now if we can have a new service that gives some useful information and can also help you notice that a pooler would be a good idea that's great, but it's not something that should really change, unlike other properties discussed here.
Huh? This field shows controlled kill, like pg_terminate_backend() or a postmaster clean stop. If there's a crash that kills everything, you will lose all pg_stat_* data anyway.
Agreed, especially idle_in_transaction_time. This one would deserve threshold.
Agreed. |
OK I will submit a new PR during my second week of check_pgactivity dedicated week (mid-april). |
PR #308 now exists, so we can close this one and move on. |
No description provided.