"targets": [ "10.123.175.30:9100","10.125.150.14:9100"], "labels": { "env": "dev", "job": "node", "group": "developer" } for t
I have a few IIS sites I would like to monitor using Prometheus. Specifically monitor and alert on outages. I cannot figure out how to grab a metric when a site
tl;dr I have a label in prometheus called "ignore" with value "yes": metric_test{label1="label1",ignore="yes"} 1 I want to disable alerts for any metrics with
In the Prometheus documentation it describes the holt_winters() function that can be used generate a smoothed curve. However the documentation states it should
I am running a diskspace used query in Prometheus and would like to return only the top 5 or 10 entries from the search result. Is there anyway I can achieve th
So I'm aware of some percentile functions in PromQL like histogram_quantile which is used in a case like this: // Over the past 5 minutes, what's the maximum ht
We are using Prometheus-Grafana. Now we want to set alert for CPU load average of 5 minutes. We have 60 servers which have different CPU core like few machine
As per prometheus storage.md , the recommendation is not to use nfs storage as persistent volume for prometheus. But solutions like prometheus operator and ope
I'm running Prometheus in a kubernetes cluster. All is running find and my UI pods are counting visitors. Please ignore the title, what you see here is the
We have many use-cases when we want to calculate success rate but there were not tasks that succeeded, we would expect that the success rate will be 0, but it's
I have a counter metric in prometheus. I want to add lables to it dynamically for example if my request comes http://abc123.com/{p1} ,I want my custom_metric_na
I'd like to get the 0.95 percentile memory usage of my pods from the last x time. However this query start to take too long if I use a 'big' (7 / 10d) range. T