'Cumulative sum of AWS Cloudwatch Metric
AWS Cloudwatch receives a count of 1 every time I start an image download. I am downloading 1,000s of images (on a cluster of EC2 instances) and would like to track the total progress.
I can't find any documentation on how to plot the cumulative sum of a metric. The AWS Cloudwatch Math Expressions looked promising, but they do not have an integrate function.
Currently, I can plot the sum of the started image downloads but only for periods, as seen below. Ideally, I'd like to plot the integral of this plot:
Solution 1:[1]
You are correct. All Amazon CloudWatch metrics are for a defined period.
The maximum period for a metric is one day, so this is not suitable for a cumulative counter that you wish to continue beyond one day.
You would need to find an alternate method of storing the count, such as an Amazon DynamoDB table. Use an atomic counter via UpdateItem to increment the count.
Solution 2:[2]
You can get a cumulative sum over the current range by using the SUM()
function that is operated over the original range containing only the number One (1). Remember, you're looking for a single number in the end, so it's not much of a graph, but you need to turn the single value sum back into a time-series.
- Define
m1
as your metric. This is the metric you will want to useSUM()
on. - Define an expression
e1
asm1/m1
. This results in a time-series with every value equal to 1. This is what will allow you convert that SUM back to a time-series. - Define an expression
e2
asSUM(m1) / e1
. This is, effectively, the cumulative sum ofm1
divided by one for every data-point in the original time-series. It will be a horizontal line on the graph, which will have every point on that horizontal line being the cumulative sum of metricm1
. This is required because Cloudwatch can only plot a time-series on the chart, not a single value. - Make
m1
ande1
invisible. You need them, but you don't need to see them. - Finally, change the chart type from
Line
toNumber
, since you only wanted the cumulative sum anyway.
The reason you can't use SUM()
directly is because it is a single value. By dividing by a time-series containing all 1's, the entire graph is the result of the SUM()
. Then, changing the chart to a Number effectively hides all the math and presents only the "final result".
Solution 3:[3]
Looks like RUNNING_SUM() has been added that does what your need:
You can find RUNNING_SUM() under "Add math"->"All functions"
Solution 4:[4]
You can also use a very long period.
Change your stat to SUM
, and set your metric's period to 7 days
. You'll get a time series of 1 point with the cumulative sum of all the downloads.
If you give each download a unique dimension value, you can keep your queries separate.
Sources
This article follows the attribution requirements of Stack Overflow and is licensed under CC BY-SA 3.0.
Source: Stack Overflow
Solution | Source |
---|---|
Solution 1 | John Rotenstein |
Solution 2 | User51 |
Solution 3 | Kim Rydhof Thor Hansen |
Solution 4 | Heath Borders |