collectd CloudHealth memory plugin for AWS

Upload EC2 memory metrics to CloudHealth for better cost optimization!

Scope

We are aware that there is an official agent to upload metrics. However, the agent failed to fit in our environment due to variuos reasons:

Runs it's own embedded collectd daemon (besides the one we're already using)
Misc. compliance requirements (where each component needs to be validated):
- Installation process fetches a lot of resources from the outside world via wget (contrasting to a single package from a signed repository)
- Agent code runs in embedded Ruby

Within AWS, CloudHealth already has knowledge about CPU and network metrics, so we just need to add the memory metrics on our own by using the API.

CloudHealth API Limitation

You can only post CPU, memory, and file system metrics.
You can only post up to 8 days of historical metrics data.
Metrics must have an hourly resolution.
An active AWS Instance associated with the metrics must already be present and active in the CloudHealth Platform and not be Chef-managed.
Metric retrieval is for individual assets only, that is, for AWS EC2 Instances or file systems of AWS EC2 Instances.
The payload can contain a max of 1000 data points. If there are more than 1000 data points, the entire request is rejected with a 422 response.
When posting to file systems, the associated instance must be present and active. However, if a file system object does not currently exist, a new one is automatically created and linked to the instance.

Additionally to the information provided by CloudHealth, the API expects the "hourly resolution" to be sliced to the full hour (e.g. 2020-12-04T17:00:00). If not, the API will respond:

Click to expand!

{
  "errors": [],
  "succeeded": 0,
  "failed": 1,
  "datasets": [
    {
      "errors": [],
      "succeeded": 0,
      "failures": [
        {
          "error": "Date/time value '2020-12-04T17:43:35' cannot have a non-zero minute value.",
          "row": [
            "<region>:<aws-account-id>:<instance-id>",
            "2020-12-04T17:43:35",
            37.088733582900176,
            52.81394681853567,
            33.08149301429918
          ]
        }
      ]
    }
  ]
}

Design

Runs as plain collectd python plugin which only receives memory.used.percent metrics from a filter chain.
Collects the following metrics for a period of 1 hour (see CloudHealth API Limitation):
- memory:used:percent.avg
- memory:used:percent.max
- memory:used:percent.min
Stores metrics in memory by default (see cloudhealthmemory.conf for persistence)
Uses a background thread for uploading the metrics after 1 hour
- including retry on connection issues
- if upload does not work, the next upload cycle will include all missing metrics

Requirements

collectd-core
python-requests
python-yaml

Installation

Install requirements
Copy cloudhealthmemory.py into your collectd plugin path (most probably /var/lib/collectd)
Copy cloudhealthmemory.conf into your collectd configuration directory (e.g. /etc/collectd/collectd.conf.d/)
Make sure the config files are being loaded by collectd
Restart collectd daemon

Example:

apt-get install -y collectd-core python libpython2.7 python-yaml python-requests
mkdir -p /etc/collectd/collectd.conf.d
curl -Lo /var/lib/collectd/cloudhealthmemory.py https://raw.githubusercontent.com/root360/collectd-cloudhealth-memory-aws-plugin/master/cloudhealthmemory.py
curl -Lo /etc/collectd/collectd.conf.d/cloudhealthmemory.conf https://raw.githubusercontent.com/root360/collectd-cloudhealth-memory-aws-plugin/master/cloudhealthmemory.conf
grep "collectd.conf.d/\*\.conf" /etc/collectd/collectd.conf || echo 'Include "/etc/collectd/collectd.conf.d/*.conf"' >> /etc/collectd/collectd.conf
systemctl restart collectd

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
LICENSE		LICENSE
README.md		README.md
cloudhealthmemory.conf		cloudhealthmemory.conf
cloudhealthmemory.py		cloudhealthmemory.py
design.png		design.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

collectd CloudHealth memory plugin for AWS

Scope

CloudHealth API Limitation

Design

Requirements

Installation

About

Releases

Packages

Languages

License

root360/collectd-cloudhealth-memory-aws-plugin

Folders and files

Latest commit

History

Repository files navigation

collectd CloudHealth memory plugin for AWS

Scope

CloudHealth API Limitation

Design

Requirements

Installation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages