Datadog Concepts Monitors and Alerting: Difference between revisions

From NovaOrdis Knowledge Base
Jump to navigation Jump to search
Line 7: Line 7:
=Monitor Types=
=Monitor Types=
==Metric==
==Metric==
{{External|https://docs.datadoghq.com/monitors/create/types/metric/?tab=threshold}}
{{External|https://docs.datadoghq.com/monitors/create/types/metric/}}
Metric monitors watch a continuous stream of data. The metrics are collected via the [[Datadog_Concepts#Agent|Datadog Agent]] or the API and can be [[#Alert|alerted]] upon if they cross a threshold over a given period of time.
Metric monitors watch a continuous stream of data. The metrics are collected via the [[Datadog_Concepts#Agent|Datadog Agent]] or the API and can be [[#Alert|alerted]] upon if they cross a threshold over a given period of time.



Revision as of 21:44, 25 May 2022

External

Internal

Overview

When something goes wrong, a computer tells you about it. This is what a monitor is: a Datadog feature that actively checks metrics, integration availability, network endpoints, etc. and communicates when an alerting condition occurs. The monitor has a query and alert conditions. There are different monitor types.

Monitor Types

Metric

https://docs.datadoghq.com/monitors/create/types/metric/

Metric monitors watch a continuous stream of data. The metrics are collected via the Datadog Agent or the API and can be alerted upon if they cross a threshold over a given period of time.

Host

https://docs.datadoghq.com/monitors/create/types/host

A host monitor listens to the Datadog Agent heartbeats and notifies on the status of the heartbeat. This could give an indication whether the hosts the Agents run on are responsive. Every Datadog Agent reports a service check called datadog.agent.up with the status OK. The Host monitor has two kind of alert conditions: Check Alert and Cluster Alert.

Triggered Monitor

Downtime

Incident

SLO

Alert

Alert Conditions

Check Alert

Cluster Alert

Notification

Operations