Availability events

This page provides information about supported availability events, along with the logic behind raising them.

Unexpected low traffic

UNEXPECTED_LOW_LOAD

Dynatrace collects a multidimensional baseline for application and user action traffic and therefore learns the typical traffic pattern of all your applications and user actions. Alerting on abnormally low application traffic is enabled by default as those situations can indicate full application outages! Refer to application anomaly detection settings at Settings > Anomaly detection > Applications to adapt the sensitivity of low-traffic alerting. When enabled, Dynatrace follows a time-interval comparison pattern (daily or weekly) in alerting on abnormal traffic situations. The actual monitored application traffic is compared with the previous interval and alerts if the comparison reports unusually low traffic.

A typical example of an unexpected low traffic event is shown below.

Unexpected low traffic event

Applicable Dynatrace entities

The following Dynatrace entities apply to this event:

  • Service
  • Application
  • Mobile application

Host or monitoring unavailable

OSI_UNEXPECTEDLY_UNAVAILABLE

This event is detected when a host is abruptly shut down or Dynatrace loses the network connection to the host’s OneAgent. If the operating system is shut down regularly, Dynatrace won't open a problem. Dynatrace will show the host in an unavailable state.

Condition:

  • Network connection to the monitored host is lost unexpectedly while OneAgent and host are still running. The connection must be lost for more than 5 minutes, before OneAgent starts sending signals again and cached metric data fills in the missing chart data.
  • OneAgent isn't able to catch and send the regular operating system shutdown message.
  • Decommissioning of virtual hosts (for example, in AWS auto-scaling groups) when the operating system isn't shut down and therefore Dynatrace detects this as a connection-lost event (event delayed up to 10 minutes after connection is lost).

Closing conditions:

  • Event is resolved when the host is available again
  • Timeout of event occurs after 12 hours (previous timeouts were 5 days for hosts and 3 days for virtualized hosts).

An example host unavailable event is shown below.

Host unavailable

Process unavailable

PROCESS_UNAVAILABLE, PROCESS_GROUP_LOW_INSTANCE_COUNT

Dynatrace OneAgent automatically detects each running process on hosts and reports the availability state of all processes. As many hosts run volatile processes that are restarted on a regular schedule, Dynatrace doesn't open a problem for each process shutdown by default. However this event is shown as soon as any other triggered event is raised. Alerting on process unavailability is an opt-in setting available via process-group configuration. Process-group configuration allows you to choose between three options:

  1. If service requests are impacted: Open a process unavailable event only if Dynatrace detects active client requests hitting the selected process.
  2. If any process becomes unavailable: Open the event if any of the processes within the process group become unavailable.
  3. If the minimum threshold isn't met: Open the event if a minimum number of running processes within that process group isn't met within an observation period of at least 2 minutes.

Process group availability setting

A typical example of a process unavailable event is shown below.

Process unavailable event

Applicable Dynatrace entities

The following Dynatrace entities apply to this event:

  • Process group (PROCESS_GROUP_LOW_INSTANCE_COUNT)
  • Process group instance (PROCESS_UNAVAILABLE)

Synthetic monitor global outage

WEB_CHECK_GLOBAL_OUTAGE

Synthetic monitors check your websites from multiple geographic regions on a regular schedule. Dynatrace raises a synthetic global outage event if your website stops responding from all configured geographic regions.

A typical example of a synthetic monitor global outage event is shown below.

Synthetic monitors global outage

Applicable Dynatrace entities

The following Dynatrace entities apply to this event:

  • Synthetic monitor

Synthetic monitor local outage

WEB_CHECK_LOCAL_OUTAGE

Synthetic monitors regularly check your websites from multiple geographic regions. Dynatrace raises a synthetic local outage event in case that your Website is not responding from at least one configured geographic region. Once your Web application is not responding from all configured regions, a problem will be elevated to report a global outage.

A typical example of a synthetic monitor local outage event is shown below.

Synthetic monitors local outage event

Applicable Dynatrace entities

The following Dynatrace entities apply to this event:

  • Synthetic monitor

Synthetic monitor outage

SYNTHETIC_AVAILABILITY

The Dynatrace API allows you to receive synthetic test results and synthetic events from Dynatrace Synthetic Classic. This availability event is raised if the Synthetic Classic monitor reports the unavailability of a system under test. The lifespan of Synthetic Classic events is managed through the Classic monitor. Refer to your alerting settings within Synthetic Classic.

A typical example of a synthetic monitor outage event is shown below.

Synthetic Classic outage

Applicable Dynatrace entities

The following Dynatrace entities apply to this event:

  • External synthetic monitor

Availability log pattern found

HOST_LOG_AVAILABILITY

Dynatrace Log Analytics allows you to define log patterns that indicate availability related problems on a host. All detected pattern appearances are registered and if the number of detected pattern exceeds your critical threshold an availability log pattern event is raised.

A typical example of an availability log pattern found event is shown below.

Host log availability event

Applicable Dynatrace entities

The following Dynatrace entities apply to this event:

  • Host

Availability log pattern found

PROCESS_LOG_AVAILABILITY

Dynatrace Log Analytics allows you to define log pattern that indicate availability related problems on a process level. Every appearance of a configured pattern is counted over time and if the number of detected pattern exceeds your critical threshold an availability log pattern event is raised.

A typical example of an availability log pattern found event is shown below.

Process log availability event

Applicable Dynatrace entities

The following Dynatrace entities apply to this event:

  • Process group instance

Custom availability event

AVAILABILITY_EVENT

This generic availability event can be used by monitoring plugins or through the Dynatrace REST API to raise a customized availability event with a user defined title.

An example of a custom availability event with a user defined title Batch process schedule outage is shown below.

Custom availability event

Applicable Dynatrace entities

All Dynatrace entities apply to this event.