AWS IoT

Dynatrace ingests metrics for multiple preselected namespaces, including AWS IoT. You can view metrics for each service instance, split metrics into multiple dimensions, and create custom charts that you can pin to your dashboards.

Prerequisites

To enable monitoring for this service, you need

  • An Environment or Cluster ActiveGate version 1.181+
  • Dynatrace version 1.182+
  • An updated AWS monitoring policy to include the additional AWS services.
    To update the AWS IAM policy, use the JSON below, containing the monitoring policy (permissions) for all supporting services.

If you don't want to add permissions to all services, and just select permissions for certain services, consult the table below. The table contains a set of permissions that are required for all services (All monitored Amazon services) and, for each supporting service, a list of optional permissions specific to that service.

Example of JSON policy for one single service.

In this example, from the complete list of permissions you need to select

  • "apigateway:GET" for Amazon API Gateway
  • "cloudwatch:GetMetricData", "cloudwatch:GetMetricStatistics", "cloudwatch:ListMetrics", "sts:GetCallerIdentity", "tag:GetResources", "tag:GetTagKeys", and "ec2:DescribeAvailabilityZones" for All monitored Amazon services.

Add the service to monitoring

In order to view the service metrics, you must add the service to monitoring in your Dynatrace environment.

Note: Once AWS supporting services are added to monitoring, you might have to wait 15-20 minutes before the metric values are displayed.

Configure service metrics

Once you add a service, Dynatrace starts automatically collecting a suite of metrics for this particular service. These are recommended metrics. Apart from the recommended metrics, most services have the possibility of enabling optional metrics. You can remove or edit any of the existing metrics or any of their dimensions, where there are multiple dimensions available. Metrics consisting of only one dimension can't be edited. They can only be removed or added.

Service-wide metrics are metrics for the whole service across all regions. Typically, these metrics include dimensions containing Region in their name. If selected, these metrics are displayed on a separate chart when viewing your AWS deployment in Dynatrace. Keep in mind that available dimensions differ among services.

To change a metric's statistics, you have to recreate that metric by choosing different statistics. You can choose among the following statistics: Sum, Minimum, Maximum, Average, and Sample count. The Average + Minimum + Maximum statistics enable you to collect all three statistics as one metric instead of one statistic for three metrics separately. This can reduce your expenses for retrieving metrics from your AWS deployment.

To be able to save a newly added metric, you need to select at least one statistic and one dimension.

Note: Once AWS supporting services are configured, you might have to wait 15-20 minutes before the metric values are displayed.

View service metrics

Once you add the service to monitoring, you can view the service metrics in your Dynatrace environment either on your dashboard page or on the custom device overview page.

Available metrics

Name Description Statistics Unit Dimensions Recommended
CanceledJobExecutionCount The number of job executions whose status has changed to CANCELED within a time period that is determined by CloudWatch. The JobId dimension contains the ID of the job. Count Sum Region, JobId
CanceledJobExecutionTotalCount The total number of job executions whose status is CANCELED for the given job. The JobId dimension contains the ID of the job. Count Sum Region, JobId
ClientError The number of client errors generated while executing the job. The JobId dimension contains the ID of the job. Count Sum Region, JobId
Connect.AuthError The number of connection requests that could not be authorized by the message broker. The Protocol dimension contains the protocol used to send the CONNECT message. Count Sum Region, Protocol
Connect.ClientError The number of connection requests rejected because the MQTT message did not meet the requirements defined in AWS IoT quotas. The Protocol dimension contains the protocol used to send the CONNECT message. Count Sum Region, Protocol
Connect.ServerError The number of connection requests that failed because an internal error occurred. The Protocol dimension contains the protocol used to send the CONNECT message. Count Sum Region, Protocol
Connect.Success The number of successful connections to the message broker. The Protocol dimension contains the protocol used to send the CONNECT message. Count Sum Region, Protocol ✔️
Connect.Throttle The number of connection requests that were throttled because the account exceeded the allowed connect request rate. The Protocol dimension contains the protocol used to send the CONNECT message. Count Sum Region, Protocol
DeleteThingShadow.Accepted The number of DeleteThingShadow requests processed successfully. The Protocol dimension contains the protocol used to make the request. Count Sum Region, Protocol
FailedJobExecutionCount The number of job executions whose status has changed to FAILED within a time period that is determined by CloudWatch. The JobId dimension contains the ID of the job. Count Sum Region, JobId
FailedJobExecutionTotalCount The total number of job executions whose status is FAILED for the given job. The JobId dimension contains the ID of the job. Count Sum Region, JobId
Failure The number of failed rule action invocations. The RuleName dimension contains the name of the rule that specifies the action. The ActionType dimension contains the type of action that was invoked. Count Sum Region, RuleName, ActionType
GetThingShadow.Accepted The number of GetThingShadow requests processed successfully. The Protocol dimension contains the protocol used to make the request. Count Sum Region, Protocol
InProgressJobExecutionCount The number of job executions whose status has changed to IN_PROGRESS within a time period that is determined by CloudWatch. The JobId dimension contains the ID of the job. Count Sum Region, JobId
InProgressJobExecutionTotalCount The total number of job executions whose status is IN_PROGRESS for the given job. The JobId dimension contains the ID of the job. Count Sum Region, JobId
NumLogBatchesFailedToPublishThrottled The singular batch of log events that has failed to publish due to throttling errors Count Sum Region
NumLogEventsFailedToPublishThrottled The number of log events within the batch that have failed to publish due to throttling errors Count Sum Region
ParseError The number of JSON parse errors that occurred in messages published on a topic on which a rule is listening. The RuleName dimension contains the name of the rule. Count Sum Region, RuleName
Ping.Success The number of ping messages received by the message broker. The Protocol dimension contains the protocol used to send the ping message. Count Sum Region, Protocol
PublishIn.AuthError The number of publish requests the message broker was unable to authorize. The Protocol dimension contains the protocol used to publish the message. Count Sum Region, Protocol
PublishIn.ClientError The number of publish requests rejected by the message broker because the message did not meet the requirements defined in AWS IoT quotas. The Protocol dimension contains the protocol used to publish the message. Count Sum Region, Protocol
PublishIn.ServerError The number of publish requests the message broker failed to process because an internal error occurred. The Protocol dimension contains the protocol used to send the PUBLISH message. Count Sum Region, Protocol
PublishIn.Success The number of publish requests successfully processed by the message broker. The Protocol dimension contains the protocol used to send the PUBLISH message. Count Sum Region, Protocol ✔️
PublishIn.Throttle The number of publish request that were throttled because the client exceeded the allowed inbound message rate. The Protocol dimension contains the protocol used to send the PUBLISH message. Count Sum Region, Protocol
PublishOut.AuthError The number of publish requests made by the message broker that could not be authorized by AWS IoT. The Protocol dimension contains the protocol used to send the PUBLISH message. Count Sum Region, Protocol
PublishOut.ClientError The number of publish requests made by the message broker that were rejected because the message did not meet the requirements defined in AWS IoT quotas. The Protocol dimension contains the protocol used to send the PUBLISH message. Count Sum Region, Protocol
PublishOut.Success The number of publish requests successfully made by the message broker. The Protocol dimension contains the protocol used to send the PUBLISH message. Count Sum Region, Protocol
QueuedJobExecutionCount The number of job executions whose status has changed to QUEUED within a time period that is determined by CloudWatch. The JobId dimension contains the ID of the job. Count Sum Region, JobId
QueuedJobExecutionTotalCount The total number of job executions whose status is QUEUED for the given job. The JobId dimension contains the ID of the job. Count Sum Region, JobId
RejectedJobExecutionCount The number of job executions whose status has changed to REJECTED within a time period that is determined by CloudWatch. The JobId dimension contains the ID of the job. Count Sum Region, JobId
RejectedJobExecutionTotalCount The total number of job executions whose status is REJECTED for the given job. The JobId dimension contains the ID of the job. Count Sum Region, JobId
RemovedJobExecutionCount The number of job executions whose status has changed to REMOVED within a time period that is determined by CloudWatch. The JobId dimension contains the ID of the job. Count Sum Region, JobId
RemovedJobExecutionTotalCount The total number of job executions whose status is REMOVED for the given job. The JobId dimension contains the ID of the job. Count Sum Region, JobId
RuleMessageThrottled The number of messages throttled by the rules engine because of malicious behavior or because the number of messages exceeds the rules engine's throttle limit. The RuleName dimension contains the name of the rule to be triggered. Count Sum Region, RuleName
RuleNotFound The rule to be triggered could not be found. The RuleName dimension contains the name of the rule. Count Sum Region, RuleName
RulesExecuted The number of AWS IoT rules executed Count Sum Region ✔️
ServerError The number of connection requests that failed because an internal error occurred. The Protocol dimension contains the protocol used to send the CONNECT message. Count Sum Region, JobId
Subscribe.AuthError The number of subscription requests made by a client that could not be authorized. The Protocol dimension contains the protocol used to send the SUBSCRIBE message. Count Sum Region, Protocol
Subscribe.ClientError The number of subscribe requests that were rejected because the SUBSCRIBE message did not meet the requirements defined in AWS IoT quotas. The Protocol dimension contains the protocol used to send the SUBSCRIBE message. Count Sum Region, Protocol
Subscribe.ServerError The number of subscribe requests that were rejected because an internal error occurred. The Protocol dimension contains the protocol used to send the SUBSCRIBE message. Count Sum Region, Protocol
Subscribe.Success The number of subscribe requests that were successfully processed by the message broker. The Protocol dimension contains the protocol used to send the SUBSCRIBE message. Count Sum Region, Protocol ✔️
Subscribe.Throttle The number of subscribe requests that were throttled because the client exceeded the allowed subscribe request rate. The Protocol dimension contains the protocol used to send the SUBSCRIBE message. Count Sum Region, Protocol
SuccededJobExecutionCount The number of job executions whose status has changed to SUCCESS within a time period that is determined by CloudWatch. The JobId dimension contains the ID of the job. Count Sum Region, JobId
SuccededJobExecutionTotalCount The total number of job executions whose status is SUCCESS for the given job. The JobId dimension contains the ID of the job. Count Sum Region, JobId
Success The number of successful rule action invocations. The RuleName dimension contains the name of the rule that specifies the action. The ActionType dimension contains the type of action that was invoked. Count Sum Region, RuleName, ActionType
TopicMatch The number of incoming messages published on a topic on which a rule is listening. The RuleName dimension contains the name of the rule. Count Sum Region, RuleName
Unsubscribe.ClientError The number of unsubscribe requests that were rejected because the UNSUBSCRIBE message did not meet the requirements defined in AWS IoT quotas. The Protocol dimension contains the protocol used to send the UNSUBSCRIBE message. Count Sum Region, Protocol
Unsubscribe.ServerError The number of unsubscribe requests that were rejected because an internal error occurred. The Protocol dimension contains the protocol used to send the UNSUBSCRIBE message. Count Sum Region, Protocol
Unsubscribe.Success The number of unsubscribe requests that were successfully processed by the message broker. The Protocol dimension contains the protocol used to send the UNSUBSCRIBE message. Count Sum Region, Protocol
Unsubscribe.Throttle The number of unsubscribe requests that were rejected because the client exceeded the allowed unsubscribe request rate. The Protocol dimension contains the protocol used to send the UNSUBSCRIBE message. Count Sum Region, Protocol
UpdateThingShadow.Accepted The number of UpdateThingShadow requests processed successfully. The Protocol dimension contains the protocol used to make the request. Count Sum Region, Protocol