Azure Batch monitoring

The Azure Batch overview page gives you a comprehensive view of how many jobs and tasks were completed over a period of time. You can also track nodes in different states, such as running, idle, or offline.

Prerequisites

  • Dynatrace version 1.196+
  • Environment ActiveGate version 1.195+

Enable monitoring

To enable monitoring for Azure Batch, you first need to set up integration with Azure Monitor.

Add the service to monitoring

In order to view the service metrics, you must add the service to monitoring in your Dynatrace environment.

Monitor resources based on tags

You can choose to monitor resources based on existing Azure tags, as Dynatrace automatically imports them from service instances.

To monitor resources based on tags

  1. Go to Settings > Cloud and virtualization > Azure and select the Azure instance.
  2. For Resource monitoring method, select Monitor resources based on tags.
  3. Enter the Key and Value.
  4. Select Save.

Note: To import the Azure tags automatically into Dynatrace, enable Capture Azure tags automatically.

Configure service metrics

Once you add a service, Dynatrace starts automatically collecting a suite of metrics for this particular service. These are recommended metrics.

Recommended metrics:

  • Are enabled by default
  • Can't be disabled
  • Can have recommended dimensions (enabled by default, can't be disabled)
  • Can have optional dimensions (disabled by default, can be enabled).

Apart from the recommended metrics, most services have the possibility of enabling optional metrics.

Optional metrics:

  • Can be added and configured manually

View service metrics

You can view the service metrics in your Dynatrace environment either on the custom device overview page or on your Dashboards page.

View metrics on the custom device overview page

To access the custom device overview page

  1. Go to Technologies on the Dynatrace navigation menu.
  2. Filter by service name and select the relevant custom device group.
  3. Once you select the custom device group, you're on the custom device group overview page.
  4. The custom device group overview page lists all instances (custom devices) belonging to the group. Select an instance to view the custom device overview page.

View metrics on your dashboard

Once you add a service to monitoring, a preset dashboard for the respective service containing all recommended metrics is automatically created on your Dashboards page. You can look for specific dashboards by filtering by Preset and then by Name.

Note: For existing monitored services, you might need to resave your credentials for the preset dashboard to appear on the Dashboards page. To resave your credentials, go to Settings > Cloud and virtualization > Azure, select the desired Azure instance, then select Save.

You can't make changes on a preset dashboard directly, but you can clone and edit it. To clone a dashboard, open the browse menu (...) and select Clone.
To remove a dashboard from the dashboards list, you can hide it. To hide a dashboard, open the browse menu (...) and select Hide.
Note: Hiding a dashboard doesn't affect other users.
clone-hide-azure

azure-batch-dash

Available metrics

Name Description Dimensions Unit Recommended
CoreCount Total number of dedicated cores in the batch account None Count ✔️
CreatingNodeCount Number of nodes being created None Count
IdleNodeCount Number of idle nodes None Count ✔️
JobDeleteCompleteEvent Total number of jobs that have been successfully deleted jobId Count
JobDeleteStartEvent Total number of jobs that have been requested to be deleted jobId Count
JobDisableCompleteEvent Total number of jobs that have been successfully disabled jobId Count
JobDisableStartEvent Total number of jobs that have been requested to be disabled jobId Count
JobStartEvent Total number of jobs that have been successfully started jobId Count ✔️
JobTerminateCompleteEvent Total number of jobs that have been successfully terminated jobId Count
JobTerminateStartEvent Total number of jobs that have been requested to be terminated jobId Count
LeavingPoolNodeCount Number of nodes leaving the pool None Count
LowPriorityCoreCount Total number of low-priority cores in the batch account None Count ✔️
LowPriorityNodeCount Total number of low-priority nodes in the batch account None Count ✔️
OfflineNodeCount Number of offline nodes None Count
PoolCreateEvent Total number of pools that have been created poolId Count
PoolDeleteCompleteEvent Total number of pool deletes that have completed poolId Count
PoolDeleteStartEvent Total number of pool deletes that have started poolId Count
PoolResizeCompleteEvent Total number of pool resizes that have completed poolId Count
PoolResizeStartEvent Total number of pool resizes that have started poolId Count
PreemptedNodeCount Number of preempted nodes None Count
RebootingNodeCount Number of rebooting nodes None Count ✔️
ReimagingNodeCount Number of reimaging nodes None Count
RunningNodeCount Number of running nodes None Count ✔️
StartTaskFailedNodeCount Number of nodes where the Start Task has failed None Count
StartingNodeCount Number of nodes starting None Count ✔️
TaskCompleteEvent Total number of tasks that have completed poolId,jobId Count ✔️
TaskFailEvent Total number of tasks that have completed in a failed state poolId,jobId Count ✔️
TaskStartEvent Total number of tasks that have started poolId,jobId Count ✔️
TotalNodeCount Total number of dedicated nodes in the batch account None Count ✔️
UnusableNodeCount Number of unusable nodes None Count
WaitingForStartTaskNodeCount Number of nodes waiting for the Start Task to complete None Count