Extend the platform,
empower your team.
Get insights into Google AI Platform service metrics collected from the Google Operations API to ensure health of your cloud infrastructure.
ExtensionThis Dynatrace extension leverages data collected from the Google Operations API to constantly monitor health and performance of Google AI Platform services. This extension combines all relevant data into pre-configured dashboards and provides alerting and event tracking.
This is intended for users, who want to:
This enables you to:
View and analyze 18 metrics that are specific to Google AI Platform, like Accelerator memory utilization, Accelerator utilization, CPU utilization, and more.
Build custom dashboards for your cloud infrastructure.
Analyze Google AI Platform logs.
Set custom alerts that trigger remediation workflows.
Google AI Platform metric and log ingestion requires advanced GCP integration.
Compatibility requirements
This extension package contains:
To provide correlation and causation analysis all ingested metrics and logs are analyzed by the Dynatrace Davis AI engine, which consumes DDUs.
To add this extension to your environment:
Following GCP integration and Google AI Platform configuration:
Below is a complete list of the feature sets provided in this version. To ensure a good fit for your needs, individual feature sets can be activated and deactivated by your administrator during configuration.
Metric name | Metric key | Description | Unit |
---|---|---|---|
Accelerator memory utilization | cloud.gcp.ml_googleapis_com.training.accelerator.memory.utilization | - | Percent |
Accelerator utilization | cloud.gcp.ml_googleapis_com.training.accelerator.utilization | - | Percent |
CPU utilization | cloud.gcp.ml_googleapis_com.training.cpu.utilization | - | Percent |
Memory utilization | cloud.gcp.ml_googleapis_com.training.memory.utilization | - | Percent |
Network bytes received | cloud.gcp.ml_googleapis_com.training.network.received_bytes_count | - | Byte |
Network bytes sent | cloud.gcp.ml_googleapis_com.training.network.sent_bytes_count | - | Byte |
Error count | cloud.gcp.ml_googleapis_com.prediction.error_count | - | Count |
Latency | cloud.gcp.ml_googleapis_com.prediction.latencies | - | MicroSecond |
Accelerator duty cycle | cloud.gcp.ml_googleapis_com.prediction.online.accelerator.duty_cycle | - | Percent |
Accelerator memory usage | cloud.gcp.ml_googleapis_com.prediction.online.accelerator.memory.bytes_used | - | Byte |
CPU usage | cloud.gcp.ml_googleapis_com.prediction.online.cpu.utilization | - | Percent |
Memory usage | cloud.gcp.ml_googleapis_com.prediction.online.memory.bytes_used | - | Byte |
Network bytes received | cloud.gcp.ml_googleapis_com.prediction.online.network.bytes_received.count | - | Byte |
Network bytes sent | cloud.gcp.ml_googleapis_com.prediction.online.network.bytes_sent.count | - | Byte |
Replica count | cloud.gcp.ml_googleapis_com.prediction.online.replicas | - | Count |
Replica target | cloud.gcp.ml_googleapis_com.prediction.online.target_replicas | - | Count |
Prediction count | cloud.gcp.ml_googleapis_com.prediction.prediction_count | - | Count |
Response count | cloud.gcp.ml_googleapis_com.prediction.response_count | - | Count |
Support for GCP overview
No release notes