| Accelerator duty cycle | cloud.gcp.aiplatform_googleapis_com.prediction.online.deployment_resource_pool.accelerator.duty_cycle | - | Percent |
| Accelerator memory usage | cloud.gcp.aiplatform_googleapis_com.prediction.online.deployment_resource_pool.accelerator.memory.bytes_used | - | Byte |
| CPU utilization | cloud.gcp.aiplatform_googleapis_com.prediction.online.deployment_resource_pool.cpu.utilization | - | Percent |
| Memory usage | cloud.gcp.aiplatform_googleapis_com.prediction.online.deployment_resource_pool.memory.bytes_used | - | Byte |
| Network bytes received | cloud.gcp.aiplatform_googleapis_com.prediction.online.deployment_resource_pool.network.received_bytes_count | - | Byte |
| Network bytes sent | cloud.gcp.aiplatform_googleapis_com.prediction.online.deployment_resource_pool.network.sent_bytes_count | - | Byte |
| Replica count | cloud.gcp.aiplatform_googleapis_com.prediction.online.deployment_resource_pool.replicas | - | Count |
| Replica target | cloud.gcp.aiplatform_googleapis_com.prediction.online.deployment_resource_pool.target_replicas | - | Count |
| Accelerator duty cycle | cloud.gcp.aiplatform_googleapis_com.prediction.online.accelerator.duty_cycle | - | Percent |
| Accelerator memory usage | cloud.gcp.aiplatform_googleapis_com.prediction.online.accelerator.memory.bytes_used | - | Byte |
| CPU utilization | cloud.gcp.aiplatform_googleapis_com.prediction.online.cpu.utilization | - | Percent |
| Number of online prediction errors | cloud.gcp.aiplatform_googleapis_com.prediction.online.error_count | - | Count |
| Memory usage | cloud.gcp.aiplatform_googleapis_com.prediction.online.memory.bytes_used | - | Byte |
| Network bytes received | cloud.gcp.aiplatform_googleapis_com.prediction.online.network.received_bytes_count | - | Byte |
| Network bytes sent | cloud.gcp.aiplatform_googleapis_com.prediction.online.network.sent_bytes_count | - | Byte |
| Number of online predictions | cloud.gcp.aiplatform_googleapis_com.prediction.online.prediction_count | - | Count |
| Prediction latencies | cloud.gcp.aiplatform_googleapis_com.prediction.online.prediction_latencies | - | MilliSecond |
| Private endpoint prediction latencies | cloud.gcp.aiplatform_googleapis_com.prediction.online.private.prediction_latencies | - | MilliSecond |
| Private endpoint response count | cloud.gcp.aiplatform_googleapis_com.prediction.online.private.response_count | - | Count |
| Replica count | cloud.gcp.aiplatform_googleapis_com.prediction.online.replicas | - | Count |
| Response count | cloud.gcp.aiplatform_googleapis_com.prediction.online.response_count | - | Count |
| Replica target | cloud.gcp.aiplatform_googleapis_com.prediction.online.target_replicas | - | Count |
| Executing PipelineJobs | cloud.gcp.aiplatform_googleapis_com.executing_vertexai_pipeline_jobs | - | Count |
| Executing PipelineTasks | cloud.gcp.aiplatform_googleapis_com.executing_vertexai_pipeline_tasks | - | Count |
| Generate content requests per minute per project per base model | cloud.gcp.aiplatform_googleapis_com.generate_content_requests_per_minute_per_project_per_base_model | - | Count |
| Online prediction dedicated requests per base model version | cloud.gcp.aiplatform_googleapis_com.online_prediction_dedicated_requests_per_base_model_version | - | Count |
| Online prediction dedicated tokens per minute per base model version | cloud.gcp.aiplatform_googleapis_com.online_prediction_dedicated_tokens_per_base_model_version | - | Count |
| Online prediction requests per base model | cloud.gcp.aiplatform_googleapis_com.online_prediction_requests_per_base_model | - | Count |
| Online prediction tokens per minute per base model | cloud.gcp.aiplatform_googleapis_com.online_prediction_tokens_per_minute_per_base_model | - | Count |
| Generate content requests per minute per project per base model quota exceeded error | cloud.gcp.aiplatform_googleapis_com.quota.generate_content_requests_per_minute_per_project_per_base_model.exceeded | - | Count |
| Generate content requests per minute per project per base model quota limit | cloud.gcp.aiplatform_googleapis_com.quota.generate_content_requests_per_minute_per_project_per_base_model.limit | - | Count |
| Generate content requests per minute per project per base model quota usage | cloud.gcp.aiplatform_googleapis_com.quota.generate_content_requests_per_minute_per_project_per_base_model.usage | - | Count |
| Online prediction dedicated requests per base model version quota exceeded error | cloud.gcp.aiplatform_googleapis_com.quota.online_prediction_dedicated_requests_per_base_model_version.exceeded | - | Count |
| Online prediction dedicated requests per base model version quota limit | cloud.gcp.aiplatform_googleapis_com.quota.online_prediction_dedicated_requests_per_base_model_version.limit | - | Count |
| Online prediction dedicated requests per base model version quota usage | cloud.gcp.aiplatform_googleapis_com.quota.online_prediction_dedicated_requests_per_base_model_version.usage | - | Count |
| Online prediction dedicated tokens per minute per base model version quota exceeded error | cloud.gcp.aiplatform_googleapis_com.quota.online_prediction_dedicated_tokens_per_base_model_version.exceeded | - | Count |
| Online prediction dedicated tokens per minute per base model version quota limit | cloud.gcp.aiplatform_googleapis_com.quota.online_prediction_dedicated_tokens_per_base_model_version.limit | - | Count |
| Online prediction dedicated tokens per minute per base model version quota usage | cloud.gcp.aiplatform_googleapis_com.quota.online_prediction_dedicated_tokens_per_base_model_version.usage | - | Count |
| Online prediction requests per base model quota exceeded | cloud.gcp.aiplatform_googleapis_com.quota.online_prediction_requests_per_base_model.exceeded | - | Count |
| Online prediction requests per base model quota limit | cloud.gcp.aiplatform_googleapis_com.quota.online_prediction_requests_per_base_model.limit | - | Count |
| Online prediction requests per base model quota usage | cloud.gcp.aiplatform_googleapis_com.quota.online_prediction_requests_per_base_model.usage | - | Count |
| Online prediction tokens per minute per base model quota exceeded | cloud.gcp.aiplatform_googleapis_com.quota.online_prediction_tokens_per_minute_per_base_model.exceeded | - | Count |
| Online prediction tokens per minute per base model quota limit | cloud.gcp.aiplatform_googleapis_com.quota.online_prediction_tokens_per_minute_per_base_model.limit | - | Count |
| Online prediction tokens per minute per base model quota usage | cloud.gcp.aiplatform_googleapis_com.quota.online_prediction_tokens_per_minute_per_base_model.usage | - | Count |
| Character count | cloud.gcp.aiplatform_googleapis_com.publisher.online_serving.character_count | - | Count |
| Characters | cloud.gcp.aiplatform_googleapis_com.publisher.online_serving.characters | - | Count |
| Character Throughput | cloud.gcp.aiplatform_googleapis_com.publisher.online_serving.consumed_throughput.count | - | Count |
| First token latencies | cloud.gcp.aiplatform_googleapis_com.publisher.online_serving.first_token_latencies | - | MilliSecond |
| Model invocation count | cloud.gcp.aiplatform_googleapis_com.publisher.online_serving.model_invocation_count | - | Count |
| Model invocation latencies | cloud.gcp.aiplatform_googleapis_com.publisher.online_serving.model_invocation_latencies | - | MilliSecond |
| Token count | cloud.gcp.aiplatform_googleapis_com.publisher.online_serving.token_count | - | Count |
| Tokens | cloud.gcp.aiplatform_googleapis_com.publisher.online_serving.tokens | - | Count |