Accelerator duty cycle | cloud.gcp.aiplatform_googleapis_com.prediction.online.deployment_resource_pool.accelerator.duty_cycle | - | Percent |
Accelerator memory usage | cloud.gcp.aiplatform_googleapis_com.prediction.online.deployment_resource_pool.accelerator.memory.bytes_used | - | Byte |
CPU utilization | cloud.gcp.aiplatform_googleapis_com.prediction.online.deployment_resource_pool.cpu.utilization | - | Percent |
Memory usage | cloud.gcp.aiplatform_googleapis_com.prediction.online.deployment_resource_pool.memory.bytes_used | - | Byte |
Network bytes received | cloud.gcp.aiplatform_googleapis_com.prediction.online.deployment_resource_pool.network.received_bytes_count | - | Byte |
Network bytes sent | cloud.gcp.aiplatform_googleapis_com.prediction.online.deployment_resource_pool.network.sent_bytes_count | - | Byte |
Replica count | cloud.gcp.aiplatform_googleapis_com.prediction.online.deployment_resource_pool.replicas | - | Count |
Replica target | cloud.gcp.aiplatform_googleapis_com.prediction.online.deployment_resource_pool.target_replicas | - | Count |
Accelerator duty cycle | cloud.gcp.aiplatform_googleapis_com.prediction.online.accelerator.duty_cycle | - | Percent |
Accelerator memory usage | cloud.gcp.aiplatform_googleapis_com.prediction.online.accelerator.memory.bytes_used | - | Byte |
CPU utilization | cloud.gcp.aiplatform_googleapis_com.prediction.online.cpu.utilization | - | Percent |
Number of online prediction errors | cloud.gcp.aiplatform_googleapis_com.prediction.online.error_count | - | Count |
Memory usage | cloud.gcp.aiplatform_googleapis_com.prediction.online.memory.bytes_used | - | Byte |
Network bytes received | cloud.gcp.aiplatform_googleapis_com.prediction.online.network.received_bytes_count | - | Byte |
Network bytes sent | cloud.gcp.aiplatform_googleapis_com.prediction.online.network.sent_bytes_count | - | Byte |
Number of online predictions | cloud.gcp.aiplatform_googleapis_com.prediction.online.prediction_count | - | Count |
Prediction latencies | cloud.gcp.aiplatform_googleapis_com.prediction.online.prediction_latencies | - | MilliSecond |
Private endpoint prediction latencies | cloud.gcp.aiplatform_googleapis_com.prediction.online.private.prediction_latencies | - | MilliSecond |
Private endpoint response count | cloud.gcp.aiplatform_googleapis_com.prediction.online.private.response_count | - | Count |
Replica count | cloud.gcp.aiplatform_googleapis_com.prediction.online.replicas | - | Count |
Response count | cloud.gcp.aiplatform_googleapis_com.prediction.online.response_count | - | Count |
Replica target | cloud.gcp.aiplatform_googleapis_com.prediction.online.target_replicas | - | Count |
Executing PipelineJobs | cloud.gcp.aiplatform_googleapis_com.executing_vertexai_pipeline_jobs | - | Count |
Executing PipelineTasks | cloud.gcp.aiplatform_googleapis_com.executing_vertexai_pipeline_tasks | - | Count |
Generate content requests per minute per project per base model | cloud.gcp.aiplatform_googleapis_com.generate_content_requests_per_minute_per_project_per_base_model | - | Count |
Online prediction dedicated requests per base model version | cloud.gcp.aiplatform_googleapis_com.online_prediction_dedicated_requests_per_base_model_version | - | Count |
Online prediction dedicated tokens per minute per base model version | cloud.gcp.aiplatform_googleapis_com.online_prediction_dedicated_tokens_per_base_model_version | - | Count |
Online prediction requests per base model | cloud.gcp.aiplatform_googleapis_com.online_prediction_requests_per_base_model | - | Count |
Online prediction tokens per minute per base model | cloud.gcp.aiplatform_googleapis_com.online_prediction_tokens_per_minute_per_base_model | - | Count |
Generate content requests per minute per project per base model quota exceeded error | cloud.gcp.aiplatform_googleapis_com.quota.generate_content_requests_per_minute_per_project_per_base_model.exceeded | - | Count |
Generate content requests per minute per project per base model quota limit | cloud.gcp.aiplatform_googleapis_com.quota.generate_content_requests_per_minute_per_project_per_base_model.limit | - | Count |
Generate content requests per minute per project per base model quota usage | cloud.gcp.aiplatform_googleapis_com.quota.generate_content_requests_per_minute_per_project_per_base_model.usage | - | Count |
Online prediction dedicated requests per base model version quota exceeded error | cloud.gcp.aiplatform_googleapis_com.quota.online_prediction_dedicated_requests_per_base_model_version.exceeded | - | Count |
Online prediction dedicated requests per base model version quota limit | cloud.gcp.aiplatform_googleapis_com.quota.online_prediction_dedicated_requests_per_base_model_version.limit | - | Count |
Online prediction dedicated requests per base model version quota usage | cloud.gcp.aiplatform_googleapis_com.quota.online_prediction_dedicated_requests_per_base_model_version.usage | - | Count |
Online prediction dedicated tokens per minute per base model version quota exceeded error | cloud.gcp.aiplatform_googleapis_com.quota.online_prediction_dedicated_tokens_per_base_model_version.exceeded | - | Count |
Online prediction dedicated tokens per minute per base model version quota limit | cloud.gcp.aiplatform_googleapis_com.quota.online_prediction_dedicated_tokens_per_base_model_version.limit | - | Count |
Online prediction dedicated tokens per minute per base model version quota usage | cloud.gcp.aiplatform_googleapis_com.quota.online_prediction_dedicated_tokens_per_base_model_version.usage | - | Count |
Online prediction requests per base model quota exceeded | cloud.gcp.aiplatform_googleapis_com.quota.online_prediction_requests_per_base_model.exceeded | - | Count |
Online prediction requests per base model quota limit | cloud.gcp.aiplatform_googleapis_com.quota.online_prediction_requests_per_base_model.limit | - | Count |
Online prediction requests per base model quota usage | cloud.gcp.aiplatform_googleapis_com.quota.online_prediction_requests_per_base_model.usage | - | Count |
Online prediction tokens per minute per base model quota exceeded | cloud.gcp.aiplatform_googleapis_com.quota.online_prediction_tokens_per_minute_per_base_model.exceeded | - | Count |
Online prediction tokens per minute per base model quota limit | cloud.gcp.aiplatform_googleapis_com.quota.online_prediction_tokens_per_minute_per_base_model.limit | - | Count |
Online prediction tokens per minute per base model quota usage | cloud.gcp.aiplatform_googleapis_com.quota.online_prediction_tokens_per_minute_per_base_model.usage | - | Count |
Character count | cloud.gcp.aiplatform_googleapis_com.publisher.online_serving.character_count | - | Count |
Characters | cloud.gcp.aiplatform_googleapis_com.publisher.online_serving.characters | - | Count |
Character Throughput | cloud.gcp.aiplatform_googleapis_com.publisher.online_serving.consumed_throughput.count | - | Count |
First token latencies | cloud.gcp.aiplatform_googleapis_com.publisher.online_serving.first_token_latencies | - | MilliSecond |
Model invocation count | cloud.gcp.aiplatform_googleapis_com.publisher.online_serving.model_invocation_count | - | Count |
Model invocation latencies | cloud.gcp.aiplatform_googleapis_com.publisher.online_serving.model_invocation_latencies | - | MilliSecond |
Token count | cloud.gcp.aiplatform_googleapis_com.publisher.online_serving.token_count | - | Count |
Tokens | cloud.gcp.aiplatform_googleapis_com.publisher.online_serving.tokens | - | Count |