Skip to technology filters Skip to main content
Dynatrace Hub

Extend the platform,
empower your team.

Popular searches:
Home hero bg
MilvusMilvus
Milvus

Milvus

Gain insights about vector database resource utilization and cache behavior

Technology
Free trialDocumentation
  • Product information

Overview

Vector databases, exemplified by Milvus, play a crucial role as semantic caches within contemporary Large Language Model (LLM) service frameworks.

Semantic caches are instrumental in mitigating latency for familiar and frequently accessed user prompts, concurrently optimizing the overall expenditure associated with cloud-based pre-trained model services.

Vigilant monitoring of cache efficiency and memory utilization is imperative for optimal resource allocation, while the cache's adaptability to dynamic contexts serves as a metric for its ability to accurately respond to evolving conversation dynamics. Furthermore, considerations of cache warm-up times contribute to expediting the availability of cached information. In the realm of vector databases, the performance of queries and indexing speed emerges as pivotal indicators directly influencing the system's efficacy in handling similarity searches.

Key factors such as scalability, accuracy of vector representations, and storage efficiency assume critical roles in proficiently managing expanding datasets. Additionally, the performance metrics related to updates, deletions, and query throughput further impact the overall effectiveness of these systems in delivering real-time and accurate responses in natural language processing and similarity search applications.

Striking an optimal balance across these Key Performance Indicators (KPIs) ensures that both semantic LLM caches and vector databases, like Milvus, achieve peak performance across diverse use cases.

To summarize, the overarching goal of vector databases, exemplified by Milvus, is to address performance-related challenges, enhance operational efficiency, and contribute to a more seamless and responsive experience in various natural language processing applications.

Get started

Setting up full stack observability for your GenAI applications is possible with Traceloop's OpenLLMetry, with OpenTelemetry under the hood, which can seamlessly provide comprehensive end-to-end insights into your production environments.

To set up OpenLLMetry with Dynatrace, see Dynatrace Documentation.

Details

Milvus exposes Prometheus-compatible metrics for monitoring at port 9091 under the path /metrics. A standard Prometheus setup can be used to visualize metrics on various dashboards in your Dynatrace environment.

Milvus metrics are then used to measure request latencies, import speed, time spent on vector vs object storage, memory usage, application usage, and more.

Beside other measurements, Milvus exposes following metrics that allow users to observe the health and performance of their vectorized index.

You can use Dynatrace to display, analyze and alert on many different Milvus telemetry metrics that can be categorized into the following three main areas:

• Milvus Performance Metrics • System Performance Metrics: Metrics relating to CPU/GPU usage, network traffic, and disk read speed. • Hardware Storage Metrics: Metrics relating to data size, data files, and storage capacity.

Milvus Performance Metrics

  • Insert per Second: Number of vectors that are inserted in a second. (Real-time display)
  • Queries per Minute: "Queries Per Minute" (QPM) is a performance metric employed in technical documentation to gauge a system's efficiency in processing search queries within a designated timeframe, typically a minute. This metric is derived by dividing the total number of executed queries by the duration of the observed time period. For example, if a system completes 100 search queries in 5 minutes, the QPM would be calculated as 20 QPM. QPM is instrumental in assessing the responsiveness and effectiveness of systems, particularly in real-time applications where timely query processing is critical. It serves as a key indicator for developers to optimize algorithms and resources, identifying potential bottlenecks and improving overall system performance. Interpretation of QPM is often complemented by considering other metrics like query elapsed time and resource utilization, providing a comprehensive view of a system's efficiency in handling search queries.
  • Query Time per Vector: Average time to query one vector. Divide the query elapsed time by the number of queried vectors.
  • Query Service Level: Query service level = n_queries_completed_within_threshold1 / n_queries. Generally, it is recommended to set 3 time periods - threshold1, threshold2, and threshold3, to track the query service level.
  • Uptime

Milvus System Performance Metrics

-GPU Utilization: GPU utilization ratio (%).

  • GPU Memory Usage: GPU memory (GB) currently consumed by Milvus.
  • CPU Utilization: CPU utilization ratio (%). Divide the time that the server is busy by the total elapsed time.
  • Memory Usage: Memory (GB) currently consumed by Milvus.
  • Cache Utilization: Cache utilization ratio (%).

-Network IO: Network IO read/write speed (GB/s).

  • Disk Read Speed: Disk read speed (GB/s).
  • Disk Write Speed: Disk write speed (GB/s).

Milvus Hardware storage metrics

-Data Size: Total amount (GB) of data stored in Milvus. Total File: Number of data files currently stored in Milvus.

Dynatrace
Documentation
By Dynatrace
Dynatrace support center
Copy to clipboard
Dynatrace Hub
Hub HomeGet data into DynatraceBuild your own app
Log Management and AnalyticsKubernetesAI and LLM ObservabilityInfrastructure ObservabilitySoftware DeliveryApplication ObservabilityApplication SecurityDigital ExperienceBusiness Observability
Filter
Type
Built and maintained by
Deployment model
SaaS
  • SaaS
  • Managed
Partner FinderBecome a partnerDynatrace Developer

All

769 Results filtered by:

Chroma logo

Chroma

Gain insights into the health of your vector and embedding databases from Chroma

Technology
Amazon Kinesis Video Streams logo

Amazon Kinesis Video Streams

Provides you SDKs to install on your devices to make it easy to stream media.

Technology
Fluentd logo

Fluentd

Stream log data to Dynatrace via Fluentd for analysis.

Technology
Palo Alto firewalls logo

Palo Alto firewalls

Palo Alto extension for problems detection

Extension
AWS Elastic Beanstalk logo

AWS Elastic Beanstalk

Quickly deploy and manage applications in the AWS Cloud without having to learn about the infrastructure beneath.

Technology
Workflows logo

Workflows

Automate tasks in your IT landscape, remediate problems, and visualize processes

App
Security Posture Management logo

Security Posture Management

Detect, prioritize, and remediate security and compliance findings with SPM.

App
Amazon Route 53 logo

Amazon Route 53

Effectively connects user requests to infrastructure running in AWS.

Technology
Amazon Inspector logo

Amazon Inspector

Tests the network accessibility of your EC2 instances and the security state of your applications that run on those instances.

Technology
NGINX Plus logo

NGINX Plus

All‑in‑one web server, load balancer, content cache, and media streaming.

Technology
Databases logo

Databases

Easily monitor, troubleshoot, and optimize your entire database fleets.

App
Amazon DocumentDB logo

Amazon DocumentDB

Amazon DocumentDB (with MongoDB compatibility) is a fast, reliable, and fully managed database service.

Technology
Dynatrace MCP Server logo

Dynatrace MCP Server

Fuel your AI agents with high quality data and real-time production insights

Technology
IBM IMS SOAP Gateway logo

IBM IMS SOAP Gateway

Intelligently monitor your transactions end-to-end and analyze the performance of your IMS SOAP Gateway.

Technology
Amazon Keyspaces for Apache Cassandra logo

Amazon Keyspaces for Apache Cassandra

Scalable, highly available, and managed Apache Cassandra–compatible database service.

Technology
Apache Camel logo

Apache Camel

Intelligently monitor, analyze, and optimize your integration framework and all applications deployed in your stack.

Technology
Amazon Elastic Kubernetes Service (EKS) logo

Amazon Elastic Kubernetes Service (EKS)

Harness automation and AI to simplify Kubernetes observability at scale.

Technology
WordPress logo

WordPress

Content management system written in PHP and paired with a MySQL or MariaDB database.

Technology
Azure Cosmos DB logo

Azure Cosmos DB

Fully managed and serverless distributed database supporting open-source PostgreSQL, MongoDB, and Apache Cassandra.

Technology
AWS Glue logo

AWS Glue

Fully managed extract, transform, and load (ETL) service.

Technology
Azure Event Hub Clusters logo

Azure Event Hub Clusters

Big data streaming platform and event ingestion service. It can receive and process millions of events per second.

Technology
Confluent Cloud (Kafka) logo

Confluent Cloud (Kafka)

Remotely monitor your Confluent Cloud Kafka Clusters and other resources!

Extension
Kong - Prometheus logo

Kong - Prometheus

Monitor Prometheus metrics exposed by Kong and proxied upstream services

Extension
BOSH bpm logo

BOSH bpm

Automated monitoring of platform processes running in BOSH bpm containers.

Technology
Prometheus in Kubernetes logo

Prometheus in Kubernetes

Collect metrics from Prometheus exporters in Kubernetes for Dynatrace analytics

Technology
Nutanix Clusters logo

Nutanix Clusters

Monitor Nutanix clusters' performance, usage and availability, with Nutanix API.

Extension
Azure Service-Fabric logo

Azure Service-Fabric

Intelligent end-to-end observability for serverless and hybrid environments using Azure Service Fabric.

Technology
Luna Network HSM Device logo

Luna Network HSM Device

Monitor your Luna Network Hardware Security Module (HSM) Devices through SNMP.

Extension
Consul Service Mesh (StatsD) logo

Consul Service Mesh (StatsD)

Extend visibility into your Consul Service Mesh instances to monitor health and improve performance.

Extension
Azure Relays logo

Azure Relays

Enables you to securely expose services that run in your corporate network to the public cloud.

Technology
Microsoft IIS logo

Microsoft IIS

Flexible and secure web server for hosting with Windows Server.

Extension
Kubernetes Monitoring Statistics logo

Kubernetes Monitoring Statistics

Troubleshoot your Dynatrace Kubernetes monitoring and Prometheus integration.

Extension
Google Cloud logo

Google Cloud

Meet your business challenges head on with cloud computing services from Google.

Technology
Snyk logo

Snyk

Ingest Snyk vulnerability findings, scans, and audit logs.

Extension
Amazon EventBridge logo

Amazon EventBridge

Serverless event bus to connect applications together using data from your own applications, integrated SaaS applications, and AWS services.

Technology
Ruby logo

Ruby

Dynatrace monitors your Ruby applications and services on the process level.

Technology
Apache OpenEJB logo

Apache OpenEJB

Automatically and intelligently monitor, analyze and optimize your applications based on Apache OpenEJB.

Technology
Log ingestion API logo

Log ingestion API

Stream your logs and log records to Dynatrace via the log ingest API

Technology
Citrix DaaS & Virtual Apps and Desktops logo

Citrix DaaS & Virtual Apps and Desktops

Gain insight into your Citrix DaaS & Virtual Apps and Desktops environments

Extension
Azure Device Provisioning Service logo

Azure Device Provisioning Service

Helper service for IoT Hub that enables just-in-time provisioning to the right IoT hub without human intervention.

Technology
Milvus logo

Milvus

Gain insights about vector database resource utilization and cache behavior

Technology
Google Memorystore logo

Google Memorystore

Get insights into Google Memorystore service metrics collected from the Google Operations API to ensure health of your cloud infrastructure.

Extension
Azul Platform Core (Zulu) logo

Azul Platform Core (Zulu)

Automatically and intelligently monitor, analyze, and optimize the performance of your virtual machine.

Technology
VirusTotal logo

VirusTotal

Enrich observables with threat intelligence from VirusTotal.

App
AWS Outposts logo

AWS Outposts

Fully managed service extending AWS infrastructure, services, APIs and more.

Technology
Databricks Workspace logo

Databricks Workspace

Remotely monitor your Databricks Workspaces!

Extension
LlamaIndex logo

LlamaIndex

Monitor your LLM-powered agents and workflows built with LlamaIndex framework.

Technology
UPS Device logo

UPS Device

Monitor your Uninterruptible Power Supplies (UPS) over SNMP

Extension
TensorFlow Keras logo

TensorFlow Keras

Observe the training progress of TensorFlow Keras AI models

Technology
Amazon Elastic Container Service (ECS) logo

Amazon Elastic Container Service (ECS)

Fully managed container orchestration service.

Technology