Azure Virtual Machines

Metrics, Dashboards, Alerts and more for Azure Virtual Machines Integration in Sysdig Monitor.
Azure Virtual Machines

This integration can be enabled via the Azure Cloud Metrics.

This integration has 24 metrics.

List of Alerts

AlertDescriptionFormat
[Azure Virtual Machines] VM Not AvailableVM Not Available.Prometheus
[Azure Virtual Machines] CPU Usage is HighThe CPU usage of the VM is high.Prometheus
[Azure Virtual Machines] Low Available MemoryThe Available Memory of the VM is low.Prometheus
[Azure Virtual Machines] OS Disk Queue Is HighThe OS Disk queue depth is high. It doesn’t include Data Disks.Prometheus
[Azure Virtual Machines] Data Disk Queue Is HighThe Data Disk queue depth is high. It doesn’t include OS Disks.Prometheus
[Azure Virtual Machines] Premium OS Disk Read Cache Miss Is HighThe Premium OS Disk read cache miss is high. It doesn’t include Premium Data Disks.Prometheus
[Azure Virtual Machines] Premium Data Disk Read Cache Miss Is HighThe Premium Data Disk read cache miss is high. It doesn’t include Premium OS Disks.Prometheus
[Azure Virtual Machines] VM Cached IOPS Consumed Are HighThe VM cached IOPS consumed are high.Prometheus
[Azure Virtual Machines] VM Uncached IOPS Consumed Are HighThe VM uncached IOPS consumed are high.Prometheus
[Azure Virtual Machines] Low CPU Credits RemainingThere are not many CPU credits remaining. Only available on B-series burstable VMs.Prometheus
[Azure Virtual Machines] High OS Disk Used Burst Bandwidth CreditsThe percentage of OS Disk used burst bandwidth (bytes per second) credits is near 100%. It doesn’t include Data Disks or non-burstable OS Disks.Prometheus
[Azure Virtual Machines] High OS Disk Used Burst IO CreditsThe percentage of OS Disk used burst IO credits is near 100%. It doesn’t include Data Disks or non-burstable OS Disks.Prometheus
[Azure Virtual Machines] High Data Disk Used Burst Bandwidth CreditsThe percentage of Data Disk used burst bandwidth (bytes per second) credits is near 100%. It doesn’t include OS Disks or non-burstable Data Disks.Prometheus
[Azure Virtual Machines] High Data Disk Used Burst IO CreditsThe percentage of Data Disk used burst IO credits is near 100%. It doesn’t include OS Disks or non-burstable Data Disks.Prometheus

List of Dashboards

Azure Virtual Machines

The dashboard provides information on the Azure Virtual Machines integration. Azure Virtual Machines

List of Metrics

Metric name
azure_compute_virtualmachines_available_memory_bytes
azure_compute_virtualmachines_cpu_credits_remaining
azure_compute_virtualmachines_data_disk_queue_depth
azure_compute_virtualmachines_data_disk_read_bytes_sec
azure_compute_virtualmachines_data_disk_read_operations_sec
azure_compute_virtualmachines_data_disk_used_burst_bps_credits_percentage
azure_compute_virtualmachines_data_disk_used_burst_io_credits_percentage
azure_compute_virtualmachines_data_disk_write_bytes_sec
azure_compute_virtualmachines_data_disk_write_operations_sec
azure_compute_virtualmachines_network_in_total
azure_compute_virtualmachines_network_out_total
azure_compute_virtualmachines_os_disk_queue_depth
azure_compute_virtualmachines_os_disk_read_bytes_sec
azure_compute_virtualmachines_os_disk_read_operations_sec
azure_compute_virtualmachines_os_disk_used_burst_bps_credits_percentage
azure_compute_virtualmachines_os_disk_used_burst_io_credits_percentage
azure_compute_virtualmachines_os_disk_write_bytes_sec
azure_compute_virtualmachines_os_disk_write_operations_sec
azure_compute_virtualmachines_percentage_cpu
azure_compute_virtualmachines_premium_data_disk_cache_read_miss
azure_compute_virtualmachines_premium_os_disk_cache_read_miss
azure_compute_virtualmachines_vm_availability_metric_avg
azure_compute_virtualmachines_vm_cached_iops_consumed_percentage
azure_compute_virtualmachines_vm_uncached_iops_consumed_percentage

Monitoring and Troubleshooting Azure Virtual Machines

This document describes important metrics and queries that you can use to monitor and troubleshoot Azure Virtual Machines.

The metrics covered in this document are applicable only to Azure Virtual Machines. Sysdig offers a different integration for Azure Virtual Machine Scale Sets.

Resources

VM Status and Resources

Use the following query to check if the VMs are available. This metric is currently available to only a small set of users.

azure_compute_virtualmachines_vm_availability_metric_avg != 1

A return value other than 1 indicates a problem.

Percentage CPU Used

Use the following query to get the percentage of CPU used by the VMs:

azure_compute_virtualmachines_percentage_cpu

Available Memory

Use the following query to get the available memory of the VMs (in bytes):

azure_compute_virtualmachines_available_memory_bytes

Unfortunately, no metric is available for used memory or maximum memory. Therefore, you cannot calculate the percentage of available memory.

Having 100MB left in a VM with 500MB max and having 100MB left in a VM with 2000MB max are totally different memory calculations.

Network

Network Bytes In

The following query returns the number of bytes received on all the network interfaces by the Virtual Machine (the incoming traffic):

azure_compute_virtualmachines_network_in_total

Network Bytes Out

The following query returns the number of bytes out on all the network interfaces by the Virtual Machine (the outgoing traffic):

azure_compute_virtualmachines_network_out_total

Disks

The Virtual Machines have an ‘OS’ disk, but they can also have attached Data Disks. The Data Disks have a Logical Unit Number (LUN) which specifies the slot in which the data drive appears when mounted. The valid LUN values are 0 through 15. The ’lun’ label available in the data disk metrics is useful to differentiate which metric comes from which data disk.

OS Disk Byte Rate

Use the following query to get the amount of read and write bytes per second in the OS disk:

azure_compute_virtualmachines_os_disk_read_bytes_sec

azure_compute_virtualmachines_os_disk_write_bytes_sec

OS Disk IOPS

Use the following query to get the amount of read and write bytes per second for IOPS:

azure_compute_virtualmachines_os_disk_read_operations_sec

azure_compute_virtualmachines_os_disk_write_operations_sec

OS Disk Queue

Use the following query to get the amount of read and write bytes per second for queue depth:

azure_compute_virtualmachines_os_disk_queue_depth

Data Disk Byte Rate

In a similar fashion, use the following query to get the read and write bytes per second in the data disk:

azure_compute_virtualmachines_data_disk_read_bytes_sec

azure_compute_virtualmachines_data_disk_write_bytes_sec

Data Disk IOPS

se the following query to get the read and write bytes per second in the data disk for IOPS:

azure_compute_virtualmachines_data_disk_read_operations_sec

azure_compute_virtualmachines_data_disk_write_operations_sec

Data Disk Queue

se the following query to get the read and write bytes per second for queue depth:

azure_compute_virtualmachines_data_disk_queue_depth

Premium Disks Cache

Premium disks (both OS and data disks) have metrics for their read cache miss and hit, which return a percentage between 0 and 100. Ideally the hit percentage should be 100%, and the miss percentage should be 0%.

Use the following query to get the values in OS disk:

azure_compute_virtualmachines_premium_os_disk_cache_read_miss

azure_compute_virtualmachines_premium_os_disk_cache_read_hit

Use the following to get the values in the data disks:

azure_compute_virtualmachines_premium_data_disk_cache_read_miss

azure_compute_virtualmachines_premium_data_disk_cache_read_hit

VM Cached and Uncached IOPS Consumed

Use the following query to get the cached and uncached VM IOPS consumed:

azure_compute_virtualmachines_vm_cached_iops_consumed_percentage

azure_compute_virtualmachines_vm_uncached_iops_consumed_percentage

A value near 100% indicates a bottleneck. This occurs when the VM machine IOPS reaches its limit. In that case, the bottleneck issue can be fixed by increasing the size of the VM. Before fixing it you should check the IOPS of the attached disks. If their IOPS are also at 100%, the bottleneck is in the disks.

Credits

CPU Credits

The following metric, only available on B-series burstable VMs, returns the remaining CPU credits available to burst:

azure_compute_virtualmachines_cpu_credits_remaining

Disk Credits

OS Disk Used Burst Bandwidth and IO Credits

Use the following query to get the percentage of used burst bps (bytes per second) credits for the OS disk:

azure_compute_virtualmachines_os_disk_used_burst_bps_credits_percentage

Use the following query to get the percentage of used burst IO credits for the OS disk:

azure_compute_virtualmachines_os_disk_used_burst_io_credits_percentage

Data Disk Used Burst Bandwidth and IO Credits

Use the following query to get the percentage of used burst bps (bytes per second) credits for the data disks:

azure_compute_virtualmachines_data_disk_used_burst_bps_credits_percentage

Use the following query to get the percentage of used burst IO credits in the data disks:

azure_compute_virtualmachines_data_disk_used_burst_io_credits_percentage