• 1:
• 2:
• 2.1:
• 2.2:
• 2.2.1:
• 2.2.2:
• 2.3:
• 2.3.1:
• 2.3.2:
• 2.4:
• 2.5:
• 2.5.1:
• 2.6:
• 2.7:
• 2.8:
• 2.8.1:
• 2.9:
• 2.9.1:
• 2.10:
• 2.10.1:
• 2.11:
• 2.11.1:
• 2.12:
• 2.12.1:
• 2.13:
• 2.13.1:
• 2.14:
• 2.14.1:
• 2.14.1.1:
• 2.14.2:
• 2.14.2.1:
• 2.14.3:
• 2.14.3.1:
• 2.15:
• 2.15.1:
• 2.16:
• 2.17:
• 2.17.1:
• 2.17.1.1:
• 2.17.2:
• 2.17.2.1:
• 2.18:
• 2.18.1:
• 2.19:
• 2.19.1:
• 2.20:
• 2.20.1:
• 2.21:
• 2.21.1:
• 2.22:
• 2.22.1:
• 2.23:
• 2.23.1:
• 2.24:
• 2.24.1:
• 2.25:
• 2.25.1:
• 3:
• 3.1:
• 3.2:
• 4:
• 5:
• 5.1:
• 5.1.1:
• 5.1.2:
• 5.1.3:
• 5.1.4:
• 5.1.5:
• 5.1.6:
• 5.1.7:
• 5.1.8:
• 6:
• 7:
• 8:
• 9:
• 10:
• 10.1:
• 10.2:
• 11:
• 12:
• 13:
• 14:
• 15:
• 16:

# Metrics in Sysdig Legacy Format

The Sysdig legacy metrics dictionary lists the default legacy metrics supported by the Sysdig product suite, as well as kube state and cloud provider metrics.

The metrics listed in this section follows the statsd-compatible Sysdig naming convention. To see a mapping between Prometheus notation and Sysdig notation, see Metrics and Label Mapping.

## Overview

Each metric in the dictionary has several pieces of metadata listed to provide greater context for how the metric can be used within Sysdig products. An example layout is displayed below:

## Metric Name

Metric definition. For some metrics, the equation for how the value is determined is provided.

Definition

Metric Type

Metric type determines whether the metric value is a counter metric or a gauge metric. Sysdig Monitor offers two Metric types:

Counter: The metric whose value keeps on increasing and is reliant on previous values. It helps you record how many times something has happened, for example, a user login.

Gauge: Represents a single numerical value that can arbitrarily fluctuate over time. Each value returns an instantaneous measurement, for example, CPU usage.

Value Type

The type of value the metric can have. The possible values are:

• Percent (%)

• Byte

• Date

• Double

• Integer (int)

• relativeTime

• String

Segment By

The levels within the infrastructure that the metric can be segmented at:

• Host

• Container

• Process

• Kubernetes

• Mesos

• Swarm

• CloudProvider

Default Time Aggregation

The default time aggregation format for the metric.

Available Time Aggregation Formats

The time aggregation formats the metric can be aggregated by:

• Average (Avg)

• Rate

• Sum

• Minimum (Min)

• Maximum (Max)

Default Group Aggregation

The default group aggregation format for the metric.

Available Group Aggregation Formats

The group aggregation formats the metric can be aggregated by:

• Average (Avg)

• Sum

• Minimum (Min)

• Maximum (Max)

# Agent

Note: Sysdig follows the Prometheus-compabtible naming convention for both metrics and labels as opposed to the previous statsd-compatible, legacy Sysdig naming convention. However, this page still shows metrics in the legacy Sysdig naming convention. Until this page is updated, see Metrics and Label Mapping for the mapping between Sysdig legacy and Prometheus naming conventions.

## dragent.analyzer

`dragent` is the main process in the agent that collects and collates data from multiple sources, including syscall events from the kernel in order to generate metrics. The `analyzer` module that runs in the `dragent` process does much of the work involved in generating metrics. These internal metrics are used to troubleshoot the health of the analyzer component.

Sysdig Monitor provides the following analyzer metrics:

MetricsTypeMinimum Agent VersionDescription
`dragent.analyzer.processes`gauge0.80.0 or aboveThe number of processes found by the analyzer.
`dragent.analyzer.threads`The number of threads found by the analyzer.
`dragent.analyzer.threads.dropped`counterThe number of threads not reported due to thread limits.
`dragent.analyzer.containers`gaugeThe number of containers found by the analyzer.
`dragent.analyzer.javaprocs`The number of java processes found by the analyzer.
`dragent.analyzer.appchecks`The number of application checks reporting to the analyzer.
`dragent.analyzer.mesos.autodetect`If the agent is configured to autodetect a Mesos environment, value is 1, otherwise is 0.
`dragent.analyzer.mesos.detected`If the agent actually found a Mesos environment, value is 1, otherwise, value is 0
`dragent.analyzer.fp.pct100`The analyzer flush CPU % (0-100)
`dragent.analyzer.fl.ms`The analyzer flush duration (milliseconds)
`dragent.analyzer.sr`The current sampling ratio (1=all events, 2= half of events analyzed, 4=one fourth of events analyzed, and so on.
`dragent.analyzer.n_evts`The number of events processed
`dragent.analyzer.n_drops`The number of events dropped
`dragent.analyzer.n_drops_buffer`The number of events dropped due to the buffer being full.
`dragent.analyzer.n_preemptions`The number of driver preemptions.
`dragent.analyzer.n_command_lines`The number of command lines collected and sent to the collector.
`dragent.analyzer.command_line_cats.n_none`
`dragent.analyzer.n_container_healthcheck_command_lines`0.80.1 or aboveThe number of command lines identified as container health checks. This metric does not change even if healthcheck command lines are not sent to the collector.

# Applications

Note: Sysdig follows the Prometheus-compabtible naming convention for both metrics and labels as opposed to the previous statsd-compatible, legacy Sysdig naming convention. However, this page still shows metrics in the legacy Sysdig naming convention. Until this page is updated, see Metrics and Label Mapping for the mapping between Sysdig legacy and Prometheus naming conventions.

The metrics in this section are collected from either default or customized agent configurations for integrated applications. See also: Integrate Applications (Default App Checks).

# Apache Metrics

## apache.conns_async_closing

The number of asynchronous closing connections.

## apache.conns_async_keep_alive

The number of asynchronous keep-alive connections.

## apache.conns_async_writing

The number of asynchronous write connections.

## apache.conns_total

The total number of connections handled.

## apache.net.bytes

The total number of bytes served.

## apache.net.bytes_per_s

The number of bytes served per second.

## apache.net.hits

The total number of requests performed.

## apache.net.request_per_s

The number of requests performed per second.

## apache.performance.busy_workers

The number of workers currently serving requests.

The percentage of CPU used.

## apache.performance.idle_workers

The number of idle workers in the instance.

## apache.performance.uptime

The amount of time the server has been running in seconds.

# Apache Kafka Consumer Metrics

## kafka.broker_offset

The current message offset value on the broker.

## kafka.consumer_lag

The lag in messages between the consumer and the broker.

## kafka.consumer_offset

The current message offset value on the consumer.

# Apache Kafka JMX Metrics

The `kafka.consumer.*` and `kafka.producer.*` metrics are only available with JMX customization as documented in Integrate JMX Metrics from Java Virtual Machines.

## kafka.consumer.bytes_consumed

The average number of bytes consumed for a specific topic per second.

## kafka.consumer.bytes_in

The rate of bytes coming in to the consumer.

## kafka.consumer.delayed_requests

The number of delayed consumer requests.

## kafka.consumer.expires_per_second

The rate of delayed consumer request expiration.

## kafka.consumer.fetch_rate

The minimum rate at which the consumer sends fetch requests to a broker.

## kafka.consumer.fetch_size_avg

The average number of bytes fetched for a specific topic per request.

## kafka.consumer.fetch_size_max

The maximum number of bytes fetched for a specific topic per request.

## kafka.consumer.kafka_commits

The rate of offset commits to Kafka.

## kafka.consumer.max_lag

The maximum consumer lag.

## kafka.consumer.messages_in

The rate of consumer message consumption.

## kafka.consumer.records_consumed

The average number of records consumed per second for a specific topic.

## kafka.consumer.records_per_request_avg

The average number of records in each request for a specific topic.

## kafka.consumer.zookeeper_commits

The rate of offset commits to ZooKeeper.

## kafka.expires_sec

The rate of delayed producer request expiration.

## kafka.follower.expires_per_second

The rate of request expiration on followers.

## kafka.log.flush_rate

The log flush rate.

## kafka.messages_in

The incoming message rate.

## kafka.net.bytes_in

The incoming byte rate.

## kafka.net.bytes_out

The outgoing byte rate.

## kafka.net.bytes_rejected

The rejected byte rate.

## kafka.producer.available_buffer_bytes

The total amount of buffer memory, including unallocated buffer memory and memory in the free list, that is not being used.

## kafka.producer.batch_size_avg

The average number of bytes sent per partition per-request.

## kafka.producer.batch_size_max

The maximum number of bytes sent per partition per-request.

## kafka.producer.buffer_bytes_total

The maximum amount of buffer memory the client can use.

## kafka.producer.bufferpool_wait_time

The fraction of time an appender waits for space allocation.

## kafka.producer.bytes_out

The rate of bytes going out for the producer.

## kafka.producer.compression_rate

The average compression rate of record batches for a topic.

## kafka.producer.compression_rate_avg

The average compression rate of record batches.

## kafka.producer.delayed_requests

The number of producer requests delayed.

## kafka.producer.expires_per_seconds

The rate of producer request expiration.

## kafka.producer.io_wait

The producer I/O wait time.

## kafka.producer.message_rate

The producer message rate.

The age of the current producer metadata being used, in seconds.

## kafka.producer.record_error_rate

The average number of retried record sends for a topic per second.

## kafka.producer.record_queue_time_avg

The average time that record batches spent in the record accumulator, in milliseconds.

## kafka.producer.record_queue_time_max

The maximum amount of time record batches can spend in the record accumulator, in milliseconds.

## kafka.producer.record_retry_rate

The average number of retried record sends for a topic per second.

## kafka.producer.record_send_rate

The average number of records sent per second for a topic.

## kafka.producer.record_size_avg

The average record size.

## kafka.producer.record_size_max

The maximum record size.

## kafka.producer.records_per_request

The average number of records sent per second.

## kafka.producer.request_latency_avg

The average request latency of the producer.

## kafka.producer.request_latency_max

The maximum request latency in milliseconds.

## kafka.producer.request_rate

The number of producer requests per second.

## kafka.producer.requests_in_flight

The current number of in-flight requests awaiting a response

## kafka.producer.response_rate

The number of producer responses per second.

## kafka.producer.throttle_time_avg

The average time in a request was throttled by a broker, in milliseconds.

## kafka.producer.throttle_time_max

The maximum time in a request was throttled by a broker, in milliseconds.

The number of user threads blocked waiting for buffer memory to enqueue their records.

## kafka.replication.isr_expands

The rate of replicas joining the ISR pool.

## kafka.replication.isr_shrinks

The rate of replicas leaving the ISR pool.

The leader election rate.

The unclean leader election rate.

## kafka.replication.under_replicated_partitions

The number of unreplicated partitions.

## kafka.request.fetch.failed

The number of client fetch request failures.

## kafka.request.fetch.failed_per_second

The rate of client fetch request failures per second.

## kafka.request.fetch.time.99percentile

The time for fetch requests for the 99th percentile.

## kafka.request.fetch.time.avg

The average time per fetch request.

## kafka.request.handler.avg.idle.pct

The average fraction of time the request handler threads are idle.

The time for metadata requests for 99th percentile.

The average time for a metadata request.

## kafka.request.offsets.time.99percentile

The time for offset requests for the 99th percentile.

## kafka.request.offsets.time.avg

The average time for an offset request.

## kafka.request.produce.failed

The number of failed produce requests.

## kafka.request.produce.failed_per_second

The rate of failed produce requests per second.

## kafka.request.produce.time.99percentile

The time for produce requests for the 99th percentile.

## kafka.request.produce.time.avg

The average time for a produce request.

The time for update metadata requests for the 99th percentile

The average time for a request to update metadata.

# Base Consul Metrics

## consul.catalog.nodes_critical

Number of nodes with service status `critical` from those registered.

## consul.catalog.nodes_passing

Number of nodes with service status `passing` from those registered.

## consul.catalog.nodes_up

Number of nodes.

## consul.catalog.nodes_warning

Number of nodes with service status `warning` from those registered.

## consul.catalog.services_critical

Total critical services on nodes.

## consul.catalog.services_passing

Total passing services on nodes.

## consul.catalog.services_up

Total services registered on nodes.

## consul.catalog.services_warning

Total warning services on nodes.

## consul.catalog.total_nodes

Number of nodes registered in the consul cluster.

## consul.net.node.latency.max

Maximum latency from this node to all others.

## consul.net.node.latency.median

Median latency from this node to all others.

## consul.net.node.latency.min

Minimum latency from this node to all others.

## consul.net.node.latency.p25

p25 latency from this node to all others.

## consul.net.node.latency.p75

p75 latency from this node to all others.

## consul.net.node.latency.p90

p90 latency from this node to all others.

## consul.net.node.latency.p95

p95 latency from this node to all others.

## consul.net.node.latency.p99

p99 latency from this node to all others.

## consul.peers

Number of peers in the peer set.

# Consul StatsD Metrics

## consul.memberlist.msg.suspect

Number of times an agent suspects another as failed while probing during gossip protocol.

## consul.raft.apply

Number of raft transactions occurring.

## consul.raft.commitTime.95percentile

The p95 time it takes to commit a new entry to the raft log on the leader.

## consul.raft.commitTime.avg

The average time it takes to commit a new entry to the raft log on the leader.

## consul.raft.commitTime.count

The number of samples of raft.commitTime

## consul.raft.commitTime.max

The max time it takes to commit a new entry to the raft log on the leader.

## consul.raft.commitTime.median

The median time it takes to commit a new entry to the raft log on the leader.

The p95 time it takes for the leader to write log entries to disk.

The average time it takes for the leader to write log entries to disk.

The number of samples of raft.leader.dispatchLog.

The max time it takes for the leader to write log entries to disk.

The median time it takes for the leader to write log entries to disk.

P95 time elapsed since the leader was last able to check its lease with followers.

Average time elapsed since the leader was last able to check its lease with followers.

The number of samples of raft.leader.lastContact.

Max time elapsed since the leader was last able to check its lease with followers.

Median time elapsed since the leader was last able to check its lease with followers.

## consul.raft.state.candidate

The number of initiated leader elections.

Number of completed leader elections.

## consul.runtime.alloc_bytes

Current bytes allocated by the Consul process.

## consul.runtime.free_count

Cumulative count of heap objects freed.

## consul.runtime.heap_objects

Number of objects allocated on the heap.

## consul.runtime.malloc_count

Cumulative count of heap objects allocated.

## consul.runtime.num_goroutines

Number of running goroutines.

## consul.runtime.sys_bytes

Total size of the virtual address space reserved by the Go runtime.

## consul.runtime.total_gc_pause_ns

Cumulative nanoseconds in GC stop-the-world pauses since Consul started.

## consul.runtime.total_gc_runs

Number of completed GC cycles.

## consul.serf.events

Incremented when an agent processes a serf event.

## consul.serf.member.flap

Number of times an agent is marked dead and then quickly recovers.

## consul.serf.member.join

Incremented when an agent processes a join event.

# Couchbase Metrics

## couchbase.by_bucket.avg_bg_wait_time

The average background wait time.

## couchbase.by_bucket.avg_disk_commit_time

The average disk commit time.

## couchbase.by_bucket.avg_disk_update_time

The average disk update time.

## couchbase.by_bucket.bg_wait_total

The total background wait time.

The number of bytes read.

## couchbase.by_bucket.bytes_written

The number of bytes written.

The number of compare and swap bad values.

## couchbase.by_bucket.cas_hits

The number of compare and swap hits.

## couchbase.by_bucket.cas_misses

The number of compare and swap misses.

## couchbase.by_bucket.cmd_get

The number of compare and swap gets.

## couchbase.by_bucket.cmd_set

The number of compare and swap sets.

## couchbase.by_bucket.couch_docs_actual_disk_size

The size of the couchbase docs on disk.

## couchbase.by_bucket.couch_docs_data_size

The data size of the couchbase docs.

## couchbase.by_bucket.couch_docs_disk_size

Couch docs total size in bytes.

## couchbase.by_bucket.couch_docs_fragmentation

The percentage of couchbase docs fragmentation.

## couchbase.by_bucket.couch_spatial_data_size

The size of object data for spatial views.

## couchbase.by_bucket.couch_spatial_disk_size

The amount of disk space occupied by spatial views.

## couchbase.by_bucket.couch_spatial_ops

Spatial operations.

## couchbase.by_bucket.couch_total_disk_size

The total disk size for couchbase.

## couchbase.by_bucket.couch_views_data_size

The size of object data for views.

## couchbase.by_bucket.couch_views_disk_size

The amount of disk space occupied by views.

## couchbase.by_bucket.couch_views_fragmentation

The view fragmentation.

## couchbase.by_bucket.couch_views_ops

View operations.

## couchbase.by_bucket.cpu_idle_ms

CPU idle milliseconds.

## couchbase.by_bucket.cpu_utilization_rate

CPU utilization percentage.

## couchbase.by_bucket.curr_connections

Current bucket connections.

## couchbase.by_bucket.curr_items

Number of active items in memory.

## couchbase.by_bucket.curr_items_tot

Total number of items.

## couchbase.by_bucket.decr_hits

Decrement hits.

## couchbase.by_bucket.decr_misses

Decrement misses.

## couchbase.by_bucket.delete_hits

Delete hits.

## couchbase.by_bucket.delete_misses

Delete misses.

## couchbase.by_bucket.disk_commit_count

Disk commits.

## couchbase.by_bucket.disk_write_queue

Disk write queue depth.

## couchbase.by_bucket.ep_bg_fetched

Disk reads per second.

## couchbase.by_bucket.ep_cache_miss_rate

Cache miss rate.

## couchbase.by_bucket.ep_cache_miss_ratio

Cache miss ratio.

## couchbase.by_bucket.ep_dcp_2i_backoff

Number of backoffs for indexes DCP connections.

## couchbase.by_bucket.ep_dcp_2i_count

Number of indexes DCP connections.

## couchbase.by_bucket.ep_dcp_2i_items_remaining

Number of indexes items remaining to be sent.

## couchbase.by_bucket.ep_dcp_2i_items_sent

Number of indexes items sent.

## couchbase.by_bucket.ep_dcp_2i_producer_count

Number of indexes producers

## couchbase.by_bucket.ep_dcp_2i_total_bytes

Number bytes per second being sent for indexes DCP connections.

## couchbase.by_bucket.ep_dcp_fts_backoff

Number of backoffs for fts DCP connections.

## couchbase.by_bucket.ep_dcp_fts_count

Number of fts DCP connections.

## couchbase.by_bucket.ep_dcp_fts_items_remaining

Number of fts items remaining to be sent.

## couchbase.by_bucket.ep_dcp_fts_items_sent

Number of fts items sent.

## couchbase.by_bucket.ep_dcp_fts_producer_count

Number of fts producers.

## couchbase.by_bucket.ep_dcp_fts_total_bytes

Number bytes per second being sent for fts DCP connections.

## couchbase.by_bucket.ep_dcp_other_backoff

Number of backoffs for other DCP connections.

## couchbase.by_bucket.ep_dcp_other_count

Number of other DCP connections.

## couchbase.by_bucket.ep_dcp_other_items_remaining

Number of other items remaining to be sent.

## couchbase.by_bucket.ep_dcp_other_items_sent

Number of other items sent.

## couchbase.by_bucket.ep_dcp_other_producer_count

Number of other producers.

## couchbase.by_bucket.ep_dcp_other_total_bytes

Number bytes per second being sent for other DCP connections.

## couchbase.by_bucket.ep_dcp_replica_backoff

Number of backoffs for replica DCP connections.

## couchbase.by_bucket.ep_dcp_replica_count

Number of replica DCP connections.

## couchbase.by_bucket.ep_dcp_replica_items_remaining

Number of replica items remaining to be sent.

## couchbase.by_bucket.ep_dcp_replica_items_sent

Number of replica items sent.

## couchbase.by_bucket.ep_dcp_replica_producer_count

Number of replica producers.

## couchbase.by_bucket.ep_dcp_replica_total_bytes

Number bytes per second being sent for replica DCP connections.

## couchbase.by_bucket.ep_dcp_views_backoff

Number of backoffs for views DCP connections.

## couchbase.by_bucket.ep_dcp_views_count

Number of views DCP connections.

## couchbase.by_bucket.ep_dcp_views_items_remaining

Number of views items remaining to be sent.

## couchbase.by_bucket.ep_dcp_views_items_sent

Number of views items sent.

## couchbase.by_bucket.ep_dcp_views_producer_count

Number of views producers.

## couchbase.by_bucket.ep_dcp_views_total_bytes

Number bytes per second being sent for views DCP connections.

## couchbase.by_bucket.ep_dcp_xdcr_backoff

Number of backoffs for xdcr DCP connections.

## couchbase.by_bucket.ep_dcp_xdcr_count

Number of xdcr DCP connections.

## couchbase.by_bucket.ep_dcp_xdcr_items_remaining

Number of xdcr items remaining to be sent.

## couchbase.by_bucket.ep_dcp_xdcr_items_sent

Number of xdcr items sent.

## couchbase.by_bucket.ep_dcp_xdcr_producer_count

Number of xdcr producers.

## couchbase.by_bucket.ep_dcp_xdcr_total_bytes

Number bytes per second being sent for xdcr DCP connections.

## couchbase.by_bucket.ep_diskqueue_drain

Total Drained items on disk queue.

## couchbase.by_bucket.ep_diskqueue_fill

Total enqueued items on disk queue.

## couchbase.by_bucket.ep_diskqueue_items

Total number of items waiting to be written to disk.

## couchbase.by_bucket.ep_flusher_todo

Number of items currently being written.

## couchbase.by_bucket.ep_item_commit_failed

Number of times a transaction failed to commit due to storage errors.

## couchbase.by_bucket.ep_kv_size

Total amount of user data cached in RAM in this bucket.

## couchbase.by_bucket.ep_max_size

The maximum amount of memory this bucket can use.

## couchbase.by_bucket.ep_mem_high_wat

Memory usage high water mark for auto-evictions.

## couchbase.by_bucket.ep_mem_low_wat

Memory usage low water mark for auto-evictions.

## couchbase.by_bucket.ep_meta_data_memory

Total amount of item metadata consuming RAM in this bucket.

## couchbase.by_bucket.ep_num_non_resident

Number of non-resident items.

## couchbase.by_bucket.ep_num_ops_del_meta

Number of delete operations per second for this bucket as the target for XDCR.

## couchbase.by_bucket.ep_num_ops_del_ret_meta

Number of delRetMeta operations per second for this bucket as the target for XDCR.

## couchbase.by_bucket.ep_num_ops_get_meta

Number of read operations per second for this bucket as the target for XDCR.

## couchbase.by_bucket.ep_num_ops_set_meta

Number of set operations per second for this bucket as the target for XDCR.

## couchbase.by_bucket.ep_num_ops_set_ret_meta

Number of setRetMeta operations per second for this bucket as the target for XDCR.

## couchbase.by_bucket.ep_num_value_ejects

Number of times item values got ejected from memory to disk.\

## couchbase.by_bucket.ep_oom_errors

Number of times unrecoverable OOMs happened while processing operations.

## couchbase.by_bucket.ep_ops_create

Create operations.

## couchbase.by_bucket.ep_ops_update

Update operations.

Extra memory used by transient data like persistence queues or checkpoints.

## couchbase.by_bucket.ep_queue_size

Number of items queued for storage.

## couchbase.by_bucket.ep_resident_items_rate

Number of resident items.

## couchbase.by_bucket.ep_tap_replica_queue_drain

Total drained items in the replica queue.

## couchbase.by_bucket.ep_tap_total_queue_drain

Total drained items in the queue.

## couchbase.by_bucket.ep_tap_total_queue_fill

Total enqueued items in the queue.

## couchbase.by_bucket.ep_tap_total_total_backlog_size

Number of remaining items for replication.

## couchbase.by_bucket.ep_tmp_oom_errors

Number of times recoverable OOMs happened while processing operations.

## couchbase.by_bucket.ep_vb_total

Total number of vBuckets for this bucket.

## couchbase.by_bucket.evictions

Number of evictions

## couchbase.by_bucket.get_hits

Number of get hits

## couchbase.by_bucket.get_misses

Number of get misses.

## couchbase.by_bucket.hibernated_requests

Number of streaming requests now idle.

## couchbase.by_bucket.hibernated_waked

Rate of streaming request wakeups.

## couchbase.by_bucket.hit_ratio

Hit ratio.

## couchbase.by_bucket.incr_hits

Number of increment hits.

## couchbase.by_bucket.incr_misses

Number of increment misses.

## couchbase.by_bucket.mem_actual_free

Free memory.

## couchbase.by_bucket.mem_actual_used

Used memory.

## couchbase.by_bucket.mem_free

Free memory.

## couchbase.by_bucket.mem_total

Total available memory.

## couchbase.by_bucket.mem_used (deprecated)

Engine’s total memory usage.

## couchbase.by_bucket.mem_used_sys

System memory usage.

## couchbase.by_bucket.misses

Total number of misses.

## couchbase.by_bucket.ops

Total number of operations.

## couchbase.by_bucket.page_faults

Number of page faults.

## couchbase.by_bucket.rest_requests

Number of HTTP requests.

## couchbase.by_bucket.swap_total

Total amount of swap available.

## couchbase.by_bucket.swap_used

Amount of swap used.

## couchbase.by_bucket.vb_active_eject

Number of items per second being ejected to disk from active vBuckets.

## couchbase.by_bucket.vb_active_itm_memory

Amount of active user data cached in RAM in this bucket.

## couchbase.by_bucket.vb_active_meta_data_memory

Amount of active item metadata consuming RAM in this bucket.

## couchbase.by_bucket.vb_active_num

Number of active items.

## couchbase.by_bucket.vb_active_num_non_resident

Number of non resident vBuckets in the active state for this bucket.

## couchbase.by_bucket.vb_active_ops_create

New items per second being inserted into active vBuckets in this bucket.

## couchbase.by_bucket.vb_active_ops_update

Number of items updated on active vBucket per second for this bucket.

## couchbase.by_bucket.vb_active_queue_age

Sum of disk queue item age in milliseconds.

## couchbase.by_bucket.vb_active_queue_drain

Total drained items in the queue.

## couchbase.by_bucket.vb_active_queue_fill

Number of active items per second being put on the active item disk queue.

## couchbase.by_bucket.vb_active_queue_size

Number of active items in the queue.

## couchbase.by_bucket.vb_active_resident_items_ratio

Number of resident items.

## couchbase.by_bucket.vb_avg_active_queue_age

Average age in seconds of active items in the active item queue.

## couchbase.by_bucket.vb_avg_pending_queue_age

Average age in seconds of pending items in the pending item queue.

## couchbase.by_bucket.vb_avg_replica_queue_age

Average age in seconds of replica items in the replica item queue.

## couchbase.by_bucket.vb_avg_total_queue_age

Average age of items in the queue.

## couchbase.by_bucket.vb_pending_curr_items

Number of items in pending vBuckets.

## couchbase.by_bucket.vb_pending_eject

Number of items per second being ejected to disk from pending vBuckets.

## couchbase.by_bucket.vb_pending_itm_memory

Amount of pending user data cached in RAM in this bucket.

## couchbase.by_bucket.vb_pending_meta_data_memory

Amount of pending item metadata consuming RAM in this bucket.

## couchbase.by_bucket.vb_pending_num

Number of pending items.

## couchbase.by_bucket.vb_pending_num_non_resident

Number of non resident vBuckets in the pending state for this bucket.

## couchbase.by_bucket.vb_pending_ops_create

Number of pending create operations.

## couchbase.by_bucket.vb_pending_ops_update

Number of items updated on pending vBucket per second for this bucket.

## couchbase.by_bucket.vb_pending_queue_age

Sum of disk pending queue item age in milliseconds.

## couchbase.by_bucket.vb_pending_queue_drain

Total drained pending items in the queue.

## couchbase.by_bucket.vb_pending_queue_fill

Total enqueued pending items on disk queue.

## couchbase.by_bucket.vb_pending_queue_size

Number of pending items in the queue.

## couchbase.by_bucket.vb_pending_resident_items_ratio

Number of resident pending items.

## couchbase.by_bucket.vb_replica_curr_items

Number of in memory items.

## couchbase.by_bucket.vb_replica_eject

Number of items per second being ejected to disk from replica vBuckets.

## couchbase.by_bucket.vb_replica_itm_memory

Amount of replica user data cached in RAM in this bucket.

## couchbase.by_bucket.vb_replica_num

Number of replica vBuckets.

## couchbase.by_bucket.vb_replica_num_non_resident

Number of non resident vBuckets in the replica state for this bucket.

## couchbase.by_bucket.vb_replica_ops_create

Number of replica create operations.

## couchbase.by_bucket.vb_replica_ops_update

Number of items updated on replica vBucket per second for this bucket.

## couchbase.by_bucket.vb_replica_queue_age

Sum of disk replica queue item age in milliseconds.

## couchbase.by_bucket.vb_replica_queue_drain

Total drained replica items in the queue.

## couchbase.by_bucket.vb_replica_queue_fill

Total enqueued replica items on disk queue.

## couchbase.by_bucket.vb_replica_queue_size

Replica items in disk queue.

## couchbase.by_bucket.vb_replica_resident_items_ratio

Number of resident replica items.

## couchbase.by_bucket.vb_total_queue_age

Sum of disk queue item age in milliseconds.

## couchbase.by_bucket.xdc_ops

Number of cross-datacenter replication operations.

## couchbase.by_node.couch_docs_actual_disk_size

Couch docs total size on disk in bytes.

## couchbase.by_node.couch_docs_data_size

Couch docs data size in bytes.

## couchbase.by_node.couch_views_actual_disk_size

Couch views total size on disk in bytes.

## couchbase.by_node.couch_views_data_size

Couch views data size on disk in bytes.

## couchbase.by_node.curr_items

Number of active items in memory.

## couchbase.by_node.curr_items_tot

Total number of items.

## couchbase.by_node.vb_replica_curr_items

Number of in memory items.

## couchbase.hdd.free

Free hard disk space.

## couchbase.hdd.quota_total

Hard disk quota.

## couchbase.hdd.total

Total hard disk space.

## couchbase.hdd.used

Used hard disk space.

## couchbase.hdd.used_by_data

Hard disk used for data.

## couchbase.ram.quota_total

RAM quota.

## couchbase.ram.total

The total RAM available.

## couchbase.ram.used

The amount of RAM in use.

## couchbase.ram.used_by_data

The amount of RAM used for data.

# Elasticsearch Metrics

All Elasticsearch metrics have the type gauge.

## elasticsearch.active_primary_shards

The number of active primary shards in the cluster.

## elasticsearch.active_shards

The number of active shards in the cluster.

## elasticsearch.breakers.fielddata.estimated_size_in_bytes

The estimated size in bytes of the field data circuit breaker.

The constant multiplier for byte estimations of the field data circuit breaker.

## elasticsearch.breakers.fielddata.tripped

The number of times the field data circuit breaker has tripped.

## elasticsearch.breakers.parent.estimated_size_in_bytes

The estimated size in bytes of the parent circuit breaker.

The constant multiplier for byte estimations of the parent circuit breaker.

## elasticsearch.breakers.parent.tripped

The number of times the parent circuit breaker has tripped.

## elasticsearch.breakers.request.estimated_size_in_bytes

The estimated size in bytes of the request circuit breaker.

The constant multiplier for byte estimations of the request circuit breaker.

## elasticsearch.breakers.request.tripped

The number of times the request circuit breaker has tripped.

## elasticsearch.breakers.inflight_requests.tripped

The number of times the inflight circuit breaker has tripped.

The constant multiplier for byte estimations of the inflight circuit breaker.

## elasticsearch.breakers.inflight_requests.estimated_size_in_bytes

The estimated size in bytes of the inflight circuit breaker.

## elasticsearch.cache.field.evictions

The total number of evictions from the field data cache.

## elasticsearch.cache.field.size

The size of the field cache.

## elasticsearch.cache.filter.count

The number of items in the filter cache.

## elasticsearch.cache.filter.evictions

The total number of evictions from the filter cache.

## elasticsearch.cache.filter.size

The size of the filter cache.

## elasticsearch.cluster_status

The elasticsearch cluster health as a number: red = 0, yellow = 1, green = 2

## elasticsearch.docs.count

The total number of documents in the cluster across all shards.

## elasticsearch.docs.deleted

The total number of documents deleted from the cluster across all shards.

## elasticsearch.fielddata.evictions

The total number of evictions from the fielddata cache.

## elasticsearch.fielddata.size

The size of the fielddata cache.

## elasticsearch.flush.total

The total number of index flushes to disk since start.

## elasticsearch.flush.total.time

The total time spent flushing the index to disk.

## elasticsearch.fs.total.available_in_bytes

The total number of bytes available to this Java virtual machine on this file store.

## elasticsearch.fs.total.disk_io_op

The total I/O operations on the file store.

## elasticsearch.fs.total.disk_io_size_in_bytes

Total bytes used for all I/O operations on the file store.

The total bytes read from the file store.

The total number of reads from the file store.

## elasticsearch.fs.total.disk_write_size_in_bytes

The total bytes written to the file store.

## elasticsearch.fs.total.disk_writes

The total number of writes to the file store.

## elasticsearch.fs.total.free_in_bytes

The total number of unallocated bytes in the file store.

## elasticsearch.fs.total.total_in_bytes

The total size in bytes of the file store.

## elasticsearch.get.current

The number of get requests currently running.

## elasticsearch.get.exists.time

The total time spent on get requests where the document existed.

## elasticsearch.get.exists.total

The total number of get requests where the document existed.

## elasticsearch.get.missing.time

The total time spent on get requests where the document was missing.

## elasticsearch.get.missing.total

The total number of get requests where the document was missing.

## elasticsearch.get.time

The total time spent on get requests.

## elasticsearch.get.total

The total number of get requests.

## elasticsearch.http.current_open

The number of current open HTTP connections.

## elasticsearch.http.total_opened

The total number of opened HTTP connections.

## elasticsearch.id_cache.size

The size of the id cache

## elasticsearch.indexing.delete.current

The number of documents currently being deleted from an index.

## elasticsearch.indexing.delete.time

The total time spent deleting documents from an index.

## elasticsearch.indexing.delete.total

The total number of documents deleted from an index.

## elasticsearch.indexing.index.current

The number of documents currently being indexed to an index.

## elasticsearch.indexing.index.time

The total time spent indexing documents to an index.

## elasticsearch.indexing.index.total

The total number of documents indexed to an index.

## elasticsearch.indices.count

The number of indices in the cluster.

## elasticsearch.indices.indexing.index_failed

The number of failed indexing operations.

## elasticsearch.indices.indexing.throttle_time

The total time indexing waited due to throttling.

## elasticsearch.indices.query_cache.evictions

The number of query cache evictions.

## elasticsearch.indices.query_cache.hit_count

The number of query cache hits.

## elasticsearch.indices.query_cache.memory_size_in_bytes

The memory used by the query cache.

## elasticsearch.indices.query_cache.miss_count

The number of query cache misses.

## elasticsearch.indices.recovery.current_as_source

The number of ongoing recoveries for which a shard serves as a source.

## elasticsearch.indices.recovery.current_as_target

The number of ongoing recoveries for which a shard serves as a target.

## elasticsearch.indices.recovery.throttle_time

The total time recoveries waited due to throttling.

## elasticsearch.indices.request_cache.evictions

The number of request cache evictions.

## elasticsearch.indices.request_cache.hit_count

The number of request cache hits.

## elasticsearch.indices.request_cache.memory_size_in_bytes

The memory used by the request cache.

## elasticsearch.indices.request_cache.miss_count

The number of request cache misses.

## elasticsearch.indices.segments.count

The number of segments in an index shard.

## elasticsearch.indices.segments.doc_values_memory_in_bytes

The memory used by doc values.

## elasticsearch.indices.segments.fixed_bit_set_memory_in_bytes

The memory used by fixed bit set.

## elasticsearch.indices.segments.index_writer_max_memory_in_bytes

The maximum memory used by the index writer.

## elasticsearch.indices.segments.index_writer_memory_in_bytes

The memory used by the index writer.

## elasticsearch.indices.segments.memory_in_bytes

The memory used by index segments.

## elasticsearch.indices.segments.norms_memory_in_bytes

The memory used by norms.

## elasticsearch.indices.segments.stored_fields_memory_in_bytes

The memory used by stored fields.

## elasticsearch.indices.segments.term_vectors_memory_in_bytes

The memory used by term vectors.

## elasticsearch.indices.segments.terms_memory_in_bytes

The memory used by terms.

## elasticsearch.indices.segments.version_map_memory_in_bytes

The memory used by the segment version map.

## elasticsearch.indices.translog.operations

The number of operations in the transaction log.

## elasticsearch.indices.translog.size_in_bytes

The size of the transaction log.

## elasticsearch.initializing_shards

The number of shards that are currently initializing.

## elasticsearch.merges.current

The number of currently active segment merges.

## elasticsearch.merges.current.docs

The number of documents across segments currently being merged.

## elasticsearch.merges.current.size

The size of the segments currently being merged.

## elasticsearch.merges.total

The total number of segment merges.

## elasticsearch.merges.total.docs

The total number of documents across all merged segments.

## elasticsearch.merges.total.size

The total size of all merged segments.

## elasticsearch.merges.total.time

The total time spent on segment merging.

## elasticsearch.number_of_data_nodes

The number of data nodes in the cluster.

## elasticsearch.number_of_nodes

The total number of nodes in the cluster.

The number of high priority pending tasks.

The number of urgent priority pending tasks.

The average time spent by tasks in the queue.

The total number of pending tasks.

## elasticsearch.process.open_fd

The number of opened file descriptors associated with the current process, or -1 if not supported.

## elasticsearch.refresh.total

The total number of index refreshes.

## elasticsearch.refresh.total.time

The total time spent on index refreshes.

## elasticsearch.relocating_shards

The number of shards that are relocating from one node to another.

## elasticsearch.search.fetch.current

The number of search fetches currently running.

## elasticsearch.search.fetch.open_contexts

The number of active searches.

## elasticsearch.search.fetch.time

The total time spent on the search fetch.

## elasticsearch.search.fetch.total

The total number of search fetches.

## elasticsearch.search.query.current

The number of currently active queries.

## elasticsearch.search.query.time

The total time spent on queries.

## elasticsearch.search.query.total

The total number of queries.

## elasticsearch.store.size

The total size in bytes of the store.

The number of active threads in the bulk pool.

The number of queued threads in the bulk pool.

The total number of threads in the bulk pool.

The number of rejected threads in the bulk pool.

The number of active threads in the fetch shard started pool.

The total number of threads in the fetch shard started pool.

The number of queued threads in the fetch shard started pool.

The number of rejected threads in the fetch shard started pool.

The number of active threads in the fetch shard store pool.

The total number of threads in the fetch shard store pool.

The number of queued threads in the fetch shard store pool.

The number of rejected threads in the fetch shard store pool.

The number of active threads in the flush queue.

The number of queued threads in the flush pool.

The total number of threads in the flush pool.

The number of rejected threads in the flush pool.

The number of active threads for force merge operations.

The total number of threads for force merge operations.

The number of queued threads for force merge operations.

The number of rejected threads for force merge operations.

The number of active threads in the generic pool.

The number of queued threads in the generic pool.

The total number of threads in the generic pool.

The number of rejected threads in the generic pool.

The number of active threads in the get pool.

The number of queued threads in the get pool.

The total number of threads in the get pool.

The number of rejected threads in the get pool.

The number of active threads in the index pool.

The number of queued threads in the index pool.

The total number of threads in the index pool.

The number of rejected threads in the index pool.

The number of active threads in the listener pool.

The number of queued threads in the listener pool.

The total number of threads in the listener pool.

The number of rejected threads in the listener pool.

The number of active threads in the management pool.

The number of queued threads in the management pool.

The total number of threads in the management pool.

The number of rejected threads in the management pool.

The number of active threads in the merge pool.

The number of queued threads in the merge pool.

The total number of threads in the merge pool.

The number of rejected threads in the merge pool.

The number of active threads in the percolate pool.

The number of queued threads in the percolate pool.

The total number of threads in the percolate pool.

The number of rejected threads in the percolate pool.

The number of active threads in the refresh pool.

The number of queued threads in the refresh pool.

The total number of threads in the refresh pool.

The number of rejected threads in the refresh pool.

The number of active threads in the search pool.

The number of queued threads in the search pool.

The total number of threads in the search pool.

The number of rejected threads in the search pool.

The number of active threads in the snapshot pool.

The number of queued threads in the snapshot pool.

The total number of threads in the snapshot pool.

The number of rejected threads in the snapshot pool.

The number of active threads in the write pool.

The number of queued threads in the write pool.

The total number of threads in the write pool.

The number of rejected threads in the write pool.

## elasticsearch.transport.rx_count

The total number of packets received in cluster communication.

## elasticsearch.transport.rx_size

The total size of data received in cluster communication.

## elasticsearch.transport.server_open

The number of connections opened for cluster communication.

## elasticsearch.transport.tx_count

The total number of packets sent in cluster communication.

## elasticsearch.transport.tx_size

The total size of data sent in cluster communication.

## elasticsearch.unassigned_shards

The number of shards that are unassigned to a node.

## elasticsearch.delayed_unassigned_shards

The number of shards whose allocation has been delayed.

## jvm.gc.collection_count

The total number of garbage collections run by the JVM.

## jvm.gc.collection_time

The total time spent on garbage collection in the JVM.

## jvm.gc.collectors.old.collection_time

The total time spent in major GCs in the JVM that collect old generation objects.

## jvm.gc.collectors.old.count

The total count of major GCs in the JVM that collect old generation objects.

## jvm.gc.collectors.young.collection_time

The total time spent in minor GCs in the JVM that collects young generation objects.

## jvm.gc.collectors.young.count

The total count of minor GCs in the JVM that collects young generation objects.

## jvm.gc.concurrent_mark_sweep.collection_time

The total time spent on “concurrent mark & sweep” GCs in the JVM.

## jvm.gc.concurrent_mark_sweep.count

The total count of “concurrent mark & sweep” GCs in the JVM.

## jvm.gc.par_new.collection_time

The total time spent on “parallel new” GCs in the JVM.

## jvm.gc.par_new.count

The total count of “parallel new” GCs in the JVM.

## jvm.mem.heap_committed

The amount of memory guaranteed to be available to the JVM heap.

## jvm.mem.heap_in_use

The amount of memory currently used by the JVM heap as a value between 0 and 1.

## jvm.mem.heap_max

The maximum amount of memory that can be used by the JVM heap.

## jvm.mem.heap_used

The amount of memory in bytes currently used by the JVM heap.

## jvm.mem.non_heap_committed

The amount of memory guaranteed to be available to JVM non-heap.

## jvm.mem.non_heap_used

The amount of memory in bytes currently used by the JVM non-heap.

## jvm.mem.pools.young.used

The amount of memory in bytes currently used by the Young Generation heap region.

## jvm.mem.pools.young.max

The maximum amount of memory that can be used by the Young Generation heap region.

## jvm.mem.pools.old.used

The amount of memory in bytes currently used by the Old Generation heap region.

## jvm.mem.pools.old.max

The maximum amount of memory that can be used by the Old Generation heap region.

## jvm.mem.pools.survivor.used

The amount of memory in bytes currently used by the Survivor Space.

## jvm.mem.pools.survivor.max

The maximum amount of memory that can be used by the Survivor Space.

The number of active threads in the JVM.

The peak number of threads used by the JVM.

## elasticsearch.index.health

The status of the index.

## elasticsearch.index.docs.count

The number of documents in the index.

## elasticsearch.index.docs.deleted

The number of deleted documents in the index.

## elasticsearch.index.primary_shards

The number of primary shards in the index.

## elasticsearch.index.replica_shards

The number of replica shards in the index.

## elasticsearch.index.primary_store_size

The store size of primary shards in the index.

## elasticsearch.index.store_size

The store size of primary and replica shards in the index.

# Elasticsearch Metrics

All Elasticsearch metrics have the type gauge.

## elasticsearch.active_primary_shards

The number of active primary shards in the cluster.

## elasticsearch.active_shards

The number of active shards in the cluster.

## elasticsearch.breakers.fielddata.estimated_size_in_bytes

The estimated size in bytes of the field data circuit breaker.

The constant multiplier for byte estimations of the field data circuit breaker.

## elasticsearch.breakers.fielddata.tripped

The number of times the field data circuit breaker has tripped.

## elasticsearch.breakers.parent.estimated_size_in_bytes

The estimated size in bytes of the parent circuit breaker.

The constant multiplier for byte estimations of the parent circuit breaker.

## elasticsearch.breakers.parent.tripped

The number of times the parent circuit breaker has tripped.

## elasticsearch.breakers.request.estimated_size_in_bytes

The estimated size in bytes of the request circuit breaker.

The constant multiplier for byte estimations of the request circuit breaker.

## elasticsearch.breakers.request.tripped

The number of times the request circuit breaker has tripped.

## elasticsearch.breakers.inflight_requests.tripped

The number of times the inflight circuit breaker has tripped.

The constant multiplier for byte estimations of the inflight circuit breaker.

## elasticsearch.breakers.inflight_requests.estimated_size_in_bytes

The estimated size in bytes of the inflight circuit breaker.

## elasticsearch.cache.field.evictions

The total number of evictions from the field data cache.

## elasticsearch.cache.field.size

The size of the field cache.

## elasticsearch.cache.filter.count

The number of items in the filter cache.

## elasticsearch.cache.filter.evictions

The total number of evictions from the filter cache.

## elasticsearch.cache.filter.size

The size of the filter cache.

## elasticsearch.cluster_status

The elasticsearch cluster health as a number: red = 0, yellow = 1, green = 2

## elasticsearch.docs.count

The total number of documents in the cluster across all shards.

## elasticsearch.docs.deleted

The total number of documents deleted from the cluster across all shards.

## elasticsearch.fielddata.evictions

The total number of evictions from the fielddata cache.

## elasticsearch.fielddata.size

The size of the fielddata cache.

## elasticsearch.flush.total

The total number of index flushes to disk since start.

## elasticsearch.flush.total.time

The total time spent flushing the index to disk.

## elasticsearch.fs.total.available_in_bytes

The total number of bytes available to this Java virtual machine on this file store.

## elasticsearch.fs.total.disk_io_op

The total I/O operations on the file store.

## elasticsearch.fs.total.disk_io_size_in_bytes

Total bytes used for all I/O operations on the file store.

The total bytes read from the file store.

The total number of reads from the file store.

## elasticsearch.fs.total.disk_write_size_in_bytes

The total bytes written to the file store.

## elasticsearch.fs.total.disk_writes

The total number of writes to the file store.

## elasticsearch.fs.total.free_in_bytes

The total number of unallocated bytes in the file store.

## elasticsearch.fs.total.total_in_bytes

The total size in bytes of the file store.

## elasticsearch.get.current

The number of get requests currently running.

## elasticsearch.get.exists.time

The total time spent on get requests where the document existed.

## elasticsearch.get.exists.total

The total number of get requests where the document existed.

## elasticsearch.get.missing.time

The total time spent on get requests where the document was missing.

## elasticsearch.get.missing.total

The total number of get requests where the document was missing.

## elasticsearch.get.time

The total time spent on get requests.

## elasticsearch.get.total

The total number of get requests.

## elasticsearch.http.current_open

The number of current open HTTP connections.

## elasticsearch.http.total_opened

The total number of opened HTTP connections.

## elasticsearch.id_cache.size

The size of the id cache

## elasticsearch.indexing.delete.current

The number of documents currently being deleted from an index.

## elasticsearch.indexing.delete.time

The total time spent deleting documents from an index.

## elasticsearch.indexing.delete.total

The total number of documents deleted from an index.

## elasticsearch.indexing.index.current

The number of documents currently being indexed to an index.

## elasticsearch.indexing.index.time

The total time spent indexing documents to an index.

## elasticsearch.indexing.index.total

The total number of documents indexed to an index.

## elasticsearch.indices.count

The number of indices in the cluster.

## elasticsearch.indices.indexing.index_failed

The number of failed indexing operations.

## elasticsearch.indices.indexing.throttle_time

The total time indexing waited due to throttling.

## elasticsearch.indices.query_cache.evictions

The number of query cache evictions.

## elasticsearch.indices.query_cache.hit_count

The number of query cache hits.

## elasticsearch.indices.query_cache.memory_size_in_bytes

The memory used by the query cache.

## elasticsearch.indices.query_cache.miss_count

The number of query cache misses.

## elasticsearch.indices.recovery.current_as_source

The number of ongoing recoveries for which a shard serves as a source.

## elasticsearch.indices.recovery.current_as_target

The number of ongoing recoveries for which a shard serves as a target.

## elasticsearch.indices.recovery.throttle_time

The total time recoveries waited due to throttling.

## elasticsearch.indices.request_cache.evictions

The number of request cache evictions.

## elasticsearch.indices.request_cache.hit_count

The number of request cache hits.

## elasticsearch.indices.request_cache.memory_size_in_bytes

The memory used by the request cache.

## elasticsearch.indices.request_cache.miss_count

The number of request cache misses.

## elasticsearch.indices.segments.count

The number of segments in an index shard.

## elasticsearch.indices.segments.doc_values_memory_in_bytes

The memory used by doc values.

## elasticsearch.indices.segments.fixed_bit_set_memory_in_bytes

The memory used by fixed bit set.

## elasticsearch.indices.segments.index_writer_max_memory_in_bytes

The maximum memory used by the index writer.

## elasticsearch.indices.segments.index_writer_memory_in_bytes

The memory used by the index writer.

## elasticsearch.indices.segments.memory_in_bytes

The memory used by index segments.

## elasticsearch.indices.segments.norms_memory_in_bytes

The memory used by norms.

## elasticsearch.indices.segments.stored_fields_memory_in_bytes

The memory used by stored fields.

## elasticsearch.indices.segments.term_vectors_memory_in_bytes

The memory used by term vectors.

## elasticsearch.indices.segments.terms_memory_in_bytes

The memory used by terms.

## elasticsearch.indices.segments.version_map_memory_in_bytes

The memory used by the segment version map.

## elasticsearch.indices.translog.operations

The number of operations in the transaction log.

## elasticsearch.indices.translog.size_in_bytes

The size of the transaction log.

## elasticsearch.initializing_shards

The number of shards that are currently initializing.

## elasticsearch.merges.current

The number of currently active segment merges.

## elasticsearch.merges.current.docs

The number of documents across segments currently being merged.

## elasticsearch.merges.current.size

The size of the segments currently being merged.

## elasticsearch.merges.total

The total number of segment merges.

## elasticsearch.merges.total.docs

The total number of documents across all merged segments.

## elasticsearch.merges.total.size

The total size of all merged segments.

## elasticsearch.merges.total.time

The total time spent on segment merging.

## elasticsearch.number_of_data_nodes

The number of data nodes in the cluster.

## elasticsearch.number_of_nodes

The total number of nodes in the cluster.

The number of high priority pending tasks.

The number of urgent priority pending tasks.

The average time spent by tasks in the queue.

The total number of pending tasks.

## elasticsearch.process.open_fd

The number of opened file descriptors associated with the current process, or -1 if not supported.

## elasticsearch.refresh.total

The total number of index refreshes.

## elasticsearch.refresh.total.time

The total time spent on index refreshes.

## elasticsearch.relocating_shards

The number of shards that are relocating from one node to another.

## elasticsearch.search.fetch.current

The number of search fetches currently running.

## elasticsearch.search.fetch.open_contexts

The number of active searches.

## elasticsearch.search.fetch.time

The total time spent on the search fetch.

## elasticsearch.search.fetch.total

The total number of search fetches.

## elasticsearch.search.query.current

The number of currently active queries.

## elasticsearch.search.query.time

The total time spent on queries.

## elasticsearch.search.query.total

The total number of queries.

## elasticsearch.store.size

The total size in bytes of the store.

The number of active threads in the bulk pool.

The number of queued threads in the bulk pool.

The total number of threads in the bulk pool.

The number of rejected threads in the bulk pool.

The number of active threads in the fetch shard started pool.

The total number of threads in the fetch shard started pool.

The number of queued threads in the fetch shard started pool.

The number of rejected threads in the fetch shard started pool.

The number of active threads in the fetch shard store pool.

The total number of threads in the fetch shard store pool.

The number of queued threads in the fetch shard store pool.

The number of rejected threads in the fetch shard store pool.

The number of active threads in the flush queue.

The number of queued threads in the flush pool.

The total number of threads in the flush pool.

The number of rejected threads in the flush pool.

The number of active threads for force merge operations.

The total number of threads for force merge operations.

The number of queued threads for force merge operations.

The number of rejected threads for force merge operations.

The number of active threads in the generic pool.

The number of queued threads in the generic pool.

The total number of threads in the generic pool.

The number of rejected threads in the generic pool.

The number of active threads in the get pool.

The number of queued threads in the get pool.

The total number of threads in the get pool.

The number of rejected threads in the get pool.

The number of active threads in the index pool.

The number of queued threads in the index pool.

The total number of threads in the index pool.

The number of rejected threads in the index pool.

The number of active threads in the listener pool.

The number of queued threads in the listener pool.

The total number of threads in the listener pool.

The number of rejected threads in the listener pool.

The number of active threads in the management pool.

The number of queued threads in the management pool.

The total number of threads in the management pool.

The number of rejected threads in the management pool.

The number of active threads in the merge pool.

The number of queued threads in the merge pool.

The total number of threads in the merge pool.

The number of rejected threads in the merge pool.

The number of active threads in the percolate pool.

The number of queued threads in the percolate pool.

The total number of threads in the percolate pool.

The number of rejected threads in the percolate pool.

The number of active threads in the refresh pool.

The number of queued threads in the refresh pool.

The total number of threads in the refresh pool.

The number of rejected threads in the refresh pool.

The number of active threads in the search pool.

The number of queued threads in the search pool.

The total number of threads in the search pool.

The number of rejected threads in the search pool.

The number of active threads in the snapshot pool.

The number of queued threads in the snapshot pool.

The total number of threads in the snapshot pool.

The number of rejected threads in the snapshot pool.

The number of active threads in the write pool.

The number of queued threads in the write pool.

The total number of threads in the write pool.

The number of rejected threads in the write pool.

## elasticsearch.transport.rx_count

The total number of packets received in cluster communication.

## elasticsearch.transport.rx_size

The total size of data received in cluster communication.

## elasticsearch.transport.server_open

The number of connections opened for cluster communication.

## elasticsearch.transport.tx_count

The total number of packets sent in cluster communication.

## elasticsearch.transport.tx_size

The total size of data sent in cluster communication.

## elasticsearch.unassigned_shards

The number of shards that are unassigned to a node.

## elasticsearch.delayed_unassigned_shards

The number of shards whose allocation has been delayed.

## jvm.gc.collection_count

The total number of garbage collections run by the JVM.

## jvm.gc.collection_time

The total time spent on garbage collection in the JVM.

## jvm.gc.collectors.old.collection_time

The total time spent in major GCs in the JVM that collect old generation objects.

## jvm.gc.collectors.old.count

The total count of major GCs in the JVM that collect old generation objects.

## jvm.gc.collectors.young.collection_time

The total time spent in minor GCs in the JVM that collects young generation objects.

## jvm.gc.collectors.young.count

The total count of minor GCs in the JVM that collects young generation objects.

## jvm.gc.concurrent_mark_sweep.collection_time

The total time spent on “concurrent mark & sweep” GCs in the JVM.

## jvm.gc.concurrent_mark_sweep.count

The total count of “concurrent mark & sweep” GCs in the JVM.

## jvm.gc.par_new.collection_time

The total time spent on “parallel new” GCs in the JVM.

## jvm.gc.par_new.count

The total count of “parallel new” GCs in the JVM.

## jvm.mem.heap_committed

The amount of memory guaranteed to be available to the JVM heap.

## jvm.mem.heap_in_use

The amount of memory currently used by the JVM heap as a value between 0 and 1.

## jvm.mem.heap_max

The maximum amount of memory that can be used by the JVM heap.

## jvm.mem.heap_used

The amount of memory in bytes currently used by the JVM heap.

## jvm.mem.non_heap_committed

The amount of memory guaranteed to be available to JVM non-heap.

## jvm.mem.non_heap_used

The amount of memory in bytes currently used by the JVM non-heap.

## jvm.mem.pools.young.used

The amount of memory in bytes currently used by the Young Generation heap region.

## jvm.mem.pools.young.max

The maximum amount of memory that can be used by the Young Generation heap region.

## jvm.mem.pools.old.used

The amount of memory in bytes currently used by the Old Generation heap region.

## jvm.mem.pools.old.max

The maximum amount of memory that can be used by the Old Generation heap region.

## jvm.mem.pools.survivor.used

The amount of memory in bytes currently used by the Survivor Space.

## jvm.mem.pools.survivor.max

The maximum amount of memory that can be used by the Survivor Space.

The number of active threads in the JVM.

The peak number of threads used by the JVM.

## elasticsearch.index.health

The status of the index.

## elasticsearch.index.docs.count

The number of documents in the index.

## elasticsearch.index.docs.deleted

The number of deleted documents in the index.

## elasticsearch.index.primary_shards

The number of primary shards in the index.

## elasticsearch.index.replica_shards

The number of replica shards in the index.

## elasticsearch.index.primary_store_size

The store size of primary shards in the index.

## elasticsearch.index.store_size

The store size of primary and replica shards in the index.

# etcd Metrics

Rate of failed Raft RPC requests.

Rate of successful Raft RPC requests.

Average latency to each peer in the cluster.

Current latency to each peer in the cluster.

Maximum latency to each peer in the cluster.

Minimum latency to each peer in the cluster.

Standard deviation latency to each peer in the cluster.

## etcd.self.recv.appendrequest.count

Rate of append requests this node has processed.

## etcd.self.recv.bandwidthrate

Rate of bytes received.

## etcd.self.recv.pkgrate

Rate of packets received.

## etcd.self.send.appendrequest.count

Rate of append requests this node has sent.

## etcd.self.send.bandwidthrate

Rate of bytes sent.

## etcd.self.send.pkgrate

Rate of packets sent.

## etcd.store.compareanddelete.fail

Rate of compare and delete requests failure.

## etcd.store.compareanddelete.success

Rate of compare and delete requests success.

## etcd.store.compareandswap.fail

Rate of compare and swap requests failure.

## etcd.store.compareandswap.success

Rate of compare and swap requests success.

## etcd.store.create.fail

Rate of failed create requests.

## etcd.store.create.success

Rate of successful create requests.

## etcd.store.delete.fail

Rate of failed delete requests.

## etcd.store.delete.success

Rate of successful delete requests.

## etcd.store.expire.count

Rate of expired keys.

## etcd.store.gets.fail

Rate of failed get requests.

## etcd.store.gets.success

Rate of successful get requests.

## etcd.store.sets.fail

Rate of failed set requests.

## etcd.store.sets.success

Rate of successful set requests.

## etcd.store.update.fail

Rate of failed update requests.

## etcd.store.update.success

Rate of successful update requests.

## etcd.store.watchers

Rate of watchers.

# fluentd Metrics

## fluentd.buffer_queue_length

The length of the plugin buffer queue for this plugin.

## fluentd.buffer_total_queued_size

The size of the buffer queue for this plugin.

## fluentd.retry_count

The number of retries for this plugin.

# Go Metrics

## go_expvar.memstats.alloc

The number of bytes allocated and not yet freed.

## go_expvar.memstats.frees

The number of free bytes.

## go_expvar.memstats.heap_idle

The number of bytes in idle spans.

## go_expvar.memstats.heap_inuse

The number of bytes in non-idle spans.

## go_expvar.memstats.heap_objects

The total number of allocated objects.

## go_expvar.memstats.heap_released

The number of bytes released to the OS.

## go_expvar.memstats.heap_sys

The number of bytes obtained from the system.

## go_expvar.memstats.lookups

The number of pointer lookups.

## go_expvar.memstats.mallocs

The number of mallocs.

## go_expvar.memstats.num_gc

The number of garbage collections.

## go_expvar.memstats.pause_ns.avg

The average of recent GC pause durations.

## go_expvar.memstats.pause_ns.count

The number of submitted GC pause durations.

## go_expvar.memstats.pause_ns.max

The max GC pause duration.

## go_expvar.memstats.pause_ns.median

The median GC pause duration.

## go_expvar.memstats.pause_total_ns

The total GC pause duration over the lifetime of process.

## go_expvar.memstats.total_alloc

The bytes allocated (even if freed).

# Go Metrics

## go_expvar.memstats.alloc

The number of bytes allocated and not yet freed.

## go_expvar.memstats.frees

The number of free bytes.

## go_expvar.memstats.heap_idle

The number of bytes in idle spans.

## go_expvar.memstats.heap_inuse

The number of bytes in non-idle spans.

## go_expvar.memstats.heap_objects

The total number of allocated objects.

## go_expvar.memstats.heap_released

The number of bytes released to the OS.

## go_expvar.memstats.heap_sys

The number of bytes obtained from the system.

## go_expvar.memstats.lookups

The number of pointer lookups.

## go_expvar.memstats.mallocs

The number of mallocs.

## go_expvar.memstats.num_gc

The number of garbage collections.

## go_expvar.memstats.pause_ns.avg

The average of recent GC pause durations.

## go_expvar.memstats.pause_ns.count

The number of submitted GC pause durations.

## go_expvar.memstats.pause_ns.max

The max GC pause duration.

## go_expvar.memstats.pause_ns.median

The median GC pause duration.

## go_expvar.memstats.pause_total_ns

The total GC pause duration over the lifetime of process.

## go_expvar.memstats.total_alloc

The bytes allocated (even if freed).

# HTTP Metrics

## http.ssl.days_left

The number of days until the SSL certificate expires.

## network.http.response_time

The response time of a HTTP request to a specified URL.

# HTTP Metrics

See HTTP integration.

## http.ssl.days_left

The number of days until the SSL certificate expires.

## network.http.response_time

The response time of a HTTP request to a specified URL.

# HAProxy Metrics

## haproxy.backend_hosts

The number of backend hosts.

## haproxy.backend.bytes.in_rate

The rate of bytes in on backend hosts.

## haproxy.backend.bytes.out_rate

The rate of bytes out on backend hosts.

## haproxy.backend.connect.time

The average connect time over the last 1024 requests.

## haproxy.backend.denied.req_rate

The number of requests denied due to security concerns.

## haproxy.backend.denied.resp_rate

The number of responses denied due to security concerns.

## haproxy.backend.errors.con_rate

The rate of requests that encountered an error trying to connect to a backend server.

## haproxy.backend.errors.resp_rate

The rate of responses aborted due to error.

## haproxy.backend.queue.current

The number of requests without an assigned backend.

## haproxy.backend.queue.time

The average queue time over the last 1024 requests.

## haproxy.backend.response.1xx

The backend HTTP responses with 1xx code.

## haproxy.backend.response.2xx

The backend HTTP responses with 2xx code.

## haproxy.backend.response.3xx

The backend HTTP responses with 3xx code.

## haproxy.backend.response.4xx

The backend HTTP responses with 4xx code.

## haproxy.backend.response.5xx

The backend HTTP responses with 5xx code.

## haproxy.backend.response.other

The backend HTTP responses with another code (protocol error).

## haproxy.backend.response.time

The average response time over the last 1024 requests (0 for TCP).

## haproxy.backend.session.current

The number of active backend sessions.

## haproxy.backend.session.limit

The configured backend session limit.

## haproxy.backend.session.pct

The percentage of sessions in use. The formula used for this metric is `backend.session.current `/` backend.session.limit` * 100.

## haproxy.backend.session.rate

The number of backend sessions created per second.

## haproxy.backend.session.time

The average total session time over the last 1024 requests.

## haproxy.backend.uptime

The number of seconds since the last UP<->DOWN transition.

## haproxy.backend.warnings.redis_rate

The number of times a request was redispatched to another server.

## haproxy.backend.warnings.retr_rate

The number of times a connection to a server was retried.

## haproxy.count_per_status

The number of hosts by status (UP/DOWN/NOLB/MAINT).

## haproxy.frontend.bytes.in_rate

The rate of bytes in on frontend hosts.

## haproxy.frontend.bytes.out_rate

The rate of bytes out on frontend hosts.

## haproxy.frontend.denied.req_rate

The number of requests denied due to security concerns.

## haproxy.frontend.denied.resp_rate

The number of responses denied due to security concerns.

## haproxy.frontend.errors.req_rate

The rate of request errors.

## haproxy.frontend.requests.rate

The number of HTTP requests per second.

## haproxy.frontend.response.1xx

The frontend HTTP responses with 1xx code.

## haproxy.frontend.response.2xx

The frontend HTTP responses with 2xx code.

## haproxy.frontend.response.3xx

The frontend HTTP responses with 3xx code.

## haproxy.frontend.response.4xx

The frontend HTTP responses with 4xx code.

## haproxy.frontend.response.5xx

The frontend HTTP responses with 5xx code.

## haproxy.frontend.response.other

The frontend HTTP responses with another code (protocol error).

## haproxy.frontend.session.current

The number of active frontend sessions.

## haproxy.frontend.session.limit

The configured backend session limit.

## haproxy.frontend.session.pct

The percentage of sessions in use. The formula used for this metric is `frontend.session.current `/ `frontend.session.limit` * 100.

## haproxy.frontend.session.rate

The number of frontend sessions created per second.

## Agent 9.6.0 Additional HAProxy Metrics

• haproxy.backend.requests.tot_rate

Rate of total number of HTTP requests

• haproxy.frontend.connections.rate

Number of connections per second

• haproxy.frontend.connections.tot_rate

Rate of total number of connections

• haproxy.frontend.requests.intercepted

Number of intercepted requests per second

• haproxy.frontend.requests.tot_rate

Rate of total number of HTTP requests

# HAProxy Metrics

## haproxy.backend_hosts

The number of backend hosts.

## haproxy.backend.bytes.in_rate

The rate of bytes in on backend hosts.

## haproxy.backend.bytes.out_rate

The rate of bytes out on backend hosts.

## haproxy.backend.connect.time

The average connect time over the last 1024 requests.

## haproxy.backend.denied.req_rate

The number of requests denied due to security concerns.

## haproxy.backend.denied.resp_rate

The number of responses denied due to security concerns.

## haproxy.backend.errors.con_rate

The rate of requests that encountered an error trying to connect to a backend server.

## haproxy.backend.errors.resp_rate

The rate of responses aborted due to error.

## haproxy.backend.queue.current

The number of requests without an assigned backend.

## haproxy.backend.queue.time

The average queue time over the last 1024 requests.

## haproxy.backend.response.1xx

The backend HTTP responses with 1xx code.

## haproxy.backend.response.2xx

The backend HTTP responses with 2xx code.

## haproxy.backend.response.3xx

The backend HTTP responses with 3xx code.

## haproxy.backend.response.4xx

The backend HTTP responses with 4xx code.

## haproxy.backend.response.5xx

The backend HTTP responses with 5xx code.

## haproxy.backend.response.other

The backend HTTP responses with another code (protocol error).

## haproxy.backend.response.time

The average response time over the last 1024 requests (0 for TCP).

## haproxy.backend.session.current

The number of active backend sessions.

## haproxy.backend.session.limit

The configured backend session limit.

## haproxy.backend.session.pct

The percentage of sessions in use. The formula used for this metric is `backend.session.current `/` backend.session.limit` * 100.

## haproxy.backend.session.rate

The number of backend sessions created per second.

## haproxy.backend.session.time

The average total session time over the last 1024 requests.

## haproxy.backend.uptime

The number of seconds since the last UP<->DOWN transition.

## haproxy.backend.warnings.redis_rate

The number of times a request was redispatched to another server.

## haproxy.backend.warnings.retr_rate

The number of times a connection to a server was retried.

## haproxy.count_per_status

The number of hosts by status (UP/DOWN/NOLB/MAINT).

## haproxy.frontend.bytes.in_rate

The rate of bytes in on frontend hosts.

## haproxy.frontend.bytes.out_rate

The rate of bytes out on frontend hosts.

## haproxy.frontend.denied.req_rate

The number of requests denied due to security concerns.

## haproxy.frontend.denied.resp_rate

The number of responses denied due to security concerns.

## haproxy.frontend.errors.req_rate

The rate of request errors.

## haproxy.frontend.requests.rate

The number of HTTP requests per second.

## haproxy.frontend.response.1xx

The frontend HTTP responses with 1xx code.

## haproxy.frontend.response.2xx

The frontend HTTP responses with 2xx code.

## haproxy.frontend.response.3xx

The frontend HTTP responses with 3xx code.

## haproxy.frontend.response.4xx

The frontend HTTP responses with 4xx code.

## haproxy.frontend.response.5xx

The frontend HTTP responses with 5xx code.

## haproxy.frontend.response.other

The frontend HTTP responses with another code (protocol error).

## haproxy.frontend.session.current

The number of active frontend sessions.

## haproxy.frontend.session.limit

The configured backend session limit.

## haproxy.frontend.session.pct

The percentage of sessions in use. The formula used for this metric is `frontend.session.current `/ `frontend.session.limit` * 100.

## haproxy.frontend.session.rate

The number of frontend sessions created per second.

## Agent 9.6.0 Additional HAProxy Metrics

• haproxy.backend.requests.tot_rate

Rate of total number of HTTP requests

• haproxy.frontend.connections.rate

Number of connections per second

• haproxy.frontend.connections.tot_rate

Rate of total number of connections

• haproxy.frontend.requests.intercepted

Number of intercepted requests per second

• haproxy.frontend.requests.tot_rate

Rate of total number of HTTP requests

# Jenkins Metrics

## jenkins.job.duration

The duration of a job, measured in seconds.

## jenkins.job.success

The status of a successful job.

## jenkins.job.failure

The status of a failed job.

# Jenkins Metrics

## jenkins.job.duration

The duration of a job, measured in seconds.

## jenkins.job.success

The status of a successful job.

## jenkins.job.failure

The status of a failed job.

# Lighttpd Metrics

## lighttpd.net.bytes

The total number of bytes sent and received.

## lighttpd.net.bytes_per_s

The number of bytes sent and received per second.

## lighttpd.net.hits

The total number of hits since the start.

## lighttpd.net.request_per_s

The number of requests per second.

## lighttpd.performance.busy_servers

The number of active connections.

## lighttpd.performance.idle_server

The number of idle connections.

## lighttpd.performance.uptime

The amount of time the server has been up and running.

# Lighttpd Metrics

## lighttpd.net.bytes

The total number of bytes sent and received.

## lighttpd.net.bytes_per_s

The number of bytes sent and received per second.

## lighttpd.net.hits

The total number of hits since the start.

## lighttpd.net.request_per_s

The number of requests per second.

## lighttpd.performance.busy_servers

The number of active connections.

## lighttpd.performance.idle_server

The number of idle connections.

## lighttpd.performance.uptime

The amount of time the server has been up and running.

# Memcached Metrics

## memcache.avg_item_size

The average size of an item.

## memcache.bytes

The current number of bytes used by this server to store items.

The rate of bytes read from the network by this server.

## memcache.bytes_written_rate

The rate of bytes written to the network by this server.

The rate at which keys are compared and swapped where the comparison (original) value did not match the supplied value.

## memcache.cas_hits_rate

The rate at which keys are compared and swapped and found present.

## memcache.cas_misses_rate

The rate at which keys are compared and swapped and not found present.

## memcache.cmd_flush_rate

The rate of `flush_all` commands.

## memcache.cmd_get_rate

The rate of `get` commands.

## memcache.cmd_set_rate

The rate of `set` commands.

## memcache.connection_structures

The number of connection structures allocated by the server.

## memcache.curr_connections

The number of open connections to this server.

## memcache.curr_items

The current number of items stored by the server.

## memcache.delete_hits_rate

The rate at which delete commands result in items being removed.

## memcache.delete_misses_rate

The rate at which delete commands result in no items being removed.

## memcache.evictions_rate

The rate at which valid items are removed from cache to free memory for new items.

## memcache.fill_percent

The amount of memory being used by the server for storing items as a percentage of the max allowed.

## memcache.get_hit_percent

The percentage of requested keys that are found present since the start of the Memcached server.

## memcache.get_hits_rate

The rate at which keys are requested and found present.

## memcache.get_misses_rate

The rate at which keys are requested and not found.

## memcache.items.age

The age of the oldest item in the LRU.

## memcache.items.crawler_reclaimed_rate

The rate at which items freed by the LRU Crawler.

## memcache.items.direct_reclaims_rate

The rate at which worker threads had to directly pull LRU tails to find memory for a new item.

## memcache.items.evicted_nonzero_rate

The rate at which nonzero items which had an explicit expire time set had to be evicted from the LRU before expiring.

## memcache.items.evicted_rate

The rate st which items had to be evicted from the LRU before expiring.

## memcache.items.evicted_time

The number of seconds since the last access for the most recent item evicted from this class.

## memcache.items.evicted_unfetched_rate

The rate at which valid items evicted from the LRU which were never touched after being set.

## memcache.items.expired_unfetched_rate

The rate at which expired items reclaimed from the LRU which were never touched after being set.

## memcache.items.lrutail_reflocked_rate

The rate at which items found to be refcount locked in the LRU tail.

## memcache.items.moves_to_cold_rate

The rate at which items were moved from HOT or WARM into COLD.

## memcache.items.moves_to_warm_rate

The rate at which items were moved from COLD to WARM.

## memcache.items.moves_within_lru_rate

The rate at which active items were bumped within HOT or WARM.

## memcache.items.number

The number of items presently stored in this slab class.

## memcache.items.number_cold

The number of items presently stored in the COLD LRU.

## memcache.items.number_hot

The number of items presently stored in the HOT LRU.

## memcache.items.number_noexp

The number of items presently stored in the NOEXP class.

## memcache.items.number_warm

The number of items presently stored in the WARM LRU.

## memcache.items.outofmemory_rate

The rate at which the underlying slab class was unable to store a new item.

## memcache.items.reclaimed_rate

The rate at which entries were stored using memory from an expired entry.

## memcache.items.tailrepairs_rate

The rate at which Memcached self-healed a slab with a refcount leak.

## memcache.limit_maxbytes

The number of bytes this server is allowed to use for storage.

## memcache.listen_disabled_num_rate

The rate at which the server has reached the max connection limit.

## memcache.pointer_size

The default size of pointers on the host OS (generally 32 or 64).

## memcache.rusage_system_rate

The fraction of user time the CPU spent executing this server process.

## memcache.rusage_user_rate

The fraction of time the CPU spent executing kernel code on behalf of this server process.

## memcache.slabs.active_slabs

The total number of slab classes allocated.

The rate at which CAS commands failed to modify a value due to a bad CAS ID.

## memcache.slabs.cas_hits_rate

The rate at which CAS commands modified this slab class.

## memcache.slabs.chunk_size

The amount of space each chunk uses.

## memcache.slabs.chunks_per_page

The number of chunks that exist within one page.

## memcache.slabs.cmd_set_rate

The rate at which set requests stored data in this slab class.

## memcache.slabs.decr_hits_rate

The rate at which decrs commands modified this slab class.

## memcache.slabs.delete_hits_rate

The rate at which delete commands succeeded in this slab class.

## memcache.slabs.free_chunks

The number of chunks not yet allocated to items or freed via delete.

## memcache.slabs.free_chunks_end

The number of free chunks at the end of the last allocated page.

## memcache.slabs.get_hits_rate

The rate at which get requests were serviced by this slab class.

## memcache.slabs.incr_hits_rate

The rate at which incrs commands modified this slab class.

## memcache.slabs.mem_requested

The number of bytes requested to be stored in this slab.

## memcache.slabs.total_chunks

The total number of chunks allocated to the slab class.

## memcache.slabs.total_malloced

The total amount of memory allocated to slab pages.

## memcache.slabs.total_pages

The total number of pages allocated to the slab class.

## memcache.slabs.touch_hits_rate

The rate of touches serviced by this slab class.

## memcache.slabs.used_chunks

The number of chunks that have been allocated to items.

## memcache.slabs.used_chunks_rate

The rate at which chunks have been allocated to items.

The number of threads used by the current Memcached server process.

## memcache.total_connections_rate

The rate at which connections to this server are opened.

## memcache.total_items

The total number of items stored by this server since it started.

## memcache.uptime

The number of seconds this server has been running.

# Memcached Metrics

## memcache.avg_item_size

The average size of an item.

## memcache.bytes

The current number of bytes used by this server to store items.

The rate of bytes read from the network by this server.

## memcache.bytes_written_rate

The rate of bytes written to the network by this server.

The rate at which keys are compared and swapped where the comparison (original) value did not match the supplied value.

## memcache.cas_hits_rate

The rate at which keys are compared and swapped and found present.

## memcache.cas_misses_rate

The rate at which keys are compared and swapped and not found present.

## memcache.cmd_flush_rate

The rate of `flush_all` commands.

## memcache.cmd_get_rate

The rate of `get` commands.

## memcache.cmd_set_rate

The rate of `set` commands.

## memcache.connection_structures

The number of connection structures allocated by the server.

## memcache.curr_connections

The number of open connections to this server.

## memcache.curr_items

The current number of items stored by the server.

## memcache.delete_hits_rate

The rate at which delete commands result in items being removed.

## memcache.delete_misses_rate

The rate at which delete commands result in no items being removed.

## memcache.evictions_rate

The rate at which valid items are removed from cache to free memory for new items.

## memcache.fill_percent

The amount of memory being used by the server for storing items as a percentage of the max allowed.

## memcache.get_hit_percent

The percentage of requested keys that are found present since the start of the Memcached server.

## memcache.get_hits_rate

The rate at which keys are requested and found present.

## memcache.get_misses_rate

The rate at which keys are requested and not found.

## memcache.items.age

The age of the oldest item in the LRU.

## memcache.items.crawler_reclaimed_rate

The rate at which items freed by the LRU Crawler.

## memcache.items.direct_reclaims_rate

The rate at which worker threads had to directly pull LRU tails to find memory for a new item.

## memcache.items.evicted_nonzero_rate

The rate at which nonzero items which had an explicit expire time set had to be evicted from the LRU before expiring.

## memcache.items.evicted_rate

The rate st which items had to be evicted from the LRU before expiring.

## memcache.items.evicted_time

The number of seconds since the last access for the most recent item evicted from this class.

## memcache.items.evicted_unfetched_rate

The rate at which valid items evicted from the LRU which were never touched after being set.

## memcache.items.expired_unfetched_rate

The rate at which expired items reclaimed from the LRU which were never touched after being set.

## memcache.items.lrutail_reflocked_rate

The rate at which items found to be refcount locked in the LRU tail.

## memcache.items.moves_to_cold_rate

The rate at which items were moved from HOT or WARM into COLD.

## memcache.items.moves_to_warm_rate

The rate at which items were moved from COLD to WARM.

## memcache.items.moves_within_lru_rate

The rate at which active items were bumped within HOT or WARM.

## memcache.items.number

The number of items presently stored in this slab class.

## memcache.items.number_cold

The number of items presently stored in the COLD LRU.

## memcache.items.number_hot

The number of items presently stored in the HOT LRU.

## memcache.items.number_noexp

The number of items presently stored in the NOEXP class.

## memcache.items.number_warm

The number of items presently stored in the WARM LRU.

## memcache.items.outofmemory_rate

The rate at which the underlying slab class was unable to store a new item.

## memcache.items.reclaimed_rate

The rate at which entries were stored using memory from an expired entry.

## memcache.items.tailrepairs_rate

The rate at which Memcached self-healed a slab with a refcount leak.

## memcache.limit_maxbytes

The number of bytes this server is allowed to use for storage.

## memcache.listen_disabled_num_rate

The rate at which the server has reached the max connection limit.

## memcache.pointer_size

The default size of pointers on the host OS (generally 32 or 64).

## memcache.rusage_system_rate

The fraction of user time the CPU spent executing this server process.

## memcache.rusage_user_rate

The fraction of time the CPU spent executing kernel code on behalf of this server process.

## memcache.slabs.active_slabs

The total number of slab classes allocated.

The rate at which CAS commands failed to modify a value due to a bad CAS ID.

## memcache.slabs.cas_hits_rate

The rate at which CAS commands modified this slab class.

## memcache.slabs.chunk_size

The amount of space each chunk uses.

## memcache.slabs.chunks_per_page

The number of chunks that exist within one page.

## memcache.slabs.cmd_set_rate

The rate at which set requests stored data in this slab class.

## memcache.slabs.decr_hits_rate

The rate at which decrs commands modified this slab class.

## memcache.slabs.delete_hits_rate

The rate at which delete commands succeeded in this slab class.

## memcache.slabs.free_chunks

The number of chunks not yet allocated to items or freed via delete.

## memcache.slabs.free_chunks_end

The number of free chunks at the end of the last allocated page.

## memcache.slabs.get_hits_rate

The rate at which get requests were serviced by this slab class.

## memcache.slabs.incr_hits_rate

The rate at which incrs commands modified this slab class.

## memcache.slabs.mem_requested

The number of bytes requested to be stored in this slab.

## memcache.slabs.total_chunks

The total number of chunks allocated to the slab class.

## memcache.slabs.total_malloced

The total amount of memory allocated to slab pages.

## memcache.slabs.total_pages

The total number of pages allocated to the slab class.

## memcache.slabs.touch_hits_rate

The rate of touches serviced by this slab class.

## memcache.slabs.used_chunks

The number of chunks that have been allocated to items.

## memcache.slabs.used_chunks_rate

The rate at which chunks have been allocated to items.

The number of threads used by the current Memcached server process.

## memcache.total_connections_rate

The rate at which connections to this server are opened.

## memcache.total_items

The total number of items stored by this server since it started.

## memcache.uptime

The number of seconds this server has been running.

# Mesos Agent Metrics

## mesos.slave.cpus_percent

The percentage of CPUs allocated to the slave.

## mesos.slave.cpus_total

The total number of CPUs.

## mesos.slave.cpus_used

The number of CPUs allocated to the slave.

## mesos.slave.disk_percent

The percentage of disk space allocated to the slave.

## mesos.slave.disk_total

The total disk space available.

## mesos.slave.disk_used

The amount of disk space allocated to the slave.

## mesos.slave.executors_registering

The number of executors registering.

## mesos.slave.executors_running

The number of executors currently running.

## mesos.slave.executors_terminated

The number of terminated executors.

## mesos.slave.executors_terminating

The number of terminating executors.

## mesos.slave.frameworks_active

The number of active frameworks.

## mesos.slave.invalid_framework_messages

The number of invalid framework messages.

The number of invalid status updates.

## mesos.slave.mem_percent

The percentage of memory allocated to the slave.

## mesos.slave.mem_total

The total memory available.

## mesos.slave.mem_used

The amount of memory allocated to the slave.

## mesos.slave.recovery_errors

The number of errors encountered during slave recovery.

The number of failed tasks.

The number of finished tasks.

The number of killed tasks.

The number of lost tasks.

The number of running tasks.

The number of staging tasks.

The number of starting tasks.

## mesos.slave.valid_framework_messages

The number of valid framework messages.

The number of valid status updates.

The disk space available for the task.

The amount of memory used by the task.

## mesos.stats.registered

Defines whether this slave is registered with a master.

## mesos.stats.system.cpus_total

The total number of CPUs available.

The average load for the last minute.

The average load for the last five minutes.

The average load for the last 15 minutes.

## mesos.stats.system.mem_free_bytes

The amount of free memory.

## mesos.stats.system.mem_total_bytes

The total amount of memory.

## mesos.stats.uptime_secs

The current uptime for the slave.

# Mesos Agent Metrics

## mesos.slave.cpus_percent

The percentage of CPUs allocated to the slave.

## mesos.slave.cpus_total

The total number of CPUs.

## mesos.slave.cpus_used

The number of CPUs allocated to the slave.

## mesos.slave.disk_percent

The percentage of disk space allocated to the slave.

## mesos.slave.disk_total

The total disk space available.

## mesos.slave.disk_used

The amount of disk space allocated to the slave.

## mesos.slave.executors_registering

The number of executors registering.

## mesos.slave.executors_running

The number of executors currently running.

## mesos.slave.executors_terminated

The number of terminated executors.

## mesos.slave.executors_terminating

The number of terminating executors.

## mesos.slave.frameworks_active

The number of active frameworks.

## mesos.slave.invalid_framework_messages

The number of invalid framework messages.

The number of invalid status updates.

## mesos.slave.mem_percent

The percentage of memory allocated to the slave.

## mesos.slave.mem_total

The total memory available.

## mesos.slave.mem_used

The amount of memory allocated to the slave.

## mesos.slave.recovery_errors

The number of errors encountered during slave recovery.

The number of failed tasks.

The number of finished tasks.

The number of killed tasks.

The number of lost tasks.

The number of running tasks.

The number of staging tasks.

The number of starting tasks.

## mesos.slave.valid_framework_messages

The number of valid framework messages.

The number of valid status updates.

The disk space available for the task.

The amount of memory used by the task.

## mesos.stats.registered

Defines whether this slave is registered with a master.

## mesos.stats.system.cpus_total

The total number of CPUs available.

The average load for the last minute.

The average load for the last five minutes.

The average load for the last 15 minutes.

## mesos.stats.system.mem_free_bytes

The amount of free memory.

## mesos.stats.system.mem_total_bytes

The total amount of memory.

## mesos.stats.uptime_secs

The current uptime for the slave.

# Mesos Master Metrics

## mesos.cluster.cpus_percent

The percentage of CPUs allocated to the cluster.

## mesos.cluster.cpus_total

The total number of CPUs.

## mesos.cluster.cpus_used

The number of CPUs used by the cluster.

## mesos.cluster.disk_percent

The percentage of disk space allocated to the cluster.

## mesos.cluster.disk_total

The total amount of disk space.

## mesos.cluster.disk_used

The amount of disk space used by the cluster.

## mesos.cluster.dropped_messages

The number of dropped messages.

## mesos.cluster.event_queue_dispatches

The number of dispatches in the event queue.

## mesos.cluster.event_queue_http_requests

The number of HTTP requests in the event queue.

## mesos.cluster.event_queue_messages

The number of messages in the event queue.

## mesos.cluster.frameworks_active

The number of active frameworks.

## mesos.cluster.frameworks_connected

The number of connected frameworks.

## mesos.cluster.frameworks_disconnected

The number of disconnected frameworks.

## mesos.cluster.frameworks_inactive

The number of inactive frameworks.

## mesos.cluster.gpus_total

The total number of GPUs.

## mesos.cluster.invalid_framework_to_executor_messages

The number of invalid messages between the framework and the executor.

## mesos.cluster.invalid_status_update_acknowledgements

The number of invalid status update acknowledgements.

The number of invalid framework messages.

## mesos.cluster.mem_percent

The percentage of memory allocated to the cluster.

## mesos.cluster.mem_total

The total amount of memory available.

## mesos.cluster.mem_used

The amount of memory the cluster is using.

## mesos.cluster.outstanding_offers

The number of outstanding resource offers.

## mesos.cluster.slave_registrations

The number of slaves able to rejoin the cluster after a disconnect.

## mesos.cluster.slave_removals

The number of slaves that have been removed for any reason, including maintenance.

## mesos.cluster.slave_reregistrations

The number of slaves that have re-registered.

## mesos.cluster.slave_shutdowns_canceled

The number of slave shutdowns processes that have been cancelled.

## mesos.cluster.slave_shutdowns_scheduled

The number of slaves that have failed health checks and are scheduled for removal.

## mesos.cluster.slaves_active

The number of active slaves.

## mesos.cluster.slaves_connected

The number of connected slaves.

## mesos.cluster.slaves_disconnected

The number of disconnected slaves.

## mesos.cluster.slaves_inactive

The number of inactive slaves.

The number of cluster tasks that resulted in an error.

The number of failed cluster tasks.

The number of completed cluster tasks.

The number of killed cluster tasks.

The number of lost cluster tasks.

The number of cluster tasks currently running.

The number of cluster tasks currently staging.

The number of cluster tasks starting.

## mesos.cluster.valid_framework_to_executor_messages

The number of valid framework messages.

## mesos.cluster.valid_status_update_acknowledgements

The number of valid status update acknowledgements.

The number of valid status updates.

## mesos.framework.cpu

The CPU of the Mesos framework.

## mesos.framework.disk

The total disk space of the Mesos framework, measured in mebibytes.

## mesos.framework.mem

The total memory of the Mesos framework, measured in mebibytes.

## mesos.registrar.queued_operations

The number of queued operations.

## mesos.registrar.registry_size_bytes

The size of the Mesos registry in bytes.

## mesos.registrar.state_fetch_ms

The Mesos registry’s read latency, in bytes.

## mesos.registrar.state_store_ms

The Mesos registry’s write latency, in bytes.

## mesos.registrar.state_store_ms.count

The Mesos registry’s write count, in bytes.

## mesos.registrar.state_store_ms.max

The maximum write latency for the registry, in milliseconds.

## mesos.registrar.state_store_ms.min

The minimum write latency for the registry, in miliseconds.

## mesos.registrar.state_store_ms.p50

The median registry write latency, in milliseconds.

## mesos.registrar.state_store_ms.p90

The 90th percentile registry write latency, in milliseconds.

## mesos.registrar.state_store_ms.p95

The 95th percentile registry write latency, in milliseconds.

## mesos.registrar.state_store_ms.p99

The 99th percentile registry write latency, in milliseconds.

## mesos.registrar.state_store_ms.p999

The 99.9th percentile registry write latency, in milliseconds.

## mesos.registrar.state_store_ms.p9999

The 99.99th percentile registry write latency, in milliseconds.

## mesos.role.cpu

The CPU capacity of the configured role.

## mesos.role.disk

The total disk space available to the Mesos role, in mebibytes.

## mesos.role.mem

The total memory available to the Mesos role, in mebibytes.

## mesos.stats.elected

Defines whether this is the elected master or not.

## mesos.stats.system.cpus_total

The total number of CPUs in the system.

The average load for the last minute.

The average load for the last five minutes.

The average load for the last fifteen minutes.

## mesos.stats.system.mem_free_bytes

The total amount of free system memory, in bytes.

## mesos.stats.system.mem_total_bytes

The total cluster memory in bytes.

## mesos.stats.uptime_secs

The current uptime of the cluster.

# Mesos Master Metrics

## mesos.cluster.cpus_percent

The percentage of CPUs allocated to the cluster.

## mesos.cluster.cpus_total

The total number of CPUs.

## mesos.cluster.cpus_used

The number of CPUs used by the cluster.

## mesos.cluster.disk_percent

The percentage of disk space allocated to the cluster.

## mesos.cluster.disk_total

The total amount of disk space.

## mesos.cluster.disk_used

The amount of disk space used by the cluster.

## mesos.cluster.dropped_messages

The number of dropped messages.

## mesos.cluster.event_queue_dispatches

The number of dispatches in the event queue.

## mesos.cluster.event_queue_http_requests

The number of HTTP requests in the event queue.

## mesos.cluster.event_queue_messages

The number of messages in the event queue.

## mesos.cluster.frameworks_active

The number of active frameworks.

## mesos.cluster.frameworks_connected

The number of connected frameworks.

## mesos.cluster.frameworks_disconnected

The number of disconnected frameworks.

## mesos.cluster.frameworks_inactive

The number of inactive frameworks.

## mesos.cluster.gpus_total

The total number of GPUs.

## mesos.cluster.invalid_framework_to_executor_messages

The number of invalid messages between the framework and the executor.

## mesos.cluster.invalid_status_update_acknowledgements

The number of invalid status update acknowledgements.

The number of invalid framework messages.

## mesos.cluster.mem_percent

The percentage of memory allocated to the cluster.

## mesos.cluster.mem_total

The total amount of memory available.

## mesos.cluster.mem_used

The amount of memory the cluster is using.

## mesos.cluster.outstanding_offers

The number of outstanding resource offers.

## mesos.cluster.slave_registrations

The number of slaves able to rejoin the cluster after a disconnect.

## mesos.cluster.slave_removals

The number of slaves that have been removed for any reason, including maintenance.

## mesos.cluster.slave_reregistrations

The number of slaves that have re-registered.

## mesos.cluster.slave_shutdowns_canceled

The number of slave shutdowns processes that have been cancelled.

## mesos.cluster.slave_shutdowns_scheduled

The number of slaves that have failed health checks and are scheduled for removal.

## mesos.cluster.slaves_active

The number of active slaves.

## mesos.cluster.slaves_connected

The number of connected slaves.

## mesos.cluster.slaves_disconnected

The number of disconnected slaves.

## mesos.cluster.slaves_inactive

The number of inactive slaves.

The number of cluster tasks that resulted in an error.

The number of failed cluster tasks.

The number of completed cluster tasks.

The number of killed cluster tasks.

The number of lost cluster tasks.

The number of cluster tasks currently running.

The number of cluster tasks currently staging.

The number of cluster tasks starting.

## mesos.cluster.valid_framework_to_executor_messages

The number of valid framework messages.

## mesos.cluster.valid_status_update_acknowledgements

The number of valid status update acknowledgements.

The number of valid status updates.

## mesos.framework.cpu

The CPU of the Mesos framework.

## mesos.framework.disk

The total disk space of the Mesos framework, measured in mebibytes.

## mesos.framework.mem

The total memory of the Mesos framework, measured in mebibytes.

## mesos.registrar.queued_operations

The number of queued operations.

## mesos.registrar.registry_size_bytes

The size of the Mesos registry in bytes.

## mesos.registrar.state_fetch_ms

The Mesos registry’s read latency, in bytes.

## mesos.registrar.state_store_ms

The Mesos registry’s write latency, in bytes.

## mesos.registrar.state_store_ms.count

The Mesos registry’s write count, in bytes.

## mesos.registrar.state_store_ms.max

The maximum write latency for the registry, in milliseconds.

## mesos.registrar.state_store_ms.min

The minimum write latency for the registry, in miliseconds.

## mesos.registrar.state_store_ms.p50

The median registry write latency, in milliseconds.

## mesos.registrar.state_store_ms.p90

The 90th percentile registry write latency, in milliseconds.

## mesos.registrar.state_store_ms.p95

The 95th percentile registry write latency, in milliseconds.

## mesos.registrar.state_store_ms.p99

The 99th percentile registry write latency, in milliseconds.

## mesos.registrar.state_store_ms.p999

The 99.9th percentile registry write latency, in milliseconds.

## mesos.registrar.state_store_ms.p9999

The 99.99th percentile registry write latency, in milliseconds.

## mesos.role.cpu

The CPU capacity of the configured role.

## mesos.role.disk

The total disk space available to the Mesos role, in mebibytes.

## mesos.role.mem

The total memory available to the Mesos role, in mebibytes.

## mesos.stats.elected

Defines whether this is the elected master or not.

## mesos.stats.system.cpus_total

The total number of CPUs in the system.

The average load for the last minute.

The average load for the last five minutes.

The average load for the last fifteen minutes.

## mesos.stats.system.mem_free_bytes

The total amount of free system memory, in bytes.

## mesos.stats.system.mem_total_bytes

The total cluster memory in bytes.

## mesos.stats.uptime_secs

The current uptime of the cluster.

# Marathon Metrics

## marathon.apps

The total number of applications.

## marathon.backoffFactor

The multiplication factor for the delay between each consecutive failed task. This value is multiplied by the value of marathon.backoffSeconds each time the task fails until the maximum delay is reached, or the task succeeds.

## marathon.backoffSeconds

The period of time between attempts to run a failed task. This value is multiplied by marathon.backoffFactor for each consecutive task failure, until either the task succeeds or the maximum delay is reached.

## marathon.cpus

The number of CPUs configured for each application instance.

## marathon.disk

The amount of disk space configured for each application instance.

## marathon.instances

The number of instances of a specific application.

## marathon.mem

The total amount of configured memory for each instance of a specific application.

The number of tasks running for a specific application.

The number of tasks staged for a specific application.

# Marathon Metrics

## marathon.apps

The total number of applications.

## marathon.backoffFactor

The multiplication factor for the delay between each consecutive failed task. This value is multiplied by the value of marathon.backoffSeconds each time the task fails until the maximum delay is reached, or the task succeeds.

## marathon.backoffSeconds

The period of time between attempts to run a failed task. This value is multiplied by marathon.backoffFactor for each consecutive task failure, until either the task succeeds or the maximum delay is reached.

## marathon.cpus

The number of CPUs configured for each application instance.

## marathon.disk

The amount of disk space configured for each application instance.

## marathon.instances

The number of instances of a specific application.

## marathon.mem

The total amount of configured memory for each instance of a specific application.

The number of tasks running for a specific application.

The number of tasks staged for a specific application.

# MongoDB Metrics

## Metrics Introduced with Agent v9.7.0

The following metrics are supported by Sysdig Agent v9.7.0 and above.

Metric NameDescription
mongodb.tcmalloc.generic.current_allocated_bytesThe number of bytes used by the application.
mongodb.tcmalloc.generic.heap_sizeBytes of system memory reserved by TCMalloc.
mongodb.tcmalloc.tcmalloc.aggressive_memory_decommitStatus of aggressive memory de-commit mode.
mongodb.tcmalloc.tcmalloc.central_cache_free_bytesThe number of free bytes in the central cache.
mongodb.tcmalloc.tcmalloc.current_total_thread_cache_bytesThe number of bytes used across all thread caches.
mongodb.tcmalloc.tcmalloc.max_total_thread_cache_bytesThe upper limit on the total number of bytes stored across all per-thread caches.
mongodb.tcmalloc.tcmalloc.pageheap_free_bytesThe number of bytes in free mapped pages in page heap.
mongodb.tcmalloc.tcmalloc.pageheap_unmapped_bytesThe number of bytes in free unmapped pages in page heap.
mongodb.tcmalloc.tcmalloc.spinlock_total_delay_nsGives the spinlock delay time.
mongodb.tcmalloc.tcmalloc.thread_cache_free_bytesThe number of free bytes in thread caches.
mongodb.tcmalloc.tcmalloc.transfer_cache_free_bytesThe number of free bytes that are waiting to be transferred between the central cache and a thread cache.

## mongodb.asserts.msgps

Number of message assertions raised per second.

## mongodb.asserts.regularps

Number of regular assertions raised per second.

## mongodb.asserts.rolloversps

Number of times that the rollover counters roll over per second. The counters rollover to zero every 2^30 assertions.

## mongodb.asserts.userps

Number of user assertions raised per second.

## mongodb.asserts.warningps

Number of warnings raised per second.

## mongodb.backgroundflushing.average_ms

Average time for each flush to disk.

## mongodb.backgroundflushing.flushesps

Number of times the database has flushed all writes to disk.

## mongodb.backgroundflushing.last_ms

Amount of time that the last flush operation took to complete.

## mongodb.backgroundflushing.total_ms

Total number of time that the `mongod` processes have spent writing (i.e. flushing) data to disk.

## mongodb.connections.available

Number of unused available incoming connections the database can provide.

## mongodb.connections.current

Number of connections to the database server from clients.

## mongodb.connections.totalcreated

Total number of connections created.

## mongodb.cursors.timedout

Total number of cursors that have timed out since the server process started.

## mongodb.cursors.totalopen

Number of cursors that MongoDB is maintaining for clients

## mongodb.dbs

Total number of existing databases

## mongodb.dur.commits

Number of transactions written to the journal during the last journal group commit interval.

## mongodb.dur.commitsinwritelock

Count of the commits that occurred while a write lock was held.

## mongodb.dur.compression

Compression ratio of the data written to the journal.

## mongodb.dur.earlycommits

Number of times MongoDB requested a commit before the scheduled journal group commit interval.

## mongodb.dur.journaledmb

Amount of data written to journal during the last journal group commit interval.

## mongodb.dur.timems.commits

Amount of time spent for commits.

## mongodb.dur.timems.commitsinwritelock

Amount of time spent for commits that occurred while a write lock was held.

## mongodb.dur.timems.dt

Amount of time over which MongoDB collected the `dur.timeMS` data.

## mongodb.dur.timems.preplogbuffer

Amount of time spent preparing to write to the journal.

## mongodb.dur.timems.remapprivateview

Amount of time spent remapping copy-on-write memory mapped views.

## mongodb.dur.timems.writetodatafiles

Amount of time spent writing to data files after journaling.

## mongodb.dur.timems.writetojournal

Amount of time spent writing to the journal

## mongodb.dur.writetodatafilesmb

Amount of data written from journal to the data files during the last journal group commit interval.

## mongodb.extra_info.page_faultsps

Number of page faults per second that require disk operations.

## mongodb.fsynclocked

Number of fsynclocked performed on a mongo instance.

Count of the active client connections performing read operations.

## mongodb.globallock.activeclients.total

Total number of active client connections to the database.

## mongodb.globallock.activeclients.writers

Count of active client connections performing write operations.

Number of operations that are currently queued and waiting for the read lock.

## mongodb.globallock.currentqueue.total

Total number of operations queued waiting for the lock.

## mongodb.globallock.currentqueue.writers

Number of operations that are currently queued and waiting for the write lock.

## mongodb.globallock.locktime

Time since the database last started that the globalLock has been held.

## mongodb.globallock.ratio

Ratio of the time that the globalLock has been held to the total time since it was created.

## mongodb.globallock.totaltime

Time since the database last started and created the global lock.

## mongodb.indexcounters.accessesps

Number of times that operations have accessed indexes per second.

## mongodb.indexcounters.hitsps

Number of times per second that an index has been accessed and mongod is able to return the index from memory.

## mongodb.indexcounters.missesps

Number of times per second that an operation attempted to access an index that was not in memory.

## mongodb.indexcounters.missratio

Ratio of index hits to misses.

## mongodb.indexcounters.resetsps

Number of times per second the index counters have been reset.

## mongodb.locks.collection.acquirecount.exclusiveps

Number of times the collection lock type was acquired in the Exclusive (X) mode.

## mongodb.locks.collection.acquirecount.intent_exclusiveps

Number of times the collection lock type was acquired in the Intent Exclusive (IX) mode.

## mongodb.locks.collection.acquirecount.intent_sharedps

Number of times the collection lock type was acquired in the Intent Shared (IS) mode.

## mongodb.locks.collection.acquirecount.sharedps

Number of times the collection lock type was acquired in the Shared (S) mode.

## mongodb.locks.collection.acquirewaitcount.exclusiveps

Number of times the collection lock type acquisition in the Exclusive (X) mode encountered waits because the locks were held in a conflicting mode.

## mongodb.locks.collection.acquirewaitcount.sharedps

Number of times the collection lock type acquisition in the Shared (S) mode encountered waits because the locks were held in a conflicting mode.

## mongodb.locks.collection.timeacquiringmicros.exclusiveps

Wait time for the collection lock type acquisitions in the Exclusive (X) mode.

## mongodb.locks.collection.timeacquiringmicros.sharedps

Wait time for the collection lock type acquisitions in the Shared (S) mode.

## mongodb.locks.database.acquirecount.exclusiveps

Number of times the database lock type was acquired in the Exclusive (X) mode.

## mongodb.locks.database.acquirecount.intent_exclusiveps

Number of times the database lock type was acquired in the Intent Exclusive (IX) mode.

## mongodb.locks.database.acquirecount.intent_sharedps

Number of times the database lock type was acquired in the Intent Shared (IS) mode.

## mongodb.locks.database.acquirecount.sharedps

Number of times the database lock type was acquired in the Shared (S) mode.

## mongodb.locks.database.acquirewaitcount.exclusiveps

Number of times the database lock type acquisition in the Exclusive (X) mode encountered waits because the locks were held in a conflicting mode.

## mongodb.locks.database.acquirewaitcount.intent_exclusiveps

Number of times the database lock type acquisition in the Intent Exclusive (IX) mode encountered waits because the locks were held in a conflicting mode.

## mongodb.locks.database.acquirewaitcount.intent_sharedps

Number of times the database lock type acquisition in the Intent Shared (IS) mode encountered waits because the locks were held in a conflicting mode.

## mongodb.locks.database.acquirewaitcount.sharedps

Number of times the database lock type acquisition in the Shared (S) mode encountered waits because the locks were held in a conflicting mode.

## mongodb.locks.database.timeacquiringmicros.exclusiveps

Wait time for the database lock type acquisitions in the Exclusive (X) mode.

## mongodb.locks.database.timeacquiringmicros.intent_exclusiveps

Wait time for the database lock type acquisitions in the Intent Exclusive (IX) mode.

## mongodb.locks.database.timeacquiringmicros.intent_sharedps

Wait time for the database lock type acquisitions in the Intent Shared (IS) mode.

## mongodb.locks.database.timeacquiringmicros.sharedps

Wait time for the database lock type acquisitions in the Shared (S) mode.

## mongodb.locks.global.acquirecount.exclusiveps

Number of times the global lock type was acquired in the Exclusive (X) mode.

## mongodb.locks.global.acquirecount.intent_exclusiveps

Number of times the global lock type was acquired in the Intent Exclusive (IX) mode.

## mongodb.locks.global.acquirecount.intent_sharedps

Number of times the global lock type was acquired in the Intent Shared (IS) mode.

## mongodb.locks.global.acquirecount.sharedps

Number of times the global lock type was acquired in the Shared (S) mode.

## mongodb.locks.global.acquirewaitcount.exclusiveps

Number of times the global lock type acquisition in the Exclusive (X) mode encountered waits because the locks were held in a conflicting mode.

## mongodb.locks.global.acquirewaitcount.intent_exclusiveps

Number of times the global lock type acquisition in the Intent Exclusive (IX) mode encountered waits because the locks were held in a conflicting mode.

## mongodb.locks.global.acquirewaitcount.intent_sharedps

Number of times the global lock type acquisition in the Intent Shared (IS) mode encountered waits because the locks were held in a conflicting mode.

## mongodb.locks.global.acquirewaitcount.sharedps

Number of times the global lock type acquisition in the Shared (S) mode encountered waits because the locks were held in a conflicting mode.

## mongodb.locks.global.timeacquiringmicros.exclusiveps

Wait time for the global lock type acquisitions in the Exclusive (X) mode.

## mongodb.locks.global.timeacquiringmicros.intent_exclusiveps

Wait time for the global lock type acquisitions in the Intent Exclusive (IX) mode.

## mongodb.locks.global.timeacquiringmicros.intent_sharedps

Wait time for the global lock type acquisitions in the Intent Shared (IS) mode.

## mongodb.locks.global.timeacquiringmicros.sharedps

Wait time for the global lock type acquisitions in the Shared (S) mode.

Number of times the metadata lock type was acquired in the Exclusive (X) mode.

Number of times the metadata lock type was acquired in the Shared (S) mode.

## mongodb.locks.mmapv1journal.acquirecount.intent_exclusiveps

Number of times the MMAPv1 storage engine lock type was acquired in the Intent Exclusive (IX) mode.

## mongodb.locks.mmapv1journal.acquirecount.intent_sharedps

Number of times the MMAPv1 storage engine lock type was acquired in the Intent Shared (IS) mode.

## mongodb.locks.mmapv1journal.acquirewaitcount.intent_exclusiveps

Number of times the MMAPv1 storage engine lock type acquisition in the Intent Exclusive (IX) mode encountered waits because the locks were held in a conflicting mode.

## mongodb.locks.mmapv1journal.acquirewaitcount.intent_sharedps

Number of times the MMAPv1 storage engine lock type acquisition in the Intent Shared (IS) mode encountered waits because the locks were held in a conflicting mode.

## mongodb.locks.mmapv1journal.timeacquiringmicros.intent_exclusiveps

Wait time for the MMAPv1 storage engine lock type acquisitions in the Intent Exclusive (IX) mode.

## mongodb.locks.mmapv1journal.timeacquiringmicros.intent_sharedps

Wait time for the MMAPv1 storage engine lock type acquisitions in the Intent Shared (IS) mode.

## mongodb.locks.oplog.acquirecount.intent_exclusiveps

Number of times the oplog lock type was acquired in the Intent Exclusive (IX) mode.

## mongodb.locks.oplog.acquirecount.sharedps

Number of times the oplog lock type was acquired in the Shared (S) mode.

## mongodb.locks.oplog.acquirewaitcount.intent_exclusiveps

Number of times the oplog lock type acquisition in the Intent Exclusive (IX) mode encountered waits because the locks were held in a conflicting mode.

## mongodb.locks.oplog.acquirewaitcount.sharedps

Number of times the oplog lock type acquisition in the Shared (S) mode encountered waits because the locks were held in a conflicting mode.

## mongodb.locks.oplog.timeacquiringmicros.intent_exclusiveps

Wait time for the oplog lock type acquisitions in the Intent Exclusive (IX) mode.

## mongodb.locks.oplog.timeacquiringmicros.sharedps

Wait time for the oplog lock type acquisitions in the Shared (S) mode.

## mongodb.mem.bits

Size of the in-memory storage engine.

## mongodb.mem.mapped

Amount of mapped memory by the database.

## mongodb.mem.mappedwithjournal

The amount of mapped memory, including the memory used for journaling.

## mongodb.mem.resident

Amount of memory currently used by the database process.

## mongodb.mem.virtual

Amount of virtual memory used by the mongod process.

## mongodb.metrics.commands.count.failed

Number of times count failed

## mongodb.metrics.commands.count.total

Number of times count executed

## mongodb.metrics.commands.createIndexes.failed

Number of times createIndexes failed

## mongodb.metrics.commands.createIndexes.total

Number of times createIndexes executed

## mongodb.metrics.commands.delete.failed

Number of times delete failed

## mongodb.metrics.commands.delete.total

Number of times delete executed

## mongodb.metrics.commands.eval.failed

Number of times eval failed

## mongodb.metrics.commands.eval.total

Number of times eval executed

## mongodb.metrics.commands.findAndModify.failed

Number of times findAndModify failed

## mongodb.metrics.commands.findAndModify.total

Number of times findAndModify executed

## mongodb.metrics.commands.insert.failed

Number of times insert failed

## mongodb.metrics.commands.insert.total

Number of times insert executed

## mongodb.metrics.commands.update.failed

Number of times update failed

## mongodb.metrics.commands.update.total

Number of times update executed

## mongodb.metrics.cursor.open.notimeout

Number of open cursors with the option `DBQuery.Option.noTimeout` set to prevent timeout after a period of inactivity.

## mongodb.metrics.cursor.open.pinned

Number of pinned open cursors.

## mongodb.metrics.cursor.open.total

Number of cursors that MongoDB is maintaining for clients.

## mongodb.metrics.cursor.timedoutps

Number of cursors that time out, per second.

## mongodb.metrics.document.deletedps

Number of documents deleted per second.

## mongodb.metrics.document.insertedps

Number of documents inserted per second.

## mongodb.metrics.document.returnedps

Number of documents returned by queries per second.

## mongodb.metrics.document.updatedps

Number of documents updated per second.

## mongodb.metrics.getlasterror.wtime.numps

Number of getLastError operations per second with a specified write concern (i.e. w) that wait for one or more members of a replica set to acknowledge the write operation.

## mongodb.metrics.getlasterror.wtime.totalmillisps

Fraction of time (ms/s) that the mongod has spent performing getLastError operations with write concern (i.e. w) that wait for one or more members of a replica set to acknowledge the write operation.

## mongodb.metrics.getlasterror.wtimeoutsps

Number of times per second that write concern operations have timed out as a result of the wtimeout threshold to getLastError

## mongodb.metrics.operation.fastmodps

Number of update operations per second that neither cause documents to grow nor require updates to the index.

## mongodb.metrics.operation.idhackps

Number of queries per second that contain the _id field.

## mongodb.metrics.operation.writeconflictsps

Number of times per second that write concern operations has encounter a conflict.

## mongodb.metrics.operation.scanandorderps

Number of queries per second that return sorted numbers that cannot perform the sort operation using an index.

## mongodb.metrics.queryexecutor.scannedps

Number of index items scanned per second during queries and query-plan evaluation.

## mongodb.metrics.record.movesps

Number of times per second documents move within the on-disk representation of the MongoDB data set.

## mongodb.metrics.repl.apply.batches.numps

Number of batches applied across all databases per second.

## mongodb.metrics.repl.apply.batches.totalmillisps

Fraction of time (ms/s) the mongod has spent applying operations from the oplog.

## mongodb.metrics.repl.apply.opsps

Number of oplog operations applied per second.

## mongodb.metrics.repl.buffer.count

Number of operations in the oplog buffer.

## mongodb.metrics.repl.buffer.maxsizebytes

Maximum size of the buffer.

## mongodb.metrics.repl.buffer.sizebytes

Current size of the contents of the oplog buffer.

## mongodb.metrics.repl.network.bytesps

Amount of data read from the replication sync source per second.

## mongodb.metrics.repl.network.getmores.numps

Number of getmore operations per second.

## mongodb.metrics.repl.network.getmores.totalmillisps

Fraction of time (ms/s) required to collect data from getmore operations.

## mongodb.metrics.repl.network.opsps

Number of operations read from the replication source per second.

Number of oplog query processes created per second.

Number of documents loaded during the pre-fetch stage of replication.

Amount of time spent loading documents as part of the pre-fetch stage of replication.

Number of index entries loaded by members before updating documents as part of the pre-fetch stage of replication.

Amount of time spent loading documents as part of the pre-fetch stage of replication.

## mongodb.metrics.ttl.deleteddocumentsps

Number of documents deleted from collections with a ttl index per second.

## mongodb.metrics.ttl.passesps

Number of times per second the background process removes documents from collections with a ttl index.

## mongodb.network.bytesinps

The number of bytes that reflects the amount of network traffic received by this database.

## mongodb.network.bytesoutps

The number of bytes that reflects the amount of network traffic sent from this database.

## mongodb.network.numrequestsps

Number of distinct requests that the server has received.

## mongodb.opcounters.commandps

Total number of commands per second issued to the database.

## mongodb.opcounters.deleteps

Number of delete operations per second.

## mongodb.opcounters.getmoreps

Number of getmore operations per second.

## mongodb.opcounters.insertps

Number of insert operations per second.

## mongodb.opcounters.queryps

Total number of queries per second.

## mongodb.opcounters.updateps

Number of update operations per second.

## mongodb.opcountersrepl.commandps

Total number of replicated commands issued to the database per second.

## mongodb.opcountersrepl.deleteps

Number of replicated delete operations per second.

## mongodb.opcountersrepl.getmoreps

Number of replicated getmore operations per second.

## mongodb.opcountersrepl.insertps

Number of replicated insert operations per second.

## mongodb.opcountersrepl.queryps

Total number of replicated queries per second.

## mongodb.opcountersrepl.updateps

Number of replicated update operations per second.

## mongodb.oplog.logsizemb

Total size of the oplog.

## mongodb.oplog.timediff

Oplog window: difference between the first and last operation in the oplog.

## mongodb.oplog.usedsizemb

Total amount of space used by the oplog.

## mongodb.replset.health

Member health value of the replica set: conveys if the member is up (i.e. 1) or down (i.e. 0).

## mongodb.replset.replicationlag

Delay between a write operation on the primary and its copy to a secondary.

## mongodb.replset.state

State of a replica that reflects its disposition within the set.

## mongodb.replset.votefraction

Fraction of votes a server will cast in a replica set election.

The number of votes a server will cast in a replica set election.

## mongodb.stats.datasize

Total size of the data held in this database including the padding factor.

## mongodb.stats.indexes

Total number of indexes across all collections in the database.

## mongodb.stats.indexsize

Total size of all indexes created on this database.

## mongodb.stats.objects

Number of objects (documents) in the database across all collections.

## mongodb.stats.storagesize

Total amount of space allocated to collections in this database for document storage.

## mongodb.uptime

Number of seconds that the mongos or mongod process has been active.

## mongodb.wiredtiger.cache.bytes_currently_in_cache

Size of the data currently in cache.

## mongodb.wiredtiger.cache.failed_eviction_of_pages_exceeding_the_in_memory_maximumps

Number of failed eviction of pages that exceeded the in-memory maximum, per second.

## mongodb.wiredtiger.cache.in_memory_page_splits

In-memory page splits.

## mongodb.wiredtiger.cache.maximum_bytes_configured

Maximum cache size.

## mongodb.wiredtiger.cache.maximum_page_size_at_eviction

Maximum page size at eviction.

## mongodb.wiredtiger.cache.modified_pages_evicted

Number of pages, that have been modified, evicted from the cache.

## mongodb.wiredtiger.cache.pages_currently_held_in_cache

Number of pages currently held in the cache.

Number of page evicted by application threads per second.

## mongodb.wiredtiger.cache.pages_evicted_exceeding_the_in_memory_maximumps

Number of pages evicted because they exceeded the cache in-memory maximum, per second.

## mongodb.wiredtiger.cache.tracked_dirty_bytes_in_cache

Size of the dirty data in the cache.

## mongodb.wiredtiger.cache.unmodified_pages_evicted

Number of pages, that were not modified, evicted from the cache.

Number of available read tickets (concurrent transactions) remaining.

Number of read tickets (concurrent transactions) in use.

Total number of read tickets (concurrent transactions) available.

## mongodb.wiredtiger.concurrenttransactions.write.available

Number of available write tickets (concurrent transactions) remaining.

## mongodb.wiredtiger.concurrenttransactions.write.out

Number of write tickets (concurrent transactions) in use.

## mongodb.wiredtiger.concurrenttransactions.write.totaltickets

Total number of write tickets (concurrent transactions) available.

## mongodb.collection.size

The total size in bytes of the data in the collection plus the size of every indexes on the mongodb.collection.

## mongodb.collection.avgObjSize

The size of the average object in the collection in bytes.

## mongodb.collection.count

Total number of objects in the collection.

## mongodb.collection.capped

Whether or not the collection is capped.

## mongodb.collection.max

Maximum number of documents in a capped collection.

## mongodb.collection.maxSize

Maximum size of a capped collection in bytes.

## mongodb.collection.storageSize

Total storage space allocated to this collection for document storage.

## mongodb.collection.nindexes

Total number of indices on the collection.

## mongodb.collection.indexSizes

Size of index in bytes.

## mongodb.collection.indexes.accesses.ops

Number of time the index was used.

## mongodb.usage.commands.countps

Number of commands per second

## mongodb.usage.commands.count

Number of commands since server start (deprecated)

## mongodb.usage.commands.time

Total time spent performing commands in microseconds

## mongodb.usage.getmore.countps

Number of getmore per second

## mongodb.usage.getmore.count

Number of getmore since server start (deprecated)

## mongodb.usage.getmore.time

Total time spent performing getmore in microseconds

## mongodb.usage.insert.countps

Number of inserts per second

## mongodb.usage.insert.count

Number of inserts since server start (deprecated)

## mongodb.usage.insert.time

Total time spent performing inserts in microseconds

## mongodb.usage.queries.countps

Number of queries per second

## mongodb.usage.queries.count

Number of queries since server start (deprecated)

## mongodb.usage.queries.time

Total time spent performing queries in microseconds

Number of read locks per second

Number of read locks since server start (deprecated)

Total time spent performing read locks in microseconds

## mongodb.usage.remove.countps

Number of removes per second

## mongodb.usage.remove.count

Number of removes since server start (deprecated)

## mongodb.usage.remove.time

Total time spent performing removes in microseconds

## mongodb.usage.total.countps

Number of operations per second

## mongodb.usage.total.count

Number of operations since server start (deprecated)

## mongodb.usage.total.time

Total time spent performing operations in microseconds

## mongodb.usage.update.countps

Number of updates per second

## mongodb.usage.update.count

Number of updates since server start (deprecated)

## mongodb.usage.update.time

Total time spent performing updates in microseconds

## mongodb.usage.writeLock.countps

Number of write locks per second

## mongodb.usage.writeLock.count

Number of write locks since server start (deprecated)

## mongodb.usage.writeLock.time

Total time spent performing write locks in microseconds

# MongoDB Metrics

## Metrics Introduced with Agent v9.7.0

The following metrics are supported by Sysdig Agent v9.7.0 and above.

Metric NameDescription
mongodb.tcmalloc.generic.current_allocated_bytesThe number of bytes used by the application.
mongodb.tcmalloc.generic.heap_sizeBytes of system memory reserved by TCMalloc.
mongodb.tcmalloc.tcmalloc.aggressive_memory_decommitStatus of aggressive memory de-commit mode.
mongodb.tcmalloc.tcmalloc.central_cache_free_bytesThe number of free bytes in the central cache.
mongodb.tcmalloc.tcmalloc.current_total_thread_cache_bytesThe number of bytes used across all thread caches.
mongodb.tcmalloc.tcmalloc.max_total_thread_cache_bytesThe upper limit on the total number of bytes stored across all per-thread caches.
mongodb.tcmalloc.tcmalloc.pageheap_free_bytesThe number of bytes in free mapped pages in page heap.
mongodb.tcmalloc.tcmalloc.pageheap_unmapped_bytesThe number of bytes in free unmapped pages in page heap.
mongodb.tcmalloc.tcmalloc.spinlock_total_delay_nsGives the spinlock delay time.
mongodb.tcmalloc.tcmalloc.thread_cache_free_bytesThe number of free bytes in thread caches.
mongodb.tcmalloc.tcmalloc.transfer_cache_free_bytesThe number of free bytes that are waiting to be transferred between the central cache and a thread cache.

## mongodb.asserts.msgps

Number of message assertions raised per second.

## mongodb.asserts.regularps

Number of regular assertions raised per second.

## mongodb.asserts.rolloversps

Number of times that the rollover counters roll over per second. The counters rollover to zero every 2^30 assertions.

## mongodb.asserts.userps

Number of user assertions raised per second.

## mongodb.asserts.warningps

Number of warnings raised per second.

## mongodb.backgroundflushing.average_ms

Average time for each flush to disk.

## mongodb.backgroundflushing.flushesps

Number of times the database has flushed all writes to disk.

## mongodb.backgroundflushing.last_ms

Amount of time that the last flush operation took to complete.

## mongodb.backgroundflushing.total_ms

Total number of time that the `mongod` processes have spent writing (i.e. flushing) data to disk.

## mongodb.connections.available

Number of unused available incoming connections the database can provide.

## mongodb.connections.current

Number of connections to the database server from clients.

## mongodb.connections.totalcreated

Total number of connections created.

## mongodb.cursors.timedout

Total number of cursors that have timed out since the server process started.

## mongodb.cursors.totalopen

Number of cursors that MongoDB is maintaining for clients

## mongodb.dbs

Total number of existing databases

## mongodb.dur.commits

Number of transactions written to the journal during the last journal group commit interval.

## mongodb.dur.commitsinwritelock

Count of the commits that occurred while a write lock was held.

## mongodb.dur.compression

Compression ratio of the data written to the journal.

## mongodb.dur.earlycommits

Number of times MongoDB requested a commit before the scheduled journal group commit interval.

## mongodb.dur.journaledmb

Amount of data written to journal during the last journal group commit interval.

## mongodb.dur.timems.commits

Amount of time spent for commits.

## mongodb.dur.timems.commitsinwritelock

Amount of time spent for commits that occurred while a write lock was held.

## mongodb.dur.timems.dt

Amount of time over which MongoDB collected the `dur.timeMS` data.

## mongodb.dur.timems.preplogbuffer

Amount of time spent preparing to write to the journal.

## mongodb.dur.timems.remapprivateview

Amount of time spent remapping copy-on-write memory mapped views.

## mongodb.dur.timems.writetodatafiles

Amount of time spent writing to data files after journaling.

## mongodb.dur.timems.writetojournal

Amount of time spent writing to the journal

## mongodb.dur.writetodatafilesmb

Amount of data written from journal to the data files during the last journal group commit interval.

## mongodb.extra_info.page_faultsps

Number of page faults per second that require disk operations.

## mongodb.fsynclocked

Number of fsynclocked performed on a mongo instance.

Count of the active client connections performing read operations.

## mongodb.globallock.activeclients.total

Total number of active client connections to the database.

## mongodb.globallock.activeclients.writers

Count of active client connections performing write operations.

Number of operations that are currently queued and waiting for the read lock.

## mongodb.globallock.currentqueue.total

Total number of operations queued waiting for the lock.

## mongodb.globallock.currentqueue.writers

Number of operations that are currently queued and waiting for the write lock.

## mongodb.globallock.locktime

Time since the database last started that the globalLock has been held.

## mongodb.globallock.ratio

Ratio of the time that the globalLock has been held to the total time since it was created.

## mongodb.globallock.totaltime

Time since the database last started and created the global lock.

## mongodb.indexcounters.accessesps

Number of times that operations have accessed indexes per second.

## mongodb.indexcounters.hitsps

Number of times per second that an index has been accessed and mongod is able to return the index from memory.

## mongodb.indexcounters.missesps

Number of times per second that an operation attempted to access an index that was not in memory.

## mongodb.indexcounters.missratio

Ratio of index hits to misses.

## mongodb.indexcounters.resetsps

Number of times per second the index counters have been reset.

## mongodb.locks.collection.acquirecount.exclusiveps

Number of times the collection lock type was acquired in the Exclusive (X) mode.

## mongodb.locks.collection.acquirecount.intent_exclusiveps

Number of times the collection lock type was acquired in the Intent Exclusive (IX) mode.

## mongodb.locks.collection.acquirecount.intent_sharedps

Number of times the collection lock type was acquired in the Intent Shared (IS) mode.

## mongodb.locks.collection.acquirecount.sharedps

Number of times the collection lock type was acquired in the Shared (S) mode.

## mongodb.locks.collection.acquirewaitcount.exclusiveps

Number of times the collection lock type acquisition in the Exclusive (X) mode encountered waits because the locks were held in a conflicting mode.

## mongodb.locks.collection.acquirewaitcount.sharedps

Number of times the collection lock type acquisition in the Shared (S) mode encountered waits because the locks were held in a conflicting mode.

## mongodb.locks.collection.timeacquiringmicros.exclusiveps

Wait time for the collection lock type acquisitions in the Exclusive (X) mode.

## mongodb.locks.collection.timeacquiringmicros.sharedps

Wait time for the collection lock type acquisitions in the Shared (S) mode.

## mongodb.locks.database.acquirecount.exclusiveps

Number of times the database lock type was acquired in the Exclusive (X) mode.

## mongodb.locks.database.acquirecount.intent_exclusiveps

Number of times the database lock type was acquired in the Intent Exclusive (IX) mode.

## mongodb.locks.database.acquirecount.intent_sharedps

Number of times the database lock type was acquired in the Intent Shared (IS) mode.

## mongodb.locks.database.acquirecount.sharedps

Number of times the database lock type was acquired in the Shared (S) mode.

## mongodb.locks.database.acquirewaitcount.exclusiveps

Number of times the database lock type acquisition in the Exclusive (X) mode encountered waits because the locks were held in a conflicting mode.

## mongodb.locks.database.acquirewaitcount.intent_exclusiveps

Number of times the database lock type acquisition in the Intent Exclusive (IX) mode encountered waits because the locks were held in a conflicting mode.

## mongodb.locks.database.acquirewaitcount.intent_sharedps

Number of times the database lock type acquisition in the Intent Shared (IS) mode encountered waits because the locks were held in a conflicting mode.

## mongodb.locks.database.acquirewaitcount.sharedps

Number of times the database lock type acquisition in the Shared (S) mode encountered waits because the locks were held in a conflicting mode.

## mongodb.locks.database.timeacquiringmicros.exclusiveps

Wait time for the database lock type acquisitions in the Exclusive (X) mode.

## mongodb.locks.database.timeacquiringmicros.intent_exclusiveps

Wait time for the database lock type acquisitions in the Intent Exclusive (IX) mode.

## mongodb.locks.database.timeacquiringmicros.intent_sharedps

Wait time for the database lock type acquisitions in the Intent Shared (IS) mode.

## mongodb.locks.database.timeacquiringmicros.sharedps

Wait time for the database lock type acquisitions in the Shared (S) mode.

## mongodb.locks.global.acquirecount.exclusiveps

Number of times the global lock type was acquired in the Exclusive (X) mode.

## mongodb.locks.global.acquirecount.intent_exclusiveps

Number of times the global lock type was acquired in the Intent Exclusive (IX) mode.

## mongodb.locks.global.acquirecount.intent_sharedps

Number of times the global lock type was acquired in the Intent Shared (IS) mode.

## mongodb.locks.global.acquirecount.sharedps

Number of times the global lock type was acquired in the Shared (S) mode.

## mongodb.locks.global.acquirewaitcount.exclusiveps

Number of times the global lock type acquisition in the Exclusive (X) mode encountered waits because the locks were held in a conflicting mode.

## mongodb.locks.global.acquirewaitcount.intent_exclusiveps

Number of times the global lock type acquisition in the Intent Exclusive (IX) mode encountered waits because the locks were held in a conflicting mode.

## mongodb.locks.global.acquirewaitcount.intent_sharedps

Number of times the global lock type acquisition in the Intent Shared (IS) mode encountered waits because the locks were held in a conflicting mode.

## mongodb.locks.global.acquirewaitcount.sharedps

Number of times the global lock type acquisition in the Shared (S) mode encountered waits because the locks were held in a conflicting mode.

## mongodb.locks.global.timeacquiringmicros.exclusiveps

Wait time for the global lock type acquisitions in the Exclusive (X) mode.

## mongodb.locks.global.timeacquiringmicros.intent_exclusiveps

Wait time for the global lock type acquisitions in the Intent Exclusive (IX) mode.

## mongodb.locks.global.timeacquiringmicros.intent_sharedps

Wait time for the global lock type acquisitions in the Intent Shared (IS) mode.

## mongodb.locks.global.timeacquiringmicros.sharedps

Wait time for the global lock type acquisitions in the Shared (S) mode.

Number of times the metadata lock type was acquired in the Exclusive (X) mode.

Number of times the metadata lock type was acquired in the Shared (S) mode.

## mongodb.locks.mmapv1journal.acquirecount.intent_exclusiveps

Number of times the MMAPv1 storage engine lock type was acquired in the Intent Exclusive (IX) mode.

## mongodb.locks.mmapv1journal.acquirecount.intent_sharedps

Number of times the MMAPv1 storage engine lock type was acquired in the Intent Shared (IS) mode.

## mongodb.locks.mmapv1journal.acquirewaitcount.intent_exclusiveps

Number of times the MMAPv1 storage engine lock type acquisition in the Intent Exclusive (IX) mode encountered waits because the locks were held in a conflicting mode.

## mongodb.locks.mmapv1journal.acquirewaitcount.intent_sharedps

Number of times the MMAPv1 storage engine lock type acquisition in the Intent Shared (IS) mode encountered waits because the locks were held in a conflicting mode.

## mongodb.locks.mmapv1journal.timeacquiringmicros.intent_exclusiveps

Wait time for the MMAPv1 storage engine lock type acquisitions in the Intent Exclusive (IX) mode.

## mongodb.locks.mmapv1journal.timeacquiringmicros.intent_sharedps

Wait time for the MMAPv1 storage engine lock type acquisitions in the Intent Shared (IS) mode.

## mongodb.locks.oplog.acquirecount.intent_exclusiveps

Number of times the oplog lock type was acquired in the Intent Exclusive (IX) mode.

## mongodb.locks.oplog.acquirecount.sharedps

Number of times the oplog lock type was acquired in the Shared (S) mode.

## mongodb.locks.oplog.acquirewaitcount.intent_exclusiveps

Number of times the oplog lock type acquisition in the Intent Exclusive (IX) mode encountered waits because the locks were held in a conflicting mode.

## mongodb.locks.oplog.acquirewaitcount.sharedps

Number of times the oplog lock type acquisition in the Shared (S) mode encountered waits because the locks were held in a conflicting mode.

## mongodb.locks.oplog.timeacquiringmicros.intent_exclusiveps

Wait time for the oplog lock type acquisitions in the Intent Exclusive (IX) mode.

## mongodb.locks.oplog.timeacquiringmicros.sharedps

Wait time for the oplog lock type acquisitions in the Shared (S) mode.

## mongodb.mem.bits

Size of the in-memory storage engine.

## mongodb.mem.mapped

Amount of mapped memory by the database.

## mongodb.mem.mappedwithjournal

The amount of mapped memory, including the memory used for journaling.

## mongodb.mem.resident

Amount of memory currently used by the database process.

## mongodb.mem.virtual

Amount of virtual memory used by the mongod process.

## mongodb.metrics.commands.count.failed

Number of times count failed

## mongodb.metrics.commands.count.total

Number of times count executed

## mongodb.metrics.commands.createIndexes.failed

Number of times createIndexes failed

## mongodb.metrics.commands.createIndexes.total

Number of times createIndexes executed

## mongodb.metrics.commands.delete.failed

Number of times delete failed

## mongodb.metrics.commands.delete.total

Number of times delete executed

## mongodb.metrics.commands.eval.failed

Number of times eval failed

## mongodb.metrics.commands.eval.total

Number of times eval executed

## mongodb.metrics.commands.findAndModify.failed

Number of times findAndModify failed

## mongodb.metrics.commands.findAndModify.total

Number of times findAndModify executed

## mongodb.metrics.commands.insert.failed

Number of times insert failed

## mongodb.metrics.commands.insert.total

Number of times insert executed

## mongodb.metrics.commands.update.failed

Number of times update failed

## mongodb.metrics.commands.update.total

Number of times update executed

## mongodb.metrics.cursor.open.notimeout

Number of open cursors with the option `DBQuery.Option.noTimeout` set to prevent timeout after a period of inactivity.

## mongodb.metrics.cursor.open.pinned

Number of pinned open cursors.

## mongodb.metrics.cursor.open.total

Number of cursors that MongoDB is maintaining for clients.

## mongodb.metrics.cursor.timedoutps

Number of cursors that time out, per second.

## mongodb.metrics.document.deletedps

Number of documents deleted per second.

## mongodb.metrics.document.insertedps

Number of documents inserted per second.

## mongodb.metrics.document.returnedps

Number of documents returned by queries per second.

## mongodb.metrics.document.updatedps

Number of documents updated per second.

## mongodb.metrics.getlasterror.wtime.numps

Number of getLastError operations per second with a specified write concern (i.e. w) that wait for one or more members of a replica set to acknowledge the write operation.

## mongodb.metrics.getlasterror.wtime.totalmillisps

Fraction of time (ms/s) that the mongod has spent performing getLastError operations with write concern (i.e. w) that wait for one or more members of a replica set to acknowledge the write operation.

## mongodb.metrics.getlasterror.wtimeoutsps

Number of times per second that write concern operations have timed out as a result of the wtimeout threshold to getLastError

## mongodb.metrics.operation.fastmodps

Number of update operations per second that neither cause documents to grow nor require updates to the index.

## mongodb.metrics.operation.idhackps

Number of queries per second that contain the _id field.

## mongodb.metrics.operation.writeconflictsps

Number of times per second that write concern operations has encounter a conflict.

## mongodb.metrics.operation.scanandorderps

Number of queries per second that return sorted numbers that cannot perform the sort operation using an index.

## mongodb.metrics.queryexecutor.scannedps

Number of index items scanned per second during queries and query-plan evaluation.

## mongodb.metrics.record.movesps

Number of times per second documents move within the on-disk representation of the MongoDB data set.

## mongodb.metrics.repl.apply.batches.numps

Number of batches applied across all databases per second.

## mongodb.metrics.repl.apply.batches.totalmillisps

Fraction of time (ms/s) the mongod has spent applying operations from the oplog.

## mongodb.metrics.repl.apply.opsps

Number of oplog operations applied per second.

## mongodb.metrics.repl.buffer.count

Number of operations in the oplog buffer.

## mongodb.metrics.repl.buffer.maxsizebytes

Maximum size of the buffer.

## mongodb.metrics.repl.buffer.sizebytes

Current size of the contents of the oplog buffer.

## mongodb.metrics.repl.network.bytesps

Amount of data read from the replication sync source per second.

## mongodb.metrics.repl.network.getmores.numps

Number of getmore operations per second.

## mongodb.metrics.repl.network.getmores.totalmillisps

Fraction of time (ms/s) required to collect data from getmore operations.

## mongodb.metrics.repl.network.opsps

Number of operations read from the replication source per second.

Number of oplog query processes created per second.

Number of documents loaded during the pre-fetch stage of replication.

Amount of time spent loading documents as part of the pre-fetch stage of replication.

Number of index entries loaded by members before updating documents as part of the pre-fetch stage of replication.

Amount of time spent loading documents as part of the pre-fetch stage of replication.

## mongodb.metrics.ttl.deleteddocumentsps

Number of documents deleted from collections with a ttl index per second.

## mongodb.metrics.ttl.passesps

Number of times per second the background process removes documents from collections with a ttl index.

## mongodb.network.bytesinps

The number of bytes that reflects the amount of network traffic received by this database.

## mongodb.network.bytesoutps

The number of bytes that reflects the amount of network traffic sent from this database.

## mongodb.network.numrequestsps

Number of distinct requests that the server has received.

## mongodb.opcounters.commandps

Total number of commands per second issued to the database.

## mongodb.opcounters.deleteps

Number of delete operations per second.

## mongodb.opcounters.getmoreps

Number of getmore operations per second.

## mongodb.opcounters.insertps

Number of insert operations per second.

## mongodb.opcounters.queryps

Total number of queries per second.

## mongodb.opcounters.updateps

Number of update operations per second.

## mongodb.opcountersrepl.commandps

Total number of replicated commands issued to the database per second.

## mongodb.opcountersrepl.deleteps

Number of replicated delete operations per second.

## mongodb.opcountersrepl.getmoreps

Number of replicated getmore operations per second.

## mongodb.opcountersrepl.insertps

Number of replicated insert operations per second.

## mongodb.opcountersrepl.queryps

Total number of replicated queries per second.

## mongodb.opcountersrepl.updateps

Number of replicated update operations per second.

## mongodb.oplog.logsizemb

Total size of the oplog.

## mongodb.oplog.timediff

Oplog window: difference between the first and last operation in the oplog.

## mongodb.oplog.usedsizemb

Total amount of space used by the oplog.

## mongodb.replset.health

Member health value of the replica set: conveys if the member is up (i.e. 1) or down (i.e. 0).

## mongodb.replset.replicationlag

Delay between a write operation on the primary and its copy to a secondary.

## mongodb.replset.state

State of a replica that reflects its disposition within the set.

## mongodb.replset.votefraction

Fraction of votes a server will cast in a replica set election.

The number of votes a server will cast in a replica set election.

## mongodb.stats.datasize

Total size of the data held in this database including the padding factor.

## mongodb.stats.indexes

Total number of indexes across all collections in the database.

## mongodb.stats.indexsize

Total size of all indexes created on this database.

## mongodb.stats.objects

Number of objects (documents) in the database across all collections.

## mongodb.stats.storagesize

Total amount of space allocated to collections in this database for document storage.

## mongodb.uptime

Number of seconds that the mongos or mongod process has been active.

## mongodb.wiredtiger.cache.bytes_currently_in_cache

Size of the data currently in cache.

## mongodb.wiredtiger.cache.failed_eviction_of_pages_exceeding_the_in_memory_maximumps

Number of failed eviction of pages that exceeded the in-memory maximum, per second.

## mongodb.wiredtiger.cache.in_memory_page_splits

In-memory page splits.

## mongodb.wiredtiger.cache.maximum_bytes_configured

Maximum cache size.

## mongodb.wiredtiger.cache.maximum_page_size_at_eviction

Maximum page size at eviction.

## mongodb.wiredtiger.cache.modified_pages_evicted

Number of pages, that have been modified, evicted from the cache.

## mongodb.wiredtiger.cache.pages_currently_held_in_cache

Number of pages currently held in the cache.

Number of page evicted by application threads per second.

## mongodb.wiredtiger.cache.pages_evicted_exceeding_the_in_memory_maximumps

Number of pages evicted because they exceeded the cache in-memory maximum, per second.

## mongodb.wiredtiger.cache.tracked_dirty_bytes_in_cache

Size of the dirty data in the cache.

## mongodb.wiredtiger.cache.unmodified_pages_evicted

Number of pages, that were not modified, evicted from the cache.

Number of available read tickets (concurrent transactions) remaining.

Number of read tickets (concurrent transactions) in use.

Total number of read tickets (concurrent transactions) available.

## mongodb.wiredtiger.concurrenttransactions.write.available

Number of available write tickets (concurrent transactions) remaining.

## mongodb.wiredtiger.concurrenttransactions.write.out

Number of write tickets (concurrent transactions) in use.

## mongodb.wiredtiger.concurrenttransactions.write.totaltickets

Total number of write tickets (concurrent transactions) available.

## mongodb.collection.size

The total size in bytes of the data in the collection plus the size of every indexes on the mongodb.collection.

## mongodb.collection.avgObjSize

The size of the average object in the collection in bytes.

## mongodb.collection.count

Total number of objects in the collection.

## mongodb.collection.capped

Whether or not the collection is capped.

## mongodb.collection.max

Maximum number of documents in a capped collection.

## mongodb.collection.maxSize

Maximum size of a capped collection in bytes.

## mongodb.collection.storageSize

Total storage space allocated to this collection for document storage.

## mongodb.collection.nindexes

Total number of indices on the collection.

## mongodb.collection.indexSizes

Size of index in bytes.

## mongodb.collection.indexes.accesses.ops

Number of time the index was used.

## mongodb.usage.commands.countps

Number of commands per second

## mongodb.usage.commands.count

Number of commands since server start (deprecated)

## mongodb.usage.commands.time

Total time spent performing commands in microseconds

## mongodb.usage.getmore.countps

Number of getmore per second

## mongodb.usage.getmore.count

Number of getmore since server start (deprecated)

## mongodb.usage.getmore.time

Total time spent performing getmore in microseconds

## mongodb.usage.insert.countps

Number of inserts per second

## mongodb.usage.insert.count

Number of inserts since server start (deprecated)

## mongodb.usage.insert.time

Total time spent performing inserts in microseconds

## mongodb.usage.queries.countps

Number of queries per second

## mongodb.usage.queries.count

Number of queries since server start (deprecated)

## mongodb.usage.queries.time

Total time spent performing queries in microseconds

Number of read locks per second

Number of read locks since server start (deprecated)

Total time spent performing read locks in microseconds

## mongodb.usage.remove.countps

Number of removes per second

## mongodb.usage.remove.count

Number of removes since server start (deprecated)

## mongodb.usage.remove.time

Total time spent performing removes in microseconds

## mongodb.usage.total.countps

Number of operations per second

## mongodb.usage.total.count

Number of operations since server start (deprecated)

## mongodb.usage.total.time

Total time spent performing operations in microseconds

## mongodb.usage.update.countps

Number of updates per second

## mongodb.usage.update.count

Number of updates since server start (deprecated)

## mongodb.usage.update.time

Total time spent performing updates in microseconds

## mongodb.usage.writeLock.countps

Number of write locks per second

## mongodb.usage.writeLock.count

Number of write locks since server start (deprecated)

## mongodb.usage.writeLock.time

Total time spent performing write locks in microseconds

# MySQL Metrics

## mysql.galera.wsrep_cluster_size

The current number of nodes in the Galera cluster.

## mysql.innodb.buffer_pool_free

The number of free pages in the InnoDB Buffer Pool.

## mysql.innodb.buffer_pool_total

The total number of pages in the InnoDB Buffer Pool.

## mysql.innodb.buffer_pool_used

The number of used pages in the InnoDB Buffer Pool.

## mysql.innodb.buffer_pool_utilization

The utilization of the InnoDB Buffer Pool.

## mysql.innodb.current_row_locks

The number of current row locks.

The rate of data reads.

## mysql.innodb.data_writes

The rate of data writes.

## mysql.innodb.mutex_os_waits

The rate of mutex OS waits.

## mysql.innodb.mutex_spin_rounds

The rate of mutex spin rounds.

## mysql.innodb.mutex_spin_waits

The rate of mutex spin waits.

## mysql.innodb.os_log_fsyncs

The rate of fsync writes to the log file.

## mysql.innodb.row_lock_time

The fraction of time spent (ms/s) acquring row locks.

## mysql.innodb.row_lock_waits

The number of times per second a row lock had to be waited for.

## mysql.net.connections

The rate of connections to the server.

## mysql.net.max_connections

The maximum number of connections that have been in use simultaneously since the server started.

## mysql.performance.com_delete

The rate of delete statements.

## mysql.performance.com_delete_multi

The rate of delete-multi statements.

## mysql.performance.com_insert

The rate of insert statements.

## mysql.performance.com_insert_select

The rate of insert-select statements.

## mysql.performance.com_replace_select

The rate of replace-select statements.

## mysql.performance.com_select

The rate of select statements.

## mysql.performance.com_update

The rate of update statements.

## mysql.performance.com_update_multi

The rate of update-multi.

## mysql.performance.created_tmp_disk_tables

The rate of internal on-disk temporary tables created by second by the server while executing statements.

## mysql.performance.created_tmp_files

The rate of temporary files created by second.

## mysql.performance.created_tmp_tables

The rate of internal temporary tables created by second by the server while executing statements.

## mysql.performance.kernel_time

The percentage of CPU time spent in kernel space by MySQL.

## mysql.performance.key_cache_utilization

The key cache utilization ratio.

## mysql.performance.open_files

The number of open files.

## mysql.performance.open_tables

The number of of tables that are open.

## mysql.performance.qcache_hits

The rate of query cache hits.

## mysql.performance.queries

The rate of queries.

## mysql.performance.questions

The rate of statements executed by the server.

## mysql.performance.slow_queries

The rate of slow queries.

## mysql.performance.table_locks_waited

The total number of times that a request for a table lock could not be granted immediately and a wait was needed.

## mysql.performance.table_locks_waited.gauge

The number of currently open connections.

The number of threads that are not sleeping.

## mysql.performance.user_time

The percentage of CPU time spent in user space by MySQL.

## mysql.replication.seconds_behind_master

The lag in seconds between the master and the slave.

## mysql.replication.slave_running

A boolean showing if this server is a replication slave that is connected to a replication master.

## mysql.replication.slaves_connected

The number of slaves connected to a replication master.

# NGINX Metrics

## nginx.net.conn_dropped_per_s

The rate of connections dropped.

## nginx.net.conn_opened_per_s

The rate of connections opened.

## nginx.net.connections

The total number of active connections.

The number of connections reading client requests.

## nginx.net.request_per_s

The rate of requests processed.

## nginx.net.waiting

The number of keep-alive connections waiting for work.

## nginx.net.writing

The number of connections waiting on upstream responses and/or writing responses back to the client.

# NGINX Metrics

See also: NGINX and NGINX Plus integration information.

## nginx.net.conn_dropped_per_s

The rate of connections dropped.

## nginx.net.conn_opened_per_s

The rate of connections opened.

## nginx.net.connections

The total number of active connections.

The number of connections reading client requests.

## nginx.net.request_per_s

The rate of requests processed.

## nginx.net.waiting

The number of keep-alive connections waiting for work.

## nginx.net.writing

The number of connections waiting on upstream responses and/or writing responses back to the client.

# NGINX Plus Metrics

## nginx.plus.cache.bypass.bytes

The total number of bytes read from the proxied server.

## nginx.plus.cache.bypass.bytes_written

The total number of bytes written to the cache.

## nginx.plus.cache.bypass.responses

The total number of responses from the cache.

## nginx.plus.cache.bypass.responses_written

The total number of responses written to the cache.

## nginx.plus.cache.cold

Boolean. Defines whether the cache loader process is still loading data from the disk into the cache or not.

## nginx.plus.cache.expired.bytes

The total number of bytes read from the proxied server.

## nginx.plus.cache.expired.bytes_written

The total number of bytes written to the cache.

## nginx.plus.cache.expired.responses

The total number of responses not taken from the cache

## nginx.plus.cache.expired.responses_written

The total number of responses written to the cache

## nginx.plus.cache.hit.bytes

The total number of bytes read from the cache

## nginx.plus.cache.hit.responses

The total number of responses read from the cache

## nginx.plus.cache.max_size

The limit on the maximum size of the cache specified in the configuration

## nginx.plus.cache.miss.bytes

The total number of bytes read from the proxied server

## nginx.plus.cache.miss.bytes_written

The total number of bytes written to the cache

## nginx.plus.cache.miss.responses

The total number of responses not taken from the cache

## nginx.plus.cache.miss.responses_written

The total number of responses written to the cache

## nginx.plus.cache.revalidated.bytes

The total number of bytes read from the cache

## nginx.plus.cache.revalidated.response

The total number of responses read from the cache

## nginx.plus.cache.size

The current size of the cache

## nginx.plus.cache.stale.bytes

The total number of bytes read from the cache

## nginx.plus.cache.stale.responses

The total number of responses read from the cache

## nginx.plus.cache.updating.bytes

The total number of bytes read from the cache

## nginx.plus.cache.updating.responses

The total number of responses read from the cache

## nginx.plus.connections.accepted

The total number of accepted client connections.

## nginx.plus.connections.active

The current number of active client connections.

## nginx.plus.connections.dropped

The total number of dropped client connections.

## nginx.plus.connections.idle

The current number of idle client connections.

## nginx.plus.generation

The total number of configuration reloads

Time of the last reload of configuration (time since Epoch).

## nginx.plus.pid

The ID of the worker process that handled status request.

## nginx.plus.plus.upstream.peers.fails

The total number of unsuccessful attempts to communicate with the server.

## nginx.plus.ppid

The ID of the master process that started the worker process

## nginx.plus.processes.respawned

The total number of abnormally terminated and re-spawned child processes.

## nginx.plus.requests.current

The current number of client requests.

## nginx.plus.requests.total

The total number of client requests.

The total number of requests completed without sending a response.

## nginx.plus.server_zone.processing

The number of client requests that are currently being processed.

The total amount of data received from clients.

## nginx.plus.server_zone.requests

The total number of client requests received from clients.

## nginx.plus.server_zone.responses.1xx

The number of responses with 1xx status code.

## nginx.plus.server_zone.responses.2xx

The number of responses with 2xx status code.

## nginx.plus.server_zone.responses.3xx

The number of responses with 3xx status code.

## nginx.plus.server_zone.responses.4xx

The number of responses with 4xx status code.

## nginx.plus.server_zone.responses.5xx

The number of responses with 5xx status code.

## nginx.plus.server_zone.responses.total

The total number of responses sent to clients.

## nginx.plus.server_zone.sent

The total amount of data sent to clients.

## nginx.plus.slab.pages.free

The current number of free memory pages

## nginx.plus.slab.pages.used

The current number of used memory pages

## nginx.plus.slab.slots.fails

The number of unsuccessful attempts to allocate memory of specified size

## nginx.plus.slab.slots.free

The current number of free memory slots

## nginx.plus.slab.slots.reqs

The total number of attempts to allocate memory of specified size

## nginx.plus.slab.slots.used

The current number of used memory slots

## nginx.plus.ssl.handshakes

The total number of successful SSL handshakes.

## nginx.plus.ssl.handshakes_failed

The total number of failed SSL handshakes.

## nginx.plus.ssl.session_reuses

The total number of session reuses during SSL handshake.

## nginx.plus.stream.server_zone.connections

The total number of connections accepted from clients

## nginx.plus.stream.server_zone.connections

The total number of connections accepted from clients

The total number of requests completed without sending a response.

The total number of requests completed without sending a response.

## nginx.plus.stream.server_zone.processing

The number of client requests that are currently being processed.

## nginx.plus.stream.server_zone.processing

The number of client requests that are currently being processed.

The total amount of data received from clients.

The total amount of data received from clients.

## nginx.plus.stream.server_zone.sent

The total amount of data sent to clients.

## nginx.plus.stream.server_zone.sent

The total amount of data sent to clients.

## nginx.plus.stream.server_zone.sessions.1xx

The number of responses with 1xx status code.

## nginx.plus.stream.server_zone.sessions.2xx

The number of responses with 2xx status code.

## nginx.plus.stream.server_zone.sessions.3xx

The number of responses with 3xx status code.

## nginx.plus.stream.server_zone.sessions.4xx

The number of responses with 4xx status code.

## nginx.plus.stream.server_zone.sessions.5xx

The number of responses with 5xx status code.

## nginx.plus.stream.server_zone.sessions.total

The total number of responses sent to clients.

## nginx.plus.stream.upstream.peers.active

The current number of connections

## nginx.plus.stream.upstream.peers.backup

A boolean value indicating whether the server is a backup server.

## nginx.plus.stream.upstream.peers.connections

The total number of client connections forwarded to this server.

## nginx.plus.stream.upstream.peers.downstart

The time (time since Epoch) when the server became “unavail” or “checking” or “unhealthy”

## nginx.plus.stream.upstream.peers.downtime

Total time the server was in the “unavail” or “checking” or “unhealthy” states.

## nginx.plus.stream.upstream.peers.fails

The total number of unsuccessful attempts to communicate with the server.

## nginx.plus.stream.upstream.peers.health_checks.checks

The total number of health check requests made.

## nginx.plus.stream.upstream.peers.health_checks.fails

The number of failed health checks.

## nginx.plus.stream.upstream.peers.health_checks.last_passed

Boolean indicating if the last health check request was successful and passed tests.

## nginx.plus.stream.upstream.peers.health_checks.unhealthy

How many times the server became unhealthy (state “unhealthy”).

## nginx.plus.stream.upstream.peers.id

The ID of the server.

The total number of bytes received from this server.

## nginx.plus.stream.upstream.peers.selected

The time (time since Epoch) when the server was last selected to process a connection.

## nginx.plus.stream.upstream.peers.sent

The total number of bytes sent to this server.

## nginx.plus.stream.upstream.peers.unavail

How many times the server became unavailable for client connections (state “unavail”).

## nginx.plus.stream.upstream.peers.weight

Weight of the server.

## nginx.plus.stream.upstream.zombies

The current number of servers removed from the group but still processing active client connections.

## nginx.plus.timestamp

Current time since Epoch.

## nginx.plus.upstream.keepalive

The current number of idle keepalive connections.

## nginx.plus.upstream.peers.active

The current number of active connections.

## nginx.plus.upstream.peers.backup

A boolean value indicating whether the server is a backup server.

## nginx.plus.upstream.peers.downstart

The time (since Epoch) when the server became “unavail” or “unhealthy”.

## nginx.plus.upstream.peers.downtime

Total time the server was in the “unavail” and “unhealthy” states.

## nginx.plus.upstream.peers.health_checks.checks

The total number of health check requests made.

## nginx.plus.upstream.peers.health_checks.fails

The number of failed health checks.

## nginx.plus.upstream.peers.health_checks.last_passed

Boolean indicating if the last health check request was successful and passed tests.

## nginx.plus.upstream.peers.health_checks.unhealthy

How many times the server became unhealthy (state “unhealthy”).

## nginx.plus.upstream.peers.id

he ID of the server.

The total amount of data received from this server.

## nginx.plus.upstream.peers.requests

The total number of client requests forwarded to this server.

## nginx.plus.upstream.peers.responses.1xx

The number of responses with 1xx status code.

## nginx.plus.upstream.peers.responses.1xx_count

The number of responses with 1xx status code (shown as count).

## nginx.plus.upstream.peers.responses.2xx

The number of responses with 2xx status code.

## nginx.plus.upstream.peers.responses.2xx_count

The number of responses with 2xx status code (shown as count).

## nginx.plus.upstream.peers.responses.3xx

The number of responses with 3xx status code.

## nginx.plus.upstream.peers.responses.3xx_count

The number of responses with 3xx status code (shown as count).

## nginx.plus.upstream.peers.responses.4xx

The number of responses with 4xx status code.

## nginx.plus.upstream.peers.responses.4xx_count

The number of responses with 4xx status code (shown as count).

## nginx.plus.upstream.peers.responses.5xx

The number of responses with 5xx status code.

## nginx.plus.upstream.peers.responses.5xx_count

The number of responses with 5xx status code (shown as count).

## nginx.plus.upstream.peers.responses.total

The total number of responses obtained from this server.

## nginx.plus.upstream.peers.selected

The time (since Epoch) when the server was last selected to process a request (1.7.5).

## nginx.plus.upstream.peers.sent

The total amount of data sent to this server.

## nginx.plus.upstream.peers.unavail

How many times the server became unavailable for client requests (state “unavail”) due to the number of unsuccessful attempts reaching the max_fails threshold.

## nginx.plus.upstream.peers.weight

The weight of the server.

## nginx.plus.version

The NGINX version.

# NGINX Plus Metrics

See also: NGINX and NGINX Plus integration information.

## nginx.plus.cache.bypass.bytes

The total number of bytes read from the proxied server.

## nginx.plus.cache.bypass.bytes_written

The total number of bytes written to the cache.

## nginx.plus.cache.bypass.responses

The total number of responses from the cache.

## nginx.plus.cache.bypass.responses_written

The total number of responses written to the cache.

## nginx.plus.cache.cold

Boolean. Defines whether the cache loader process is still loading data from the disk into the cache or not.

## nginx.plus.cache.expired.bytes

The total number of bytes read from the proxied server.

## nginx.plus.cache.expired.bytes_written

The total number of bytes written to the cache.

## nginx.plus.cache.expired.responses

The total number of responses not taken from the cache

## nginx.plus.cache.expired.responses_written

The total number of responses written to the cache

## nginx.plus.cache.hit.bytes

The total number of bytes read from the cache

## nginx.plus.cache.hit.responses

The total number of responses read from the cache

## nginx.plus.cache.max_size

The limit on the maximum size of the cache specified in the configuration

## nginx.plus.cache.miss.bytes

The total number of bytes read from the proxied server

## nginx.plus.cache.miss.bytes_written

The total number of bytes written to the cache

## nginx.plus.cache.miss.responses

The total number of responses not taken from the cache

## nginx.plus.cache.miss.responses_written

The total number of responses written to the cache

## nginx.plus.cache.revalidated.bytes

The total number of bytes read from the cache

## nginx.plus.cache.revalidated.response

The total number of responses read from the cache

## nginx.plus.cache.size

The current size of the cache

## nginx.plus.cache.stale.bytes

The total number of bytes read from the cache

## nginx.plus.cache.stale.responses

The total number of responses read from the cache

## nginx.plus.cache.updating.bytes

The total number of bytes read from the cache

## nginx.plus.cache.updating.responses

The total number of responses read from the cache

## nginx.plus.connections.accepted

The total number of accepted client connections.

## nginx.plus.connections.active

The current number of active client connections.

## nginx.plus.connections.dropped

The total number of dropped client connections.

## nginx.plus.connections.idle

The current number of idle client connections.

## nginx.plus.generation

The total number of configuration reloads

Time of the last reload of configuration (time since Epoch).

## nginx.plus.pid

The ID of the worker process that handled status request.

## nginx.plus.plus.upstream.peers.fails

The total number of unsuccessful attempts to communicate with the server.

## nginx.plus.ppid

The ID of the master process that started the worker process

## nginx.plus.processes.respawned

The total number of abnormally terminated and re-spawned child processes.

## nginx.plus.requests.current

The current number of client requests.

## nginx.plus.requests.total

The total number of client requests.

The total number of requests completed without sending a response.

## nginx.plus.server_zone.processing

The number of client requests that are currently being processed.

The total amount of data received from clients.

## nginx.plus.server_zone.requests

The total number of client requests received from clients.

## nginx.plus.server_zone.responses.1xx

The number of responses with 1xx status code.

## nginx.plus.server_zone.responses.2xx

The number of responses with 2xx status code.

## nginx.plus.server_zone.responses.3xx

The number of responses with 3xx status code.

## nginx.plus.server_zone.responses.4xx

The number of responses with 4xx status code.

## nginx.plus.server_zone.responses.5xx

The number of responses with 5xx status code.

## nginx.plus.server_zone.responses.total

The total number of responses sent to clients.

## nginx.plus.server_zone.sent

The total amount of data sent to clients.

## nginx.plus.slab.pages.free

The current number of free memory pages

## nginx.plus.slab.pages.used

The current number of used memory pages

## nginx.plus.slab.slots.fails

The number of unsuccessful attempts to allocate memory of specified size

## nginx.plus.slab.slots.free

The current number of free memory slots

## nginx.plus.slab.slots.reqs

The total number of attempts to allocate memory of specified size

## nginx.plus.slab.slots.used

The current number of used memory slots

## nginx.plus.ssl.handshakes

The total number of successful SSL handshakes.

## nginx.plus.ssl.handshakes_failed

The total number of failed SSL handshakes.

## nginx.plus.ssl.session_reuses

The total number of session reuses during SSL handshake.

## nginx.plus.stream.server_zone.connections

The total number of connections accepted from clients

## nginx.plus.stream.server_zone.connections

The total number of connections accepted from clients

The total number of requests completed without sending a response.

The total number of requests completed without sending a response.

## nginx.plus.stream.server_zone.processing

The number of client requests that are currently being processed.

## nginx.plus.stream.server_zone.processing

The number of client requests that are currently being processed.

The total amount of data received from clients.

The total amount of data received from clients.

## nginx.plus.stream.server_zone.sent

The total amount of data sent to clients.

## nginx.plus.stream.server_zone.sent

The total amount of data sent to clients.

## nginx.plus.stream.server_zone.sessions.1xx

The number of responses with 1xx status code.

## nginx.plus.stream.server_zone.sessions.2xx

The number of responses with 2xx status code.

## nginx.plus.stream.server_zone.sessions.3xx

The number of responses with 3xx status code.

## nginx.plus.stream.server_zone.sessions.4xx

The number of responses with 4xx status code.

## nginx.plus.stream.server_zone.sessions.5xx

The number of responses with 5xx status code.

## nginx.plus.stream.server_zone.sessions.total

The total number of responses sent to clients.

## nginx.plus.stream.upstream.peers.active

The current number of connections

## nginx.plus.stream.upstream.peers.backup

A boolean value indicating whether the server is a backup server.

## nginx.plus.stream.upstream.peers.connections

The total number of client connections forwarded to this server.

## nginx.plus.stream.upstream.peers.downstart

The time (time since Epoch) when the server became “unavail” or “checking” or “unhealthy”

## nginx.plus.stream.upstream.peers.downtime

Total time the server was in the “unavail” or “checking” or “unhealthy” states.

## nginx.plus.stream.upstream.peers.fails

The total number of unsuccessful attempts to communicate with the server.

## nginx.plus.stream.upstream.peers.health_checks.checks

The total number of health check requests made.

## nginx.plus.stream.upstream.peers.health_checks.fails

The number of failed health checks.

## nginx.plus.stream.upstream.peers.health_checks.last_passed

Boolean indicating if the last health check request was successful and passed tests.

## nginx.plus.stream.upstream.peers.health_checks.unhealthy

How many times the server became unhealthy (state “unhealthy”).

## nginx.plus.stream.upstream.peers.id

The ID of the server.

The total number of bytes received from this server.

## nginx.plus.stream.upstream.peers.selected

The time (time since Epoch) when the server was last selected to process a connection.

## nginx.plus.stream.upstream.peers.sent

The total number of bytes sent to this server.

## nginx.plus.stream.upstream.peers.unavail

How many times the server became unavailable for client connections (state “unavail”).

## nginx.plus.stream.upstream.peers.weight

Weight of the server.

## nginx.plus.stream.upstream.zombies

The current number of servers removed from the group but still processing active client connections.

## nginx.plus.timestamp

Current time since Epoch.

## nginx.plus.upstream.keepalive

The current number of idle keepalive connections.

## nginx.plus.upstream.peers.active

The current number of active connections.

## nginx.plus.upstream.peers.backup

A boolean value indicating whether the server is a backup server.

## nginx.plus.upstream.peers.downstart

The time (since Epoch) when the server became “unavail” or “unhealthy”.

## nginx.plus.upstream.peers.downtime

Total time the server was in the “unavail” and “unhealthy” states.

## nginx.plus.upstream.peers.health_checks.checks

The total number of health check requests made.

## nginx.plus.upstream.peers.health_checks.fails

The number of failed health checks.

## nginx.plus.upstream.peers.health_checks.last_passed

Boolean indicating if the last health check request was successful and passed tests.

## nginx.plus.upstream.peers.health_checks.unhealthy

How many times the server became unhealthy (state “unhealthy”).

## nginx.plus.upstream.peers.id

he ID of the server.

The total amount of data received from this server.

## nginx.plus.upstream.peers.requests

The total number of client requests forwarded to this server.

## nginx.plus.upstream.peers.responses.1xx

The number of responses with 1xx status code.

## nginx.plus.upstream.peers.responses.1xx_count

The number of responses with 1xx status code (shown as count).

## nginx.plus.upstream.peers.responses.2xx

The number of responses with 2xx status code.

## nginx.plus.upstream.peers.responses.2xx_count

The number of responses with 2xx status code (shown as count).

## nginx.plus.upstream.peers.responses.3xx

The number of responses with 3xx status code.

## nginx.plus.upstream.peers.responses.3xx_count

The number of responses with 3xx status code (shown as count).

## nginx.plus.upstream.peers.responses.4xx

The number of responses with 4xx status code.

## nginx.plus.upstream.peers.responses.4xx_count

The number of responses with 4xx status code (shown as count).

## nginx.plus.upstream.peers.responses.5xx

The number of responses with 5xx status code.

## nginx.plus.upstream.peers.responses.5xx_count

The number of responses with 5xx status code (shown as count).

## nginx.plus.upstream.peers.responses.total

The total number of responses obtained from this server.

## nginx.plus.upstream.peers.selected

The time (since Epoch) when the server was last selected to process a request (1.7.5).

## nginx.plus.upstream.peers.sent

The total amount of data sent to this server.

## nginx.plus.upstream.peers.unavail

How many times the server became unavailable for client requests (state “unavail”) due to the number of unsuccessful attempts reaching the max_fails threshold.

## nginx.plus.upstream.peers.weight

The weight of the server.

## nginx.plus.version

The NGINX version.

# NTP Metrics

## ntp.offset

The time difference between the local clock and the NTP reference clock, in seconds.

# NTP Metrics

## ntp.offset

The time difference between the local clock and the NTP reference clock, in seconds.

# PGBouncer Metrics

## pgbouncer.pools.cl_active

The number of client connections linked to a server connection and able to process queries.

## pgbouncer.pools.cl_waiting

The number of client connections waiting on a server connection.

## pgbouncer.pools.maxwait

The age of the oldest unserved client connection.

## pgbouncer.pools.sv_active

The number of server connections linked to a client connection.

## pgbouncer.pools.sv_idle

The number of server connections idle and ready for a client query.

The number of server connections currently in the process of logging in.

## pgbouncer.pools.sv_tested

The number of server connections currently running either server_reset_query or server_check_query.

## pgbouncer.pools.sv_used

The number of server connections idle more than server_check_delay, needing server_check_query.

## pgbouncer.stats.avg_query

The average query duration.

## pgbouncer.stats.avg_recv

The average amount of client network traffic received.

## pgbouncer.stats.avg_req

The average number of requests per second in the last stat period.

## pgbouncer.stats.avg_sent

The average amount of client network traffic sent.

The total network traffic received.

## pgbouncer.stats.bytes_sent_per_second

The total network traffic sent.

## pgbouncer.stats.requests_per_second

The request rate.

## pgbouncer.stats.total_query_time

The time spent by PgBouncer actively querying PostgreSQL.

# PGBouncer Metrics

## pgbouncer.pools.cl_active

The number of client connections linked to a server connection and able to process queries.

## pgbouncer.pools.cl_waiting

The number of client connections waiting on a server connection.

## pgbouncer.pools.maxwait

The age of the oldest unserved client connection.

## pgbouncer.pools.sv_active

The number of server connections linked to a client connection.

## pgbouncer.pools.sv_idle

The number of server connections idle and ready for a client query.

The number of server connections currently in the process of logging in.

## pgbouncer.pools.sv_tested

The number of server connections currently running either server_reset_query or server_check_query.

## pgbouncer.pools.sv_used

The number of server connections idle more than server_check_delay, needing server_check_query.

## pgbouncer.stats.avg_query

The average query duration.

## pgbouncer.stats.avg_recv

The average amount of client network traffic received.

## pgbouncer.stats.avg_req

The average number of requests per second in the last stat period.

## pgbouncer.stats.avg_sent

The average amount of client network traffic sent.

The total network traffic received.

## pgbouncer.stats.bytes_sent_per_second

The total network traffic sent.

## pgbouncer.stats.requests_per_second

The request rate.

## pgbouncer.stats.total_query_time

The time spent by PgBouncer actively querying PostgreSQL.

# PHP-FPM Metrics

## php_fpm.listen_queue.size

The size of the socket queue of pending connections.

## php_fpm.processes.active

The total number of active processes.

## php_fpm.processes.idle

The total number of idle processes.

## php_fpm.processes.max_reached

The number of times the process limit has been reached.

## php_fpm.processes.total

The total number of processes.

## php_fpm.requests.accepted

The total number of accepted requests.

## php_fpm.requests.slow

The total number of slow requests.

# PHP-FPM Metrics

## php_fpm.listen_queue.size

The size of the socket queue of pending connections.

## php_fpm.processes.active

The total number of active processes.

## php_fpm.processes.idle

The total number of idle processes.

## php_fpm.processes.max_reached

The number of times the process limit has been reached.

## php_fpm.processes.total

The total number of processes.

## php_fpm.requests.accepted

The total number of accepted requests.

## php_fpm.requests.slow

The total number of slow requests.

# PostgreSQL Metrics

Metric NameTypeDescription
`postgresql.seq_scans`gaugeThe number of sequential scans initiated on this table.
`postgresql.index_scans`gaugeThe number of index scans initiated on this table.
`postgresql.index_rows_fetched`gaugeThe number of live rows fetched by index scans.
`postgresql.rows_hot_updated`gaugeThe number of rows HOT updated, meaning no separate index update was needed.
`postgresql.live_rows`gaugeThe estimated number of live rows.
`postgresql.dead_rows`gaugeThe estimated number of dead rows.
`postgresql.index_rows_read`gaugeThe number of index entries returned by scans on this index.
`postgresql.table_size`gaugeThe total disk space used by the specified table. Includes TOAST, free space map, and visibility map. Excludes indexes.
`postgresql.index_size`gaugeThe total disk space used by indexes attached to the specified table.
`postgresql.total_size`gaugeThe total disk space used by the table, including indexes and TOAST data.
`postgresql.heap_blocks_read`gaugeThe number of disk blocks read from this table.
`postgresql.heap_blocks_hit`gaugeThe number of buffer hits in this table.
`postgresql.index_blocks_read`gaugeThe number of disk blocks read from all indexes on this table.
`postgresql.index_blocks_hit`gaugeThe number of buffer hits in all indexes on this table.
`postgresql.toast_blocks_read`gaugeThe number of disk blocks read from this table’s TOAST table.
`postgresql.toast_blocks_hit`gaugeThe number of buffer hits in this table’s TOAST table.
`postgresql.toast_index_blocks_read`gaugeThe number of disk blocks read from this table’s TOAST table index.
`postgresql.toast_index_blocks_hit`gaugeThe number of buffer hits in this table’s TOAST table index.
`postgresql.active_queries`gaugeThe number of active queries in this database.
`postgresql.archiver.archived_count`gaugeThe number of WAL files that have been successfully archived.
`postgresql.archiver.failed_count`gaugeThe number of failed attempts for archiving WAL files.
`postgresql.before_xid_wraparound`gaugeThe number of transactions that can occur until a transaction wraparound.
`postgresql.index_rel_rows_fetched`rateThe number of live rows fetched by index scans.
`postgresql.transactions.idle_in_transaction`gaugeThe number of ‘idle in transaction’ transactions in this database.
`postgresql.transactions.open`gaugeThe number of open transactions in this database.
`postgresql.waiting_queries`gaugeThe number of waiting queries in this database.
`postgresql.waiting_queries`gaugeThe number of buffers allocated
`postgresql.bgwriter.buffers_backend`gaugeThe number of buffers written directly by a backend.
`postgresql.bgwriter.buffers_backend_fsync`gaugeThe of times a backend had to execute its own fsync call instead of the background writer.
`postgresql.bgwriter.buffers_checkpoint`gaugeThe number of buffers written during checkpoints.
`postgresql.bgwriter.buffers_clean`gaugeThe number of buffers written by the background writer.
`postgresql.bgwriter.checkpoints_requested`gaugeThe number of requested checkpoints that were performed.
`postgresql.bgwriter.checkpoints_timed`gaugeThe number of scheduled checkpoints that were performed.
`postgresql.bgwriter.maxwritten_clean`gauge.The number of times the background writer stopped a cleaning scan due to writing too many buffers.
`postgresql.bgwriter.sync_time`gaugeThe total amount of checkpoint processing time spent synchronizing files to disk.
`postgresql.bgwriter.write_time`gaugeThe total amount of checkpoint processing time spent writing files to disk.
`postgresql.buffer_hit`gaugeThe number of times disk blocks were found in the buffer cache, preventing the need to read from the database.
`postgresql.commits`gaugeThe number of transactions that have been committed in this database.
`postgresql.connections`gaugeThe number of active connections to this database.
`postgresql.database_size`gaugeThe disk space used by this database.
`postgresql.deadlocks`gaugeThe number of deadlocks detected in this database
`postgresql.disk_read`gaugeThe number of disk blocks read in this database.
`postgresql.locks`gaugeThe number of locks active for this database.
`postgresql.max_connections`gaugeThe maximum number of client connections allowed to this database.
`postgresql.percent_usage_connections`gaugeThe number of connections to this database as a fraction of the maximum number of allowed connections.
`postgresql.replication_delay`gaugeThe current replication delay in seconds. Only available with PostgreSQL 9.1 and newer.
`postgresql.replication_delay_bytes`gaugeThe current replication delay in bytes. Only available with PostgreSQL 9.2 and newer.
`postgresql.rollbacks`gaugeThe number of transactions that have been rolled back in this database.
`postgresql.rows_deleted`gaugeThe number of rows deleted by queries in this database.
`postgresql.rows_fetched`gaugeThe number of rows fetched by queries in this database.
`postgresql.rows_inserted`gaugeThe number of rows inserted by queries in this database. The metrics can be segmented by ‘db’Â or ’table’Â and can be viewed per-relation.
`postgresql.rows_returned`gaugeThe number of rows returned by queries in this database. The metrics can be segmented by ‘db’Â or ’table’Â and can be viewed per-relation.
`postgresql.rows_updated`gaugeThe number of rows updated by queries in this database.
`postgresql.rows_deleted`gaugeThe number of rows deleted by queries in this database. The metrics can be segmented by ‘db’Â or ’table’Â and can be viewed per-relation.
`postgresql.table.count`gaugeThe number of user tables in this database.
`postgresql.temp_bytes`gaugeThe amount of data written to temporary files by queries in this database.
`postgresql.temp_files`gaugeThe number of temporary files created by queries in this database.
`postgresql.toast_blocks_read`gaugeThe number of disk blocks read from this table’s TOAST table.
`postgresql.transactions.idle_in_transaction`gaugeThe number of ‘idle in transaction’ transactions in this database.
`postgresql.transactions.open`gaugeThe number of open transactions in this database.

# PostgreSQL Metrics

Metric NameTypeDescription
`postgresql.seq_scans`gaugeThe number of sequential scans initiated on this table.
`postgresql.index_scans`gaugeThe number of index scans initiated on this table.
`postgresql.index_rows_fetched`gaugeThe number of live rows fetched by index scans.
`postgresql.rows_hot_updated`gaugeThe number of rows HOT updated, meaning no separate index update was needed.
`postgresql.live_rows`gaugeThe estimated number of live rows.
`postgresql.dead_rows`gaugeThe estimated number of dead rows.
`postgresql.index_rows_read`gaugeThe number of index entries returned by scans on this index.
`postgresql.table_size`gaugeThe total disk space used by the specified table. Includes TOAST, free space map, and visibility map. Excludes indexes.
`postgresql.index_size`gaugeThe total disk space used by indexes attached to the specified table.
`postgresql.total_size`gaugeThe total disk space used by the table, including indexes and TOAST data.
`postgresql.heap_blocks_read`gaugeThe number of disk blocks read from this table.
`postgresql.heap_blocks_hit`gaugeThe number of buffer hits in this table.
`postgresql.index_blocks_read`gaugeThe number of disk blocks read from all indexes on this table.
`postgresql.index_blocks_hit`gaugeThe number of buffer hits in all indexes on this table.
`postgresql.toast_blocks_read`gaugeThe number of disk blocks read from this table’s TOAST table.
`postgresql.toast_blocks_hit`gaugeThe number of buffer hits in this table’s TOAST table.
`postgresql.toast_index_blocks_read`gaugeThe number of disk blocks read from this table’s TOAST table index.
`postgresql.toast_index_blocks_hit`gaugeThe number of buffer hits in this table’s TOAST table index.
`postgresql.active_queries`gaugeThe number of active queries in this database.
`postgresql.archiver.archived_count`gaugeThe number of WAL files that have been successfully archived.
`postgresql.archiver.failed_count`gaugeThe number of failed attempts for archiving WAL files.
`postgresql.before_xid_wraparound`gaugeThe number of transactions that can occur until a transaction wraparound.
`postgresql.index_rel_rows_fetched`rateThe number of live rows fetched by index scans.
`postgresql.transactions.idle_in_transaction`gaugeThe number of ‘idle in transaction’ transactions in this database.
`postgresql.transactions.open`gaugeThe number of open transactions in this database.
`postgresql.waiting_queries`gaugeThe number of waiting queries in this database.
`postgresql.waiting_queries`gaugeThe number of buffers allocated
`postgresql.bgwriter.buffers_backend`gaugeThe number of buffers written directly by a backend.
`postgresql.bgwriter.buffers_backend_fsync`gaugeThe of times a backend had to execute its own fsync call instead of the background writer.
`postgresql.bgwriter.buffers_checkpoint`gaugeThe number of buffers written during checkpoints.
`postgresql.bgwriter.buffers_clean`gaugeThe number of buffers written by the background writer.
`postgresql.bgwriter.checkpoints_requested`gaugeThe number of requested checkpoints that were performed.
`postgresql.bgwriter.checkpoints_timed`gaugeThe number of scheduled checkpoints that were performed.
`postgresql.bgwriter.maxwritten_clean`gauge.The number of times the background writer stopped a cleaning scan due to writing too many buffers.
`postgresql.bgwriter.sync_time`gaugeThe total amount of checkpoint processing time spent synchronizing files to disk.
`postgresql.bgwriter.write_time`gaugeThe total amount of checkpoint processing time spent writing files to disk.
`postgresql.buffer_hit`gaugeThe number of times disk blocks were found in the buffer cache, preventing the need to read from the database.
`postgresql.commits`gaugeThe number of transactions that have been committed in this database.
`postgresql.connections`gaugeThe number of active connections to this database.
`postgresql.database_size`gaugeThe disk space used by this database.
`postgresql.deadlocks`gaugeThe number of deadlocks detected in this database
`postgresql.disk_read`gaugeThe number of disk blocks read in this database.
`postgresql.locks`gaugeThe number of locks active for this database.
`postgresql.max_connections`gaugeThe maximum number of client connections allowed to this database.
`postgresql.percent_usage_connections`gaugeThe number of connections to this database as a fraction of the maximum number of allowed connections.
`postgresql.replication_delay`gaugeThe current replication delay in seconds. Only available with PostgreSQL 9.1 and newer.
`postgresql.replication_delay_bytes`gaugeThe current replication delay in bytes. Only available with PostgreSQL 9.2 and newer.
`postgresql.rollbacks`gaugeThe number of transactions that have been rolled back in this database.
`postgresql.rows_deleted`gaugeThe number of rows deleted by queries in this database.
`postgresql.rows_fetched`gaugeThe number of rows fetched by queries in this database.
`postgresql.rows_inserted`gaugeThe number of rows inserted by queries in this database. The metrics can be segmented by ‘db’Â or ’table’Â and can be viewed per-relation.
`postgresql.rows_returned`gaugeThe number of rows returned by queries in this database. The metrics can be segmented by ‘db’Â or ’table’Â and can be viewed per-relation.
`postgresql.rows_updated`gaugeThe number of rows updated by queries in this database.
`postgresql.rows_deleted`gaugeThe number of rows deleted by queries in this database. The metrics can be segmented by ‘db’Â or ’table’Â and can be viewed per-relation.
`postgresql.table.count`gaugeThe number of user tables in this database.
`postgresql.temp_bytes`gaugeThe amount of data written to temporary files by queries in this database.
`postgresql.temp_files`gaugeThe number of temporary files created by queries in this database.
`postgresql.toast_blocks_read`gaugeThe number of disk blocks read from this table’s TOAST table.
`postgresql.transactions.idle_in_transaction`gaugeThe number of ‘idle in transaction’ transactions in this database.
`postgresql.transactions.open`gaugeThe number of open transactions in this database.

# RabbitMQ Metrics

## rabbitmq.connections

The number of current connections to a given rabbitmq vhost. Each connection is tagged as `rabbitmq_vhost:<vhost_name>`.

## rabbitmq.connections.state

The number of connections in the specified connection state.

## rabbitmq.exchange.messages.ack.count

The number of messages delivered to clients and acknowledged.

## rabbitmq.exchange.messages.ack.rate

The rate of messages delivered to clients and acknowledged per second.

## rabbitmq.exchange.messages.confirm.count

The number of messages confirmed.

## rabbitmq.exchange.messages.confirm.rate

The rate of messages confirmed per second.

## rabbitmq.exchange.messages.deliver_get.count

The sum of messages delivered in acknowledgement mode to consumers, in no-acknowledgement mode to consumers, in acknowledgement mode in response to basic.get, and in no-acknowledgement mode in response to basic.get.

## rabbitmq.exchange.messages.deliver_get.rate

The rate per second of the sum of messages delivered in acknowledgement mode to consumers, in no-acknowledgement mode to consumers, in acknowledgement mode in response to basic.get, and in no-acknowledgement mode in response to basic.get.

## rabbitmq.exchange.messages.publish_in.count

The number of messages published from channels into this exchange.

## rabbitmq.exchange.messages.publish_in.rate

The amount of messages published from channels into this exchange per second.

## rabbitmq.exchange.messages.publish_out.count

The number of messages published from this exchange into queues.

## rabbitmq.exchange.messages.publish_out.rate

The amount of messages published from this exchange into queues per second.

## rabbitmq.exchange.messages.publish.count

The number of messages published.

## rabbitmq.exchange.messages.publish.rate

The amount of messages published per second.

## rabbitmq.exchange.messages.redeliver.count

The number of subset of messages in `deliver_get` which had the redelivered flag set.

## rabbitmq.exchange.messages.redeliver.rate

The amount of subset of messages in `deliver_get` which had the redelivered flag set per second.

## rabbitmq.exchange.messages.return_unroutable.count

The number of messages returned to the publisher as unroutable.

## rabbitmq.exchange.messages.return_unroutable.rate

The amount of messages returned to publisher as unroutable per second.

## rabbitmq.node.disk_alarm

Defines whether the node has a disk alarm configured.

## rabbitmq.node.disk_free

The current free disk space.

## rabbitmq.node.fd_used

Used file descriptors.

## rabbitmq.node.mem_alarm

Defines whether the node has a memory alarm configured.

## rabbitmq.node.mem_used

The total memory used in bytes.

## rabbitmq.node.partitions

The number of network partitions this node is seeing.

## rabbitmq.node.run_queue

The average number of Erlang processes waiting to run.

## rabbitmq.node.running

Defines whether the node is running or not.

## rabbitmq.node.sockets_used

The number of file descriptors used as sockets.

## rabbitmq.overview.messages.ack.count

The number of messages delivered to clients and acknowledged.

## rabbitmq.overview.messages.ack.rate

The rate of messages delivered to clients and acknowledged per second.

## rabbitmq.overview.messages.confirm.count

The number of messages confirmed.

## rabbitmq.overview.messages.confirm.rate

The rate of messages confirmed per second.

## rabbitmq.overview.messages.deliver_get.count

The sum of messages delivered in acknowledgement mode to consumers, in no-acknowledgement mode to consumers, in acknowledgement mode in response to basic.get, and in no-acknowledgement mode in response to basic.get.

## rabbitmq.overview.messages.deliver_get.rate

The rate per second of the sum of messages delivered in acknowledgement mode to consumers, in no-acknowledgement mode to consumers, in acknowledgement mode in response to basic.get, and in no-acknowledgement mode in response to basic.get.

## rabbitmq.overview.messages.publish_in.count

The number of messages published from channels into this overview.

## rabbitmq.overview.messages.publish_in.rate

The rate of messages published from channels into this overview per second.

## rabbitmq.overview.messages.publish_out.count

The number of messages published from this overview into queues.

## rabbitmq.overview.messages.publish_out.rate

The rate of messages published from this overview into queues per second.

## rabbitmq.overview.messages.publish.count

The number of messages published.

## rabbitmq.overview.messages.publish.rate

The rate of messages published per second.

## rabbitmq.overview.messages.redeliver.count

The number of subset of messages in `deliver_get` which had the redelivered flag set.

## rabbitmq.overview.messages.redeliver.rate

The rate of subset of messages in `deliver_get` which had the redelivered flag set per second.

## rabbitmq.overview.messages.return_unroutable.count

The number of messages returned to publisher as unroutable.

## rabbitmq.overview.messages.return_unroutable.rate

The rate of messages returned to publisher as unroutable per second.

## rabbitmq.overview.object_totals.channels

The total number of channels.

## rabbitmq.overview.object_totals.connections

The total number of connections.

## rabbitmq.overview.object_totals.consumers

The total number of consumers.

## rabbitmq.overview.object_totals.queues

The total number of queues.

The number of messages ready for delivery.

The rate of messages ready for delivery.

## rabbitmq.overview.queue_totals.messages_unacknowledged.count

The number of unacknowledged messages.

## rabbitmq.overview.queue_totals.messages_unacknowledged.rate

The rate of unacknowledged messages.

## rabbitmq.overview.queue_totals.messages.count

The total number of messages (ready plus unacknowledged).

## rabbitmq.overview.queue_totals.messages.rate

The rate of messages (ready plus unacknowledged).

## rabbitmq.queue.active_consumers

The number of active consumers, consumers that can immediately receive any messages sent to the queue.

## rabbitmq.queue.bindings.count

The number of bindings for a specific queue.

## rabbitmq.queue.consumer_utilisation

The ratio of time that a queue’s consumers can take new messages.

## rabbitmq.queue.consumers

The number of consumers.

## rabbitmq.queue.memory

The number of bytes of memory consumed by the Erlang process associated with the queue, including stack, heap and internal structures.

## rabbitmq.queue.messages

The total number of messages in the queue.

The number of messages ready to be delivered to clients.

The number of messages ready to be delivered to clients per second.

## rabbitmq.queue.messages_unacknowledged

The number of messages delivered to clients but not yet acknowledged.

## rabbitmq.queue.messages_unacknowledged.rate

The number of messages delivered to clients but not yet acknowledged per second.

## rabbitmq.queue.messages.ack.count

The number of messages delivered to clients and acknowledged.

## rabbitmq.queue.messages.ack.rate

The number of messages delivered to clients and acknowledged per second.

## rabbitmq.queue.messages.deliver_get.count

The sum of messages delivered in acknowledgement mode to consumers, in no-acknowledgement mode to consumers, in acknowledgement mode in response to basic.get, and in no-acknowledgement mode in response to basic.get.

## rabbitmq.queue.messages.deliver_get.rate

The sum of messages delivered in acknowledgement mode to consumers, in no-acknowledgement mode to consumers, in acknowledgement mode in response to basic.get, and in no-acknowledgement mode in response to basic.get per second.

## rabbitmq.queue.messages.deliver.count

The number of messages delivered in acknowledgement mode to consumers.

## rabbitmq.queue.messages.deliver.rate

The number of messages delivered in acknowledgement mode to consumers.

## rabbitmq.queue.messages.publish.count

The number of messages published.

## rabbitmq.queue.messages.publish.rate

The rate of messages published per second.

## rabbitmq.queue.messages.rate

The total number of messages in the queue per second.

## rabbitmq.queue.messages.redeliver.count

The number of subset of messages in `deliver_get` which had the redelivered flag set.

## rabbitmq.queue.messages.redeliver.rate

The rate per second of subset of messages in `deliver_get` which had the redelivered flag set.

# RabbitMQ Metrics

## rabbitmq.connections

The number of current connections to a given rabbitmq vhost. Each connection is tagged as `rabbitmq_vhost:<vhost_name>`.

## rabbitmq.connections.state

The number of connections in the specified connection state.

## rabbitmq.exchange.messages.ack.count

The number of messages delivered to clients and acknowledged.

## rabbitmq.exchange.messages.ack.rate

The rate of messages delivered to clients and acknowledged per second.

## rabbitmq.exchange.messages.confirm.count

The number of messages confirmed.

## rabbitmq.exchange.messages.confirm.rate

The rate of messages confirmed per second.

## rabbitmq.exchange.messages.deliver_get.count

The sum of messages delivered in acknowledgement mode to consumers, in no-acknowledgement mode to consumers, in acknowledgement mode in response to basic.get, and in no-acknowledgement mode in response to basic.get.

## rabbitmq.exchange.messages.deliver_get.rate

The rate per second of the sum of messages delivered in acknowledgement mode to consumers, in no-acknowledgement mode to consumers, in acknowledgement mode in response to basic.get, and in no-acknowledgement mode in response to basic.get.

## rabbitmq.exchange.messages.publish_in.count

The number of messages published from channels into this exchange.

## rabbitmq.exchange.messages.publish_in.rate

The amount of messages published from channels into this exchange per second.

## rabbitmq.exchange.messages.publish_out.count

The number of messages published from this exchange into queues.

## rabbitmq.exchange.messages.publish_out.rate

The amount of messages published from this exchange into queues per second.

## rabbitmq.exchange.messages.publish.count

The number of messages published.

## rabbitmq.exchange.messages.publish.rate

The amount of messages published per second.

## rabbitmq.exchange.messages.redeliver.count

The number of subset of messages in `deliver_get` which had the redelivered flag set.

## rabbitmq.exchange.messages.redeliver.rate

The amount of subset of messages in `deliver_get` which had the redelivered flag set per second.

## rabbitmq.exchange.messages.return_unroutable.count

The number of messages returned to the publisher as unroutable.

## rabbitmq.exchange.messages.return_unroutable.rate

The amount of messages returned to publisher as unroutable per second.

## rabbitmq.node.disk_alarm

Defines whether the node has a disk alarm configured.

## rabbitmq.node.disk_free

The current free disk space.

## rabbitmq.node.fd_used

Used file descriptors.

## rabbitmq.node.mem_alarm

Defines whether the node has a memory alarm configured.

## rabbitmq.node.mem_used

The total memory used in bytes.

## rabbitmq.node.partitions

The number of network partitions this node is seeing.

## rabbitmq.node.run_queue

The average number of Erlang processes waiting to run.

## rabbitmq.node.running

Defines whether the node is running or not.

## rabbitmq.node.sockets_used

The number of file descriptors used as sockets.

## rabbitmq.overview.messages.ack.count

The number of messages delivered to clients and acknowledged.

## rabbitmq.overview.messages.ack.rate

The rate of messages delivered to clients and acknowledged per second.

## rabbitmq.overview.messages.confirm.count

The number of messages confirmed.

## rabbitmq.overview.messages.confirm.rate

The rate of messages confirmed per second.

## rabbitmq.overview.messages.deliver_get.count

The sum of messages delivered in acknowledgement mode to consumers, in no-acknowledgement mode to consumers, in acknowledgement mode in response to basic.get, and in no-acknowledgement mode in response to basic.get.

## rabbitmq.overview.messages.deliver_get.rate

The rate per second of the sum of messages delivered in acknowledgement mode to consumers, in no-acknowledgement mode to consumers, in acknowledgement mode in response to basic.get, and in no-acknowledgement mode in response to basic.get.

## rabbitmq.overview.messages.publish_in.count

The number of messages published from channels into this overview.

## rabbitmq.overview.messages.publish_in.rate

The rate of messages published from channels into this overview per second.

## rabbitmq.overview.messages.publish_out.count

The number of messages published from this overview into queues.

## rabbitmq.overview.messages.publish_out.rate

The rate of messages published from this overview into queues per second.

## rabbitmq.overview.messages.publish.count

The number of messages published.

## rabbitmq.overview.messages.publish.rate

The rate of messages published per second.

## rabbitmq.overview.messages.redeliver.count

The number of subset of messages in `deliver_get` which had the redelivered flag set.

## rabbitmq.overview.messages.redeliver.rate

The rate of subset of messages in `deliver_get` which had the redelivered flag set per second.

## rabbitmq.overview.messages.return_unroutable.count

The number of messages returned to publisher as unroutable.

## rabbitmq.overview.messages.return_unroutable.rate

The rate of messages returned to publisher as unroutable per second.

## rabbitmq.overview.object_totals.channels

The total number of channels.

## rabbitmq.overview.object_totals.connections

The total number of connections.

## rabbitmq.overview.object_totals.consumers

The total number of consumers.

## rabbitmq.overview.object_totals.queues

The total number of queues.

The number of messages ready for delivery.

The rate of messages ready for delivery.

## rabbitmq.overview.queue_totals.messages_unacknowledged.count

The number of unacknowledged messages.

## rabbitmq.overview.queue_totals.messages_unacknowledged.rate

The rate of unacknowledged messages.

## rabbitmq.overview.queue_totals.messages.count

The total number of messages (ready plus unacknowledged).

## rabbitmq.overview.queue_totals.messages.rate

The rate of messages (ready plus unacknowledged).

## rabbitmq.queue.active_consumers

The number of active consumers, consumers that can immediately receive any messages sent to the queue.

## rabbitmq.queue.bindings.count

The number of bindings for a specific queue.

## rabbitmq.queue.consumer_utilisation

The ratio of time that a queue’s consumers can take new messages.

## rabbitmq.queue.consumers

The number of consumers.

## rabbitmq.queue.memory

The number of bytes of memory consumed by the Erlang process associated with the queue, including stack, heap and internal structures.

## rabbitmq.queue.messages

The total number of messages in the queue.

The number of messages ready to be delivered to clients.

The number of messages ready to be delivered to clients per second.

## rabbitmq.queue.messages_unacknowledged

The number of messages delivered to clients but not yet acknowledged.

## rabbitmq.queue.messages_unacknowledged.rate

The number of messages delivered to clients but not yet acknowledged per second.

## rabbitmq.queue.messages.ack.count

The number of messages delivered to clients and acknowledged.

## rabbitmq.queue.messages.ack.rate

The number of messages delivered to clients and acknowledged per second.

## rabbitmq.queue.messages.deliver_get.count

The sum of messages delivered in acknowledgement mode to consumers, in no-acknowledgement mode to consumers, in acknowledgement mode in response to basic.get, and in no-acknowledgement mode in response to basic.get.

## rabbitmq.queue.messages.deliver_get.rate

The sum of messages delivered in acknowledgement mode to consumers, in no-acknowledgement mode to consumers, in acknowledgement mode in response to basic.get, and in no-acknowledgement mode in response to basic.get per second.

## rabbitmq.queue.messages.deliver.count

The number of messages delivered in acknowledgement mode to consumers.

## rabbitmq.queue.messages.deliver.rate

The number of messages delivered in acknowledgement mode to consumers.

## rabbitmq.queue.messages.publish.count

The number of messages published.

## rabbitmq.queue.messages.publish.rate

The rate of messages published per second.

## rabbitmq.queue.messages.rate

The total number of messages in the queue per second.

## rabbitmq.queue.messages.redeliver.count

The number of subset of messages in `deliver_get` which had the redelivered flag set.

## rabbitmq.queue.messages.redeliver.rate

The rate per second of subset of messages in `deliver_get` which had the redelivered flag set.

# Supervisord Metrics

## supervisord.process.count

The number of supervisord monitored processes.

## supervisord.process.uptime

The process uptime.

# Supervisord Metrics

## supervisord.process.count

The number of supervisord monitored processes.

## supervisord.process.uptime

The process uptime.

# TCP Metrics

## network.tcp.response_time

The response time of a given host and TCP port.

# TCP Metrics

See TCP integration information.

## network.tcp.response_time

The response time of a given host and TCP port.

# Varnish Metrics

All Varnish metrics have the type gauge except `varnish.n_purgesps,` which has the type rate.

## varnish.accept_fail

Accept failures. This metric is only provided by varnish 3.x.

## varnish.backend_busy

Maximum number of connections to a given backend.

## varnish.backend_conn

Successful connections to a given backend.

## varnish.backend_fail

Failed connections for a given backend.

## varnish.backend_recycle

Backend connections with keep-alive that are returned to the pool of connections.

## varnish.backend_req

Backend requests.

## varnish.backend_retry

Backend connection retries.

## varnish.backend_reuse

Recycled connections that has were reused.

## varnish.backend_toolate

Backend connections closed because they were idle too long.

## varnish.backend_unhealthy

Backend connections not tried because the backend was unhealthy.

## varnish.bans

Bans in system, including bans superseded by newer bans and bans already checked by the ban-lurker. This metric is only provided by varnish 4.x.

Bans added to ban list. This metric is only provided by varnish 4.x.

## varnish.bans_completed

Bans which are no longer active, either because they got checked by the ban-lurker or superseded by newer identical bans. This metric is only provided by varnish 4.x.

## varnish.bans_deleted

Bans deleted from ban list. This metric is only provided by varnish 4.x.

## varnish.bans_dups

Bans replaced by later identical bans. This metric is only provided by varnish 4.x.

## varnish.bans_lurker_contention

Times the ban-lurker waited for lookups. This metric is only provided by varnish 4.x.

## varnish.bans_lurker_obj_killed

Objects killed by ban-lurker. This metric is only provided by varnish 4.x.

## varnish.bans_lurker_tested

Bans and objects tested against each other by the ban-lurker. This metric is only provided by varnish 4.x.

## varnish.bans_lurker_tests_tested

Tests and objects tested against each other by the ban-lurker. ‘ban req.url == foo && req.http.host == bar’ counts as one in ‘bans_tested’ and as two in ‘bans_tests_tested’. This metric is only provided by varnish 4.x.

## varnish.bans_obj

Bans which use obj.* variables. These bans can possibly be washed by the ban-lurker. This metric is only provided by varnish 4.x.

## varnish.bans_obj_killed

Objects killed by bans during object lookup. This metric is only provided by varnish 4.x

## varnish.bans_persisted_bytes

Bytes used by the persisted ban lists. This metric is only provided by varnish 4.x.

## varnish.bans_persisted_fragmentation

Extra bytes accumulated through dropped and completed bans in the persistent ban lists. This metric is only provided by varnish 4.x.

## varnish.bans_req

Bans which use req.* variables. These bans can not be washed by the ban-lurker. This metric is only provided by varnish 4.x.

## varnish.bans_tested

Bans and objects tested against each other during hash lookup. This metric is only provided by varnish 4.x.

## varnish.bans_tests_tested

Tests and objects tested against each other during lookup. ‘ban req.url == foo && req.http.host == bar’ counts as one in ‘bans_tested’ and as two in ‘bans_tests_tested’. This metric is only provided by varnish 4.x.

## varnish.busy_sleep

Requests sent to sleep without a worker thread because they found a busy object. This metric is only provided by varnish 4.x.

## varnish.busy_wakeup

Requests taken off the busy object sleep list and and rescheduled. This metric is only provided by varnish 4.x.

## varnish.cache_hit

Requests served from the cache.

## varnish.cache_hitpass

Requests passed to a backend where the decision to pass them found in the cache.

## varnish.cache_miss

Requests fetched from a backend server.

## varnish.client_conn

Client connections accepted. This metric is only provided by varnish 3.x.

## varnish.client_drop

Client connection dropped, no session. This metric is only provided by varnish 3.x.

## varnish.client_drop_late

Client connection dropped late. This metric is only provided by varnish 3.x.

## varnish.client_req

Parseable client requests seen.

## varnish.client_req_400

Requests that were malformed in some drastic way. This metric is only provided by varnish 4.x.

## varnish.client_req_411

Requests that were missing a Content-Length: header. This metric is only provided by varnish 4.x.

## varnish.client_req_413

Requests that were too big. This metric is only provided by varnish 4.x.

## varnish.client_req_417

Requests with a bad Expect: header. This metric is only provided by varnish 4.x.

## varnish.dir_dns_cache_full

DNS director full DNS cache. This metric is only provided by varnish 3.x.

## varnish.dir_dns_failed

DNS director failed lookup. This metric is only provided by varnish 3.x.

## varnish.dir_dns_hit

DNS director cached lookup hit. This metric is only provided by varnish 3.x.

## varnish.dir_dns_lookups

DNS director lookups. This metric is only provided by varnish 3.x.

## varnish.esi_errors

Edge Side Includes (ESI) parse errors.

## varnish.esi_warnings

Edge Side Includes (ESI) parse warnings.

## varnish.exp_mailed

Objects mailed to expiry thread for handling. This metric is only provided by varnish 4.x.

Objects received by expiry thread for handling. This metric is only provided by varnish 4.x.

## varnish.fetch_1xx

Back end response with no body because of 1XX response (Informational).

## varnish.fetch_204

Back end response with no body because of 204 response (No Content).

## varnish.fetch_304

Back end response with no body because of 304 response (Not Modified).

Back end response’s body length could not be determined and/or had bad headers.

## varnish.fetch_chunked

Back end response bodies that were chunked.

## varnish.fetch_close

Fetch wanted close.

## varnish.fetch_eof

Back end response bodies with EOF.

## varnish.fetch_failed

Back end response fetches that failed.

Back end HEAD requests.

## varnish.fetch_length

Back end response bodies with Content-Length.

Back end fetches that failed because no thread was available. This metric is only provided by varnish 4.x.

## varnish.fetch_oldhttp

Number of responses served by backends with http < 1.1

## varnish.fetch_zero

Number of responses that have zero length.

## varnish.hcb_insert

HCB inserts.

## varnish.hcb_lock

HCB lookups with lock.

## varnish.hcb_nolock

HCB lookups without lock.

## varnish.LCK.backend.colls

Collisions. This metric is only provided by varnish 3.x.

## varnish.LCK.backend.creat

Created locks.

## varnish.LCK.backend.destroy

Destroyed locks.

## varnish.LCK.backend.locks

Lock operations.

## varnish.LCK.ban.colls

Collisions. This metric is only provided by varnish 3.x.

## varnish.LCK.ban.creat

Created locks.

## varnish.LCK.ban.destroy

Destroyed locks.

## varnish.LCK.ban.locks

Lock operations.

## varnish.LCK.busyobj.creat

Created locks. This metric is only provided by varnish 4.x.

## varnish.LCK.busyobj.destroy

Destroyed locks. This metric is only provided by varnish 4.x.

## varnish.LCK.busyobj.locks

Lock operations. This metric is only provided by varnish 4.x.

## varnish.LCK.cli.colls

Collisions. This metric is only provided by varnish 3.x.

## varnish.LCK.cli.creat

Created locks.

## varnish.LCK.cli.destroy

Destroyed locks.

## varnish.LCK.cli.locks

Lock operations.

## varnish.LCK.exp.colls

Collisions. This metric is only provided by varnish 3.x.

## varnish.LCK.exp.creat

Created locks.

## varnish.LCK.exp.destroy

Destroyed locks.

## varnish.LCK.exp.locks

Lock operations.

## varnish.LCK.hcb.colls

Collisions. This metric is only provided by varnish 3.x.

## varnish.LCK.hcb.creat

Created locks.

## varnish.LCK.hcb.destroy

Destroyed locks.

## varnish.LCK.hcb.locks

Lock operations.

## varnish.LCK.hcl.colls

Collisions. This metric is only provided by varnish 3.x.

## varnish.LCK.hcl.creat

Created locks.

## varnish.LCK.hcl.destroy

Destroyed locks.

## varnish.LCK.hcl.locks

Lock operations.

## varnish.LCK.herder.colls

Collisions. This metric is only provided by varnish 3.x.

## varnish.LCK.herder.creat

Created locks.

## varnish.LCK.herder.destroy

Destroyed locks.

## varnish.LCK.herder.locks

Lock operations.

## varnish.LCK.hsl.colls

Collisions. This metric is only provided by varnish 3.x.

## varnish.LCK.hsl.creat

Created locks.

## varnish.LCK.hsl.destroy

Destroyed locks.

## varnish.LCK.hsl.locks

Lock operations.

## varnish.LCK.lru.colls

Collisions. This metric is only provided by varnish 3.x.

## varnish.LCK.lru.creat

Created locks.

## varnish.LCK.lru.destroy

Destroyed locks.

## varnish.LCK.lru.locks

Lock operations.

## varnish.LCK.mempool.creat

Created locks. This metric is only provided by varnish 4.x.

## varnish.LCK.mempool.destroy

Destroyed locks. This metric is only provided by varnish 4.x.

## varnish.LCK.mempool.locks

Lock operations. This metric is only provided by varnish 4.x.

## varnish.LCK.nbusyobj.creat

Created locks. This metric is only provided by varnish 4.x.

## varnish.LCK.nbusyobj.destroy

Destroyed locks. This metric is only provided by varnish 4.x.

## varnish.LCK.nbusyobj.locks

Lock operations. This metric is only provided by varnish 4.x.

## varnish.LCK.objhdr.colls

Collisions. This metric is only provided by varnish 3.x.

## varnish.LCK.objhdr.creat

Created locks.

## varnish.LCK.objhdr.destroy

Destroyed locks.

## varnish.LCK.objhdr.locks

Lock operations.

## varnish.LCK.pipestat.creat

Created locks. This metric is only provided by varnish 4.x.

## varnish.LCK.pipestat.destroy

Destroyed locks. This metric is only provided by varnish 4.x.

## varnish.LCK.pipestat.locks

Lock operations. This metric is only provided by varnish 4.x.

## varnish.LCK.sess.creat

Created locks. This metric is only provided by varnish 4.x.

## varnish.LCK.sess.destroy

Destroyed locks. This metric is only provided by varnish 4.x.

## varnish.LCK.sess.locks

Lock operations. This metric is only provided by varnish 4.x.

## varnish.LCK.sessmem.colls

Collisions. This metric is only provided by varnish 3.x.

## varnish.LCK.sessmem.creat

Created locks.

## varnish.LCK.sessmem.destroy

Destroyed locks.

## varnish.LCK.sessmem.locks

Lock operations.

## varnish.LCK.sma.colls

Collisions. This metric is only provided by varnish 3.x.

## varnish.LCK.sma.creat

Created locks.

## varnish.LCK.sma.destroy

Destroyed locks.

## varnish.LCK.sma.locks

Lock operations.

## varnish.LCK.smf.colls

Collisions. This metric is only provided by varnish 3.x.

## varnish.LCK.smf.creat

Created locks.

## varnish.LCK.smf.destroy

Destroyed locks.

## varnish.LCK.smf.locks

Lock operations.

## varnish.LCK.smp.colls

Collisions. This metric is only provided by varnish 3.x.

## varnish.LCK.smp.creat

Created locks.

## varnish.LCK.smp.destroy

Destroyed locks.

## varnish.LCK.smp.locks

Lock operations.

## varnish.LCK.sms.colls

Collisions. This metric is only provided by varnish 3.x.

## varnish.LCK.sms.creat

Created locks.

## varnish.LCK.sms.destroy

Destroyed locks.

## varnish.LCK.sms.locks

Lock operations.

## varnish.LCK.stat.colls

Collisions. This metric is only provided by varnish 3.x.

## varnish.LCK.stat.creat

Created locks. This metric is only provided by varnish 3.x.

## varnish.LCK.stat.destroy

Destroyed locks. This metric is only provided by varnish 3.x.

## varnish.LCK.stat.locks

Lock operations. This metric is only provided by varnish 3.x.

## varnish.LCK.vbe.colls

Collisions. This metric is only provided by varnish 3.x.

## varnish.LCK.vbe.creat

Created locks. This metric is only provided by varnish 3.x.

## varnish.LCK.vbe.destroy

Destroyed locks. This metric is only provided by varnish 3.x.

## varnish.LCK.vbe.locks

Lock operations. This metric is only provided by varnish 3.x.

## varnish.LCK.vbp.colls

Collisions. This metric is only provided by varnish 3.x.

## varnish.LCK.vbp.creat

Created locks.

## varnish.LCK.vbp.destroy

Destroyed locks.

## varnish.LCK.vbp.locks

Lock operations.

## varnish.LCK.vcapace.creat

Created locks. This metric is only provided by varnish 4.x.

## varnish.LCK.vcapace.destroy

Destroyed locks. This metric is only provided by varnish 4.x.

## varnish.LCK.vcapace.locks

Lock operations. This metric is only provided by varnish 4.x.

## varnish.LCK.vcl.colls

Collisions. This metric is only provided by varnish 3.x.

## varnish.LCK.vcl.creat

Created locks.

## varnish.LCK.vcl.destroy

Destroyed locks.

## varnish.LCK.vcl.locks

Lock operations.

## varnish.LCK.vxid.creat

Created locks. This metric is only provided by varnish 4.x.

## varnish.LCK.vxid.destroy

Destroyed locks. This metric is only provided by varnish 4.x.

## varnish.LCK.vxid.locks

Lock operations. This metric is only provided by varnish 4.x.

## varnish.LCK.wq.colls

Collisions. This metric is only provided by varnish 3.x.

## varnish.LCK.wq.creat

Created locks.

## varnish.LCK.wq.destroy

Destroyed locks.

## varnish.LCK.wq.locks

Lock operations.

## varnish.LCK.wstat.colls

Collisions. This metric is only provided by varnish 3.x.

## varnish.LCK.wstat.creat

Created locks.

## varnish.LCK.wstat.destroy

Destroyed locks.

## varnish.LCK.wstat.locks

Lock operations.

## varnish.MEMPOOL.busyobj.allocs

Allocations. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.busyobj.frees

Frees. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.busyobj.live

In use. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.busyobj.pool

In pool. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.busyobj.randry

Pool ran dry. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.busyobj.recycle

Recycled from pool. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.busyobj.surplus

Too many for pool. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.busyobj.sz_needed

Size allocated. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.busyobj.sz_wanted

Size requested. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.busyobj.timeout

Timed out from pool. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.busyobj.toosmall

Too small to recycle. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.req0.allocs

Allocations. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.req0.frees

Frees. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.req0.live

In use. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.req0.pool

In pool. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.req0.randry

Pool ran dry. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.req0.recycle

Recycled from pool. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.req0.surplus

Too many for pool. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.req0.sz_needed

Size allocated. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.req0.sz_wanted

Size requested. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.req0.timeout

Timed out from pool. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.req0.toosmall

Too small to recycle. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.req1.allocs

Allocations. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.req1.frees

Frees. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.req1.live

In use. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.req1.pool

In pool. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.req1.randry

Pool ran dry. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.req1.recycle

Recycled from pool. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.req1.surplus

Too many for pool. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.req1.sz_needed

Size allocated. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.req1.sz_wanted

Size requested. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.req1.timeout

Timed out from pool. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.req1.toosmall

Too small to recycle. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.sess0.allocs

Allocations. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.sess0.frees

Frees. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.sess0.live

In use. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.sess0.pool

In pool. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.sess0.randry

Pool ran dry. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.sess0.recycle

Recycled from pool. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.sess0.surplus

Too many for pool. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.sess0.sz_needed

Size allocated. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.sess0.sz_wanted

Size requested. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.sess0.timeout

Timed out from pool. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.sess0.toosmall

Too small to recycle. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.sess1.allocs

Allocations. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.sess1.frees

Frees. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.sess1.live

In use. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.sess1.pool

In pool. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.sess1.randry

Pool ran dry. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.sess1.recycle

Recycled from pool. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.sess1.surplus

Too many for pool. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.sess1.sz_needed

Size allocated. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.sess1.sz_wanted

Size requested. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.sess1.timeout

Timed out from pool. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.sess1.toosmall

Too small to recycle. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.vbc.allocs

Allocations. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.vbc.frees

Frees. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.vbc.live

In use. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.vbc.pool

In pool. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.vbc.randry

Pool ran dry. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.vbc.recycle

Recycled from pool. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.vbc.surplus

Too many for pool. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.vbc.sz_needed

Size allocated. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.vbc.sz_wanted

Size requested. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.vbc.timeout

Timed out from pool. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.vbc.toosmall

Too small to recycle. This metric is only provided by varnish 4.x.

## varnish.MGT.child_died

Child processes that died due to signals. This metric is only provided by varnish 4.x.

## varnish.MGT.child_dump

Child processes that produced core dumps. This metric is only provided by varnish 4.x.

## varnish.MGT.child_exit

Child processes the were cleanly stopped. This metric is only provided by varnish 4.x.

## varnish.MGT.child_panic

Child processes that panicked. This metric is only provided by varnish 4.x.

## varnish.MGT.child_start

Child processes that started. This metric is only provided by varnish 4.x.

## varnish.MGT.child_stop

Child processes that exited with an unexpected return code. This metric is only provided by varnish 4.x.

## varnish.MGT.uptime

This metric is only provided by varnish 4.x.

## varnish.n_backend

Number of backends.

## varnish.n_ban

Active bans. This metric is only provided by varnish 3.x.

New bans added. This metric is only provided by varnish 3.x.

## varnish.n_ban_dups

Duplicate bans removed. This metric is only provided by varnish 3.x.

## varnish.n_ban_obj_test

Objects tested. This metric is only provided by varnish 3.x.

## varnish.n_ban_re_test

Regexps tested against. This metric is only provided by varnish 3.x.

## varnish.n_ban_retire

Old bans deleted. This metric is only provided by varnish 3.x.

## varnish.n_expired

Objects that expired from cache because of TTL.

## varnish.n_gunzip

Gunzip operations.

## varnish.n_gzip

Gzip operations.

varnish.n_lru_moved

Move operations done on the LRU list.

## varnish.n_lru_nuked

Objects forcefully evicted from storage to make room for new objects.

## varnish.n_obj_purged

Purged objects. This metric is only provided by varnish 4.x.

## varnish.n_objoverflow

Objects overflowing workspace. This metric is only provided by varnish 3.x.

## varnish.n_objsendfile

Objects sent with sendfile. This metric is only provided by varnish 3.x.

## varnish.n_objwrite

Objects sent with write. This metric is only provided by varnish 3.x.

## varnish.n_purges

Purges executed. This metric is only provided by varnish 4.x.

## varnish.n_sess

sess structs made. This metric is only provided by varnish 3.x.

## varnish.n_sess_mem

sess_mem structs made. This metric is only provided by varnish 3.x.

## varnish.n_vampireobject

Unresurrected objects.

## varnish.n_vbc

vbc structs made. This metric is only provided by varnish 3.x.

## varnish.n_vcl_avail

Available VCLs.

## varnish.n_wrk

Worker threads. This metric is only provided by varnish 3.x.

## varnish.n_wrk_create

Worker threads created. This metric is only provided by varnish 3.x.

## varnish.n_wrk_drop

Dropped work requests. This metric is only provided by varnish 3.x.

## varnish.n_wrk_failed

Worker threads not created. This metric is only provided by varnish 3.x.

## varnish.n_wrk_lqueue

Work request queue length. This metric is only provided by varnish 3.x.

## varnish.n_wrk_max

Worker threads limited. This metric is only provided by varnish 3.x.

## varnish.n_wrk_queued

Queued work requests. This metric is only provided by varnish 3.x.

## varnish.pools

Thread pools. This metric is only provided by varnish 4.x.

## varnish.s_bodybytes

Total body size. This metric is only provided by varnish 3.x.

## varnish.s_fetch

Backend fetches.

## varnish.s_hdrbytes

Total header size. This metric is only provided by varnish 3.x.

## varnish.s_pass

Passed requests.

## varnish.s_pipe

Pipe sessions seen.

## varnish.s_pipe_hdrbytes

Total request bytes received for piped sessions. This metric is only provided by varnish 4.x.

## varnish.s_pipe_in

Total number of bytes forwarded from clients in pipe sessions. This metric is only provided by varnish 4.x.

## varnish.s_pipe_out

Total number of bytes forwarded to clients in pipe sessions. This metric is only provided by varnish 4.x.

## varnish.s_req

Requests.

## varnish.s_req_bodybytes

Total request body bytes received. This metric is only provided by varnish 4.x.

## varnish.s_req_hdrbytes

Total request header bytes received. This metric is only provided by varnish 4.x.

## varnish.s_resp_bodybytes

Total response body bytes transmitted. This metric is only provided by varnish 4.x.

## varnish.s_resp_hdrbytes

Total response header bytes transmitted. This metric is only provided by varnish 4.x.

## varnish.s_sess

Client connections.

## varnish.s_synth

Synthetic responses made. This metric is only provided by varnish 4.x.

## varnish.sess_closed

Client connections closed.

## varnish.sess_conn

Client connections accepted. This metric is only provided by varnish 4.x.

## varnish.sess_drop

Client connections dropped due to lack of worker thread. This metric is only provided by varnish 4.x.

## varnish.sess_dropped

Client connections dropped due to a full queue. This metric is only provided by varnish 4.x.

## varnish.sess_fail

Failures to accept a TCP connection. Either the client changed its mind, or the kernel ran out of some resource like file descriptors. This metric is only provided by varnish 4.x.

## varnish.sess_herd varnish.sess_linger

This metric is only provided by varnish 3.x.

## varnish.sess_pipe_overflow

This metric is only provided by varnish 4.x.

## varnish.sess_pipeline varnish.sess_queued

Client connections queued to wait for a thread. This metric is only provided by varnish 4.x.

SHM MTX contention.

## varnish.shm_cycles

SHM cycles through buffer.

## varnish.shm_flushes

SHM flushes due to overflow.

## varnish.shm_records

SHM records.

## varnish.shm_writes

SHM writes.

## varnish.SMA.s0.c_bytes

Total space allocated by this storage.

## varnish.SMA.s0.c_fail

Times the storage has failed to provide a storage segment.

## varnish.SMA.s0.c_freed

Total space returned to this storage.

## varnish.SMA.s0.c_req

Times the storage has been asked to provide a storage segment.

## varnish.SMA.s0.g_alloc

Storage allocations outstanding.

## varnish.SMA.s0.g_bytes

Space allocated from the storage.

## varnish.SMA.s0.g_space

Space left in the storage.

## varnish.SMA.Transient.c_bytes

Total space allocated by this storage.

## varnish.SMA.Transient.c_fail

Times the storage has failed to provide a storage segment.

## varnish.SMA.Transient.c_freed

Total space returned to this storage.

## varnish.SMA.Transient.c_req

Times the storage has been asked to provide a storage segment.

## varnish.SMA.Transient.g_alloc

Storage allocations outstanding.

## varnish.SMA.Transient.g_bytes

Space allocated from the storage.

## varnish.SMA.Transient.g_space

Space left in the storage.

## varnish.sms_balloc

SMS space allocated.

## varnish.sms_bfree

SMS space freed.

## varnish.sms_nbytes

SMS outstanding space.

## varnish.sms_nobj

SMS outstanding allocations.

## varnish.sms_nreq

SMS allocator requests.

Length of session queue waiting for threads. This metric is only provided by varnish 4.x.

Number of threads. This metric is only provided by varnish 4.x.

Threads created. This metric is only provided by varnish 4.x.

Threads destroyed. This metric is only provided by varnish 4.x.

Threads that failed to get created. This metric is only provided by varnish 4.x.

Threads that were needed but couldn’t be created because of a thread pool limit. This metric is only provided by varnish 4.x.

## varnish.vmods

Loaded VMODs. This metric is only provided by varnish 4.x.

## varnish.vsm_cooling

Space which will soon (max 1 minute) be freed in the shared memory used to communicate with tools like varnishstat, varnishlog etc. This metric is only provided by varnish 4.x.

## varnish.vsm_free

Free space in the shared memory used to communicate with tools like varnishstat, varnishlog etc. This metric is only provided by varnish 4.x.

## varnish.vsm_overflow

Data which does not fit in the shared memory used to communicate with tools like varnishstat, varnishlog etc. This metric is only provided by varnish 4.x.

## varnish.vsm_overflowed

Total data which did not fit in the shared memory used to communicate with tools like varnishstat, varnishlog etc. This metric is only provided by varnish 4.x.

## varnish.vsm_used

Used space in the shared memory used to communicate with tools like varnishstat, varnishlog etc. This metric is only provided by varnish 4.x.

## varnish.n_purgesps

Purges executed. This metric is only provided by varnish 4.x.

# Varnish Metrics

All Varnish metrics have the type gauge except `varnish.n_purgesps,` which has the type rate.

## varnish.accept_fail

Accept failures. This metric is only provided by varnish 3.x.

## varnish.backend_busy

Maximum number of connections to a given backend.

## varnish.backend_conn

Successful connections to a given backend.

## varnish.backend_fail

Failed connections for a given backend.

## varnish.backend_recycle

Backend connections with keep-alive that are returned to the pool of connections.

## varnish.backend_req

Backend requests.

## varnish.backend_retry

Backend connection retries.

## varnish.backend_reuse

Recycled connections that has were reused.

## varnish.backend_toolate

Backend connections closed because they were idle too long.

## varnish.backend_unhealthy

Backend connections not tried because the backend was unhealthy.

## varnish.bans

Bans in system, including bans superseded by newer bans and bans already checked by the ban-lurker. This metric is only provided by varnish 4.x.

Bans added to ban list. This metric is only provided by varnish 4.x.

## varnish.bans_completed

Bans which are no longer active, either because they got checked by the ban-lurker or superseded by newer identical bans. This metric is only provided by varnish 4.x.

## varnish.bans_deleted

Bans deleted from ban list. This metric is only provided by varnish 4.x.

## varnish.bans_dups

Bans replaced by later identical bans. This metric is only provided by varnish 4.x.

## varnish.bans_lurker_contention

Times the ban-lurker waited for lookups. This metric is only provided by varnish 4.x.

## varnish.bans_lurker_obj_killed

Objects killed by ban-lurker. This metric is only provided by varnish 4.x.

## varnish.bans_lurker_tested

Bans and objects tested against each other by the ban-lurker. This metric is only provided by varnish 4.x.

## varnish.bans_lurker_tests_tested

Tests and objects tested against each other by the ban-lurker. ‘ban req.url == foo && req.http.host == bar’ counts as one in ‘bans_tested’ and as two in ‘bans_tests_tested’. This metric is only provided by varnish 4.x.

## varnish.bans_obj

Bans which use obj.* variables. These bans can possibly be washed by the ban-lurker. This metric is only provided by varnish 4.x.

## varnish.bans_obj_killed

Objects killed by bans during object lookup. This metric is only provided by varnish 4.x

## varnish.bans_persisted_bytes

Bytes used by the persisted ban lists. This metric is only provided by varnish 4.x.

## varnish.bans_persisted_fragmentation

Extra bytes accumulated through dropped and completed bans in the persistent ban lists. This metric is only provided by varnish 4.x.

## varnish.bans_req

Bans which use req.* variables. These bans can not be washed by the ban-lurker. This metric is only provided by varnish 4.x.

## varnish.bans_tested

Bans and objects tested against each other during hash lookup. This metric is only provided by varnish 4.x.

## varnish.bans_tests_tested

Tests and objects tested against each other during lookup. ‘ban req.url == foo && req.http.host == bar’ counts as one in ‘bans_tested’ and as two in ‘bans_tests_tested’. This metric is only provided by varnish 4.x.

## varnish.busy_sleep

Requests sent to sleep without a worker thread because they found a busy object. This metric is only provided by varnish 4.x.

## varnish.busy_wakeup

Requests taken off the busy object sleep list and and rescheduled. This metric is only provided by varnish 4.x.

## varnish.cache_hit

Requests served from the cache.

## varnish.cache_hitpass

Requests passed to a backend where the decision to pass them found in the cache.

## varnish.cache_miss

Requests fetched from a backend server.

## varnish.client_conn

Client connections accepted. This metric is only provided by varnish 3.x.

## varnish.client_drop

Client connection dropped, no session. This metric is only provided by varnish 3.x.

## varnish.client_drop_late

Client connection dropped late. This metric is only provided by varnish 3.x.

## varnish.client_req

Parseable client requests seen.

## varnish.client_req_400

Requests that were malformed in some drastic way. This metric is only provided by varnish 4.x.

## varnish.client_req_411

Requests that were missing a Content-Length: header. This metric is only provided by varnish 4.x.

## varnish.client_req_413

Requests that were too big. This metric is only provided by varnish 4.x.

## varnish.client_req_417

Requests with a bad Expect: header. This metric is only provided by varnish 4.x.

## varnish.dir_dns_cache_full

DNS director full DNS cache. This metric is only provided by varnish 3.x.

## varnish.dir_dns_failed

DNS director failed lookup. This metric is only provided by varnish 3.x.

## varnish.dir_dns_hit

DNS director cached lookup hit. This metric is only provided by varnish 3.x.

## varnish.dir_dns_lookups

DNS director lookups. This metric is only provided by varnish 3.x.

## varnish.esi_errors

Edge Side Includes (ESI) parse errors.

## varnish.esi_warnings

Edge Side Includes (ESI) parse warnings.

## varnish.exp_mailed

Objects mailed to expiry thread for handling. This metric is only provided by varnish 4.x.

Objects received by expiry thread for handling. This metric is only provided by varnish 4.x.

## varnish.fetch_1xx

Back end response with no body because of 1XX response (Informational).

## varnish.fetch_204

Back end response with no body because of 204 response (No Content).

## varnish.fetch_304

Back end response with no body because of 304 response (Not Modified).

Back end response’s body length could not be determined and/or had bad headers.

## varnish.fetch_chunked

Back end response bodies that were chunked.

## varnish.fetch_close

Fetch wanted close.

## varnish.fetch_eof

Back end response bodies with EOF.

## varnish.fetch_failed

Back end response fetches that failed.

Back end HEAD requests.

## varnish.fetch_length

Back end response bodies with Content-Length.

Back end fetches that failed because no thread was available. This metric is only provided by varnish 4.x.

## varnish.fetch_oldhttp

Number of responses served by backends with http < 1.1

## varnish.fetch_zero

Number of responses that have zero length.

## varnish.hcb_insert

HCB inserts.

## varnish.hcb_lock

HCB lookups with lock.

## varnish.hcb_nolock

HCB lookups without lock.

## varnish.LCK.backend.colls

Collisions. This metric is only provided by varnish 3.x.

## varnish.LCK.backend.creat

Created locks.

## varnish.LCK.backend.destroy

Destroyed locks.

## varnish.LCK.backend.locks

Lock operations.

## varnish.LCK.ban.colls

Collisions. This metric is only provided by varnish 3.x.

## varnish.LCK.ban.creat

Created locks.

## varnish.LCK.ban.destroy

Destroyed locks.

## varnish.LCK.ban.locks

Lock operations.

## varnish.LCK.busyobj.creat

Created locks. This metric is only provided by varnish 4.x.

## varnish.LCK.busyobj.destroy

Destroyed locks. This metric is only provided by varnish 4.x.

## varnish.LCK.busyobj.locks

Lock operations. This metric is only provided by varnish 4.x.

## varnish.LCK.cli.colls

Collisions. This metric is only provided by varnish 3.x.

## varnish.LCK.cli.creat

Created locks.

## varnish.LCK.cli.destroy

Destroyed locks.

## varnish.LCK.cli.locks

Lock operations.

## varnish.LCK.exp.colls

Collisions. This metric is only provided by varnish 3.x.

## varnish.LCK.exp.creat

Created locks.

## varnish.LCK.exp.destroy

Destroyed locks.

## varnish.LCK.exp.locks

Lock operations.

## varnish.LCK.hcb.colls

Collisions. This metric is only provided by varnish 3.x.

## varnish.LCK.hcb.creat

Created locks.

## varnish.LCK.hcb.destroy

Destroyed locks.

## varnish.LCK.hcb.locks

Lock operations.

## varnish.LCK.hcl.colls

Collisions. This metric is only provided by varnish 3.x.

## varnish.LCK.hcl.creat

Created locks.

## varnish.LCK.hcl.destroy

Destroyed locks.

## varnish.LCK.hcl.locks

Lock operations.

## varnish.LCK.herder.colls

Collisions. This metric is only provided by varnish 3.x.

## varnish.LCK.herder.creat

Created locks.

## varnish.LCK.herder.destroy

Destroyed locks.

## varnish.LCK.herder.locks

Lock operations.

## varnish.LCK.hsl.colls

Collisions. This metric is only provided by varnish 3.x.

## varnish.LCK.hsl.creat

Created locks.

## varnish.LCK.hsl.destroy

Destroyed locks.

## varnish.LCK.hsl.locks

Lock operations.

## varnish.LCK.lru.colls

Collisions. This metric is only provided by varnish 3.x.

## varnish.LCK.lru.creat

Created locks.

## varnish.LCK.lru.destroy

Destroyed locks.

## varnish.LCK.lru.locks

Lock operations.

## varnish.LCK.mempool.creat

Created locks. This metric is only provided by varnish 4.x.

## varnish.LCK.mempool.destroy

Destroyed locks. This metric is only provided by varnish 4.x.

## varnish.LCK.mempool.locks

Lock operations. This metric is only provided by varnish 4.x.

## varnish.LCK.nbusyobj.creat

Created locks. This metric is only provided by varnish 4.x.

## varnish.LCK.nbusyobj.destroy

Destroyed locks. This metric is only provided by varnish 4.x.

## varnish.LCK.nbusyobj.locks

Lock operations. This metric is only provided by varnish 4.x.

## varnish.LCK.objhdr.colls

Collisions. This metric is only provided by varnish 3.x.

## varnish.LCK.objhdr.creat

Created locks.

## varnish.LCK.objhdr.destroy

Destroyed locks.

## varnish.LCK.objhdr.locks

Lock operations.

## varnish.LCK.pipestat.creat

Created locks. This metric is only provided by varnish 4.x.

## varnish.LCK.pipestat.destroy

Destroyed locks. This metric is only provided by varnish 4.x.

## varnish.LCK.pipestat.locks

Lock operations. This metric is only provided by varnish 4.x.

## varnish.LCK.sess.creat

Created locks. This metric is only provided by varnish 4.x.

## varnish.LCK.sess.destroy

Destroyed locks. This metric is only provided by varnish 4.x.

## varnish.LCK.sess.locks

Lock operations. This metric is only provided by varnish 4.x.

## varnish.LCK.sessmem.colls

Collisions. This metric is only provided by varnish 3.x.

## varnish.LCK.sessmem.creat

Created locks.

## varnish.LCK.sessmem.destroy

Destroyed locks.

## varnish.LCK.sessmem.locks

Lock operations.

## varnish.LCK.sma.colls

Collisions. This metric is only provided by varnish 3.x.

## varnish.LCK.sma.creat

Created locks.

## varnish.LCK.sma.destroy

Destroyed locks.

## varnish.LCK.sma.locks

Lock operations.

## varnish.LCK.smf.colls

Collisions. This metric is only provided by varnish 3.x.

## varnish.LCK.smf.creat

Created locks.

## varnish.LCK.smf.destroy

Destroyed locks.

## varnish.LCK.smf.locks

Lock operations.

## varnish.LCK.smp.colls

Collisions. This metric is only provided by varnish 3.x.

## varnish.LCK.smp.creat

Created locks.

## varnish.LCK.smp.destroy

Destroyed locks.

## varnish.LCK.smp.locks

Lock operations.

## varnish.LCK.sms.colls

Collisions. This metric is only provided by varnish 3.x.

## varnish.LCK.sms.creat

Created locks.

## varnish.LCK.sms.destroy

Destroyed locks.

## varnish.LCK.sms.locks

Lock operations.

## varnish.LCK.stat.colls

Collisions. This metric is only provided by varnish 3.x.

## varnish.LCK.stat.creat

Created locks. This metric is only provided by varnish 3.x.

## varnish.LCK.stat.destroy

Destroyed locks. This metric is only provided by varnish 3.x.

## varnish.LCK.stat.locks

Lock operations. This metric is only provided by varnish 3.x.

## varnish.LCK.vbe.colls

Collisions. This metric is only provided by varnish 3.x.

## varnish.LCK.vbe.creat

Created locks. This metric is only provided by varnish 3.x.

## varnish.LCK.vbe.destroy

Destroyed locks. This metric is only provided by varnish 3.x.

## varnish.LCK.vbe.locks

Lock operations. This metric is only provided by varnish 3.x.

## varnish.LCK.vbp.colls

Collisions. This metric is only provided by varnish 3.x.

## varnish.LCK.vbp.creat

Created locks.

## varnish.LCK.vbp.destroy

Destroyed locks.

## varnish.LCK.vbp.locks

Lock operations.

## varnish.LCK.vcapace.creat

Created locks. This metric is only provided by varnish 4.x.

## varnish.LCK.vcapace.destroy

Destroyed locks. This metric is only provided by varnish 4.x.

## varnish.LCK.vcapace.locks

Lock operations. This metric is only provided by varnish 4.x.

## varnish.LCK.vcl.colls

Collisions. This metric is only provided by varnish 3.x.

## varnish.LCK.vcl.creat

Created locks.

## varnish.LCK.vcl.destroy

Destroyed locks.

## varnish.LCK.vcl.locks

Lock operations.

## varnish.LCK.vxid.creat

Created locks. This metric is only provided by varnish 4.x.

## varnish.LCK.vxid.destroy

Destroyed locks. This metric is only provided by varnish 4.x.

## varnish.LCK.vxid.locks

Lock operations. This metric is only provided by varnish 4.x.

## varnish.LCK.wq.colls

Collisions. This metric is only provided by varnish 3.x.

## varnish.LCK.wq.creat

Created locks.

## varnish.LCK.wq.destroy

Destroyed locks.

## varnish.LCK.wq.locks

Lock operations.

## varnish.LCK.wstat.colls

Collisions. This metric is only provided by varnish 3.x.

## varnish.LCK.wstat.creat

Created locks.

## varnish.LCK.wstat.destroy

Destroyed locks.

## varnish.LCK.wstat.locks

Lock operations.

## varnish.MEMPOOL.busyobj.allocs

Allocations. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.busyobj.frees

Frees. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.busyobj.live

In use. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.busyobj.pool

In pool. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.busyobj.randry

Pool ran dry. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.busyobj.recycle

Recycled from pool. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.busyobj.surplus

Too many for pool. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.busyobj.sz_needed

Size allocated. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.busyobj.sz_wanted

Size requested. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.busyobj.timeout

Timed out from pool. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.busyobj.toosmall

Too small to recycle. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.req0.allocs

Allocations. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.req0.frees

Frees. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.req0.live

In use. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.req0.pool

In pool. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.req0.randry

Pool ran dry. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.req0.recycle

Recycled from pool. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.req0.surplus

Too many for pool. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.req0.sz_needed

Size allocated. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.req0.sz_wanted

Size requested. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.req0.timeout

Timed out from pool. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.req0.toosmall

Too small to recycle. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.req1.allocs

Allocations. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.req1.frees

Frees. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.req1.live

In use. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.req1.pool

In pool. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.req1.randry

Pool ran dry. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.req1.recycle

Recycled from pool. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.req1.surplus

Too many for pool. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.req1.sz_needed

Size allocated. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.req1.sz_wanted

Size requested. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.req1.timeout

Timed out from pool. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.req1.toosmall

Too small to recycle. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.sess0.allocs

Allocations. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.sess0.frees

Frees. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.sess0.live

In use. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.sess0.pool

In pool. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.sess0.randry

Pool ran dry. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.sess0.recycle

Recycled from pool. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.sess0.surplus

Too many for pool. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.sess0.sz_needed

Size allocated. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.sess0.sz_wanted

Size requested. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.sess0.timeout

Timed out from pool. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.sess0.toosmall

Too small to recycle. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.sess1.allocs

Allocations. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.sess1.frees

Frees. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.sess1.live

In use. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.sess1.pool

In pool. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.sess1.randry

Pool ran dry. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.sess1.recycle

Recycled from pool. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.sess1.surplus

Too many for pool. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.sess1.sz_needed

Size allocated. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.sess1.sz_wanted

Size requested. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.sess1.timeout

Timed out from pool. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.sess1.toosmall

Too small to recycle. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.vbc.allocs

Allocations. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.vbc.frees

Frees. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.vbc.live

In use. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.vbc.pool

In pool. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.vbc.randry

Pool ran dry. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.vbc.recycle

Recycled from pool. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.vbc.surplus

Too many for pool. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.vbc.sz_needed

Size allocated. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.vbc.sz_wanted

Size requested. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.vbc.timeout

Timed out from pool. This metric is only provided by varnish 4.x.

## varnish.MEMPOOL.vbc.toosmall

Too small to recycle. This metric is only provided by varnish 4.x.

## varnish.MGT.child_died

Child processes that died due to signals. This metric is only provided by varnish 4.x.

## varnish.MGT.child_dump

Child processes that produced core dumps. This metric is only provided by varnish 4.x.

## varnish.MGT.child_exit

Child processes the were cleanly stopped. This metric is only provided by varnish 4.x.

## varnish.MGT.child_panic

Child processes that panicked. This metric is only provided by varnish 4.x.

## varnish.MGT.child_start

Child processes that started. This metric is only provided by varnish 4.x.

## varnish.MGT.child_stop

Child processes that exited with an unexpected return code. This metric is only provided by varnish 4.x.

## varnish.MGT.uptime

This metric is only provided by varnish 4.x.

## varnish.n_backend

Number of backends.

## varnish.n_ban

Active bans. This metric is only provided by varnish 3.x.

New bans added. This metric is only provided by varnish 3.x.

## varnish.n_ban_dups

Duplicate bans removed. This metric is only provided by varnish 3.x.

## varnish.n_ban_obj_test

Objects tested. This metric is only provided by varnish 3.x.

## varnish.n_ban_re_test

Regexps tested against. This metric is only provided by varnish 3.x.

## varnish.n_ban_retire

Old bans deleted. This metric is only provided by varnish 3.x.

## varnish.n_expired

Objects that expired from cache because of TTL.

## varnish.n_gunzip

Gunzip operations.

## varnish.n_gzip

Gzip operations.

varnish.n_lru_moved

Move operations done on the LRU list.

## varnish.n_lru_nuked

Objects forcefully evicted from storage to make room for new objects.

## varnish.n_obj_purged

Purged objects. This metric is only provided by varnish 4.x.

## varnish.n_objoverflow

Objects overflowing workspace. This metric is only provided by varnish 3.x.

## varnish.n_objsendfile

Objects sent with sendfile. This metric is only provided by varnish 3.x.

## varnish.n_objwrite

Objects sent with write. This metric is only provided by varnish 3.x.

## varnish.n_purges

Purges executed. This metric is only provided by varnish 4.x.

## varnish.n_sess

sess structs made. This metric is only provided by varnish 3.x.

## varnish.n_sess_mem

sess_mem structs made. This metric is only provided by varnish 3.x.

## varnish.n_vampireobject

Unresurrected objects.

## varnish.n_vbc

vbc structs made. This metric is only provided by varnish 3.x.

## varnish.n_vcl_avail

Available VCLs.

## varnish.n_wrk

Worker threads. This metric is only provided by varnish 3.x.

## varnish.n_wrk_create

Worker threads created. This metric is only provided by varnish 3.x.

## varnish.n_wrk_drop

Dropped work requests. This metric is only provided by varnish 3.x.

## varnish.n_wrk_failed

Worker threads not created. This metric is only provided by varnish 3.x.

## varnish.n_wrk_lqueue

Work request queue length. This metric is only provided by varnish 3.x.

## varnish.n_wrk_max

Worker threads limited. This metric is only provided by varnish 3.x.

## varnish.n_wrk_queued

Queued work requests. This metric is only provided by varnish 3.x.

## varnish.pools

Thread pools. This metric is only provided by varnish 4.x.

## varnish.s_bodybytes

Total body size. This metric is only provided by varnish 3.x.

## varnish.s_fetch

Backend fetches.

## varnish.s_hdrbytes

Total header size. This metric is only provided by varnish 3.x.

## varnish.s_pass

Passed requests.

## varnish.s_pipe

Pipe sessions seen.

## varnish.s_pipe_hdrbytes

Total request bytes received for piped sessions. This metric is only provided by varnish 4.x.

## varnish.s_pipe_in

Total number of bytes forwarded from clients in pipe sessions. This metric is only provided by varnish 4.x.

## varnish.s_pipe_out

Total number of bytes forwarded to clients in pipe sessions. This metric is only provided by varnish 4.x.

## varnish.s_req

Requests.

## varnish.s_req_bodybytes

Total request body bytes received. This metric is only provided by varnish 4.x.

## varnish.s_req_hdrbytes

Total request header bytes received. This metric is only provided by varnish 4.x.

## varnish.s_resp_bodybytes

Total response body bytes transmitted. This metric is only provided by varnish 4.x.

## varnish.s_resp_hdrbytes

Total response header bytes transmitted. This metric is only provided by varnish 4.x.

## varnish.s_sess

Client connections.

## varnish.s_synth

Synthetic responses made. This metric is only provided by varnish 4.x.

## varnish.sess_closed

Client connections closed.

## varnish.sess_conn

Client connections accepted. This metric is only provided by varnish 4.x.

## varnish.sess_drop

Client connections dropped due to lack of worker thread. This metric is only provided by varnish 4.x.

## varnish.sess_dropped

Client connections dropped due to a full queue. This metric is only provided by varnish 4.x.

## varnish.sess_fail

Failures to accept a TCP connection. Either the client changed its mind, or the kernel ran out of some resource like file descriptors. This metric is only provided by varnish 4.x.

## varnish.sess_herd varnish.sess_linger

This metric is only provided by varnish 3.x.

## varnish.sess_pipe_overflow

This metric is only provided by varnish 4.x.

## varnish.sess_pipeline varnish.sess_queued

Client connections queued to wait for a thread. This metric is only provided by varnish 4.x.

SHM MTX contention.

## varnish.shm_cycles

SHM cycles through buffer.

## varnish.shm_flushes

SHM flushes due to overflow.

## varnish.shm_records

SHM records.

## varnish.shm_writes

SHM writes.

## varnish.SMA.s0.c_bytes

Total space allocated by this storage.

## varnish.SMA.s0.c_fail

Times the storage has failed to provide a storage segment.

## varnish.SMA.s0.c_freed

Total space returned to this storage.

## varnish.SMA.s0.c_req

Times the storage has been asked to provide a storage segment.

## varnish.SMA.s0.g_alloc

Storage allocations outstanding.

## varnish.SMA.s0.g_bytes

Space allocated from the storage.

## varnish.SMA.s0.g_space

Space left in the storage.

## varnish.SMA.Transient.c_bytes

Total space allocated by this storage.

## varnish.SMA.Transient.c_fail

Times the storage has failed to provide a storage segment.

## varnish.SMA.Transient.c_freed

Total space returned to this storage.

## varnish.SMA.Transient.c_req

Times the storage has been asked to provide a storage segment.

## varnish.SMA.Transient.g_alloc

Storage allocations outstanding.

## varnish.SMA.Transient.g_bytes

Space allocated from the storage.

## varnish.SMA.Transient.g_space

Space left in the storage.

## varnish.sms_balloc

SMS space allocated.

## varnish.sms_bfree

SMS space freed.

## varnish.sms_nbytes

SMS outstanding space.

## varnish.sms_nobj

SMS outstanding allocations.

## varnish.sms_nreq

SMS allocator requests.

Length of session queue waiting for threads. This metric is only provided by varnish 4.x.

Number of threads. This metric is only provided by varnish 4.x.

Threads created. This metric is only provided by varnish 4.x.

Threads destroyed. This metric is only provided by varnish 4.x.

Threads that failed to get created. This metric is only provided by varnish 4.x.

Threads that were needed but couldn’t be created because of a thread pool limit. This metric is only provided by varnish 4.x.

## varnish.vmods

Loaded VMODs. This metric is only provided by varnish 4.x.

## varnish.vsm_cooling

Space which will soon (max 1 minute) be freed in the shared memory used to communicate with tools like varnishstat, varnishlog etc. This metric is only provided by varnish 4.x.

## varnish.vsm_free

Free space in the shared memory used to communicate with tools like varnishstat, varnishlog etc. This metric is only provided by varnish 4.x.

## varnish.vsm_overflow

Data which does not fit in the shared memory used to communicate with tools like varnishstat, varnishlog etc. This metric is only provided by varnish 4.x.

## varnish.vsm_overflowed

Total data which did not fit in the shared memory used to communicate with tools like varnishstat, varnishlog etc. This metric is only provided by varnish 4.x.

## varnish.vsm_used

Used space in the shared memory used to communicate with tools like varnishstat, varnishlog etc. This metric is only provided by varnish 4.x.

## varnish.n_purgesps

Purges executed. This metric is only provided by varnish 4.x.

# Benchmarks and Compliance

Note: Sysdig follows the Prometheus-compabtible naming convention for both metrics and labels as opposed to the previous statsd-compatible one. However, this page still shows metrics in the legacy Sysdig naming convention. Until this page is updated, see Metrics and Label Mapping for the mapping between legacy Sysdig and Prometheus naming conventions.

Compliance metrics are generated from scheduled CIS Benchmark scans that occur in Sysdig Secure. These metrics cover aggregate results of the various CIS Benchmark sections, as well as granular details about how many running containers are failing specific run-time compliance checks.

# Docker/CIS Benchmarks

## compliance.docker-bench.container-images-and-build-file.pass_pct

The percentage of successful Docker benchmark tests run on the container images and build files.

Metric TypeGauge
Value Type%
Segment ByContainer
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.container-images-and-build-file.tests_fail

The number of failed Docker benchmark tests run against the container images and build file.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.container-images-and-build-file.tests_pass

The number of successful Docker benchmark tests run against the container images and build file.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.container-images-and-build-file.tests_total

The total number of tests run against the container images and build file.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.container-runtime.pass_pct

The percentage of successful container runtime Docker benchmark tests.

Metric TypeGauge
Value Type%
Segment ByContainer
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.container-runtime.tests_fail

The number of failed container runtime benchmark tests.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.container-runtime.tests_pass

The number of successful container runtime Docker benchmark tests.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.container-runtime.tests_total

The total number of Docker benchmark tests run against container runtimes.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

The number of containers running without kernel restrictions in place.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.c-maxretry-not-set

The number of containers configured to not limit installation retries if the initial attempt fails.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.c-mount-prop-shared

The number of containers that use mount propagation.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.c-networking-host

The number of containers that share the host’s network namespace.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.c-no-apparmor

The number of containers running without an AppArmor profile.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.c-no-cpu-limits

The number of containers running with no CPU limits configured.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.c-no-health-check

The number of containers that have no `HEALTHCHECK` instruction configured.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.c-no-mem-limits

The number of containers configured to run without memory limitations.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.c-no-pids-cgroup-limit

The number of containers that do not use a `cgroup` for PIDs.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.c-no-restricted-privs

The number of containers running that can have additional privileges configured.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.c-no-seccomp

The number of containers that disable the default `seccomp` profile.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.c-no-securityopts

The number of containers running without SELinux options configured.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.c-no-ulimit-override

The number of containers running that override the default `ulimit`.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.c-privileged-ports

The number of containers that have privileged ports mapped into them.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.c-root-mounted-rw

The number of containers that mount the host’s root filesystem with read/write privileges.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.c-running-privileged

The number of containers running with the `--privileged` configuration option set.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.c-sensitive-dirs

The number of containers that have mounted a sensitive directory from the host.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.c-sharing-docker-sock

The number of containers that share the host’s docker socket.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.c-sharing-host-devs

The number of containers that share one or more host devices.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.c-sharing-host-ipc-ns

The number of containers that share the host’s IPC namespace.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.c-sharing-host-pid-ns

The number of containers that share the host’s PID namespace.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.c-sharing-host-user-ns

The number of containers that share the host’s user namespace.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.c-sharing-host-uts-ns

The number of containers that share the host’s UTS namespace.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.c-sshd-docker-exec-failures

The number of containers running an SSH daemon.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.c-unexpected-cgroup

The number of containers running without a dedicated `cgroup` configured.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.c-using-docker0-net

The number of containers using the default docker bridge network `docker0`.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.c-wildcard-bound-port

The number of containers that do not bind incoming traffic to a specific interface.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.docker-daemon-configuration.pass_pct

The percentage of successful Docker benchmark tests run against the Docker daemon configuration.

Metric TypeGauge
Value Type%
Segment ByContainer
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.docker-daemon-configuration.tests_fail

The number of benchmark tests run against the Docker daemon configuration that failed.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.docker-daemon-configuration.tests_pass

The number of benchmark tests run against the Docker daemon configuration that passed.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.docker-daemon-configuration.tests_total

The total number of benchmark tests run against the Docker daemon configuration.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.docker-daemon-configuration-files.pass_pct

The percentage of successful Docker benchmark tests run against the Docker daemon configuration files.

Metric TypeGauge
Value Type%
Segment ByContainer
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.docker-daemon-configuration-files.tests_fail

The number of benchmark tests run against the Docker daemon configuration files that failed.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.docker-daemon-configuration-files.tests_pass

The number of benchmark tests run against the Docker daemon configuration files that passed.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.docker-daemon-configuration-files.tests_total

The total number of benchmark tests run against the Docker daemon configuration files.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.docker-security-operations.pass_pct

The percentage of benchmark tests run against Docker security operations that were successful.

Metric TypeGauge
Value Type%
Segment ByContainer
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.docker-security-operations.tests_fail

The number of benchmark tests run against Docker security operations that failed.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.docker-security-operations.tests_pass

The number of benchmark tests run against Docker security operations that passed.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.docker-security-operations.tests_total

The total number of benchmark tests run against Docker security operations.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.docker-swarm-configuration.pass_pct

The percentage of benchmark tests run against the Docker swarm configuration that were successful.

Metric TypeGauge
Value Type%
Segment ByContainer
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.docker-swarm-configuration.tests_fail

The number of benchmark tests run against the Docker swarm configuration that failed.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Maxv

## compliance.docker-bench.docker-swarm-configuration.tests_pass

The number of benchmark tests run against the Docker swarm configuration that passed.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.docker-swarm-configuration.tests_total

The total number of benchmark tests run against the Docker swarm configuration.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.docker-users

The number of user accounts with permission to access the Docker daemon socket.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.host-configuration.pass_pct

The percentage of benchmark tests run against the host configuration that were successful.

Metric TypeGauge
Value Type%
Segment ByContainer
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.host-configuration.tests_fail

The number of benchmark tests run against the host configuration that failed.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.host-configuration.tests_pass

The number of benchmark tests run against the host configuration that passed.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.host-configuration.tests_total

The total number of benchmark tests run against the host configuration.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

The number of images that use the `COPY` function rather than the `ADD` function in Dockerfile.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.img-no-healthcheck

The number of images with no `HEALTHCHECK` instruction configured.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.img-running-root

The number of images that use the root user.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.img-update-insts-found

The number of images that run a package update step without a package installation step.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.pass_pct

The percentage of Docker benchmark tests run that passed.

Metric TypeGauge
Value Type%
Segment ByContainer
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.score

The current pass/fail score for Docker benchmark tests run. The value of this metric is calculated by starting at zero, and incrementing once for every successful test, and decrementing once for every test that returns a `WARN` result or worse.

Metric TypeCounter
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.tests_fail

The total number of Docker benchmark tests that have failed.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.tests_pass

The total number of Docker benchmark tests that have passed

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.docker-bench.tests_total

The total number of Docker benchmark tests that have been run.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

# Kubernetes Benchmarks

## compliance.k8s-bench.api-server.pass_pct

The percentage of Kubernetes benchmark tests run on the API server that passed.

Metric TypeGauge
Value Type%
Segment ByContainer
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.k8s-bench.api-server.tests_fail

The number of Kubernetes benchmark tests run on the API server that failed.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.k8s-bench.api-server.tests_pass

The number of Kubernetes benchmark tests run on the API server that passed.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.k8s-bench.api-server.tests_total

The total number of Kubernetes benchmark tests run on the API server.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.k8s-bench.api-server.tests_warn

The number of Kubernetes benchmark tests run on the API server that returned a result of `WARN`.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.k8s-bench.configuration-files.pass_pct

The percentage of Kubernetes benchmark tests run on the configuration files of non-master nodes that passed.

Metric TypeGauge
Value Type%
Segment ByContainer
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.k8s-bench.configuration-files.tests_fail

The number of Kubernetes benchmark tests run on the configuration files of non-master nodes that failed.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.k8s-bench.configuration-files.tests_pass

The number of Kubernetes benchmark tests run on the configuration files that passed.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.k8s-bench.configuration-files.tests_total

The total number of Kubernetes benchmark tests run on the configuration files of non-master nodes.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.k8s-bench.configuration-files.tests_warn

The number of Kubernetes benchmark tests run on the configuration files of non-master nodes that returned a result of `WARN`.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.k8s-bench.configure-files.pass_pct

The percentage of Kubernetes benchmark tests run on the master node configuration files that passed.

Metric TypeGauge
Value Type%
Segment ByContainer
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.k8s-bench.configure-files.tests_fail

The number of Kubernetes benchmark tests run on the master node configuration files that failed.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.k8s-bench.configure-files.tests_pass

The number of Kubernetes benchmark tests run on the master node configuration files that passed.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.k8s-bench.configure-files.tests_total

The total number of Kubernetes benchmark tests run on the master node configuration files.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.k8s-bench.configure-files.tests_warn

The number of Kubernetes benchmark tests run on the master node configuration files that returned a result of `WARN`.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.k8s-bench.controller-manager.pass_pct

The percentage of Kubernetes benchmark tests run on the controller manager that passed.

Metric TypeGauge
Value Type%
Segment ByContainer
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.k8s-bench.controller-manager.tests_fail

The number of Kubernetes benchmark tests run on the controller manager that failed.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.k8s-bench.controller-manager.tests_pass

The number of Kubernetes benchmark tests run on the controller manager that passed.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.k8s-bench.controller-manager.tests_total

The total number of Kubernetes benchmark tests run on the controller manager.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.k8s-bench.controller-manager.tests_warn

The number of Kubernetes benchmark tests run on the controller manager that returned a result of `WARN`.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.k8s-bench.etcd.pass_pct

The percentage of Kubernetes benchmark tests run on the etcd key value store that passed.

Metric TypeGauge
Value Type%
Segment ByContainer
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.k8s-bench.etcd.tests_fail

The number of Kubernetes benchmark tests run on the etcd key value store that failed.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.k8s-bench.etcd.tests_pass

The number of Kubernetes benchmark tests run on the etcd key value store that passed.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.k8s-bench.etcd.tests_total

The total number of Kubernetes benchmark tests run on the etcd key value store.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.k8s-bench.etcd.tests_warn

The number of Kubernetes benchmark tests run on the etcd key value store that returned a result of `WARN`.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.k8s-bench.general-security-primitives.pass_pct

The percentage of Kubernetes benchmark tests run on the security primitives that passed.

Metric TypeGauge
Value Type%
Segment ByContainer
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.k8s-bench.general-security-primitives.tests_fail

The number of Kubernetes benchmark tests run on the security primitives that failed.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.k8s-bench.general-security-primitives.tests_pass

The number of Kubernetes benchmark tests run on the security primitives that passed.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.k8s-bench.general-security-primitives.tests_total

The total number of Kubernetes benchmark tests run on the security primitives.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.k8s-bench.general-security-primitives.tests_warn

The number of Kubernetes benchmark tests run on the security primitives that returned a result of `WARN`.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.k8s-bench.kubelet.pass_pct

The percentage of Kubernetes benchmark tests run on the non-master node Kubernetes agent that passed.

Metric TypeGauge
Value Type%
Segment ByContainer
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.k8s-bench.kubelet.tests_fail

The number of Kubernetes benchmark tests run on the non-master node Kubernetes agent that failed.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.k8s-bench.kubelet.tests_pass

The number of Kubernetes benchmark tests run on the non-master node Kubernetes agent that passed.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.k8s-bench.kubelet.tests_total

The total number of Kubernetes benchmark tests run on the non-master node Kubernetes agent.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.k8s-bench.kubelet.tests_warn

The number of Kubernetes benchmark tests run on the non-master node Kubernetes agent that returned a result of `WARN`.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.k8s-bench.pass_pct

The percentage of Kubernetes benchmark tests that passed.

Metric TypeGauge
Value Type%
Segment ByContainer
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.k8s-bench.scheduler.pass_pct

The percentage of Kubernetes benchmark tests run on the scheduler that passed.

Metric TypeGauge
Value Type%
Segment ByContainer
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.k8s-bench.scheduler.tests_fail

The number of Kubernetes benchmark tests run on the scheduler that failed.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.k8s-bench.scheduler.tests_pass

The number of Kubernetes benchmark tests run on the scheduler that passed.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.k8s-bench.scheduler.tests_total

The total number of Kubernetes benchmark tests run on the scheduler.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.k8s-bench.scheduler.tests_warn

The number of Kubernetes benchmark tests run on the scheduler that returned a result of `WARN`.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.k8s-bench.tests_fail

The number of Kubernetes benchmark tests that failed.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.k8s-bench.tests_pass

The number of Kubernetes benchmark tests that passed.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.k8s-bench.tests_total

The total number of Kubernetes benchmark tests run.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## compliance.k8s-bench.tests_warn

The number of Kubernetes benchmark tests that returned a result of `WARN`.

Metric TypeGauge
Value TypeInteger
Segment ByContainer
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

# Containers

Note: Sysdig follows the Prometheus-compabtible naming convention for both metrics and labels as opposed to the previous statsd-compatible, legacy Sysdig naming convention. However, this page still shows metrics in the legacy Sysdig naming convention. Until this page is updated, see Metrics and Label Mapping for the mapping between Sysdig legacy and Prometheus naming conventions.

This topic introduces you to the Container metrics.

## container.count

The number of containers in the infrastructure.

Metric TypeGauge
Value TypeInteger
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Time Aggregation FormatsAvg, Sum, Min, Max

## container.id

The container’s identifier.

For Docker containers, this value is a 12 digit hex number.

Metric TypeGauge
Value TypeString
Segment ByContainer
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Time Aggregation FormatsN/A

## container.image

The name of the image used to run the container.

Metric TypeGauge
Value TypeString
Segment ByContainer
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Time Aggregation FormatsN/A

## container.name

The name of the container.

Metric TypeGauge
Value TypeString
Segment ByContainer
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Time Aggregation FormatsN/A

## container.type

The type of container (for example, Docker, LXC, or Mesos).

Metric TypeGauge
Value TypeString
Segment ByContainer
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Time Aggregation FormatsN/A

## cpu.quota.used.percent

The percentage of CPU quota a container actually used over a defined period of time.

CPU quotas are a common way of creating a CPU limit for a container. A container can only spend its quota of time on CPU cycles across a given time period. The default time period is 100ms.

Unlike CPU shares, CPU quota is a hard limit for the amount of CPU the container can use. For this reason, the CPU quota should not exceed 100% for an extended period of time. For a shorter time, containers are allowed to consume higher than the CPU quota.

Metric TypeGauge
Value Type%
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Time Aggregation FormatsAvg, Sum, Min, Max

## cpu.shares.count

The amount of CPU shares assigned to the container’s cgroup. CPU shares represent a relative weight used by the kernel to distribute CPU cycles across different containers. Each container receives its own allocation of CPU cycles, based on the ratio of share allocation for the container versus the total share allocation for all containers. For example, if an environment has three containers, each with 1024 shares, then each will receive 1/3 of the CPU cycles.

The default value for a container is 1024.

Defining a CPU shares count is a common way to create a CPU limit for a container.

The CPU shares count is not a hard limit. A container can consume more than its allocation, as long as the CPU has cycles that are not being consumed by the container they were originally allocated to.

Metric TypeGauge
Value TypeInteger
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Time Aggregation FormatsAvg, Sum, Min, Max

## cpu.shares.used.percent

The percentage of a container’s allocated CPU shares that are used. CPU shares are a common way of creating a CPU limit for a container, as they represent a relative weight used by the kernel to distribute CPU cycles across different containers. Each container receives its own allocation of CPU cycles, according to the ratio of share count vs the total number of shares claimed by all containers. For example, in an infrastructure with three containers, each with 1024 shares, each container receives 1/3 of the CPU cycles.

A container can use more CPU cycles than allocated if the CPU has cycles that are not being consumed by the container they were originally allocated to. This means that the value of `cpu.shares.used.percent` can exceed 100%.

Metric TypeGauge
Value Type%
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Time Aggregation FormatsAvg, Sum, Min, Max

## memory.limit.bytes

The RAM limit assigned to a container. The default value is 0.

Metric TypeGauge
Value TypeByte
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Time Aggregation FormatsAvg, Sum, Min, Max

## memory.limit.used.percent

The percentage of the memory limit used by a container.

Metric TypeGauge
Value Type%
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Time Aggregation FormatsAvg, Sum, Min, Max

## swap.limit.bytes

The swap limit assigned to a container.

Metric TypeGauge
Value TypeByte
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Time Aggregation FormatsAvg, Sum, Min, Max

## swap.limit.used.percent

The percentage of swap limit used by the container.

Metric TypeGauge
Value Type%
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Time Aggregation FormatsAvg, Sum, Min, Max

# Cloud Provider

Note: Sysdig follows the Prometheus-compabtible naming convention for both metrics and labels as opposed to the previous statsd-compatible, legacy Sysdig naming convention. However, this page still shows metrics in the legacy Sysdig naming convention. Until this page is updated, see Metrics and Label Mapping for the mapping between Sysdig legacy and Prometheus naming conventions.

At this time, all cloudProvider metrics are AWS-related.

## cloudProvider.account.id

The cloud provider instance account number.

This metric is useful if there are multiple accounts linked with Sysdig Monitor.

Metric TypeGauge
Value TypeString
Segment ByCloudProvider
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

## cloudProvider.availabilityZone

The AWS Availability Zone where the entity or entities are located. Each availability zone is an isolated subsection of an AWS region. See `cloudProvider.region`.

Metric TypeGauge
Value TypeString
Segment ByCloudProvider
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

## cloudProvider.host.ip.private

The private IP address allocated by the cloud provider for the instance. This address can be used for communication between instances in the same network.

Metric TypeGauge
Value TypeString
Segment ByCloudProvider
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

## cloudProvider.host.ip.public

Public IP address of the selected host.

Metric TypeGauge
Value TypeString
Segment ByCloudProvider
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

## cloudProvider.host.name

The name of the host as reported by the cloud provider.

Metric TypeGauge
Value TypeString
Segment ByCloudProvider
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

## cloudProvider.id

The ID number as assigned and reported by the cloud provider.

Metric TypeGauge
Value TypeString
Segment ByCloudProvider
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

## cloudProvider.instance.type

The type of instance (for example, AWS or Rackspace).

This metric is extremely useful to segment instances and compare their resource usage and saturation. You can use it as a grouping criteria for the explore table to quickly explore AWS usage on a per-instance-type basis. You can also use it to compare things like CPU usage, number of requests or network utilization for different instance types.

Use this grouping criteria in conjunction with the host.count metric to easily create a report on how many instances of each type you have.

Metric TypeGauge
Value TypeString
Segment ByCloudProvider
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

## cloudProvider.name

The name of the instance (for example, AWS or Rackspace).

Metric TypeGauge
Value TypeString
Segment ByCloudProvider
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

## cloudProvider.region

The region the cloud provider host (or group of hosts) is located in.

Use this grouping criteria in conjunction with the host.count metric to easily create a report on how many instances you have in each region.

Metric TypeGauge
Value TypeString
Segment ByCloudProvider
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

## cloudProvider.resource.endPoint

The DNS name for which the resource can be accessed.

Metric TypeGauge
Value TypeString
Segment ByCloudProvider
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

## cloudProvider.resource.name

The cloud provider service name (for example, Amazon EC2 or Amazon ELB).

Metric TypeGauge
Value TypeString
Segment ByCloudProvider
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

## cloudProvider.resource.type

The cloud provider service type (for example, `INSTANCE`, `LOAD_BALANCER`, `DATABASE`).

Metric TypeGauge
Value TypeString
Segment ByCloudProvider
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

## cloudProvider.status

Resource status.

Metric TypeGauge
Value TypeString
Segment ByCloudProvider
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

# AWS

For information about how Sysdig licensing affects the AWS metrics displayed in the Monitor UI, see About AWS Cloudwatch Licensing .

# Elasticache

Amazon ElastiCache is a cloud-caching service that increases the performance, speed, and redundancy with which applications can retrieve data by providing an in-memory database caching system.

## aws.elasticache.CPUUtilization

The percentage of CPU utilization.

When reaching high utilization and your main workload is from read requests, scale your cache cluster out by adding read replicas. If the main workload is from write requests, scale up by using a larger cache instance type.

For more information, refer to the ElastiCache documentation.

Metric TypeGauge
Value Type%
Segment ByCloudProvider
Default Time AggregationAverave
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.elasticache.FreeableMemory

The amount of memory considered free, or that could be made available, for use by the node.

For more information, refer to the ElastiCache documentation.

Metric TypeGauge
Value TypeByte
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.elasticache.NetworkBytesIn

The number of bytes the host has read from the network.

For more information, refer to the ElastiCache documentation.

Metric TypeCounter
Value TypeByte
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.elasticache.NetworkBytesOut

The number of bytes the host has written to the network.

For more information, refer to the ElastiCache documentation.

Metric TypeCounter
Value TypeByte
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.elasticache.SwapUsage

The amount of swap space used on the host.

If swap is being utilized, the node probably needs more memory than is available and cache performance may be negatively impacted. Consider adding more nodes or using larger ones to reduce or eliminate swapping.

For more information, refer to the ElastiCache documentation.

Metric TypeGauge
Value TypeByte
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

# Elastic Application Load Balancing (ALB)

Application Load Balancer is best suited for load balancing of HTTP and HTTPS traffic and provides advanced request routing targeted at the delivery of modern application architectures, including microservices and containers. For more information, refer to the Elastic Application Load Balancer documentation.

## aws.alb.ActiveConnectionCount

The total number of concurrent TCP connections active from clients to the load balancer and from the load balancer to the targets.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.alb.ClientTLSNegotiationErrorCount

The number of TLS connections initiated by the client that did not establish a session with the load balancer.

Possible causes include a mismatch of ciphers or protocols.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.alb.ConsumedLCUs

The number of load balancer capacity units (LCU) used by the load balancer.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.alb.HTTPCode_ELB_4XX_Count

The number of HTTP 4XX client error codes that originate form the load balancer. Client errors are generated when requests are malformed or incomplete. These requests have not been received by the target.

This count does not include any response codes generated by the targets.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.alb.HTTPCode_ELB_5XX_Count

The number of HTTP 5XX server error codes that originate from the load balancer. Server errors are generated when requests are malformed or incomplete. These requests have not been received by the target.

This count does not include any response codes generated by the targets.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.alb.HTTPCode_Target_2XX_Count

The number of HTTP 2XX response codes generated by the target.

This count does not include any response codes generated by the load balancer.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.alb.HTTPCode_Target_3XX_Count

The number of HTTP 3XX response codes generated by the target.

This count does not include any response codes generated by the load balancer.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.alb.HTTPCode_Target_4XX_Count

The number of HTTP 4XX response codes generated by the target.

This count does not include any response codes generated by the load balancer.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.alb.HTTPCode_Target_5XX_Count

The number of HTTP 5XX response codes generated by the target.

This count does not include any response codes generated by the load balancer.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.alb.HealthyHostCount

The number of targets that are considered healthy.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.alb.IPv6ProcessedBytes

The total number of bytes processed by the load balancer over IPv6.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.alb.IPv6RequestCount

The total number of data requested by the load balancer over IPv6.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.alb.NewConnectionCount

The total number of new TCP connections established from clients to the load balancer and from the load balancer to targets.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.alb.ProcessedBytes

The total number of bytes processed by the load balancer over IPv4 and IPv6.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.alb.RejectedConnectionCount

The number of connections that were rejected because the load balancer had reached its maximum number of connections.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.alb.RequestCount

The number of requests processed over IPv4 and IPv6. This count only includes the requests with a response generated by a target of the load balancer.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.alb.RequestCountPerTarget

The average number of requests received by each target in a target group.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.alb.RuleEvaluations

The number of rules processed by the load balancer given a request rate averaged over an hour.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.alb.TargetConnectionErrorCount

The number of connections that were not successfully established between the load balancer and target.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.alb.TargetResponseTime

The time elapsed, in seconds, after the request leaves the load balancer until a response from the target is received.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.alb.TargetTLSNegotiationErrorCount

The number of TLS connections initiated by the load balancer that did not establish a session with the target.

Possible causes include a mismatch of ciphers or protocols.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.alb.UnHealthyHostCount

The number of targets that are considered unhealthy.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

# Elastic Cloud Compute (EC2)

Amazon Elastic Compute Cloud (Amazon EC2) is a web service that provides secure, resizable compute capacity in the cloud. It is designed to make web-scale cloud computing easier for developers.

## aws.ec2.CPUCreditBalance

The CPU credit balance of an instance, based on what has accrued since it started. For more information, refer to the Elastic Compute Cloud metric definition table.

Metric TypeGauge
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.ec2.CPUCreditUsage

The CPU credit usage by the instance. For more information, refer to the Elastic Compute Cloud metric definition documentation.

Metric TypeGauge
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.ec2.CPUUtilization

The percentage of allocated EC2 compute units currently in use on the instance. For more information, refer to the Elastic Compute Cloud metric definition documentation.

This metric identifies the processing power required to run an application upon a selected instance.

Metric TypeGauge
Value Type%
Segment ByCloudProvider
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

The total bytes read from all ephemeral disks available to the instance. This metric is used to determine the volume of the data the application reads from the disk and can be used to determine the speed of the application.

The number reported is the number of bytes received during a specified period. For a basic (five-minute) monitoring, divide this number by 300 to find Bytes/second. For a detailed (one-minute) monitoring, divide it by 60.

For more information, refer to the Elastic Compute Cloud metric definition documentation.

Metric TypeCounter
Value TypeByte
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

Total completed read operations from all ephemeral disks available to the instance in a specified period of time. For more information, refer to the Elastic Compute Cloud metric definition documentation.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.ec2.DiskWriteBytes

It is the total bytes written to all ephemeral disks available to the instance. This metric is used to determine the volume of the data the application writes to the disk and can be used to determine the speed of the application.

The number reported is the number of bytes received during a specified period. For a basic (five-minute) monitoring, divide this number by 300 to find Bytes/second. For a detailed (one-minute) monitoring, divide it by 60.

For more information, refer to the Elastic Compute Cloud metric definition documentation.

Metric TypeCounter
Value TypeByte
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.ec2.DiskWriteOps

The completed write operations to all ephemeral disks available to the instance in a specified period of time. If your instance uses Amazon EBS volumes, see Amazon EBS Metrics. For more information, refer to the Elastic Compute Cloud metric definition documentation.

Metric TypeCounter
Value TypeByte
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.ec2.NetworkIn

The number of bytes received on all network interfaces by the instance. For more information, refer to the Elastic Compute Cloud metric definition documentation.

Metric TypeCounter
Value TypeByte
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.ec2.NetworkOut

The number of bytes sent out on all network interfaces by the instance. For more information, refer to the Elastic Compute Cloud metric definition documentation.

This metric identifies the volume of outgoing network traffic to an application on a single instance.

Metric TypeCounter
Value TypeByte
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

# Elastic Container Service (ECS)

Amazon Elastic Container Service (Amazon ECS) is a highly scalable, high-performance container orchestration service that supports Docker containers and allows you to easily run and scale containerized applications on AWS. Amazon ECS eliminates the need for you to install and operate your own container orchestration software, manage and scale a cluster of virtual machines, or schedule containers on those virtual machines.

## ecs.clusterName

The name of the cluster. For more information, refer to the AWS CloudFormation documentation.

Metric TypeGauge
Value TypeString
Segment ByCloudProvider
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

## ecs.serviceName

The name of the Elastic Container Service (Amazon ECS) service. For more information, refer to the AWS CloudFormation documentation.

Metric TypeGauge
Value TypeString
Segment ByCloudProvider
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

The name of the task definition family. For more information, refer to the AWS CloudFormation documentation.

Metric TypeGauge
Value TypeString
Segment ByCloudProvider
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

# Elastic Load Balancing (ELB)

Elastic Load Balancing automatically distributes incoming application traffic across multiple targets, such as Amazon EC2 instances, containers, IP addresses, and Lambda functions.

## aws.elb.BackendConnectionErrors

The number of errors encountered by the load balancer while attempting to connect to your application.

For high error counts, look for network related issues or check that your servers are operating correctly. The ELB is having problems connecting to them.

For more information, refer to the Elastic Load Balancing documentation.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.elb.HealthyHostCount

A count of the number of healthy instances that are bound to the load balancer.

Hosts are declared healthy if they meet the threshold for the number of consecutive health checks that are successful. Hosts that have failed more health checks than the value of the unhealthy threshold are considered unhealthy. If cross-zone is enabled, the count of the number of healthy instances is calculated for all Availability Zones.

For more information, refer to the Elastic Load Balancing documentation.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.elb.HTTPCode_Backend_2XX

The count of the number of HTTP 2XX response codes generated by back-end instances. This metric does not include any response codes generated by the load balancer.

The 2XX class status codes represent successful actions (e.g., 200-OK, 201-Created, 202-Accepted, 203-Non-Authoritative Info).

For more information, refer to the Elastic Load Balancing documentation.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.elb.HTTPCode_Backend_3XX

The count of the number of HTTP 3XX response codes generated by back-end instances. This metric does not include any response codes generated by the load balancer.

The 3XX class status code indicates that the user agent requires action (e.g., 301-Moved Permanently, 302-Found, 305-Use Proxy, 307-Temporary Redirect).

For more information, refer to the Elastic Load Balancing documentation.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.elb.HTTPCode_Backend_4XX

The count of the number of HTTP 4XX response codes generated by back-end instances. This metric does not include any response codes generated by the load balancer. For more information, refer to the Elastic Load Balancing documentation.

The 4XX class status code represents client errors (e.g., 400-Bad Request, 401-Unauthorized, 403-Forbidden, 404-Not Found).

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.elb.HTTPCode_Backend_5XX

The count of the number of HTTP 5XX response codes generated by back-end instances. This metric does not include any response codes generated by the load balancer. For more information, refer to the Elastic Load Balancing documentation.

The 5XX class status code represents back-end server errors e.g., 500-Internal Server Error, 501-Not implemented, 503-Service Unavailable).

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.elb.HTTPCode_ELB_4XX

The count of the number of HTTP 4XX client error codes generated by the load balancer when the listener is configured to use HTTP or HTTPS protocols. For more information, refer to the Elastic Load Balancing documentation.

Client errors are generated when a request is malformed or is incomplete.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.elb.HTTPCode_ELB_5XX

The count of the number of HTTP 5XX server error codes generated by the load balancer when the listener is configured to use HTTP or HTTPS protocols. This metric does not include any responses generated by back-end instances.For more information, refer to the Elastic Load Balancing documentation.

The metric is reported if there are no back-end instances that are healthy or registered to the load balancer, or if the request rate exceeds the capacity of the instances or the load balancers.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.elb.Latency

A measurement of the time backend requests require to process. For more information, refer to the Elastic Load Balancing documentation.

Latency metrics from the ELB are good indicators of the overall performance of your application.

Metric TypeCounter
Value TyperelativeTime
Segment ByCloudProvider
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.elb.RequestCount

The number of requests handled by the load balancer. For more information, refer to the Elastic Load Balancing documentation.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.elb.SpilloverCount

A count of the total number of requests that were rejected due to the queue being full. For more information, refer to the Elastic Load Balancing documentation.

Positive numbers indicate some requests are not being forwarded to any server. Clients are not notified that their request was dropped.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.elb.SurgeQueueLength

A count of the total number of requests that are pending submission to a registered instance. For more information, refer to the Elastic Load Balancing documentation.

Positive numbers indicate clients are waiting for their requests to be forwarded to a server for processing.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.elb.UnHealthyHostCount

The count of the number of unhealthy instances that are bound to the load balancer. For more information, refer to the Elastic Load Balancing documentation.

Hosts are declared healthy if they meet the threshold for the number of consecutive health checks that are successful. Hosts that have failed more health checks than the value of the unhealthy threshold are considered unhealthy.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

# DynamoDB

DynamoDB is a fully managed proprietary NoSQL database service that supports key-value and document data structures and is offered by Amazon as part of the Amazon Web Services portfolio. Amazon CloudWatch aggregates the DynamoDB metrics at one-minute intervals.

In DynamoDB, provisioned throughput requirements are specified in terms of capacity units: Read Capacity unit and Write Capacity unit. A unit of read capacity represents one strongly consistent read per second for items up to 4 KB in size. OneÂ write capacity unitÂ represents one write per second for items up to 1 KB in size. Larger items will require more capacity. You can calculate the number of units of read and write capacity by estimating the number of reads or writes required per second and multiplying by the size of the items rounded up to the nearest KB.

For more information, see the Amazon DynamoDB documentation.

## aws.dynamodb.ConditionalCheckFailedRequests

The number of failed attempts to perform conditional writes.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

The amount of read capacity units consumed over the defined time period. Amazon CloudWatch aggregates the metrics at one-minute intervals. Use theÂ SumÂ aggregation to calculate the consumed throughput. For example, get theÂ SumÂ value over a span of one minute, and divide it by the number of seconds in a minute (60) to calculate the average `ConsumedReadCapacityUnits` per second.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.dynamodb.ConsumedWriteCapacityUnits

The amount of write capacity units consumed over the specified time interval. Amazon CloudWatch aggregates the metrics at one-minute intervals. Use theÂ SumÂ aggregation to calculate the consumed throughput. For example, get theÂ SumÂ value over a span of one minute, and divide it by the number of seconds in a minute (60) to calculate the average `ConsumedWriteCapacityUnits` per second.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

The number of read capacity units provisioned for a table or a global secondary index.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.dynamodb.ProvisionedWriteCapacityUnits

The number of write capacity units provisioned for a table or global secondary table.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

The number of DynamoDB requests that exceed the amount of read capacity units provisioned.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.dynamodb.ReturnedBytes.GetRecords

The number of bytes returned by `GetRecords` operation during the specified time period.

Metric TypeCounter
Value TypeByte
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.dynamodb.ReturnedItemCount

The number of items returned by query or scan operations during the specified time period.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.dynamodb.ReturnedRecordsCount.GetRecords

The number of stream records returned by the GetRecords operations during the specific period.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.dynamodb.SuccessfulRequestLatency

The number of successful requests to DynamoDB or Amazon DynamoDB Streams during the specified time period. The time period is in milliseconds.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.dynamodb.SystemErrors

The number of requests made to DynamoDB or Amazon DynamoDB Streams that resulted in an HTTP 500 status code during the specified time period. HTTP 500 usually indicates an internal service error.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.dynamodb.ThrottledRequests

The number of requests to DynamoDB that exceed the provisioned throughput limits on a resource, such as a table or an index. ThrottledRequestsÂ is incremented by one if any event within a request exceeds a provisioned throughput limit.

If any individual request for read or write events within the batch is throttled, `ReadThrottleEvents` metrics or `WriteThrottleEvents` metrics is incremented respectively.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.dynamodb.UserErrors

The number of requests to DynamoDB or Amazon DynamoDB Streams that returned an HTTP 400 status code during the specified time period. HTTP 400 usually indicates a client-side error.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.dynamodb.WriteThrottleEvents

The number of requests to DynamoDB that exceed the provisioned write capacity units for a table or a global secondary index.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

# Relational Database Service (RDS)

Amazon Relational Database Service (Amazon RDS) is a managed SQL database service provided by Amazon Web Services (AWS). Amazon RDS supports an array of database engines to store and organize data and helps with database management tasks, such as migration, backup, recovery, and patching.

## aws.rds.BinLogDiskUsage

The amount of disk space occupied by binary logs on the master. Applies to MySQL read replicas.

For more information, refer to the Amazon Relational Database (RDS) documentation.

Metric TypeCounter
Value TypeByte
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.rds.CPUUtilization

The percentage of CPU utilization.

For more information, refer to the Amazon Relational Database (RDS) documentation.

Metric TypeGauge
Value Type%
Segment ByCloudProvider
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.rds.DatabaseConnections

The number of database connections in use.

For more information, refer to the Amazon Relational Database (RDS) documentation.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.rds.DiskQueueDepth

The number of outstanding I/Os (read/write requests) waiting to access the disk.

For more information, refer to the Amazon Relational Database (RDS) documentation.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.rds.FreeableMemory

The amount of available random access memory, in megabytes.

For more information, refer to the Amazon Relational Database (RDS) documentation.

Metric TypeGauge
Value TypeByte
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.rds.FreeStorageSpace

The amount of available storage space in bytes.

For more information, refer to the Amazon Relational Database (RDS) documentation.

Metric TypeGauge
Value TypeByte
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

The incoming (Receive) network traffic on the DB instance, including both customer database traffic and Amazon RDS traffic used for monitoring and replication. The metric is measured in bytes per second.

For more information, refer to the Amazon Relational Database (RDS) documentation.

Metric TypeCounter
Value TypeByte
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.rds.NetworkTransmitThroughput

The outgoing (Transmit) network traffic on the DB instance, including both customer database traffic and Amazon RDS traffic used for monitoring and replication. The metric is measured in bytes per second.

For more information, refer to the Amazon Relational Database (RDS) documentation.

Metric TypeCounter
Value TypeByte
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

The average number of read I/O operations per second.

For more information, refer to the Amazon Relational Database (RDS) documentation.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

The average amount of seconds taken per read I/O operation.

For more information, refer to the Amazon Relational Database (RDS) documentation.

Metric TypeCounter
Value TyperelativeTime
Segment ByCloudProvider
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

The average number of bytes read from disk per second.

For more information, refer to the Amazon Relational Database (RDS) documentation.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.rds.ReplicaLag

The amount of time, in nanoseconds, a Read Replica DB instance lags behind the source DB instance.

This metric applies to MySQL read replicas.

For more information, refer to the Amazon Relational Database (RDS) documentation.

Metric TypeCounter
Value TyperelativeTime
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.rds.SwapUsage

The amount of swap space used by the database, measured in megabytes.

For more information, refer to the Amazon Relational Database (RDS) documentation.

Metric TypeGauge
Value TypeByte
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.rds.WriteIOPS

The average number of write I/O operations per second.

For more information, refer to the Amazon Relational Database (RDS) documentation.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.rds.WriteLatency

The average amount of time taken per write I/O operation.

For more information, refer to the Amazon Relational Database (RDS) documentation.

Metric TypeCounter
Value TyperelativeTime
Segment ByCloudProvider
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.rds.WriteThroughput

The average number of bytes written to disk per second.

For more information, refer to the Amazon Relational Database (RDS) documentation.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

# Simple Queue Service (SQS)

Amazon Simple Queue Service (Amazon SQS) is a pay-per-use web service for storing messages in transit between computers. Developers use SQS to build distributed applications with decoupled components without having to deal with the overhead of creating and maintaining message queues.

Amazon Simple Queue Service (Amazon SQS) is a pay-per-use web service for storing messages in transit between computers. Developers use SQS to build distributed applications with decoupled components without having to deal with the overhead of creating and maintaining message queues. For more information, see Amazon SQS Resources.

## aws.sqs.ApproximateNumberOfMessagesDelayed

The number of messages in the queue that are delayed or currently unavailable for reading. Messages are stuck like this when the queue is configured as a delay queue or when a message has been sent with a delay parameter.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAvg
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.sqs.ApproximateNumberOfMessagesNotVisible

The number of undelivered messages. These messages are still in the queue, on their way to a client (in flight), but have not yet been deleted or have not yet reached the destination.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAvg
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.sqs.ApproximateNumberOfMessagesVisible

The number of messages available for retrieval from the queue. These are the messages which have not yet been locked by an SQS worker.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAvg
Available Group Aggregation FormatsAvg, Sum, Min, Max

The number of `ReceiveMessage` API calls that did not return a message. This metric is populated every 5 minutes.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.sqs.NumberOfMessagesDeleted

The number of messages deleted from the queue. Amazon SQS considers every successful deletion that uses a valid receipt handle, including duplicate deletions, to generate the `NumberOfMessagesDeleted` metric. Therefore, this number could include duplicate deletions.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

The number of messages returned by calls to the `ReceiveMessage` API action.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.sqs.NumberOfMessagesSent

The number of messages added to a queue.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## aws.sqs.SentMessageSize

The size of messages in bytes added to a queue. The `SentMessageSize` does not display as an available metric in the CloudWatch console until at least one message is sent to the corresponding queue.

Metric TypeCounter
Value TypeInteger
Segment ByCloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

# File

Note: Sysdig follows the Prometheus-compabtible naming convention for both metrics and labels as opposed to the previous statsd-compatible, legacy Sysdig naming convention. However, this page still shows metrics in the legacy Sysdig naming convention. Until this page is updated, see Metrics and Label Mapping for the mapping between Sysdig legacy and Prometheus naming conventions.

## file.bytes.in

The number of bytes read from the file. By default, this metric displays the total value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the total value for the whole group.

Metric TypeCounter
Value TypeByte
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## file.bytes.out

The number of bytes written to the file. By default, this metric displays the total value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the total value for the whole group.

Metric TypeCounter
Value TypeByte
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## file.bytes.total

The total number of bytes written to, and read from, the file. By default, this metric displays the total value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the total value for the whole group.

Metric TypeCounter
Value TypeByte
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## file.error.open.count

The number of errors that occurred when opening files. By default, this metric displays the total value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the total value for the whole group.

Metric TypeCounter
Value TypeInteger
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## file.error.total.count

The number of errors encountered by file system calls, such as `open()`, `close()`, and `create()`. By default, this metric displays the total value for the defined scope. For example, if the scope is defined as a group of machines, the metric value will be the total value for the whole group.

Metric TypeCounter
Value TypeInteger
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## file.iops.in

The number of file read operations per second. This metric is calculated by measuring the actual number of read requests made by a process. By default, this metric displays the total value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the total value for the whole group.

The value of `file.iops.in` can differ from the value other tools show, as they are usually based on interpolating this value from the number of bytes read and written to the file system.

Metric TypeCounter
Value TypeInteger
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## file.iops.out

The number of file write operations per second. This metric is calculated by measuring the actual number of write requests made by a process. By default, this metric displays the total value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the total value for the whole group.

The value of `file.iops.out` can differ from the value other tools show, as they are usually based on interpolating this value from the number of bytes read and written to the file system.

Metric TypeCounter
Value TypeInteger
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## file.iops.total

The number of file read and write operations per second. This metric is calculated by measuring the actual number of read/write requests made by a process. By default, this metric displays the total value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the total value for the whole group.

The value of `file.iops.total` can differ from the value other tools show, as they are usually based on interpolating this value from the number of bytes read and written to the file system.

Metric TypeCounter
Value TypeInteger
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## file.name

The name of the file.

Metric TypeGauge
Value TypeString
Segment ByHost
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

## file.open.count

The number of times the file has been opened.

Metric TypeCounter
Value TypeInteger
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## file.time.in

The time spent reading the file. By default, this metric displays the total value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the total value for the whole group.

Metric TypeCounter
Value TyperelativeTime
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## file.time.out

The time spent writing in the file. By default, this metric displays the total value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the total value for the whole group.

Metric TypeCounter
Value TyperelativeTime
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## file.time.total

The time spent during file I/O. By default, this metric displays the total value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the total value for the whole group.

Metric TypeCounter
Value TyperelativeTime
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

# File System

Note: Sysdig follows the Prometheus-compabtible naming convention for both metrics and labels as opposed to the previous statsd-compatible, legacy Sysdig naming convention. However, this page still shows metrics in the legacy Sysdig naming convention. Until this page is updated, see Metrics and Label Mapping for the mapping between Sysdig legacy and Prometheus naming conventions.

## fs.used.percent

Specifies what percentage of the file system has been used.

Description

Metric Type

Gauge

Value Type

Percent

Scope

Host, Container

Segment By

agent.tag

cloudProvider.account.id

cloudProvider.availabilityZone

cloudProvider.region

cloudProvider.tag

container.id

container.image

container.name

ecs.clusterName

ecs.serviceName

fs.device

fs.mountDir

fs.type

host.hostName

host.mac

Default Time Aggregation

Average

Available Time Aggregation Formats

Average, Rate, Sum, Minimum, Maximum

Default Group Aggregation

Average

Available Group Aggregation Formats

Average, Sum, Minimum, Maximum

## fs.free.percent

Specifies what percentage of the file system is free.

Description

Metric Type

Gauge

Value Type

Percent

Scope

Host, Container

Segment By

agent.tag

cloudProvider.account.id

cloudProvider.availabilityZone

cloudProvider.region

cloudProvider.tag

container.id

container.image

container.name

ecs.clusterName

ecs.serviceName

fs.device

fs.mountDir

fs.type

host.hostName

host.mac

Default Time Aggregation

Average

Available Time Aggregation Formats

Average, Rate, Sum, Minimum, Maximum

Default Group Aggregation

Average

Available Group Aggregation Formats

Average, Sum, Minimum, Maximum

## fs.bytes.free

The number of bytes free in the file system.

Description

Metric Type

gauge

Value Type

Byte

Scope

Host, Container

Segment By

agent.tag

cloudProvider.account.id

cloudProvider.availabilityZone

cloudProvider.region

cloudProvider.tag

container.id

container.image

container.name

ecs.clusterName

ecs.serviceName

fs.device

fs.mountDir

fs.type

host.hostName

host.mac

Default Time Aggregation

Average

Available Time Aggregation Formats

Average, Rate, Sum, Minimum, Maximum

Default Group Aggregation

Average

Available Group Aggregation Formats

Average, Sum, Minimum, Maximum

## fs.bytes.used

The number of bytes used in the file system.

Description

Metric Type

Gauge

Value Type

Byte

Scope

Host, Container

Segment By

agent.tag

cloudProvider.account.id

cloudProvider.availabilityZone

cloudProvider.region

cloudProvider.tag

container.id

container.image

container.name

ecs.clusterName

ecs.serviceName

fs.device

fs.mountDir

fs.type

host.hostName

host.mac

Default Time Aggregation

Average

Available Time Aggregation Formats

Average, Rate, Sum, Minimum, Maximum

Default Group Aggregation

Average

Available Group Aggregation Formats

Average, Sum, Minimum, Maximum

## fs.bytes.total

The size of the file system.

Description

Metric Type

Gauge

Value Type

Byte

Scope

Host, Container

Segment By

agent.tag

cloudProvider.account.id

cloudProvider.availabilityZone

cloudProvider.region

cloudProvider.tag

container.id

container.image

container.name

ecs.clusterName

ecs.serviceName

fs.device

fs.mountDir

fs.type

host.hostName

host.mac

Default Time Aggregation

Average

Available Time Aggregation Formats

Average, Rate, Sum, Minimum, Maximum

Default Group Aggregation

Average

Available Group Aggregation Formats

Average, Rate, Sum, Minimum, Maximum

## fs.inodes.total.count

The number of inodes in the file system.

Description

Metric Type

Gauge

Value Type

Integer

Scope

Host, Container

Segment By

agent.tag

cloudProvider.account.id

cloudProvider.availabilityZone

cloudProvider.region

cloudProvider.tag

container.id

container.image

container.name

ecs.clusterName

ecs.serviceName

fs.device

fs.mountDir

fs.type

host.hostName

host.mac

Default Time Aggregation

Average

Available Time Aggregation Formats

Average, Rate, Sum, Minimum, Maximum

Default Group Aggregation

Average

Available Group Aggregation Formats

Average, Sum, Minimum, Maximum

## fs.inodes.used.count

The number of inodes used in the file system.

Description

Metric Type

Gauge

Value Type

Integer

Scope

Host, Container

Segment By

agent.tag

cloudProvider.account.id

cloudProvider.availabilityZone

cloudProvider.region

cloudProvider.tag

container.id

container.image

container.name

ecs.clusterName

ecs.serviceName

fs.device

fs.mountDir

fs.type

host.hostName

host.mac

Default Time Aggregation

Average

Available Time Aggregation Formats

Average, Rate, Sum, Minimum, Maximum

Default Group Aggregation

Average

Available Group Aggregation Formats

Average, Sum, Minimum, Maximum

## fs.inodes.used.percent

Percentage of filesystem inodes usage.

Description

Metric Type

Gauge

Value Type

Percent

Scope

Host, Container

Segment By

agent.tag

cloudProvider.account.id

cloudProvider.availabilityZone

cloudProvider.region

cloudProvider.tag

container.id

container.image

container.name

ecs.clusterName

ecs.serviceName

fs.device

fs.mountDir

fs.type

host.hostName

host.mac

Default Time Aggregation

Average

Available Time Aggregation Formats

Average, Rate, Sum, Minimum, Maximum

Default Group Aggregation

Average

Available Group Aggregation Formats

Average, Sum, Minimum, Maximum

## fs.root.used.percent

Percentage of root filesystem usage.

Description

Metric Type

Gauge

Value Type

Percent

Scope

Host, Container

Segment By

agent.tag

cloudProvider.account.id

cloudProvider.availabilityZone

cloudProvider.region

cloudProvider.tag

container.id

container.image

container.name

ecs.clusterName

ecs.serviceName

fs.device

fs.mountDir

fs.type

host.hostName

host.mac

Default Time Aggregation

Average

Available Time Aggregation Formats

Average, Rate, Sum, Minimum, Maximum

Default Group Aggregation

Average

Available Group Aggregation Formats

Average, Sum, Minimum, Maximum

## fs.largest.used.percent

Percentage of the largest filesystem.

Description

Metric Type

Gauge

Value Type

Percent

Scope

Host, Container

Segment By

agent.tag

cloudProvider.account.id

cloudProvider.availabilityZone

cloudProvider.region

cloudProvider.tag

container.id

container.image

container.name

ecs.clusterName

ecs.serviceName

fs.device

fs.mountDir

fs.type

host.hostName

host.mac

Default Time Aggregation

Average

Available Time Aggregation Formats

Average, Rate, Sum, Minimum, Maximum

Default Group Aggregation

Average

Available Group Aggregation Formats

Average, Sum, Minimum, Maximum

# Host

Note: Sysdig follows the Prometheus-compabtible naming convention for both metrics and labels as opposed to the previous statsd-compatible, legacy Sysdig naming convention. However, this page still shows metrics in the legacy Sysdig naming convention. Until this page is updated, see Metrics and Label Mapping for the mapping between Sysdig legacy and Prometheus naming conventions.

## agent.id

Metric TypeGauge
Value TypeString
Segment ByHost
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

## agent.mode

For more information on agent modes, see Configure Agent Modes.

Metric TypeString
Value TypeString
Segment ByHost
Default Time Aggregationconcat
Available Time Aggregation Formatsconcat, distinct, count
Default Group Aggregationconcat
Available Group Aggregation Formatsconcat, distinct, count

## agent.version

Metric TypeGauge
Value TypeString
Segment ByHost
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

## cpu.core

Metric TypeGauge
Value TypeString
Segment ByHost
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

## host.container.mappings

Metric TypeGauge
Value TypeString
Segment ByHost
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

## host.count

Metric TypeGauge
Value TypeInteger
Segment ByHost, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## host.domain

Metric TypeGauge
Value TypeString
Segment ByHost
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

## host.hostName

Metric TypeGauge
Value TypeString
Segment ByHost
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

## host.ip.all

Metric TypeGauge
Value TypeString
Segment ByHost
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

## host.ip.private

Metric TypeGauge
Value TypeString
Segment ByHost
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

## host.ip.public

Metric TypeGauge
Value TypeString
Segment ByHost
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

## host.isClientServer

Metric TypeGauge
Value TypeString
Segment ByHost
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

## host.isInstrumented

Metric TypeGauge
Value TypeString
Segment ByHost
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

## host.isInternal

Metric TypeGauge
Value TypeString
Segment ByHost
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

## host.mac

Metric TypeGauge
Value TypeString
Segment ByHost
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

## host.procList.main

Metric TypeGauge
Value TypeString
Segment ByHost
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

## host.uname

`host.uname` provides the following system information:

• kernel name

• kernel release number

• kernel version

• machine hardware name

Agents send this metric along with a number of `labels` that map with the `uname` information. `host.uname` is supported on agent versions 10.1 and above.

### Metrics Details

Metric TypeGauge
Value TypeInteger
Segment BySee Segmentation Details.
Default Time AggregationAverage
Available Time AggregationAverage, Rate, Sum, Min, Max, Rate of Change
Default Group AggregationAverage
Available Group RollupAverage, Sum, Min, Max

### Segmentation Details

The labels are given below:

LabelDescriptionMapping to the uname toolingExample
`host.uname.kernel.name`The kernel name`uname -s`Linux
`host.uname.kernel.release`The kernel release`uname -r`5.4.0-31-generic
`host.uname.kernel.version`The kernel version`uname -v`#35-Ubuntu SMP Thu May 7 20:20:34 UTC 2020
`host.machine`The hardware name of the machine`uname -m`x86_64

### Example: Kernel Versions in the Infrastructure

The image depicts `host.uname` being segmented by `host.uname.kernel.version`. The resulting dashboard gives the distribution of kernel versions in the infrastructure.

## Count Limits StasD Metrics

The count limits metrics report the upper limit of the number of metrics of the same type. The values the metrics report can be changed by modifying the `dragent.yaml` file.

Metric NameConfiguration Parameter in the `dragent.yaml` fileDefault Value
`metricCount.limit.appCheck``app_checks_limit`500
`metricCount.limit.statsd``statsd.limit`100
`metricCount.limit.jmx``jmx.limit`500
`metricCount.limit.prometheus``prometheus.max+metrics`3000

### metricCount.appCheck

Metric TypeGauge
Value TypeString
Segment ByHost
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

### metricCount.jmx

Metric TypeGauge
Value TypeString
Segment ByHost
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

### metricCount.statsd

Metric TypeGauge
Value TypeString
Segment ByHost
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

### metricCount.prometheus

Metric TypeGauge
Value TypeString
Segment ByHost
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

# JVM

Note: Sysdig follows the Prometheus-compabtible naming convention for both metrics and labels as opposed to the previous statsd-compatible, legacy Sysdig naming convention. However, this page still shows metrics in the legacy Sysdig naming convention. Until this page is updated, see Metrics and Label Mapping for the mapping between Sysdig legacy and Prometheus naming conventions.

The number of classes currently loaded in the JVM. By default, this metric shows the total value of the selected scope. For example, if applied to a group of machines, the value will be the total value for the whole group.

## jvm.gc.ConcurrentMarkSweep.count

The number of times the Concurrent Mark-Sweep garbage collector has run.

## jvm.gc.ConcurrentMarkSweep.time

The total time the Concurrent Mark-Sweep garbage collector has run.

## jvm.gc.global.time

The total time the garbage collection has run.

## jvm.gc.PS_MarkSweep.count

The number of times the parallel scavenge Mark-Sweep old generation garbage collector has run.

## jvm.gc.PS_MarkSweep.time

The total time the parallel scavenge Mark-Sweep old generation garbage collector has run.

## jvm.gc.PS_Scavenge.count

The number of times the parallel eden/survivor space garbage collector has run.

## jvm.gc.PS_Scavenge.time

The total time the parallel eden/survivor space garbage collector has run.

## jvm.gc.ParNew.count

The number of times the parallel garbage collector has run.

## jvm.gc.ParNew.time

The total time the parallel garbage collector has run.

## jvm.gc.scavenge.time

The total time the scavenge collector has run.

## jvm.heap.committed

The amount of memory that is currently allocated to the JVM for heap memory. Heap memory is the storage area for Java objects. By default, this metric shows the total value of the selected scope. For example, if applied to a group of machines, the value will be the total value for the whole group.

The JVM may release memory to the system and Heap Committed could decrease below Heap Init; but Heap Committed can never increase above Heap Max.

## jvm.heap.init

The initial amount of memory that the JVM requests from the operating system for heap memory during startup (defined by the â€“Xms option).The value of Heap Init may be undefined. By default, this metric shows the total value of the selected scope. For example, if applied to a group of machines, the value will be the total value for the whole group.

The JVM may request additional memory from the operating system and may also release memory to the system over time.

## jvm.heap.max

The maximum size allocation of heap memory for the JVM (defined by the â€“Xmx option). By default, this metric shows the total value of the selected scope. For example, if applied to a group of machines, the value will be the total value for the whole group.

Any memory allocation attempt that would exceed this limit will cause an OutOfMemoryError exception to be thrown.

## jvm.heap.used

The amount of allocated heap memory (ie Heap Committed) currently in use. The number of classes currently loaded in the JVM. By default, this metric shows the total value of the selected scope. For example, if applied to a group of machines, the value will be the total value for the whole group.

Heap memory is the storage area for Java objects.

An object in the heap that is referenced by another object is ’live’, and will remain in the heap as long as it continues to be referenced. Objects that are no longer referenced are garbage and will be cleared out of the heap to reclaim space.

## jvm.heap.used.percent

The ratio between Heap Used and Heap Committed. By default, this metric shows the total value of the selected scope. For example, if applied to a group of machines, the value will be the total value for the whole group.

## jvm.nonHeap.committed

The amount of memory that is currently allocated to the JVM for non-heap memory. By default, this metric shows the total value of the selected scope. For example, if applied to a group of machines, the value will be the total value for the whole group.

Non-heap memory is used by Java to store loaded classes and other meta-data.

The JVM may release memory to the system and Non-Heap Committed could decrease below Non-Heap Init; but Non-Heap Committed can never increase above Non-Heap Max.

## jvm.nonHeap.init

The initial amount of memory that the JVM requests from the operating system for non-heap memory during startup. By default, this metric shows the total value of the selected scope. For example, if applied to a group of machines, the value will be the total value for the whole group.

The value of Non-Heap Init may be undefined.

The JVM may request additional memory from the operating system and may also release memory to the system over time.

## jvm.nonHeap.max

The maximum size allocation of non-heap memory for the JVM. This memory is used by Java to store loaded classes and other meta-data. By default, this metric shows the total value of the selected scope. For example, if applied to a group of machines, the value will be the total value for the whole group.

## jvm.nonHeap.used

The amount of allocated non-heap memory (Non-Heap Committed) currently in use. By default, this metric shows the total value of the selected scope. For example, if applied to a group of machines, the value will be the total value for the whole group.

Non-heap memory is used by Java to store loaded classes and other meta-data.

## jvm.nonHeap.used.percent

The ratio between Non-Heap Used and Non-Heap Committed. By default, this metric shows the total value of the selected scope. For example, if applied to a group of machines, the value will be the total value for the whole group.

The current number of live daemon and non-daemon threads. By default, this metric shows the total value of the selected scope. For example, if applied to a group of machines, the value will be the total value for the whole group.

The current number of live daemon threads. By default, this metric shows the total value of the selected scope. For example, if applied to a group of machines, the value will be the total value for the whole group.

Daemon threads are used for background supporting tasks and are only needed while normal threads are executing.

# Kubernetes

Note: Sysdig follows the Prometheus-compabtible naming convention for both metrics and labels as opposed to the previous statsd-compatible, legacy Sysdig naming convention. However, this page still shows metrics in the legacy Sysdig naming convention. Until this page is updated, see Metrics and Label Mapping for the mapping between Sysdig legacy and Prometheus naming conventions.

# Kubernetes State

## kubernetes.hpa.replicas.min

The lower limit for the number of pods that can be set by the Horizontal Pod Autoscaler. The default value is 1.

The lower limit determines the minimum number of replicas that the autoscaler can periodically adjust in a replication controller or deployment to the target specified by the user in order to match the observed average CPU utilization.

Metric Type: Gauge

Segmented by:

• `kubernetes.hpa.name`

• `kubernetes.cluster.id`

• `kubernetes.cluster.name`

• `kubernetes.namespace.name`

## kubernetes.hpa.replicas.max

The upper limit for the number of pods that can be set by the Horizontal Pod Autoscaler. This value cannot be smaller than that of `kubernetes.hpa.replicas.min`.

The upper limit determines the maximum number of replicas that the autoscaler can periodically adjust in a replication controller or deployment to the target specified by the user in order to match the observed average CPU utilization .

Metric Type: Gauge

Segmented by:

• `kubernetes.hpa.name`

• `kubernetes.cluster.id`

• `kubernetes.cluster.name`

• `kubernetes.namespace.name`

## kubernetes.hpa.replicas.current

The current number of replicas of pods managed by the Horizontal Pod Autoscaler.

Metric Type: Gauge

Segmented by:

• `kubernetes.hpa.name`

• `kubernetes.cluster.id`

• `kubernetes.cluster.name`

• `kubernetes.namespace.name`

## kubernetes.hpa.replicas.desired

The desired number of replicas of pods managed by the Horizontal Pod Autoscaler.

Metric Type: Gauge

Segmented by:

• `kubernetes.hpa.name`

• `kubernetes.cluster.id`

• `kubernetes.cluster.name`

• `kubernetes.namespace.name`

## kubernetes.resourcequota.configmaps.hard

The number of config maps that can be created in each Kubernetes namespace.

Metric Type: Gauge - Integer

## kubernetes.resourcequota.configmaps.used

The current number of config maps in each Kubernetes namespace.

Metric Type: Gauge - Integer

## kubernetes.resourcequota.limits.cpu.hard

The total CPU limit across all pods in a non-terminal state in the cluster, determined by adding each pod’s CPU limit together.

Metric Type: Gauge - Integer

## kubernetes.resourcequota.limits.cpu.used

The current amount of CPU used across all cluster pods in a non-terminal state.

Metric Type: Gauge - Integer

## kubernetes.resourcequota.limits.memory.hard

The total memory limit across all cluster pods in a non-terminal state.

Metric Type: Gauge - Integer

## kubernetes.resourcequota.limits.memory.used

The current amount of memory used across all cluster pods in a non-terminal state.

Metric Type: Gauge - Integer

## kubernetes.resourcequota.persistentvolumeclaims.hard

The maximum number of persistent volume claims that can exist in the Kubernetes namespace.

Metric Type: Gauge - Integer

## kubernetes.resourcequota.persistentvolumeclaims.used

The current number of persistent volume claims that exist in the Kubernetes namespace.

Metric Type: Gauge - Integer

## kubernetes.resourcequota.cpu.hard

The maximum number of CPU cores assigned in the namespace or at the resource quota scope level. Across all the pods in a non-terminal state, the sum of CPU requests cannot exceed this value.

Metric Type: Gauge - Integer

Segmented by:

• kubernetes.cluster

• kubernetes.namespace

• kubernetes.resourcequota

## kubernetes.resourcequota.memory.hard

The maximum memory assigned in the namespace or at the resource quota scope level. Across all the pods in a non-terminal state, the sum of memory requests cannot exceed this value

Metric Type: Gauge - Integer

Segmented by:

• kubernetes.cluster

• kubernetes.namespace

• kubernetes.resourcequota

## kubernetes.resourcequota.pods.hard

The maximum number of pods in a non-terminal state that can exist in the Kubernetes namespace.

Metric Type: Gauge - Integer

## kubernetes.resourcequota.pods.used

The current number of pods in a non-terminal state that exists in the Kubernetes namespace.

Metric Type: Gauge - Integer

## kubernetes.resourcequota.replicationcontrollers.hard

The maximum number of replication controllers that can exist in the Kubernetes namespace.

Metric Type: Gauge - Integer

## kubernetes.resourcequota.replicationcontrollers.used

The current number of replication controllers that can exist in the Kubernetes namespace.

Metric Type: Gauge - Integer

## kubernetes.resourcequota.requests.cpu.hard

The maximum number of CPU requests allowed across all cluster pods in a non-terminal state.

Metric Type: Gauge - Integer

## kubernetes.resourcequota.requests.cpu.used

The current number of CPU requests across all cluster pods in a non-terminal state.

Metric Type: Gauge - Integer

## kubernetes.resourcequota.requests.memory.hard

The maximum number of memory requests allowed across all cluster pods in a non-terminal state.

Metric Type: Gauge - Integer

## kubernetes.resourcequota.requests.memory.used

The current total number of memory requests across all cluster pods in a non-terminal state.

Metric Type: Gauge - Integer

## kubernetes.resourcequota.requests.storage.hard

The maximum number of storage requests allowed across all persistent volume claims in the cluster.

Metric Type: Gauge - Integer

## kubernetes.resourcequota.requests.storage.used

The current total number of storage requests across all persistent volume claims.

Metric Type: Gauge - Integer

## kubernetes.resourcequota.resourcequotas.hard

The maximum number of resource quotas that can exist in the Kubernetes namespace.

Metric Type: Gauge - Integer

## kubernetes.resourcequota.resourcequotas.used

The current number of resource quotas that exist in the Kubernetes namespace.

Metric Type: Gauge - Integer

## kubernetes.resourcequota.secrets.hard

The maximum number of secrets that can exist in the namespace.

Metric Type: Gauge - Integer

## kubernetes.resourcequota.secrets.used

The current number of secrets that exist in the namespace.

Metric Type: Gauge - Integer

## kubernetes.resourcequota.services.hard

The maximum number of services that can exist in the namespace.

Metric Type: Gauge - Integer

## kubernetes.resourcequota.services.used

The current number of services that exist in the namespace.

Metric Type: Gauge - Integer

The maximum number of load balancer services that can exist in the namespace.

Metric Type: Gauge - Integer

The current number of load balancer services that exist in the namespace.

Metric Type: Gauge - Integer

## kubernetes.resourcequota.services.nodeports.hard

The maximum number of node port services that can exist in the namespace.

Metric Type: Gauge - Integer

## kubernetes.resourcequota.services.nodeports.used

The current number of node port services that exist in the namespace.

Metric Type: Gauge - Integer

## kubernetes.daemonSet.pods.desired

The number of nodes that should be running the daemon pod.

## kubernetes.daemonSet.pods.misscheduled

The number of nodes running a daemon pod but are not supposed to.

The number of nodes that should be running the daemon pod and have one or more of the daemon pod running and ready.

## kubernetes.daemonSet.pods.scheduled

The number of nodes that running at least one daemon pod and are supposed to.

## kubernetes.deployment.replicas.available

The number of available pods per deployment.

## kubernetes.deployment.replicas.desired

The number of desired pods per deployment.

## kubernetes.deployment.replicas.paused

The number of paused pods per deployment. These pods will not be processed by the deployment controller.

## kubernetes.deployment.replicas.running

The number of running pods per deployment.

## kubernetes.deployment.replicas.unavailable

The number of unavailable pods per deployment.

## kubernetes.deployment.replicas.updated

The number of updated pods per deployment.

## kubernetes.job.completions

The desired number of successfully finished pods that the job should be run with.

## kubernetes.job.numFailed

The number of pods which reached Phase Failed.

## kubernetes.job.numSucceeded

The number of pods which reached Phase Succeeded.

## kubernetes.job.parallelism

The maximum desired number of pods that the job should run at any given time.

## kubernetes.job.status.active

The number of actively running pods.

## kubernetes.namespace.count

The number of namespaces.

## kubernetes.namespace.deployment.count

The number of deployments per namespace.

## kubernetes.namespace.job.count

The number of jobs per namespaces.

## kubernetes.namespace.pod.status.count

Supported by Sysdig Agent 9.5.0 and above.

The metric gives the number of pods in each aggregate state per Namespace. This is the value that the `kubectl get pods` command returns in the `STATUS` column. This metric does not represent the podÂ conditionÂ or the podÂ phase.

Segmentable by `kubernetes.namespace.name` and `kubernetes.namespace.pod.status.name`.

Due to performance implications, Sysdig Monitor shows only a subset of the pod aggregate statuses. The statuses displayed on the UI are:

• Evicted

• Error

• ContainerCreating

• CrashLoopBackOff

• Pending

• Running

To view other statuses, override the default list by adding the following property in `dragent.yaml`

``````k8s_pod_status_reason_strings:
- Pending
- ImagePullBackOff
``````

## kubernetes.namespace.pod.running.count

Required: agent 9.6.0+

The number of all the running pods in a Namespace. The metric takes free pods also into account, that is, pods that do not belong to any controller. Therefore, its value is not the sum of `(statefulset|daemonset|deployment).pod.running.count`.

`kubernetes.namespace.pod.running.count` is supported by Agent v9.6.0 and above.

Metric Type: Gauge

Segmented by: Namespace

## kubernetes.namespace.replicaSet.count

The number of replicaSets per namespace.

## kubernetes.namespace.service.count

The number of services per namespace.

## kubernetes.node.allocatable.cpuCores

The CPU resources of a node that are available for scheduling.

## kubernetes.node.allocatable.memBytes

The memory resources of a node that are available for scheduling.

## kubernetes.node.allocatable.pods

The pod resources of a node that are available for scheduling.

## kubernetes.node.capacity.cpuCores

The maximum CPU resources of the node.

## kubernetes.node.capacity.memBytes

The maximum memory resources of the node.

## kubernetes.node.capacity.pods

The maximum number of pods of the node.

## kubernetes.node.diskPressure

The number of nodes with disk pressure.

## kubernetes.node.memoryPressure

The number of nodes with memory pressure.

## kubernetes.node.networkUnavailable

The number of nodes with network unavailable.

## kubernetes.node.outOfDisk

The number of nodes that are out of disk space.

The number of nodes that are ready.

## kubernetes.node.unschedulable

The number of nodes unavailable to schedule new pods.

## kubernetes.pod.containers.waiting

The number of containers waiting for a pod.

## kubernetes.pod.resourceLimits.cpuCores

The limit on CPU cores to be used by a container.

## kubernetes.pod.resourceLimits.memBytes

The limit on memory to be used by a container in bytes.

## kubernetes.pod.resourceRequests.cpuCores

The number of CPU cores requested by containers in the pod.

## kubernetes.pod.resourceRequests.memBytes

The number of memory bytes requested by containers in the pod.

The number of pods ready to serve requests.

## kubernetes.replicaSet.replicas.fullyLabeled

The number of fully labeled pods per ReplicaSet.

The number of ready pods per ReplicaSet.

## kubernetes.statefulset.replicas

The desired number of pods per StatefulSet.

## kubernetes.statefulset.status.replicas

The total number of pods created by the StatefulSet.

## kubernetes.statefulset.status.replicas.current

The number of pods created by the current version of the StatefulSet.

The number of ready pods created by this StatefulSet.

## kubernetes.statefulset.status.replicas.updated

The number of pods updated to the new version of this StatefulSet.

# Resource Usage

## Compatibility Mapping

Before using Kubernetes resource metrics, review their compatibility with Sysdig components. The newly supported Kubernetes metrics are not available to older versions of Sysdig Agent.

Note also that you must edit the agent config file, dragent.yaml, to enable these metrics. See Enable Kube State Metrics Collection with K8s_extra_resources.

Metric NameAgentPlatform
PVC metrics0.89.3 and beyondRelease 2172
Resource Quota metrics0.87.1 and beyondRelease 2172
HPA metrics0.79.0 and beyondRelease 2172

## Kubernetes Resource Metrics

Metric Name

Metric Description

Metric Type

Segment By

kubernetes.persistentvolumeclaim.storage

The storage capacity requested by the persistent volume claim.

`kubernetes.persistentvolumeclaim.storage` provides Sysdig users with a single overarching metric for persistent volume claims (PVCs), rather than a series of metrics that often repeat/duplicate information. Each Kubernetes PVC metric is mapped to a `kubernetes.persistentvolumeclaim.storage` label, which can then be used to segment the overarching metric.

See Grouping, Scoping, and Segmenting Metrics for more information on segmenting metrics.

Gauge

• kubernetes.namespace.name

• kubernetes.persistentvolumeclaim.label.accessmode

• kubernetes.persistentvolumeclaim.label.app

• kubernetes.persistentvolumeclaim.label.status.phase

• kubernetes.persistentvolumeclaim.label.storage

• kubernetes.persistentvolumeclaim.label.storageclassname

• kubernetes.persistentvolumeclaim.label.volumename

kubernetes.pod.restart.count

The cumulative number of container restarts for the pod over its lifetime.

This metric is not useful for alerts. Sysdig recommends using `kubernetes.pod.restart.rate` instead.

Counter - Integer

Kubernetes

kubernetes.pod.restart.rate

The number of container restarts for the pod within the defined scope/time period.

Gauge - Integer

Kubernetes

kubernetes.replicaSet.replicas.desired

The number of replica pods the replicaSet is configured to maintain.

Gauge - Integer

Kubernetes

kubernetes.replicaSet.replicas.running

The current number of replica pods running in the replicaSet.

Gauge - Integer

Kubernetes

kubernetes.replicationController.replicas.desired

The number of replica pods the replicationController is configured to maintain.

Gauge - Integer

Kubernetes

kubernetes.replicationController.replicas.running

The current number of replica pods running in the replication controller.

Gauge - Integer

Kubernetes

# Network

Note: Sysdig follows the Prometheus-compabtible naming convention for both metrics and labels as opposed to the previous statsd-compatible, legacy Sysdig naming convention. However, this page still shows metrics in the legacy Sysdig naming convention. Until this page is updated, see Metrics and Label Mapping for the mapping between Sysdig legacy and Prometheus naming conventions.

## net.bytes.in

Inbound network bytes. By default, this metric displays the total value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the total value for the whole group.

Metric TypeCounter
Value TypeByte
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## net.bytes.out

Outbound network bytes. By default, this metric displays the total value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the total value for the whole group.

Metric TypeCounter
Value TypeByte
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## net.bytes.total

Total network bytes. By default, this metric displays the total value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the total value for the whole group.

Metric TypeCounter
Value TypeByte
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## net.client.ip

The client IP address.

Metric TypeGauge
Value TypeString
Segment ByHost
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

## net.connection.count.in

The number of currently established client (inbound) connections.

This metric is especially useful when segmented by port, process, or protocol.

Metric TypeCounter
Value TypeInteger
Segment ByHost, Container, Process, Protocol, Port, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## net.connection.count.out

The number of currently established server (outbound) connections.

This metric is especially useful when segmented by port, process, or protocol.

Metric TypeCounter
Value TypeInteger
Segment ByHost, Container, Process, Port, Protocol, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## net.connection.count.total

The number of currently established connections. This value may exceed the sum of the inbound and outbound metrics since it represents client and server inter-host connections as well as internal only connections.

This metric is especially useful when segmented by port, process, or protocol.

Metric TypeCounter
Value TypeInteger
Segment ByHost, Container, Process, Port, Protocol, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## net.error.count

The number of errors encountered by network system calls, such as `connect()`, `send()`, and `recv()`. By default, this metric displays the total value for the defined scope. For example, if the scope is defined as a group of machines, the metric value will be the total value for the whole group.

Metric TypeCounter
Value TypeInteger
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## net.http.error.count

`net.http.error.count` is a heuristic metric.

The number of failed HTTP requests, determined by the total number of 4xx/5xx status codes.

Metric TypeCounter
Value TypeInteger
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## net.http.method

The HTTP request method.

Metric TypeGauge
Value TypeString
Segment Byhost
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

## net.http.request.count

`net.http.request.count` is a heuristic metric.

HTTP request count.

Metric TypeCounter
Value TypeInteger
Segment ByHost
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## net.http.request.time

`net.http.request.time` is a heuristic metric.

Average HTTP request time.

Metric TypeCounter
Value TyperelativeTime
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## net.http.request.time.worst

The maximum time for HTTP requests.

Metric TypeCounter
Value TyperelativeTime
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## net.http.statusCode

The HTTP response status code.

Metric TypeGauge
Value TypeString
Segment ByHost
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

## net.http.url

The HTTP request URL.

Metric TypeGauge
Value TypeString
Segment ByHost
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

The number of bytes passing through the link from client to server.

Metric TypeCounter
Value TypeByte
Segment ByHost
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

Average delay in the network link per request.

Metric TypeCounter
Value TyperelativeTime
Segment ByHost
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

The number of bytes passing through the link from server to client.

Metric TypeCounter
Value TypeByte
Segment ByHost
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## net.local.endpoint

The local endpoint for a connection. This metric is resolved to a user-friendly host name, if available.

Metric TypeGauge
Value TypeString
Segment ByHost
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

## net.local.service

Metric TypeGauge
Value TypeString
Segment ByHost
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

## net.mongodb.collection

The MongoDB collection.

Metric TypeGauge
Value TypeString
Segment ByHost
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

## net.mongodb.error.count

`net.mongodb.error.count` is a heuristic metric.

The number of Failed MongoDB requests.

Metric TypeCounter
Value TypeInteger
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## net.mongodb.operation

The MongoDB operation.

Metric TypeGauge
Value TypeString
Segment ByHost
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

## net.mongodb.request.count

`net.mongodb.request.count` is a heuristic metric.

The total number of MongoDB requests.

Metric TypeCounter
Value TypeInteger
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## net.mongodb.request.time

`net.mongodb.request.time` is a heuristic metric.

The average time to complete a MongoDB request.

Metric TypeCounter
Value TyperelativeTime
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## net.mongodb.request.time.worst (deprecated)

The maximum time to complete a MongoDB request.

Metric TypeCounter
Value TyperelativeTime
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## net.protocol

The network protocol of a request (for example, HTTP or MySQL).

Metric TypeGauge
Value TypeString
Segment ByHost
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

## net.remote.endpoint

The remote endpoint of a connection. This metric automatically resolves as a user-friendly host name, if available.

Metric TypeGauge
Value TypeString
Segment ByHost
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

## net.remote.service

Service (port number) of a remote node.

Metric TypeGauge
Value TypeString
Segment ByHost
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

## net.request.count

`net.request.count` is a heuristic metric.

Total number of network requests.

This value may exceed the sum of inbound and outbound requests, because this count includes requests over internal connections.

Metric TypeCounter
Value TypeInteger
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## net.request.count.in

`net.request.count.in` is a heuristic metric.

Number of inbound network requests.

Metric TypeCounter
Value TypeInteger
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## net.request.count.out

Number of outbound network requests.

Metric TypeCounter
Value TypeInteger
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## net.request.time

`net.request.time` is a heuristic metric.

A measure of response time which includes app + network latency. For server side it is purely a measure of app latency. This is calculated by measuring when we see the arrival of the last request buffer to when we see the departure of the first response buffer.

Metric TypeCounter
Value TyperelativeTime
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## net.request.time.file (deprecated)

The amount of time for serving a request that is spent doing file I/O. See also net.request.time.net (network I/O time) and net.request.time.processing (CPU processing time).

Metric TypeCounter
Value TyperelativeTime
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## net.request.time.file.percent

`net.request.time.file.percent` is a heuristic metric.

The percentage of time for serving a request that is spent doing file I/O. See also `net.request.time.net` (network I/O time) and `net.request.time.processing` (CPU processing time).

Metric TypeGauge
Value Type%
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## net.request.time.in

`net.request.time.in` is a heuristic metric.

Average time to serve an inbound request.

Metric TypeCounter
Value TyperelativeTime
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## net.request.time.local (deprecated)

Average per request delay introduced by this node when it serves requests coming from the previous tiers. In other words, this is the time spent serving incoming requests minus the time spent waiting for outgoing requests to complete.

Metric TypeCounter
Value TyperelativeTime
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## net.request.time.local.percent

`net.request.time.local.percent` is a heuristic metric.

The percentage of time spent in the local node versus the next tiers, when serving requests that come from previous tiers.

Metric TypeGauge
Value Type%
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## net.request.time.net (deprecated)

The amount of time for serving a request that is spent doing network I/O. See also `net.request.time.file` (file I/O time) and `net.request.time.processing` (CPU processing time).

Metric TypeCounter
Value TyperelativeTime
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## net.request.time.net.percent

`net.request.time.net.percent` is a heuristic metric.

The percent of time for serving a request that is spent doing network I/O. See also `net.request.time.file` (file I/O time) and `net.request.time.processing` (CPU processing time).

Metric TypeGauge
Value Type%
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## net.request.time.nextTiers (deprecated)

Delay introduced by the successive tiers when serving requests.

Metric TypeCounter
Value TyperelativeTime
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## net.request.time.nextTiers.percent

`net.request.time.nextTiers.percent` is a heuristic metric.

The percentage of time spent in the next tiers versus the local node, when serving requests that come from previous tiers.

Metric TypeGauge
Value Type%
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## net.request.time.out

`net.request.time.out` is a heuristic metric.

Average time spent waiting for an outbound request.

Metric TypeCounter
Value TyperelativeTime
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## net.request.time.processing (deprecated)

The amount of time for serving a request that is spent doing CPU processing. See also `net.request.time.fil`e (file I/O time) and `net.request.time.net` (network I/O time).

Metric TypeCounter
Value TyperelativeTime
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## net.request.time.processing.percent

`net.request.time.processing.percent` is a heuristic metric.

The percent of time for serving a request that is spent doing CPU processing. See also `net.request.time.file` (file I/O time) and `net.request.time.net` (network I/O time).

Metric TypeGauge
Value Type%
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## net.request.time.worst.in

`net.request.time.worst.in` is a heuristic metric.

Maximum time to serve an inbound request.

Metric TypeCounter
Value TyperelativeTime
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## net.request.time.worst.out

`net.request.time.worst.out` is a heuristic metric.

Maximum time spent waiting for an outbound request.

Metric TypeCounter
Value TyperelativeTime
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## net.role

Metric TypeGauge
Value TypeString
Segment ByHost
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

net.server.ip

Metric TypeGauge
Value TypeString
Segment ByHost
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

net.server.port

TCP/UDP Server port number.

Metric TypeGauge
Value TypeInteger
Segment ByHost
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## net.sql.error.count

`net.sql.error.count` is a heuristic metric.

The number of Failed SQL requests.

Metric TypeCounter
Value TypeInteger
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## net.sql.query

The full SQL query. If the query string is longer than 512 characters, it will be truncated to 512 characters.

Metric TypeGauge
Value TypeString
Segment ByHost
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

## net.sql.query.type

The SQL query type (for example, `SELECT`, `INSERT`, or `DELETE`).

Metric TypeGauge
Value TypeString
Segment ByHost
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

## net.sql.request.count

`net.sql.request.count` is a heuristic metric.

The number of SQL requests.

Metric TypeCounter
Value TypeInteger
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## net.sql.request.time

`net.sql.request.time` is a heuristic metric.

Average time to complete an SQL request.

Metric TypeCounter
Value TyperelativeTime
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## net.sql.request.time.worst (deprecated)

Maximum time to complete a SQL request.

Metric TypeCounter
Value TyperelativeTime
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## net.sql.table

The SQL query table name.

Metric TypeGauge
Value TypeString
Segment ByHost
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

## net.tcp.queue.len

The length of the TCP request queue.

Metric TypeGauge
Value Type%
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

# Process

Note: Sysdig follows the Prometheus-compabtible naming convention for both metrics and labels as opposed to the previous statsd-compatible, legacy Sysdig naming convention. However, this page still shows metrics in the legacy Sysdig naming convention. Until this page is updated, see Metrics and Label Mapping for the mapping between Sysdig legacy and Prometheus naming conventions.

## fd.used.percent

The percentage of used file descriptors out of the maximum available. By default, this metric displays the average value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the average value for the whole group.

This metric should be monitored carefully, and used for alerts, as when a process reaches its file descriptor limit, the process will stop operating correctly, and potentially crash.

Metric TypeGauge
Value Type%
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## proc.commandLine

Command line used to start the process.

Metric TypeGauge
Value TypeString
Segment ByProcess
Default Time AggregationN/a
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

## proc.count

The number of processes on host or container, excluding any processes that do not have `.exe` or command line parameters in the process table. These processes typically are kernel or system level, and are typically identified by square brackets (for example, `[kthreadd]`).

As some processes are excluded, the host level `proc.count` value will be lower than the value reported by the `ps -ef` command on the host.

Metric TypeCounter
Value TypeInteger
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## proc.name

Name of the process.

Metric TypeGauge
Value TypeString
Segment ByProcess
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

## proc.name.client

Name of the Client process.

Metric TypeGauge
Value TypeString
Segment ByProcess
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

## proc.name.server

Name of the server process.

Metric TypeGauge
Value TypeString
Segment ByProcess
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

## proc.start.count

Number of process starts on host or container.

Metric TypeGauge
Value TypeInteger
Segment ByHost, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

# RedisDB Metrics

Note: Sysdig follows the Prometheus-compabtible naming convention for both metrics and labels as opposed to the previous statsd-compatible, legacy Sysdig naming convention. However, this page still shows metrics in the legacy Sysdig naming convention. Until this page is updated, see Metrics and Label Mapping for the mapping between Sysdig legacy and Prometheus naming conventions.

## redis.aof.buffer_length

The size of the AOF buffer.

## redis.aof.last_rewrite_time

The duration of the last AOF rewrite.

## redis.aof.rewrite

A flag indicating that a AOF rewrite operation is on-going.

## redis.clients.biggest_input_buf

The biggest input buffer among current client connections.

## redis.clients.blocked

The number of connections waiting on a blocking call.

## redis.clients.longest_output_list

The longest output list among current client connections.

## redis.command.calls

The number of times a redis command has been called. The commands are tagged with `command` (for example, `command:append`).

## redis.command.usec_per_call

The CPU time consumed per redis command call. The commands are tagged with `command` (for example, `command:append`).

## redis.cpu.sys

The system CPU consumed by the Redis server.

## redis.cpu.sys_children

The system CPU consumed by the background processes.

## redis.cpu.user

The user CPU consumed by the Redis server.

## redis.cpu.user_children

The user CPU consumed by the background processes.

## redis.expires

The number of keys that have expired.

## redis.expires.percent

The percentage of total keys that have been expired.

## redis.info.latency_ms

The latency of the redis INFO command.

## redis.key.length

The number of elements in a given key. Each element is tagged by `key` (for example, `key:mykeyname`).

## redis.keys

The total number of keys.

## redis.keys.evicted

The total number of keys evicted due to the maxmemory limit.

## redis.keys.expired

The total number of keys expired from the database.

## redis.mem.fragmentation_ratio

The ratio between `used_memory_rss` and `used_memory`.

## redis.mem.lua

The amount of memory used by the Lua engine.

## redis.mem.maxmemory

The maximum amount of memory allotted to the RedisDB system.

Sum of all the overheads allocated by Redis for managing its internal data structures.

Supported by Sysdig Agent v9.7.0 and above.

## redis.mem.peak

The peak amount of memory used by Redis.

## redis.mem.startup

Amount of memory consumed by Redis while initializing.

Supported by Sysdig Agent v9.7.0 and above.

The amount of memory that Redis allocated as seen by the operating system.

## redis.mem.used

The amount of memory allocated by Redis.

## redis.net.clients

The number of connected clients (excluding slaves).

## redis.net.commands

The number of commands processed by the server.

## redis.net.commands.instantaneous_ops_per_sec

The number of commands processed by the server per second.

## redis.net.rejected

The number of rejected connections.

## redis.net.slaves

The number of connected slaves.

## redis.perf.latest_fork_usec

The duration of the latest fork.

## redis.persist

The number of keys persisted. The formula for this metric is `redis.keys` - `redis.expires`.

## redis.persist.percent

Percentage of total keys that are persisted.

## redis.pubsub.channels

The number of active pubsub channels.

## redis.pubsub.patterns

The number of active pubsub patterns.

## redis.rdb.bgsave

Determines whether a bgsave is in progress. The value is one if a bgsave is in progress, and zero at all other times.

## redis.rdb.changes_since_last

The number of changes since the last background save.

## redis.rdb.last_bgsave_time

The duration of the last `bg_save` operation.

## redis.replication.backlog_histlen

The amount of data in the backlog sync buffer.

## redis.replication.delay

The replication delay in offsets.

## redis.replication.last_io_seconds_ago

The amount of time since the last interaction with master.

The amount of time that the master link has been down.

## redis.replication.master_repl_offset

The replication offset reported by the master.

## redis.replication.slave_repl_offset

The replication offset reported by the slave.

## redis.replication.sync

Determines whether a sync is in progress. The value is one if a sync is in progress, and zero at all other times.

## redis.replication.sync_left_bytes

The amount of data left before syncing is complete.

## redis.slowlog.micros.95percentile

The 95th percentile of the duration of queries reported in the slow log.

## redis.slowlog.micros.avg

The average duration of queries reported in the slow log.

## redis.slowlog.micros.count

The rate of queries reported in the slow log.

## redis.slowlog.micros.max

The maximum duration of queries reported in the slow log.

## redis.slowlog.micros.median

The median duration of queries reported in the slow log.

## redis.stats.keyspace_hits

The total number of successful lookups in the database.

## redis.stats.keyspace_misses

The total number of missed lookups in the database.

# Security Policy Metrics

Note: Sysdig follows the Prometheus-compabtible naming convention for both metrics and labels as opposed to the previous statsd-compatible, legacy Sysdig naming convention. However, this page still shows metrics in the legacy Sysdig naming convention. Until this page is updated, see Metrics and Label Mapping for the mapping between Sysdig legacy and Prometheus naming conventions.

Metrics

Description

Type

Segmented by

Minimum Agent Version

`security.evts.k8s_audit`

The total number of policy events from a Kubernetes audit policy.

Gauge

`host.mac`

`host.hostname`

0.86.0

`security.policy_evts.syscall`

The total number of policy events from a syscall policy.

`security.policies.enabled`

The number of security policies enabled for a user.

`security.policies.total`

The number of security policies that exist for a user.

`security.policy_evts.container`

The total number of policy events from a container policy.

`security.policy_evts.falco`

The total number of policy events from a Falco policy.

`security.policy_evts.filesystem`

The total number of policy events from a filesystem policy.

`security.policy_evts.high`

The number of policy events from a policy with high severity.

`security.policy_evts.low`

The number of policy events from a policy with low severity.

`security.policy_evts.medium`

The number of policy events from a policy with medium severity.

`security.policy_evts.network`

The total number of policy events from a network policy.

`security.policy_evts.process`

The total number of policy events from a process policy.

`security.policy_evts.total`

The total number of policy events across all policy types.

`security_policy_evts.by_name`

The number of events triggered with segment `name` available.

`name`

`host.mac`

`host.hostname`

# System

Note: Sysdig follows the Prometheus-compabtible naming convention for both metrics and labels as opposed to the previous statsd-compatible, legacy Sysdig naming convention. However, this page still shows metrics in the legacy Sysdig naming convention. Until this page is updated, see Metrics and Label Mapping for the mapping between Sysdig legacy and Prometheus naming conventions.

## capacity.estimated.request.stolen.count (deprecated)

The number of requests the node cannot serve due to CPU steal time. This metric is calculated by measuring the current number of requests the machine is serving, and calculating how many more requests could be served if there was no steal time.

This metric can be used to understand how steal time impacts the ability to serve user requests.

Metric TypeCounter
Value TypeInteger
Segment ByHost, Process
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## capacity.estimated.request.total.count (deprecated)

The estimated number of requests the node serves at full capacity. This metric is calculated by measuring the number of requests that a machine is serving, and the resources each request is using, and combining the values to project how many requests the machine can serve.

This metric can help users determine if/when the infrastructure capacity should be increased.

Metric TypeCounter
Value TypeInteger
Segment ByHost, Process
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## capacity.stolen.percent (deprecated)

The lost service request capacity due to stolen CPU. This metric reflects the impact on other resource usage capabilities, including disk I/O and network I/O.

capacity.stolen.percent is non-zero only if cpu.stolen.percent is also non-zero.

Metric TypeGauge
Value Type%
Segment ByHost, Process
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## capacity.total.percent (deprecated)

The estimated current capacity usage, based on CPU and disk/network utilization, with CPU stolen time added back in.

`capacity.total.percent` can be used to show how the system would perform with dedicated CPU usage.

Metric TypeGauge
Value Type%
Segment ByHost, Process
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## capacity.used.percent (deprecated)

The estimated current capacity usage, based on CPU and disk/network utilization. This metric is calculated by adding the value of how many resources each request coming to the machine is using, creating a score that indicates how saturates the machine resources are.

Metric TypeGauge
Value Type%
Segment ByHost, Process
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## cpu.idle.percent

The percentage of time that the CPU/s were idle and the system did not have an outstanding disk I/O request. By default, this metric displays the average value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the average value for the whole group.

Metric TypeGauge
Value Type%
Segment ByHost, CloudProvider
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## cpu.iowait.percent

The percentage of time that the CPU/s were idle during which the system had an outstanding disk I/O request. By default, this metric displays the average value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the average value for the whole group.

Metric TypeGauge
Value Type%
Segment ByHost, CloudProvider
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## cpu.nice.percent

The percentage of CPU utilization that occurred while executing at the user level with `Nice` priority. By default, this metric displays the average value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the average value for the whole group.

Metric TypeGauge
Value Type%
Segment ByHost, CloudProvider
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## cpu.stolen.percent

Measures the percentage of time that a virtual machine’s CPU is in a state of involuntary wait due to the fact that the physical CPU is shared among virtual machines. In calculating steal time, the operating system kernel detects when it has work available but does not have access to the physical CPU to perform that work.

If the percent of steal time is consistently high, you may want to stop and restart the instance (since it will most likely start on different physical hardware) or upgrade to a virtual machine with more CPU power. Also see `capacity.total.percent` to see how steal time directly impacts the number of server requests that could not be handled. On AWS EC2, steal time does not depend on the activity of other virtual machine neighbors. EC2 is simply making sure your instance is not using more CPU cycles than paid for.

Metric TypeGauge
Value Type%
Segment ByHost, CloudProvider
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## cpu.system.percent

The percentage of CPU utilization that occurred while executing at the system level (kernel). By default, this metric displays the average value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the average value for the whole group.

Metric TypeGauge
Value Type%
Segment ByHost, CloudProvider
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## cpu.cores.used

The CPU core usage of each container is obtained from cgroups, and is equal to the number of cores used by the container. For example, if a container uses two of an available four cores, the value of `cpu.cores.used` will be two.

Metric TypeGauge
Value TypeNumber
Segment ByHost, Container
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max, RateofChange
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## cpu.cores.used.percent

The CPU core usage percent for each container is obtained from cgroups, and is equal to the number of cores multiplied by 100. For example, if a container uses three cores, the value of `cpu.cores.used.percent` would be 300%.

This metric is comparable to the CPU usage metric in docker stats.

Metric TypeGauge
Value Type%
Segment ByHost, Container
Default Time AggregationAverage
Available Time Aggregation FormatsAverage, Rate, Sum, Min, Max, rateOfChange
Default Group AggregationAverage
Available Group Aggregation FormatsAverage, Sum, Min, Max

## cpu.used.percent

### Containers

The CPU usage for each container is obtained from cgroups, and normalized by dividing by the number of cores to determine an overall percentage.

For example, if the environment contains six cores on a host, and the container or processes are assigned two cores, Sysdig will report CPU usage as:

``````2/6 * 100% = 33.33%
``````

By comparison, the docker stats command would report the CPU usage as 200%, as each individual core is assigned a value of 100%.

For service or orchestrator constructs, the container CPU is aggregated based on container labels.

### Hosts

The CPU usage for each host is obtained from /proc, and measured as the sum of the CPU usage of all cores, normalized by dividing by the number of cores.

The CPU usage for each host is the sum of cpu.user.percent, cpu.nice.percent, cpu.stolen.percent, and cpu.system.percent.

The Linux command top can be used to review these values as well.

### Processes

The CPU usage for each process is obtained from /proc, and normalized by dividing by the number of cores.

When cpu.used.percent is segmented by process at the host level, the sum of the CPU usage of each process may not always add up to the CPU usage of the host. The most common reasons for this are:

• There are short-lived processes that spike for less than two seconds.

• Granular data is retained for the highest CPU usage processes, rather than all processes.

• The kernel thread CPU usage is not reported as a process.

Metric TypeGauge
Value Type%
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## cpu.user.percent

The percentage of CPU utilization that occurred while executing at the user level (application). By default, this metric displays the average value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the average value for the whole group.

Metric TypeGauge
Value Type%
Segment ByHost, CloudProvider
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## fs.bytes.free

Available filesystem space.

Metric TypeGauge
Value TypeByte
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## fs.bytes.total

Total filesystem size.

Metric TypeGauge
Value TypeByte
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## fs.bytes.used

Used filesystem space.

Metric TypeGauge
Value TypeByte
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## fs.device

Filesystem device.

Metric TypeGauge
Value TypeString
Segment ByHost
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

## fs.free.percent

The percentage of free filesystem space.

Metric TypeGauge
Value Type%
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## fs.inodes.total.count

Metric TypeGauge
Value TypeInteger
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## fs.inodes.used.count

Metric TypeGauge
Value TypeInteger
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## fs.inodes.used.percent

Metric TypeGauge
Value Type%
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## fs.largest.used.percent

The percentage of filesystem space used by the largest filesystem.

Metric TypeGauge
Value Type%
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## fs.mountDir

The filesystem mount directory.

Metric TypeGauge
Value TypeString
Segment ByHost
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

## fs.root.used.percent

The percentage of root filesystem space used.

Metric TypeGauge
Value Type%
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## fs.type

Filesystem type.

Metric TypeGauge
Value TypeString
Segment ByHost
Default Time AggregationN/A
Available Time Aggregation FormatsN/A
Default Group AggregationN/A
Available Group Aggregation FormatsN/A

## fs.used.percent

The amount of space written by a single container instance. This value is provided by the container engine and is not supported for some versions of CRIO. For example, CRIO-1.15 which is used in Openshift 4.2. `crictl stats` not showing the size indicates that this feature is not supported.

Metric TypeGauge
Value Type%
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## host.error.count

The number of system call errors. By default, this metric displays the total value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the total value for the whole group.

Metric TypeCounter
Value TypeInteger
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

The 15 minute system load average represents the average number of jobs in (1) the CPU run queue or (2) waiting for disk I/O averaged over 15 minutes for all cores. The value should correspond to the third (and last) load average value displayed by the ‘uptime’ command.

Metric TypeGauge
Value TypeInteger
Segment ByHost, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

The 1 minute system load average represents the average number of jobs in (1) the CPU run queue or (2) waiting for disk I/O averaged over 1 minute for all cores. The value should correspond to the third (and last) load average value displayed by the ‘uptime’ command.

Metric TypeGauge
Value TypeInteger
Segment ByHost, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

The 5 minute system load average represents the average number of jobs in (1) the CPU run queue or (2) waiting for disk I/O averaged over 5 minutes for all cores. The value should correspond to the third (and last) load average value displayed by the ‘uptime’ command.

Metric TypeGauge
Value TypeInteger
Segment ByHost, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

The 15 minute system load average represents the average number of jobs in (1) the CPU run queue or (2) waiting for disk I/O averaged over 15 minutes, divided by number of system CPUs.

Metric TypeGauge
Value TypeInteger
Segment ByHost, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

The 1 minute system load average represents the average number of jobs in (1) the CPU run queue or (2) waiting for disk I/O averaged over 1 minute, divided by number of system CPUs.

Metric TypeGauge
Value TypeInteger
Segment ByHost, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

The 5 minute system load average represents the average number of jobs in (1) the CPU run queue or (2) waiting for disk I/O averaged over 5 minutes, divided by number of system CPUs.

Metric TypeGauge
Value TypeInteger
Segment ByHost, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## memory.bytes.available

The amount of available memory. By default, this metric displays the average value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the average value for the whole group.

An estimate of how much memory is available for starting new applications, without swapping.

`memory.bytes.available` may not be directly available on older systems using kernel versions older than 3.14. In these instances, the metric is an approximate value, determined by adding the free and cached memory values.

Metric TypeGauge
Value TypeByte
Segment ByHost, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## memory.bytes.total

The total memory of a host, in bytes. This value is obtained from /proc. By default, this metric displays the average value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the average value for the whole group.

Metric TypeGauge
Value TypeByte
Segment ByHost, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## memory.bytes.used

The amount of physical memory currently in use. By default, this metric displays the average value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the average value for the whole group.

The formula for determining `memory.bytes.used` is slightly different depending on whether you are examining processes or containers. For containers, the formula is `rss+cache-inactive_file`. This means that the total amount of page cache memory (inactive_file) is subtracted from the total number of bytes of page cache memory, and the total number of bytes of anonymous and swap cache memory, combined.

This is different to the `docker stats` approach, and may result in different results.

For processes, the formula is the total value of the size of the resident anonymous memory, the size of the resident file mappings, and the size of the resident shared memory.

Metric TypeGauge
Value TypeInteger
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## memory.bytes.virtual

The virtual memory size of the process, in bytes. This value is obtained from Sysdig events. By default, this metric displays the average value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the average value for the whole group.

Metric TypeGauge
Value TypeByte
Segment ByHost, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## memory.pageFault.major

A count of the condition that occurs when a program accesses a memory page that is mapped in the virtual address space, but not loaded in physical memory. By default, this metric displays the total value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the total value for the whole group.

A major or ‘hard’ page fault is handled by using a disk I/O operation (e.g., memory mapped file or page replacement causing a page swapping). For instance, when starting an application, the Linux kernel will search physical memory and the CPU cache, and, if data does not exist, a major page fault occurs. Generally, adjusting application source code or making more physical memory available reduces major page faults.

Metric TypeCounter
Value TypeInteger
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## memory.pageFault.minor

A count of the condition in which a memory page had been loaded in memory at the time the page fault was generated, but was not marked in the memory management unit as being loaded in memory. By default, this metric displays the total value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the total value for the whole group.

If the page is loaded in memory at the time the fault is generated, but is not marked in the memory management unit as being loaded in memory, then it is called a minor or ‘soft’ page fault. A minor page fault is handled without using a disk I/O operation (e.g., allocated by malloc().). The effect of minor page faults depends on system load and other factors, but are typically short and have very little impact.

Metric TypeCounter
Value TypeInteger
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationSum
Available Group Aggregation FormatsAvg, Sum, Min, Max

## memory.swap.bytes.available

The swap memory available. This metric is determined by the sum of the free and cached swap memory. By default, this metric displays the average value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the average value for the whole group.

Metric TypeGauge
Value TypeByte
Segment ByHost, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## memory.swap.bytes.total

The total amount of swap memory. By default, this metric displays the average value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the average value for the whole group.

Metric TypeGauge
Value TypeByte
Segment ByHost, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## memory.swap.bytes.used

The amount of swap memory used. By default, this metric displays the average value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the average value for the whole group.

Metric TypeGauge
Value TypeByte
Segment ByHost, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## memory.swap.used.percent

The percentage of swap memory used. By default, this metric displays the average value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the average value for the whole group.

Metric TypeGauge
Value Type%
Segment ByHost, CloudProvider
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## memory.used.percent

The percentage of physical memory in use. By default, this metric displays the average value for the defined scope. For example, if the scope is set to a group of machines, the metric value will be the average value for the whole group.

Refer to `memory.bytes.used` for information on the calculation formulas.

Metric TypeGauge
Value Type%
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationAverage
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## system.uptime

The system uptime.

Metric TypeGauge
Value TypeInteger
Segment ByHost, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

Metric TypeGauge
Value TypeInteger
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

## uptime

The percentage of time one or more selected entities were up over the defined time window.

While this metric is a percentage value, the value is presented as an integer between 0 and 1, rather than a percentage between 0% and 100%.

Metric TypeGauge
Value TypeInteger
Segment ByHost, Container, Process, Kubernetes, Mesos, Swarm, CloudProvider
Default Time AggregationRate
Available Time Aggregation FormatsAvg, Rate, Sum, Min, Max
Default Group AggregationAverage
Available Group Aggregation FormatsAvg, Sum, Min, Max

# Heuristic and Deprecated Metrics

## Heuristic Metrics

Various network-related metrics reported by Sysdig, including response times, are calculated at the kernel level by measuring latency between systems calls. In an effort to ensure Sysdig remains the trusted source of infrastructure insights, moving forward we will be labeling some network related metrics as heuristic and are tagging with the symbol in the application.

Existing alerts using these metrics will not be modified or disabled. However, these alerts will not be able to be updated.

Additional heuristic metric details are listed below:

net.http.request.timeYes
net.http.request.countYes
net.http.error.countYes
net.sql.request.timeYes
net.sql.request.countYes
net.sql.error.countYes
net.mongodb.request.timeYes
net.mongodb.request.countYes
net.mongodb.error.countYes
net.request.time.file.percentYes
net.request.time.local.percentYes
net.request.time.net.percentYes
net.request.time.nextTiers.percentYes
net.request.time.processing.percentYes
net.request.timeNo
net.request.time.inNo
net.request.time.outNo
net.request.time.worst.inNo
net.request.time.worst.outNo
net.request.countNo
net.request.count.inNo

## Deprecated Metrics:

Based on low usage patterns, Sysdig has decided to deprecate the following metrics on August 1, 2018. Users will continue to have the ability to collect similar data using Prometheus, or another method of code instrumentation (i.e. StatsD or JMX for Java applications).

The table below shows the current metrics and options for similar functionality.

Current MetricAlternative Starting August 1, 2018
capacity.estimated.request.stolen.countCreate your application metrics using Prometheus, StatsD or JMX for Java applications.
capacity.estimated.request.total.count
capacity.stolen.percent
capacity.total.percent
capacity.used.percent
net.request.time.file
net.request.time.local
net.request.time.net
net.request.time.nextTiers
net.request.time.processing
net.sql.request.time.worstMax aggregation (net.sql.request.time)
net.mongodb.request.time.worstMax aggregation (net.mongodb.request.time)
net.http.request.time.worstMax aggregation (net.http.request.time)